Abstract
Colorectal cancer (CRC) is the third most common malignant tumor. DNA damage plays a crucial role in tumorigenesis, and abnormal DNA repair pathways affect the occurrence and progression of CRC. In the current study, we aimed to construct a DNA repair-related gene (DRG) signature to predict the overall survival (OS) of patients with CRC patients. The differentially expressed DRGs (DE-DRGs) were analyzed using The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) databases. The prognostic gene signature was identified by univariate Cox regression and least absolute shrinkage and selection operator (LASSO)-penalized Cox proportional hazards regression analysis. The predictive ability of the model was evaluated using the Kaplan–Meier curves and time-dependent receiver operating characteristic (ROC) curves. The gene set enrichment analysis (GSEA) was performed to explore the underlying biological processes and signaling pathways. ESTIMATE and CIBERSORT were implemented to estimate the tumor immune score and immune cell infiltration status between the different risk group. The half-maximal inhibitory concentration (IC50) was evaluated to representing the drug response of this signature. Nine DE-DRGs (ESCO2, AXIN2, PLK1, CDC25C, IGF1, TREX2, ALKBH2, ESR1 and MC1R) signatures was constructed to classify patients into high- and low-risk groups. The risk score was an independent prognostic indicator of OS (hazard ratio > 1, P < 0.001). The genetic alteration analysis indicated that the nine DE-DRGs in the signature were changed in 63 required samples (100%), and the major alteration was missense mutation. Function enrichment analysis revealed that the immune response and mtotic sister chromatid segregation were the main biological processes. The high-risk group had higher immune score than the low-risk group. What’s more, low-risk patients were more sensitive to selumetinib and dasatinib. The nine DE-DRGs signature was significantly associated with OS and provided a new insight for the diagnosis and treatment of CRC.
Similar content being viewed by others
Introduction
Colorectal cancer (CRC) is the third most common malignant tumor, and the account for 10% of annual cancer-related deaths worldwide in 20181.The incidence and mortality rates in men are four times higher than those in women2. More than 20% of patients with CRC are diagnosed at a progressive stage because of the inapparent early symptoms, leading to poor prognosis3. The specific mechanism of CRC is still unclear, but there are numerous risk factors identified for CRC occurrence and development, including hereditary CRC, positive family history, inflammatory bowel disease, male sex, smoking, drinking, and red meat intake, which involve complicated shifts at different molecular levels4. Although the diagnosis and treatment of CRC have made great progress, the precise treatment still faces a huge challenge. Therefore, it is essential to find new reliable prognostic biomarkers and molecular alterations to develop more effective treatments for CRC.
Human cells are exposed to multiple sources of DNA damage, including radical oxygen species, diet, ultraviolet radiation, and numerous drugs, resulting in various DNA damage responses (DDRs)5. Previous studies have identified the crucial roles played by DDR in tumorigenesis, premature aging, chronic inflammation, and apoptosis6,7. Specific alterations in the DNA repair pathway can also influence the occurrence and progression of CRCs8. To date, DDR mainly includes single-strand and double-strand repair. Numerous DNA damage repair genes, such as MSH2, MSH6, MLH1, PMS2, CHEK2, and PALB2, have an integral mechanistic effect on DNA repair pathways9. A recent study evaluated a prognostic model based on eight DNA repair-related genes (DRGs) to predict the prognosis of patients with breast cancer; the DRGs showed robust predictive power10. Another study found that ARID1A mutation in CRC is strongly related to the DNA repair pathway and affects the patients’ clinical outcomes11. Some studies have focused on the association of multiple gene signatures with the prognosis of CRC, such as immune genes and metabolism-related genes12,13. However, few studies have combined differential DRGs with the prognosis of CRC to construct a prognostic prediction model. It is highly important to explore the status of DRGs for predicting CRC outcomes.
In our study, we established a different signature based on DRGs to predict the overall survival (OS) of patients with CRC using TCGA colon and rectal cancer cohorts. We then we validated the DRGs signature in the GEO database (GSE87211, GSE103479). The CRC-specific prognostic model provides a novel direction for the diagnosis and treatment of CRC. In addition, we used GSEA and CIBERSORT to explore the underlying biological functions and immune cell infiltration status of DRGs in CRC, which may provide new insights into tumorigenesis and development.
Results
Patients features
The flowchart of this study design was displayed in Fig. 1.
TCGA colon and rectal cohorts contained 530 samples (488 patients with CRC and 42 normal samples) to identify the predictive signature. We excluded samples with incomplete clinical information (age, sex, survival time, survival status and TNM stage), and a total of 396 cancer patients were used to further estimate the independence of the predictive model. The GSE87211 dataset included 363 samples (203 rectal tumors and 160 rectal normal samples), and the GSE103479 dataset consisted of 177 patients with CRC. A total of 380 patients with CRC were acquired from GEO after merging the two datasets. Subsequently, 338 patients with complete clinical information were screened in this study. Detailed basic clinical information of these patients is presented in Table S1.
Identification of prognostic DNA repair-related DEGs in patients from TCGA
A total of 493 of 513 DRGs were found in TCGA CRC cohort (Table S2). According to the screening criteria, 118 DE-DRGs containing 9 downregulated and 109 upregulated genes were identified between 488 CRC and 42 non-cancer tissues using the Wilcoxon signed-rank test (Fig. S1, Table S3). About 36 DRGs were related to survival in the univariate Cox regression analysis (P < 0.05, Fig. S2). Finally, 12 DE-DRGs associated with OS were presented in this study (Fig. 2A–C). The protein–protein interaction network among the 12 genes was draw through STRING and we used cytoHubba to identify that ESR1, PLK1, CDKN2A and CCNB1 were the hub genes (interaction score: 0.40, Fig. 2D). The correlation strength between these genes is shown in Fig. 2E.
Construction of a DRGs prognostic signature in the TCGA cohort
A nine DE-DRGs signature was constructed through the LASSO Cox regression to minimize overfitting and further narrowed the 12 candidate genes (Fig. 3A,B). Four of these nine DRGs were protective (ESCO2, AXIN2, PLK1, CDC25C, Coef < 0), and five were related to high-risk (IGF1, TREX2, ALKBH2, ESR1, MC1R, Coef > 0) (Fig. 2C). The entire names, locations, main pathways, and related coefficients of these genes are listed in Table S4. The risk scores for each patient were defined as the linear combination of the expression levels of the nine DE-DRGs weighted by their related coefficient derived from multivariate Cox regression, as follows: risk score = (0.7945*IGF1) + (0.8003*TREX2) + (− 0.0575 *ESCO2) + (− 0.0095*AXIN2) + (− 0.0299* PLK1) + (0.0990*ALKBH2) + (− 0.0908 *CDC25C) + (0.6670*ESR1) + (0.2066*MC1R). These patients were then divided into high- (n = 223) and low-risk (n = 224) groups based on the median risk scores. Figure 3C, D presents the relationship between the distribution of the rank of risk scores and the patients’ survival status in the training set, which suggested a higher mortality with increasing risk scores. The Kaplan–Meier survival analysis revealed that these patients with high-risk scores correlated with poorer survival rates than those in the low-risk group (Fig. 3E, P < 0.001). The predictive signatures of areas under the curve (AUCs) of the time-independent ROC curve were 0.68 (95% CI: 0.58–0.76) for 1-year, 0.68 (95% CI: 0.60–0.76) 3-year, and 0.78 (95% CI: 0.68–0.86) for 5-year survival, respectively (Fig. 3F).
Independent prognostic analysis of 9 DE-DRGs signature in TCGA
To test whether the nine DE-DRGs model could be an independent predictive indicator of CRC, 396 patients with detailed clinical features including age, sex, and clinical stage were further analyzed. The expression patterns of nine DE-DRGs in the high- and low-risk groups are displayed in the heatmap depicted in Fig. 4A. The signature was significantly correlated with the cancer invasive depth (T) and survival status. These genes (ESCO2, AXIN2, PLK1, and CDC25C) were preferentially expressed in the low-risk group, whereas genes (IGF1, TREX2, ALKBH2, ESR1, and MC1R) were expressed in the high-risk group. The correlation between age, TNM stage, AJCC stage, and the risk score was statistically significant in the training set through the univariate Cox regression analysis (Fig. 4B). The multivariate Cox regression analysis revealed that the nine DE-DRGs model was identified as an independent prognostic factor (HR = 1.450, 95% CI: 1.279 − 1.642, P < 0.001) (Fig. 4C, Table 1). We then classified the patients into subgroups according to age (≤ 65 vs. > 65 years), gender (Female vs. male), T stage (T1–T2 vs. T3–T4), N stage (N0 vs. N1–N2), M stage (M0 vs. M1) and AJCC stage (stage I–II vs. stage III–IV). The stratified results suggested that the high-risk scores closely correlated with poorer survival of patients with CRC, except for T1–T2(Fig. S3A–3L).
External validation of 9 DE-DRGs signature in the GEO
Two GEO datasets were merged into a test group, and then we applied the same analysis methods to validate the stability of the predictive model. We found that the number of deaths increased with the increase in risk scores (Fig. S4A,B). Consistently, the survival rate of high-risk patients was poorer than that of low-risk patients (Fig. S4C). The AUCs of 1-,3-, and 5-year were 0.74 (95% CI: 0.60–0.88), 0.68 (95% CI: 0.59–0.76), and 0.72 (95%CI: 0.66–0.79), respectively (Fig. S4D). The heatmap of the nine DE-DRGs in the different risk groups is shown in Fig. S4E. The univariate and multivariate Cox analysis results revealed that the signature was related to OS in the GEO datasets (HR = 2.049, 95% CI: 1.537 − 2.731, P < 0.001; HR = 1.974, 95% CI: 1.475 − 2.641, P < 0.001, respectively) (Fig. S4F,G, Table 1).
Analysis of genetic alterations and function enrichment in risk model
The nine DE-DRGs in CRC were altered in 63 of 63 patients (100%), and the major alteration was missense mutation (Fig. 5A), and the CNV results showed the heterozygous CNV frequency among these genes (Fig. 5B) were 5–20% based on GSCA. The frequency of copy number deletion of ESCO2 was highest in patients with CRC. Function enrichment analysis was applied to clarify the latent biological mechanisms and pathways involved in the risk score signature. The GSEA indicated that the activation of immune responses and adaptive immune response were the main processes in the high-risk group, and chromosome segregation related processes was focused on the low-risk group (Fig. 5C,D). The nine DE-DRGs were mainly enriched in these pathways, including cell adhesion and cytokine-cytokine receptor interaction in high-risk group, and DNA replication and cell cycle were in low-risk group (Fig. 5E,F).
Correlation between TME and risk model
We found that the immune score, stromal score, and ESTIMATE score were higher in high-risk group (Fig. 6A–C, P < 0.001). The differences of 22 immune cell infiltration levels between the high- and low-risk groups was investigated (Fig. 6D, P < 0.05). The high-risk group had a significant higher abundance of T cells regulatory (Tregs). Macrophages M2 and eosinophils were higher infiltration in the low-risk group. The distribution percentage of 22 immune cells in each patient with CRC was displayed in Fig. 6E.
Correlation between GDSC drug sensitivity and risk model
We explored the drug sensitivity response of patients with CRC to chemotherapy and target therapy based on the nine DE-DRGs. These results suggested that there is correlation between the expression of ALKBH2, ESCO2, and PLK1and multiple drugs sensitivity (Fig. 7A). Furthermore, we observed that three target therapy drugs selumetinib, dasatinib and vorinostat witnessed significant differences in estimated IC50 between high- and low-risk groups (Fig. 7B–D). It suggested that low-risk patients were more sensitive to selumetinib and dasatinib but opposite results in vorinostat.
Discussion
DDR is an endogenous protective mechanism of the human body, which is crucial for maintaining intracellular homeostasis in the face of DNA damage. Dysfunctional DDR may result in genome instability and high gene mutational burden14. In recent years, more research has focused on DNA damage, DDR, and CRC tumorigenesis15,16,17,18,19. Previous data have shown that exposure of human cell DNA to intrinsic and extrinsic insults, such as oxidative stress, radiation, or chemical exposure, can directly or indirectly impact genomic instability20,21. Existing evidence indicates that the accumulation of wrongly repaired or unrepaired DNA damage influences the extracellular and intracellular environment, resulting in inflammation, genetic mutation, cell senescence, death, or even tumorigenesis22,23,24. Some new advances have been made in the studies that explored the relationship between DRGs and cancers. A previous study suggested that gene alterations in DNA repair pathways promote cancer aggressiveness and induce resistance to DNA damage cancer treatments25. DNA repair pathways are defenders of the genome, and the involved DRGs exhibit therapeutic capacity and play a vital role in the prognosis of hematologic malignant tumors26. Some studies have identified and validated some DRGs to predict the outcome of patients with breast, ovarian, and clear cell renal carcinoma, and found robust predictive ability in these carcinomas10,27,28. Hence, DRGs have the potential to be molecular biomarkers for predicting the survival outcomes and improving the diagnosis and treatment of patients with CRC.
Wang et al. have collected DNA repair genes from GSEA to build and validate a 12 gene signature (CCNB3, ISY1, CDC25C, SMC1B, MC1R, LSP1P4, RIN4, TPM1, ELL3, POLG, CD36, and NEK4) based on the expression profiles of TCGA-COAD and GEO datasets (GSE17538 and GSE38832). The authors mainly focused on the relationship between DRGs and colon cancer patients through multivariate cox regression29. Compared to this study, we attended to contrast a DRGs signature to assess the OS of patients with CRC. A novel prognostic model based on nine DE-DRGs was firstly identified and validated in TCGA and GEO database through LASSO Cox regression model, and we found the prognostic model was an independent predictive factor for OS in CRC. Our results demonstrated that the prognostic model displayed a more accurate predictive value than previous study. The risk model was closely significant with clinicopathological features and these results indicated that the signature took an advantage in predicting the survival of advanced patients with CRC. It was worth noting that nine genes were included in the TCGA and GEO database. Hence, the model had an ideal prediction impact in the training and test sets. In our study, we also referred to analyses the infiltration of tumor immune cell in different risk groups. Among these genes, PLK1, CDC25C, ESCO2, AXIN2, TREX2, ALKBH2, and MC1R were upregulated in tumor tissues, whereas IGF1 and ESR1 presented downregulation (Figs. S5 and S6). Previous studies based on these genes have been conducted at the molecular level. The role of PLK1 in carcinogenesis and tumor inhibition remains still controversial30. Experimental research has demonstrated that PLK1 acts as a tumor inhibitor when integrated with certain oncogenes (APCmin) in CRC cells, and patients with low PLK1 expression have a poor prognosis31. CDC25C participates in regulating cell cycle checkpoint G2/M transition and DNA damage repair32. A previous study explored the correlation between CRC and CDC25C, which indicated that targeted CDC25C could induce ARID1A-deficient CRC33. Studies have reported ESCO2 can downregulate MMP2 expression to inhibit CRC cell migration and tumor metastasis by mediating the epithelial–mesenchymal transition process34. Another study suggested that the transactivation of AXIN2 inhibited colon cancer cell proliferation and tumor formation by inhibiting Wnt/β-catenin signaling35. IGF1 overexpression can induce lymphangiogenesis and facilitate lymphatic metastasis in CRC cells36. Silencing ESR1 enhanced the chemosensitivity of CRC cell lines to 5-FU37. There are no studies on the mechanistic relationship between TREX2, ALKBH2, MC1R, and CRC. TREX2, a specific 3′-DNA exonuclease expressed in keratinocytes, plays a crucial role in promoting DNA damage repair, inducing cell apoptosis, arousing anti-cancer immunity, and suppressing skin carcinogenesis38. A study focused on the link between low DNA methylation of TREX2 and enhancement of gene expression and shorter survival in laryngeal cancer39. ALKBH2 overexpression inhibits gastric cancer cell proliferation and induces apoptosis and cell cycle arrest40. Moreover, ALKBH2 is associated with chemotherapy and molecular targeting. High expression promotes resistance to temozolomide chemotherapy in glioblastoma cells41. These data suggests that MC1R high expression is mediated by MITF, which is related to the RAS/ERK-signaling pathway, promoting melanocyte cell division, and enhancing the migration ability of melanoma cells42,43. However, the related regulatory mechanism between the three genes mentioned above and CRC occurrence and development need to be elucidated by further research.
Genetic alteration analysis based on GSCA reveal that these genes had very high levels of single nucleotide variants, and missense mutations ranked first. AXIN2, ESR1, PLK1, and ESCO2 change ratios were more than 20% in 63 patients with CRC. These findings suggest that this may be a potential mechanism for the induction of CRC carcinogenesis. GSEA analysis revealed that the main biological progresses of the signature were associated with immune response. In addition, some signaling pathways involved genetic materials synthesis (DNA replication, cell cycle) were enriched. What’s more, the relationship between risk model and tumor immunity was evaluated to find the potential mechanism. Then we found the significant differences in tumor immune score between high- and low-risk patients. Consequently, the high-risk group presented a higher proportion of Tregs. Previous studies had explicated that T cell was correlated with the metastasis of CRC, and DNA damage can induce type I IFN in CRC44,45. With the progress of CRC, the chemotherapeutic effects and common target therapy are extremely limited. New types targeted therapeutic drugs need to develop to alleviate the advanced CRC patients. Therefore, we predict the drug response to targeted therapy in high- and low-risk patients, and found that patients with CRC in high-risk were more sensitive to vorinostat (Histone deacetylase (HDAC) inhibitor, was used in Cutaneous T-cell lymphoma) than low-risk patients46. This result means that patients in high-risk group can benefit from the small molecular drugs. These results showed this signature can predict the sensitivety of patients to target therapy but need further investigation.
Compared with previous studies, this is the first study to integrate nine DE-DRGs into a multiple gene signature for predicting the survival of CRC, and to analyze the correlation between immune cell domination, immune-related functions, and CRC prognosis. In addtion, there was an innovation distinguished from previous studies, which we analyzed the correlation between the response of drug therapy and risk model and brought new sights on the therapeutic strategy on CRC patients. However, this study has some limitations. First, this study is a bioinformatics analysis based on public databases (TCGA and GEO) and some retrospective studies biases; therefore, more large-scale multicenter cohorts need to be further explored to validate the model. Second, rigorous basic mechanistic research on DRGs and CRC should be carried out to support the signature.
Methods and materials
Data sources
The training set of RNA-sequence datasets and the corresponding clinical characteristics of patients with CRC were acquired from TCGA database (https://portal.gdc.cancer.gov/)47. The RNA-seq data from TCGA contained 488 patients with CRC and 42 normal colorectal tissues. The mRNA expression data of GSE87211 (n = 363) and GSE103479 (n = 156) were obtained from the GEO database as validation groups (https://www.ncbi.nlm.nih.gov/geo/)48. In this study, a total of 513 DRGs were retrieved from a previous study10 and the UALCAN database, and are presented in Table S1. All data annotation and extraction were performed using the R software (version 4.0.2). The Perl program and “sva” package were applied to merge two GEO microarray datasets.
Furthermore, this study was also approved by the institutional review committee of the First Affiliated Hospital of Xi’an Jiaotong University, Shaanxi Province, Xi’an, China. Informed consent was renounced because the study did not involve specimen collection and the patients involved in the public databases have obtained ethical approval. And the study was conducted in accordance with the relevant guidelines and the regulations.
Construction and validation of a prognostic-associated DNA repair-relative gene signature
The differentially expressed genes (DEGs) associated with DNA repair in CRC were identified with the “limma” R package. The DE-DRGs were screened using the criteria: false-positive discovery (FDR) < 0.05, and Log2 | (fold change, FC) |> 1. The univariate Cox regression analysis was used to estimate survival-associated DRGs using the “survival” package (P < 0.05). The protein–protein interaction network was created using the online database “STRING” (version 11.0), and the interaction score was set as 0.40, and the cytoHubba application from Cytoscape software was utilized to identify the hub genes49. The LASSO Cox regression analysis was performed to construct a predictive signature50. The R package "glmnet" was utilized to achieve the variable selection and shrinkage of the LASSO algorithm51. The risk scores of each patient were established using the following formula: Risk score = sum (coefficients* expression of gene n)52. The "survminer" and "survival" R packages were applied to conduct the Kaplan–Meier survival curve. A time-dependent ROC curve was implemented to assess the predictive ability of the prognostic model by using the "survival ROC" R package53. The independent predictive efficiency of the prognostic signature was evaluated using univariate and multivariate Cox analyses. Differences were considered statistically significant at a bilateral P < 0.05. The hazard ratios (HRs) and 95% confidence intervals (CIs) were calculated. The merged GEO datasets with survival information were used for external validation. The same methods were used to estimate the risk scores for each case.
Genetic alterations and function enrichment analysis
The role of genetic alterations in this predictive model was explored through gene set cancer analysis (GSCA) Lite (http://bioinfo.life.hust.edu.cn/web/GSCALite/)54. To find relative biological functions and potential molecular pathways regulated by the DE-DRGs signature between the different risk groups, GSEA analysis was performed using "clusterprofler" R package55,56. The statistical significance was set at P < 0.05. ESTIMATE was performed to calculate the stromal score and immune scores, which illustrated immune cell infiltration in tumor57. In addition, CIBERSORT algorithm and “limma” package were used to analyze the differences between the different risk groups and 22 types tumor-infiltrating immune cells.
Chemotherapy response and small molecular drug prediction
The drug sensitivity response to chemotherapy and target therapy in patients with CRC were determined based on public database GDSC (Genomics Drug sensitivity in cancer)58. The half-maximal inhibitory concentration (IC50) was evaluated to represent the drug response. The GSCA was used to investigate the underlying drugs based on nine DE-DRGs. In addition, the package “pRRophetic” was applied to estimate the potential target drugs between the high- and low-risk groups.
Statistics analyses
All statistical analyses were performed using the R software (version 4.0.2). Continuous variables were expressed as mean ± SE, whereas categorical variables were summarized as frequency (n) and proportion (%). The Wilcoxon signed-rank test was used to compare the DRGs signature mRNA expression levels between cancer and non-cancer samples. Bilateral P value < 0.05 was considered statistically significant.
Conclusion
In conclusion, we developed a novel, valid and reliable prognostic model based on nine DE-DRGs. The signature was significantly correlated with OS in the training and validation cohorts, which provided a novel perspective for the diagnosis and treatment of CRC.
Data availability
The datasets generated and analyzed during the current study are available from public databases, TCGA (http://cancergenome.nih.gov/abouttcga), and GEO (https://www.ncbi.nlm.nih.gov/geo/) databases.
Abbreviations
- AUC:
-
The area under curve
- CRC:
-
Colorectal cancer
- CI:
-
Confidence interval
- DEGs:
-
Differentially expressed genes
- DE DRGs:
-
Differentially expressed DNA repair genes
- GEO:
-
Gene Expression Omnibus
- GDSC:
-
Genomics drug sensitivity in cancer
- GSCA:
-
Gene set cancer analysis
- GSEA:
-
Gene set enrichment analysis
- HR:
-
Hazard ratio
- LASSO:
-
Least absolute shrinkage and selection operator
- MITF:
-
Melanogenesis associated transcription factor
- OS:
-
Overall survival
- TCGA:
-
The Cancer Genome Atlas
References
Bray, F. et al. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 68(6), 394–424 (2018).
Dekker, E. et al. Colorectal cancer. Lancet 394(10207), 1467–1480 (2019).
Fiorentini, G. et al. Updates of colorectal cancer liver metastases therapy: review on DEBIRI. Hepat. Oncol. 7(1), 16 (2020).
Rawla, P., Sunkara, T. & Barsouk, A. Epidemiology of colorectal cancer: Incidence, mortality, survival, and risk factors. Prz. Gastroenterol. 14(2), 89–103 (2019).
Patel, J. et al. DNA damage and mitochondria in cancer and aging. Carcinogenesis (2020).
Ijsselsteijn, R., Jansen, J. G. & de Wind, N. DNA mismatch repair-dependent DNA damage responses and cancer. DNA Repair (Amst.) 93, 102923 (2020).
Nastasi, C., Mannarino, L. & D’Incalci, M. DNA damage response and immune defense. Int. J. Mol. Sci. 21, 20 (2020).
Chang, P. Y. et al. Pathogenic germline mutations of DNA repair pathway components in early-onset sporadic colorectal polyp and cancer patients. Cancers (Basel) 12, 12 (2020).
Cetin, B., Wabl, C. A. & Gumusay, O. The DNA damaging revolution. Crit. Rev. Oncol. Hematol. 156, 103117 (2020).
Zhang, D. et al. Prediction of overall survival among female patients with breast cancer using a prognostic signature based on 8 DNA repair-related genes. JAMA Netw. Open 3(10), e2014622 (2020).
Tokunaga, R. et al. The impact of ARID1A mutation on molecular characteristics in colorectal cancer. Eur. J. Cancer 140, 119–129 (2020).
Miao, Y. et al. Prognostic implications of metabolism-associated gene signatures in colorectal cancer. PeerJ 8, e9847 (2020).
Miao, Y.D. et al. Identification prognosis-associated immune genes in colon adenocarcinoma. Biosci. Rep. (2020).
Gachechiladze, M. et al. Predictive and prognostic value of DNA damage response associated kinases in solid tumors. Front. Oncol. 10, 581217 (2020).
Scarbrough, P. M. et al. A cross-cancer genetic association analysis of the DNA repair and DNA damage signaling pathways for lung, ovary, prostate, breast, and colorectal cancer. Cancer Epidemiol. Biomark. Prevent. 25(1), 193–200 (2016).
Ishikawa, S. et al. Heme induces DNA damage and hyperproliferation of colonic epithelial cells via hydrogen peroxide produced by heme oxygenase: A possible mechanism of heme-induced colon cancer. Mol. Nutr. Food Res. 54(8), 1182–1191 (2010).
Irrazabal, T. et al. Limiting oxidative DNA damage reduces microbe-induced colitis-associated colorectal cancer. Nat. Commun. 11(1), 1802 (2020).
Tortora, K. et al. DNA damage in colon mucosa of Pirc rats, an Apc-driven model of colon tumorigenesis. Toxicol. Lett. 324, 12–19 (2020).
Sharp, S. P. et al. DNA damage response genes mark the early transition from colitis to neoplasia in colitis-associated colon cancer. Gene 677, 299–307 (2018).
Barnes, J. L. et al. Carcinogens and DNA damage. Biochem. Soc. Trans. 46(5), 1213–1224 (2018).
Dumont, E. & Monari, A. Understanding DNA under oxidative stress and sensitization: The role of molecular modeling. Front. Chem. 3, 43 (2015).
Kaźmierczak-Barańska, J., Boguszewska, K. & Karwowski, B. T. Nutrition can help DNA repair in the case of aging. Nutrients 12(11), 1501 (2020).
Helm, J. S. & Rudel, R. A. Adverse outcome pathways for ionizing radiation and breast cancer involve direct and indirect DNA damage, oxidative stress, inflammation, genomic instability, and interaction with hormonal regulation of the breast. Arch. Toxicol. 94(5), 1511–1549 (2020).
Toprani, S.M., Mane, V.K. Role of DNA damage and repair mechanisms in uterine fibroid/leiomyomas: A review. Biol. Reprod. (2020).
Lodovichi, S. et al. Inhibition of DNA repair in cancer therapy: Toward a multi-target approach. Int. J. Mol. Sci. 21(18), 6684 (2020).
Rahimian, E. et al. DNA repair pathways as guardians of the genome: Therapeutic potential and possible prognostic role in hematologic neoplasms. DNA Repair (Amst.) 96, 102951 (2020).
Xiong, Y. et al. Development and validation of an individualized DNA repair-related gene signature in localized clear cell renal cell carcinoma. World J. Urol. (2020).
Sun, H. et al. Identification of a prognostic signature associated with DNA repair genes in ovarian cancer. Front. Genet. 10, 839 (2019).
Wang, X. Q. et al. Identification and validation of a novel DNA damage and DNA repair related genes based signature for colon cancer prognosis. Front. Genet. 12, 635863 (2021).
de Cárcer, G. The mitotic cancer target polo-like kinase 1: Oncogene or tumor suppressor?. Genes (Basel) 10(3), 1369 (2019).
Raab, M. et al. PLK1 has tumor-suppressive potential in APC-truncated colon cancer cells. Nat. Commun. 9(1), 1106 (2018).
Liu, K. et al. The role of CDC25C in cell cycle regulation and clinical cancer therapy: A systematic review. Cancer Cell Int. 20, 213 (2020).
Wu, C. et al. Targeting AURKA-CDC25C axis to induce synthetic lethality in ARID1A-deficient colorectal cancer cells. Nat. Commun. 9(1), 3212 (2018).
Guo, X. B. et al. ESCO2 inhibits tumor metastasis via transcriptionally repressing MMP2 in colorectal cancer. Cancer Manag. Res. 10, 6157–6166 (2018).
Yu, J. et al. CDX2 inhibits the proliferation and tumor formation of colon cancer cells by suppressing Wnt/β-catenin signaling via transactivation of GSK-3β and Axin2 expression. Cell Death Dis. 10(1), 26 (2019).
Li, Z. J. et al. Insulin-like growth factor-1 induces lymphangiogenesis and facilitates lymphatic metastasis in colorectal cancer. World J. Gastroenterol. 19(43), 7788–7794 (2013).
Ye, S. B. et al. The predictive value of estrogen receptor 1 on adjuvant chemotherapy in locally advanced colorectal cancer: A retrospective analysis with independent validation and its potential mechanism. Front. Oncol. 10, 214 (2020).
Manils, J. et al. Multifaceted role of TREX2 in the skin defense against UV-induced skin carcinogenesis. Oncotarget 6(26), 22375–22396 (2015).
Weigel, C. et al. DNA methylation at an enhancer of the three prime repair exonuclease 2 gene (TREX2) is linked to gene expression and survival in laryngeal cancer. Clin. Epigenet. 11(1), 67 (2019).
Gao, W. et al. Frequent down-regulation of hABH2 in gastric cancer and its involvement in growth of cancer cells. J. Gastroenterol. Hepatol 26(3), 577–584 (2011).
Johannessen, T. C. et al. The DNA repair protein ALKBH2 mediates temozolomide resistance in human glioblastoma cells. Neuro Oncol. 15(3), 269–278 (2013).
Li, M. et al. Enhancing the efficacy of melanocortin 1 receptor-targeted radiotherapy by pharmacologically upregulating the receptor in metastatic melanoma. Mol. Pharm. 16(9), 3904–3915 (2019).
Rosenkranz, A. A. et al. Malignant melanoma and melanocortin 1 receptor. Biochemistry (Mosc) 78(11), 1228–1237 (2013).
Lazarus, J. et al. Mathematical modeling of the metastatic colorectal cancer microenvironment defines the importance of cytotoxic lymphocyte infiltration and presence of PD-L1 on antigen presenting cells. Ann. Surg. Oncol. 26(9), 2821–2830 (2019).
Laengle, J. et al. DNA damage predicts prognosis and treatment response in colorectal liver metastases superior to immunogenic cell death and T cells. Theranostics 8(12), 3198–3213 (2018).
Remák, E. et al. Understanding relative survival outcomes for patients with cutaneous T-cell lymphoma (CTCL) subtypes mycosis fungoides and Sézary syndrome treated with mogamulizumab or vorinostat: Combining Australian real-world evidence and MAVORIC phase 3 trial data. Eur. J. Cancer 156(Suppl 1), S18 (2021).
Liu, J. et al. An integrated TCGA pan-cancer clinical data resource to drive high-quality survival outcome analytics. Cell 173(2), 400-416.e11 (2018).
Barrett, T. et al. NCBI GEO: Archive for functional genomics data sets—Update. Nucleic Acids Res. 41(database issue), D991–995 (2013).
Szklarczyk, D. et al. STRING v11: Protein–protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res. 47(D1), D607-d613 (2019).
Wang, H. et al. Precision Lasso: Accounting for correlations and linear dependencies in high-dimensional genomic data. Bioinformatics 35(7), 1181–1187 (2019).
Friedman, J., Hastie, T. & Tibshirani, R. Regularization paths for generalized linear models via coordinate descent. J. Stat. Softw. 33(1), 1–22 (2010).
Chen, H. Y. et al. A five-gene signature and clinical outcome in non-small-cell lung cancer. N. Engl. J. Med. 356(1), 11–20 (2007).
Blanche, P., Dartigues, J. F. & Jacqmin-Gadda, H. Estimating and comparing time-dependent areas under receiver operating characteristic curves for censored event times with competing risks. Stat. Med. 32(30), 5381–5397 (2013).
Liu, C. J. et al. GSCALite: A web server for gene set cancer analysis. Bioinformatics 34(21), 3771–3772 (2018).
Gene Ontology Consortium: Going forward. Nucleic Acids Res. 43(Database issue), D1049–D1056 (2015).
Kanehisa, M. & Goto, S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28(1), 27–30 (2000).
Yoshihara, K. et al. Inferring tumour purity and stromal and immune cell admixture from expression data. Nat. Commun. 4, 2612 (2013).
Yang, W. et al. Genomics of drug sensitivity in cancer (GDSC): A resource for therapeutic biomarker discovery in cancer cells. Nucleic Acids Res. 41(Database issue), D955–D961 (2013).
Acknowledgements
We thank The Cancer Genome Atlas (TCGA) project, NCBI GEO database and all members of our study team for their whole-hearted cooperation.
Funding
Supported by Key Research and Development Program of Shaanxi Province of China. (No.2020SF-114).
Author information
Authors and Affiliations
Contributions
D.L.S. and D.Z. collected and analyzed the data, wrote the manuscript. S.S.C. and J.W. analyzed the data and reviewed the manuscript. L.L.Z. and Q.H. participated in analyzing the data. D.L.S., D.Z., S.S.C. and J.W. participated in preparation of the figures and tables and interpretation of data for the work. N.D. and H.R. designed the work and revised the manuscript. All the authors read and approved the final manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Song, D., Zhang, D., Chen, S. et al. Identification and validation of prognosis-associated DNA repair gene signatures in colorectal cancer. Sci Rep 12, 6946 (2022). https://doi.org/10.1038/s41598-022-10561-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-022-10561-w
- Springer Nature Limited
This article is cited by
-
Identification of molecular subtypes and a prognostic signature based on m6A/m5C/m1A-related genes in lung adenocarcinoma
Scientific Reports (2024)
-
Multi-omics prognostic signatures of IPO11 mRNA expression and clinical outcomes in colorectal cancer using bioinformatics approaches
Health Information Science and Systems (2023)
-
An integrated analysis of microRNAs regulating DNA damage response in triple-negative breast cancer
Breast Cancer (2023)