Abstract
Background and aim
Colorectal cancer (CRC) originates from pre-existing polyps in the colon. The development of different subtypes of CRC is influenced by various genetic and epigenetic characteristics. CpG island methylator phenotype (CIMP) is found in about 15–20% of sporadic CRCs and is associated with hypermethylation of certain gene promoters. This study aims to find prognostic genes and compare their expression and methylation status as potential biomarkers in patients with serrated sessile adenomas/polyps (SSAP) and CRC, in order to evaluate which, one is a better predictor of disease.
Method
This study employed a multi-phase approach to investigate genes associated with CRC and SSAP. Initially, two gene expression datasets were analyzed using R and Limma package to identify differentially expressed genes (DEGs). Venn diagram analysis further refined the selection, revealing four genes from the Weissenberg panel with significant changes. These genes, underwent thorough in silico evaluations. Once confirmed, they proceeded to wet lab experimentation, focusing on expression and methylation status. This comprehensive methodology ensured a robust examination of the genes involved in CRC and SSAP.
Result
This study identified cancer-specific genes, with 8,351 and 1,769 genes specifically down-regulated in SSAP and CRC tissues, respectively. The down-regulated genes were associated with cell adhesion, negative regulation of cell proliferation, and drug response. Four highly downregulated genes in the Weissenberg panel, including CACNA1G, IGF2, MLH1, and SOCS1. In vitro analysis showed that they are hypermethylated in both SSAP and CRC samples while their expressions decreased only in CRC samples.
Conclusion
This suggests that the decrease in gene expression could help determine whether a polyp will become cancerous. Using both methylation status and gene expression status of genes in the Weissenberg panel in prognostic tests may lead to better prognoses for patients.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
According to the National Cancer Institute, in 2021, an estimated 1,486,920 people were living with colorectal cancer (CRC) in the United States [1]. It is generally well-known that CRC arises from pre-existing colonic polyps [2]. Premalignant epithelial polyps of the large intestine comprise several different subtypes, such as adenomatous and sessile serrated adenoma/polyp (SSAP), that are capable of development of different types of colorectal carcinomas due to their different genetic and epigenetic characteristics [3]. Three principal pathways which have been associated with these characteristics are chromosomal instability (CIN), microsatellite instability (MSI), and CpG island methylator phenotype (CIMP) pathway [4]. The latter is found in 15–20% of sporadic CRCs and it occurs when DNA methyltransferases (DNMTs) promote hypermethylation of promoter-associated CpG-rich regions of tumor suppressor and DNA mismatch repair genes such as p16, PTEN, and MLH1, leading to development and progression of CRC [5]. Various studies have indicated an association of CIMP with clinical, histopathological, and epidemiological characteristics, such as older age, female sex, tumor location in the proximal colon, poorly differentiated tumors, and high rates of BRAF and TP53 mutations [6,7,8,9,10]. Thus, the hypermethylation phenotype of certain gene promoters can be served as a biomarker for the early detection of CRC.
Detecting CRC early poses significant challenges as it often progresses asymptomatically in its initial stages, complicating timely diagnosis [11]. Moreover, CRC encompasses diverse molecular subtypes, contributing to the complexity of accurate diagnosis and prognosis determination [12]. Treatment of CRC encounters obstacles such as chemotherapy resistance, underscoring the need for more effective therapeutic strategies [13]. Identifying individuals at high risk of developing CRC remains crucial for early intervention and prevention [14]. Given these challenges, the study’s objectives of identifying prognostic genes in CRC and SSAP hold immense significance. By shedding light on potential biomarkers for improved diagnosis and treatment outcomes, the study aims to address critical gaps in current CRC management strategies [15]. SSAP, characterized by their serrated appearance and typically flat or sessile morphology, have garnered increasing attention in cancer research due to their potential role as precursor lesions for CRC [16]. Despite their relatively benign appearance, SSAPs have been associated with an increased risk of developing CRC, particularly in cases where these lesions are left undetected or inadequately removed during screening. Detecting SSAPs presents unique challenges compared to other types of colorectal polyps, as they may be subtle and easily overlooked during endoscopic examinations. Moreover, the sessile nature of these lesions can complicate their complete resection, further contributing to the risk of CRC development [17]. Therefore, improving methods for the detection and surveillance of SSAPs is imperative for effective CRC prevention strategies. While advanced imaging techniques such as chromoendoscopy and narrow-band imaging (NBI) have shown promise in enhancing SSAP detection rates, their widespread adoption and efficacy in routine clinical practice require further validation [18]. Additionally, molecular markers, including DNA methylation patterns, have emerged as potential tools for identifying high-risk SSAPs that may progress to CRC [19]. Understanding the clinical significance of SSAPs and addressing the challenges associated with their detection and management are crucial steps toward improving CRC prevention and early intervention efforts. Recent advances in high-throughput technologies have facilitated the establishment of DNA methylation panels to determine the CIMP status of specific genes in CRC tumors [20, 21]. In 2006, Weisenberger et al. introduced a quantified CIMP panel, which serves as a collection of genes widely employed as biomarkers in DNA methylation studies within cancer research. The Weissenberg panel encompasses 1505 CpG sites situated in the promoter regions of 807 genes associated with cancer. These genes were carefully selected due to their significance in cancer biology and their potential usefulness as targets for diagnosis and treatment purposes [22]. Some of the genes included in the Weissenberg panel are APC (Adenomatous polyposis coli), BRCA1 (Breast cancer 1), CDH1 (Cadherin 1), CDKN2A (Cyclin-dependent kinase inhibitor 2 A), MLH1 (MutL homolog 1), PTEN (Phosphatase and tensin homolog), TP53 (Tumor protein p53) and VHL (Von Hippel-Lindau tumor suppressor) [22, 23]. The Weissenberg panel was chosen based on its established significance in previous research related to CRC and SSAP. This panel comprises genes that have been consistently implicated in CRC pathogenesis and progression, making it a valuable tool for identifying potential prognostic markers. Additionally, the identification of differentially methylated genes can provide insights into the molecular mechanisms underlying cancer development and progression and may lead to the development of new treatments for cancer. Despite the various panels to assess CIMP status, the panel introduced by Weisenberger has been reviewed and validated in several studies [22, 24,25,26]. As far as we know, there is no correlation study between the methylation status of this panel and the expression of them at the time of polyp initiation and the spread of CRC; therefore, in the present study, we aimed to identify genes that are differentially expressed in SSAP and CRC using bioinformatics analysis. By utilizing previous studies and matching our classification with known panels, we successfully identified some common effective genes. In the next step, we have selected this final group of genes to be tested in wet lab phase.
Methods
As it is shown in Fig. 2, in the in silico phase of the study, genes involved in the pathways of interest are identified, as are genes that are not involved in these pathways. Genes that are involved in the pathways and are differentially expressed or methylated in the cells of interest are then identified. Finally, pathway analysis tools are used to identify the GO terms and pathways associated with these genes. In the in vitro phase of the study, the methylation pattern and protein-protein interaction of the genes identified in the in silico phase are evaluated. The effect of these genes on the survival rate of the cells of interest is also assessed.
In-silico analysis
The microarray data were retrieved from NCBI’s Gene Expression Omnibus database (http://www.ncbi.nlm.nih.gov/geo), with GEO accession numbers GSE7946 and GSE198692. GSE7946 is composed of 22 SSAP samples and 22 adjacent non-tumor mucosa samples from the same patients. GSE198692 is composed of 7 SSAP samples and 4 normal samples. All GSE series matrix files set were downloaded and parsed by the GEO query package [27] using R version 3.6.0, and the GSE series values were normalized with R. The linear models for microarray data (Limma) package [28] were employed to determine Differentially Expressed Genes (DEGs), which were significantly upregulated and downregulated in SSAP. DEGs were selected based on univariate tests according to the q-value (q < 0.05) and log2 fold change (log2 FC ≥ 2).
Identification of differentially expressed genes in CRC
The Gene Expression Profiling Interactive Analysis (GEPIA) database (http://gepia.cancer-pku.cn/), which is a web-based tool that provides fast and customizable functionalities based on TCGA and GTEx data, was used to compare gene expression between CRC and normal tissue. Differential expression analysis was conducted using default parameters, with statistical significance determined based on a p-value threshold of 0.05 and log2 fold change (|log2 FC| ≥ 1).
Identifying differentially expressed genes and specificity in CRC and SSAP through Venn diagram analysis
In this phase, we constructed Venn diagrams to identify genes that were specific to SSAP, specific to CRC, and genes that were common to both upregulated and downregulated genes. The parameters used for gene selection included statistical significance (q-value < 0.05) and a log2 fold change threshold of ≥ 2.
Gene ontology and pathway enrichment analysis
Gene ontology analysis of the biological processes (BP), molecular function (MF), and cellular component [6], as well as the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment for the DEGs specific to CRC, were conducted using the Database for Annotation Visualization and Integrated Discovery (DAVID; https://david.ncifcrf.gov). The statistical significance threshold was set at a p-value less than 0.05.
PPI network of Weissenberg panel
The protein-protein interactions (PPI) of IGF2, SOCS1, MLH1, and CACNA1G genes were analyzed using the STRING database (https://string-db.org/).The confidence score threshold for defining protein-protein interactions was set at 0.7. To visualize the interactions of the proteins encoded by these four genes with other proteins, the Cytoscape software (http://www.cytoscape.org/) was used.
Overall survival analysis
To evaluate the prognostic value of the panel genes, an analysis was performed using the Kaplan-Meier plotter. Patient data from the GEPIA database were utilized for this analysis [11]. The survival plots, gene expression levels, and stage plots were generated to investigate the relationship between the selected genes and common cancers. The statistical significance threshold for survival analysis was set at a p-value less than 0.05.
Evaluation of methylation and expression pattern of genes
After obtaining the methylation results, IGF2, SOCS1, MLH1, and CACNA1G were entered into the UALCAN database (http://ualcan.path.uab.edu) for further studies. The UALCAN database was used to examine the promotor methylation levels and expression of these genes with regard to clinicopathological features in CRC patients. This analysis utilized default parameters provided by the UALCAN platform, with statistical significance determined based on a p-value threshold of 0.05 [29].
In-vitro analysis
After conducting computer-based studies, we identified four genes including (IGF2, SOCS1, MLH1, and CACNA1G) with significantly decreased expression in the bioinformatics analysis, which was part of the Weissenberg panel. To confirm our findings, we further studied the expression and methylation status of these genes. Figure 1 indicates the workflow of our study. Between 2018 and 2021, fresh frozen tissue samples were obtained from various study groups at Taleghani Hospital, affiliated with Shahid Beheshti University of medical sciences in Tehran, Iran. These groups included invasive adenocarcinoma (n = 14), SSAP (n = 9), and control cases without malignancy (n = 34). Patients who had undergone radiotherapy or chemotherapy were excluded from the study. It is noteworthy that the study received approval from the Medical Ethical Committee of SBUMS and informed consent was obtained from all participants.
RNA and DNA extraction
Total RNA was extracted from the colonic tissues using the Trizol® kit (ThermoFisher Scientific, United States) according to the manufacturer’s protocol. 50 to 100 mg of the same tissues were used to isolate DNA using the QIAamp DNA mini kit (Qiagen, German), following the manufacturer’s instructions. The quantity and integrity of the extracted RNA and DNA were meticulously evaluated using Thermo Scientific NanoDrop 1000 and agarose gel electrophoresis, respectively, as critical quality assurance measures to validate sample integrity and data reliability. These quality control steps are essential for minimizing experimental variability and ensuring the validity of downstream analyses.
Real-time quantitative PCR
The PrimeScript cDNA Synthesis Kit (Takara, China) was used to synthesize cDNA following the manufacturer’s instructions. Real-time PCR was performed on an Applied Biosystems 7500 Real-Time PCR System using an SYBR Green Real-Time PCR Master Mix (Takara, China), adhering to the manufacturer’s recommendations to maintain assay standardization. The specific primers used for gene expression quantification are listed in Table 1. Thermal cycling conditions were as follows: an initial denaturation step at 95 °C for 30 s, followed by 40 cycles of denaturation at 95 °C for 5 s, annealing at 58 °C for 34 s, and extension at 72 °C for 34 s, with a final extension step at 72 °C for 15 s. The relative expression of the selected genes was determined by normalizing to β-actin via the 2−ΔΔCt method, incorporating robust data normalization and interpretation techniques for reliable results.
Bisulfite modification
Isolated DNA was modified by sodium bisulfite using an Epitect bisulfite kit (Qiagen, Germany) according to the manufacturer’s recommendations. For methylation-positive and negative controls, human HCT116 DKO N samples (ZRC175286 and ZRC174904, respectively) were obtained from ZYMO Research, USA, and utilized as controls to verify the effectiveness of the bisulfite modification process and to validate the specificity of the methylation-specific PCR assay. These controls play a crucial role in confirming the reliability and reproducibility of the methylation analysis, providing confidence in the obtained results.
Methylation-specific PCR
The bisulfite-modified DNA was subjected to a methylation-specific polymerase chain reaction (MSP) for four CIMP markers (IGF2, SOCS1, MLH1, and CACNA1G). Methylated (mMSP) and unmethylated (uMSP) primers were designed using Primer3 software (version 0.4.0), and the sequence details are provided in Table 1. PCR reactions were carried out in a total volume of 12.5 µL, comprising 1.25 µL 5XPCR buffer, 0.25 µL dNTP-mix (1 mmol/L of each), 0.25 µL of magnesium chloride (50 mmol/L), 0.75 mL of each primer (20 pmol/mL), 0.125 µL of HotStart Taq DNA polymerase (2.5 units/µL, Biolabs, USA), and 200 ng of bisulfite-modified DNA. The thermal cycling program consisted of an initial denaturation step at 95 °C for 10 min, followed by 40 cycles of denaturation at 95 °C for 15 s, annealing at 60 °C for 30 s, and extension at 72 °C for 15 s, with a final extension step at 72 °C for 10 min. These rigorous experimental conditions and controls were implemented to minimize variability and ensure the accuracy of the methylation-specific PCR results. The reactions were performed on the 7900HT Sequence Detection System version 2.3 (SDS2.3; Life Technologies, USA).
ROC curve
The overall diagnostic performance of methylation test was evaluated using a ROC curve generated by (http://www.graphpad.com/quickcalcs/ConfInterval1.cfm (accessed November 2015). The ROC curves plot the true positive rate (sensitivity) against the false positive rate (1-specificity) at different threshold values of a diagnostic test. The area under the ROC curve [30] is a measure of the diagnostic accuracy of the test, with a value of 1 indicating perfect discrimination between cases and controls, and a value of 0.5 indicating no discrimination beyond chance.
Statistical analysis
To identify DEGs, the multiple t-test was used and the Benjamini and Hochberg method was employed to adjust the q-values [31]. The difference in survival curves was compared using the log-rank test. The predictive performance of the genes was determined by calculating the area under the ROC curve [30]. The functional categories for enrichment analysis were downloaded from KEGG and the hypergeometric distribution model was used to test whether a set of observed genes in a functional term was significantly greater than what was expected by random chance.
Result
Gene ontology of cancer-specific genes
Regarding down-regulated genes, 8,351 and 1,769 genes were specifically down-regulated in SSAP and CRC tissues, respectively. Additionally, 906 genes were common in both types of tissues (Fig. 2, a). The GO of the down-regulated genes specific to CRC was examined and observed that these genes were predominantly linked to functions such as cell adhesion, negative regulation of cell proliferation, cell-cell adhesion, and drug response. Moreover, the most significantly enriched GO terms on cellular components (CC) for the down-regulated genes were the plasma membrane, cytoplasm, cytosol, membrane, and extracellular region. These genes were particularly enriched in molecular functions, including identical protein binding, calcium ion binding, actin binding, receptor binding, and heparin-binding (Fig. 2, b) As shown in Fig. 2, among the up-regulated genes, 10,998 were specifically expressed in SSAP, 1,434 genes were common in both samples and 1,222 genes were CRC-specific. Biological processes of positive regulation of cell proliferation, cell cycle, DNA replication, and mitotic cell cycle, and molecular functions such as protein binding, RNA binding, ATP binding, DNA binding, and chromatin binding were confirmed about up-regulated genes (Fig. 2, a).
Four highly down-regulated genes were among the Weissenberg panel
Among cancer-specific down-regulated genes, a subset of 5 genes, have been found that were particularly important. CACNA1G (Calcium Voltage-Gated Channel Subunit Alpha1 G) is a gene that encodes a subunit of a voltage-gated calcium channel that plays a role in regulating calcium influx into cells [32]. IGF2 (Insulin-like Growth Factor 2) is a gene that encodes a protein involved in cell growth and differentiation [33]. SOCS1 (Suppressor of Cytokine Signaling 1) is a gene that encodes a protein that negatively regulates cytokine signaling pathways [34]. The identification of these 4 important genes within the Weisenberger panel suggests that they may play a critical role in the molecular mechanisms underlying cancer development and could potentially serve as targets for future cancer therapies.
Hyper-methylated genes have interactions with diabetic and cancer-related proteins and pathways
CACNA1G, IGF2, MLH1, and SOCS1 proteins were analyzed using the STRING database and illustrated using Cytoscape software. The PPI network constructed for IGF2 showed that this gene could interact with more than 10 genes. Among them, IGF2R, GPC3, IGFBP1, IGFBP2, IGFBP3, IGFBP4, IGFBP5, and IGFBP6, have the highest connectivity with others (≥ 20 interactions), and they were identified as the hub genes for CRC. The PPI network constructed for SOCS1 showed that this protein is highly connected to GRB2, IRS1, JAK1, and IFNGR1. It has been also shown that CACNA1A is related to GNB1, GNAQ, RHOA, CALM1, CALM3, and CACNB2. MLH1 is connected with proteins such as MSH2, MBD4, BRCA1, and BLM (Fig. 3). The involvement and change of gene expression of CACNA1G, IGF2, MLH1, and SOCS1, in addition to causing CRC, can cause diseases such as type 2 diabetes, Fanconi’s anemia, and toxoplasmosis. Importantly, these genes play an important role in the production pathways of insulin, aldosterone, cortisol, and GnRH.
Hyper-methylation of CACNA1G, IGF2, MLH1, and SOCS1 is associated with a decrease in the survival of patients within 150 months
A statistical tool called Kaplan Meier plotter was used to perform an analysis of overall survival. The promoter methylation level, expression, and survival status of four genes were checked in the UALCAN database (Fig. 4). It was found that among patients with CRC, all of these genes were hypermethylated, which means their DNA had more methyl groups attached to it than normal, and their expression was significantly decreased. It should be noted that the samples used in the overall survival analysis, which were derived from the Kaplan Meier plotter, were different from those used in the analysis of DEGs. The level of expression of hub genes, which are genes that play a central role in a network of genes, was found to be associated with the overall survival rate of CRC patients. Specifically, a poorer overall survival rate was observed in patients with higher expression levels of all hub genes. This suggests that the expression levels of these 4 genes may be useful as a prognostic marker for CRC patients.
The change in the expression of Weisenberger panel genes initiates during the occurrence of cancer
In this study, it has been concluded that based on bioinformatics studies of these 4 genes, they can serve as suitable prognostic markers. To understand the onset of decreased expression of these four genes in SSAP or CRC, the expression status of these genes was investigated in samples. The result of real-time PCR showed that the expression level of IGF2, SOCS1, MLH1, and CACNA1G was not changed in SSAP compared to normal individuals. However, it was significantly decreased in CRC patients (Fig. 5).
Weissenberg panel genes are hyper-methylated in SSAP and CRC
The methylation pattern was determined for 4 genes from the DNA extracted from the tissue of people with polyps and cancerous lesions (Fig. 6). The promoter of IGF2, SOCS1, MLH1, and CACNA1G genes was found to be significantly decreased in SSAP samples compared to normal samples. Additionally, it was shown that the hypermethylation condition was more severe in CRC patients compared to SSAP cases and normal individuals. The results of the ROC curve analysis showed that the AUC for IGF2, SOCS1, MLH1, and CACNA1G genes were 0.77, 0.81, 0.74, and 0.71, respectively. This suggests that these genes have moderate to good diagnostic accuracy in discriminating between CRC patients and normal individuals (Fig. 7).
Discussion
CRC, as a major health issue in Iran, is a diagnostic challenge due to the lack of distinct symptoms. Namely, abdominal symptoms and gastrointestinal bleedings are frequently caused by non-cancerous diseases. Furthermore, the availability of resources for CRC diagnosis through the setting of colonoscopy is limited [35]. In addition, current predictive models for Suffer from poor accuracy as they Rely only on clinical symptoms. The diagnostic process for CRC is complex and involves various factors, such as the patient, the consulting physician, and the healthcare system. Understanding this process is crucial for identifying preventable factors and decreasing the impact of diagnostic delay on CRC prognosis [36]. Prior studies on CRC have revealed potential molecular and cellular mechanisms involved in the development, progression, and metastasis of this malignancy [37].
SSA polyps are precancerous lesions that can increase the likelihood of developing colon cancer in the future [38]. This study utilized various bioinformatics techniques to identify the important genes responsible for SSAP and CRC development. Recent studies suggest that bioinformatics is crucial in identifying pathways, survival, expression, and pathways associated with SSAP [39, 40]. The findings can be useful for developing precise and personalized treatments for the disease. Aberrant DNA methylation of CpG islands is widely seen in human colorectal tumors, but if this methylation occurs in promoter regions, it causes gene silencing and non-expression of an intended gene. The observed results stem from a meticulous comparison of microarray data between patients with SSAP and those with CRC against normal datasets. By employing GEO2R and the GEPIA database, we identified genes specifically expressed in CRC but not in SSAP, highlighting their potential involvement in CRC pathogenesis (Fig. 2). The ontology analysis of these genes unveiled their significance in CRC development and progression. Notably, key genes like IGF2, SOCS1, MLH1, and CACNA1G, consistent with the Weissenberg panel, emerged as down-regulated CRC-specific genes, corroborating earlier findings. To ascertain their utility as biomarkers, we assessed the expression and methylation status of these genes across SSAP, invasive adenocarcinoma, and normal tissue samples. Our findings revealed a significant decrease in gene expression levels in CRC samples compared to SSAP and normal tissue counterparts, aligning with previous studies by Komori et al. and Wentzensen et al. These results underscore the potential role of these genes in CRC pathogenesis and their viability as diagnostic and prognostic markers. In accordance with our results, in 2004, Komori et al. analyzed 8 CRC lesions and 8 normal tissue samples, they found that 1,062 genes were up-regulated and 1,004 genes were down-regulated in CRC tissues compared with normal tissue samples. Among the differentially expression genes, CACNA1G, IGF2, MLH1, and SOCS1 were identified as being significantly down-regulated in CRC tissue samples. The down-regulation of these genes suggests that they may play a role in the development and progression of CRC [41]. In research conducted by Wentzensen and colleagues, 10 samples of colorectal adenoma tissue and 10 samples of normal tissue were examined. The results demonstrated that 4,322 genes were up-regulated in colorectal adenoma tissue compared to normal tissue. On the other hand, 4,374 genes were down-regulated in colorectal adenoma tissue compared to normal tissue. Among the down-regulated genes, the researchers identified IGF2, SOCS1, MLH1, and CACNA1G as being significantly decreased in colorectal adenoma tissue samples [42]. In our study, in terms of methylation analysis, IGF2, SOCS1, MLH1, and CACNA1G were hyper-methylated in both SSAP and CRC. In a study by Mun-Kar et al., it has been found that CACNA1G and SOCS1 were hyper-methylated in giant colorectal polyps compared with normal colon tissues [43]. In another study, the researchers investigated the use of DNA methylation analysis as a diagnostic marker for CRC. The study analyzed the methylation status of regulatory regions of the IGF2 gene in CRC tissues and normal colon tissues using a methylation-specific PCR approach. The study found that the methylation status of IGF2 regulatory regions could be used as a diagnostic marker for CRC, with high sensitivity and specificity. These findings suggest that DNA methylation analysis can provide valuable information for the early detection and diagnosis of CRC [44]. Since hyper-methylation occurs in both SSAP and CRC samples, suitable markers for distinguishing between colon cancer may not be available. Our study suggests that for the diagnosis of colon cancer, it is better to examine the expression status rather than the methylation status of IGF2, SOCS1, MLH1, and CACNA1G genes. This is because the expression of these genes occurs specifically downregulated in cancer, not in SSA polyps. According to Fig. 5, the decreased expression of IGF2, SOCS1, MLH1, and CACNA1G is also observed in type 2 diabetes and Fanconi anemia. In a study, the researchers compared the expression of SOCS1 in insulin-resistant human subjects and controls. They found that the expression of SOCS1 was significantly decreased in insulin-resistant subjects, suggesting a potential role for SOCS1 in the development of insulin resistance and type 2 diabetes [45]. The mechanism behind the observed phenomenon could involve common regulatory pathways governing gene expression and DNA methylation in both SSAP and CRC. One potential mechanism is the dysregulation of epigenetic machinery, such as DNA methyltransferases, leading to aberrant DNA methylation patterns shared between SSAP and CRC. This dysregulation may occur due to mutations or alterations in upstream signaling pathways involved in epigenetic regulation, such as the Wnt signaling pathway, which is frequently dysregulated in CRC. Additionally, shared environmental factors or microenvironmental cues within the colonic epithelium may contribute to similar epigenetic changes in both SSAP and CRC.
The researchers also analyzed genetic variants in the CACNA1G gene in a large cohort of Chinese subjects with and without type 2 diabetes. They found that several genetic variants in the CACNA1G gene were significantly associated with an increased risk of type 2 diabetes, suggesting a potential role for CACNA1G in the development of the disease [30].
Conclusion
In conclusion, our study sheds light on the molecular landscape underlying CRC and SSAP, providing valuable insights into their pathogenesis and potential diagnostic and therapeutic implications. By leveraging bioinformatics techniques, we identified key genes implicated in CRC development, including IGF2, SOCS1, MLH1, and CACNA1G, which exhibited significant downregulation in CRC tissue samples compared to SSAP and normal tissues. These findings corroborate earlier studies and underscore the potential utility of these genes as diagnostic and prognostic biomarkers for CRC. Importantly, our results suggest that assessing the expression status of these genes may be more informative for CRC diagnosis than examining their methylation status, as hypermethylation was observed in both SSAP and CRC samples. The observed downregulation of these genes in CRC aligns with their involvement in other diseases like type 2 diabetes and Fanconi anemia, suggesting potential common molecular mechanisms underlying these conditions. Moving forward, further validation of these findings in larger cohorts and diverse populations is warranted to establish their clinical utility. Integration of these genes into existing diagnostic algorithms or prognostic models could enhance the accuracy of CRC detection and prognostication, ultimately improving patient outcomes. Moreover, exploring the therapeutic potential of targeting these genes or their associated pathways may pave the way for personalized treatment strategies for CRC patients. Future research endeavors should focus on elucidating the precise molecular mechanisms governing the dysregulation of these genes in CRC and exploring their therapeutic implications in preclinical and clinical settings. Additionally, investigating the interplay between genetic and environmental factors in modulating the expression of these genes could provide further insights into CRC pathogenesis and identify novel targets for intervention.
Data availability
The data presented in this study are available on request from the corresponding authors.
Abbreviations
- CRC:
-
Colorectal cancer
- CIMP:
-
CpG island methylator phenotype
- SSAP:
-
Serrated sessile adenomas/polyps
- CIN:
-
characteristics are chromosomal instability
- MSI:
-
Microsatellite instability
- DNMTs:
-
DNA methyltransferases
- NBI:
-
Narrow-band imaging
- PcG:
-
Polycomb group
- APC:
-
Adenomatous polyposis coli
- BRCA1:
-
Breast cancer 1
- CDH1:
-
Cadherin 1
- CDKN2A:
-
Cyclin-dependent kinase inhibitor 2 A
- MLH1:
-
MutL homolog 1
- PTEN:
-
Phosphatase and tensin homolog
- TP53:
-
Tumor protein p53
- VHL:
-
Von Hippel-Lindau tumor suppressor
- GEO:
-
Gene Expression Omnibus database
- GEPIA:
-
Gene Expression Profiling Interactive Analysis
- BP:
-
Biological processes
- MF:
-
Molecular function
- KEGG:
-
Kyoto Encyclopedia of Genes and Genomes
- DAVID:
-
Database for Annotation Visualization and Integrated Discovery
- CC:
-
Cellular components
- CACNA1G:
-
Calcium Voltage-Gated Channel Subunit Alpha1 G
- IGF2:
-
Insulin-like Growth Factor 2
- SOCS1:
-
Suppressor of Cytokine Signaling 1
References
Institute NC (2021) Colorectal cancer screening (PDQ®)–health professional version
Ateş Ö, Sivri B, Kılıçkap S (2017) Evaluation of risk factors for the recurrence of colorectal polyps and colorectal cancer. Turk J Med Sci 47(5):1370–1376
Rosty C et al (2013) Serrated polyps of the large intestine: current understanding of diagnosis, pathogenesis, and clinical management. J Gastroenterol 48(3):287–302
Nazemalhosseini Mojarad E et al (2013) The CpG island methylator phenotype (CIMP) in colorectal cancer. Gastroenterol Hepatol Bed Bench 6(3):120–128
Zhang X, Zhang W, Cao P (2021) Advances in CpG island methylator phenotype colorectal cancer therapies. Front Oncol 11:629390
Ogino S et al (2009) CpG island methylator phenotype, microsatellite instability, BRAF mutation and clinical outcome in colon cancer. Gut 58(1):90–96
Bae JM et al (2013) Prognostic implication of the CpG island methylator phenotype in colorectal cancers depends on tumour location. Br J Cancer 109(4):1004–1012
Ogino S, Goel A (2008) Molecular classification and correlates in colorectal cancer. J Mol Diagn 10(1):13–27
Phipps AI et al (2015) Association between molecular subtypes of colorectal cancer and patient survival. Gastroenterology 148(1):77–87e2
Samowitz WS et al (2005) Evaluation of a large, population-based sample supports a CpG island methylator phenotype in colon cancer. Gastroenterology 129(3):837–845
Brenner H, Hoffmeister M (2014) [Colorectal cancer screening: evidence and implementation]. Bundesgesundheitsblatt Gesundheitsforschung Gesundheitsschutz 57(3):302–306
Van Cutsem E et al (2016) ESMO consensus guidelines for the management of patients with metastatic colorectal cancer. Ann Oncol 27(8):1386–1422
Benson AB et al (2018) NCCN guidelines insights: Colon cancer, Version 2.2018. J Natl Compr Canc Netw 16(4):359–369
Rutter MD et al (2020) British society of gastroenterology/association of coloproctology of great Britain and Ireland/Public Health England post-polypectomy and post-colorectal cancer resection surveillance guidelines. Gut 69(2):201–223
Stintzing S et al (2019) Consensus molecular subgroups (CMS) of colorectal cancer (CRC) and first-line efficacy of FOLFIRI plus cetuximab or bevacizumab in the FIRE3 (AIO KRK-0306) trial. Ann Oncol 30(11):1796–1803
Bettington M et al (2013) The serrated pathway to colorectal carcinoma: current concepts and challenges. Histopathology 62(3):367–386
Snover DC (2011) Update on the serrated pathway to colorectal carcinoma. Hum Pathol 42(1):1–10
Hazewinkel Y et al (2013) Endoscopic features of sessile serrated adenomas: validation by international experts using high-resolution white-light endoscopy and narrow-band imaging. Gastrointest Endosc 77(6):916–924
Leggett B, Whitehall V (2010) Role of the serrated pathway in colorectal cancer pathogenesis. Gastroenterology 138(6):2088–2100
Hughes LA et al (2013) The CpG island methylator phenotype: what’s in a name? Cancer Res 73(19):5858–5868
Juo YY et al (2014) Prognostic value of CpG island methylator phenotype among colorectal cancer patients: a systematic review and meta-analysis. Ann Oncol 25(12):2314–2327
Weisenberger DJ et al (2006) CpG island methylator phenotype underlies sporadic microsatellite instability and is tightly associated with BRAF mutation in colorectal cancer. Nat Genet 38(7):787–793
Toyota M et al (1999) CpG island methylator phenotype in colorectal cancer. Proc Natl Acad Sci USA 96(15):8681–8686
Ogino S et al (2008) LINE-1 hypomethylation is inversely associated with microsatellite instability and CpG island methylator phenotype in colorectal cancer. Int J Cancer 122(12):2767–2773
Ogino S et al (2006) Distinct molecular features of colorectal carcinoma with signet ring cell component and colorectal carcinoma with mucinous component. Mod Pathol 19(1):59–68
Nosho K et al (2008) Comprehensive biostatistical analysis of CpG island methylator phenotype in colorectal cancer using a large population-based sample. PLoS ONE 3(11):e3698
Davis S, Meltzer PS (2007) GEOquery: a bridge between the Gene expression Omnibus (GEO) and BioConductor. Bioinformatics 23(14):1846–1847
Smyth G (2004) The limma package
Chandrashekar DS et al (2017) UALCAN: a portal for facilitating Tumor Subgroup Gene expression and survival analyses. Neoplasia 19(8):649–658
Ma RC et al (2013) Genome-wide association study in a Chinese population identifies a susceptibility locus for type 2 diabetes at 7q32 near PAX4. Diabetologia 56(6):1291 – 305
Serba DD et al (2015) Transcriptome Profiling of Rust Resistance in Switchgrass Using RNA-Seq Analysis The Plant Genome, 8(2): p. plantgenome2014.10.0075
Catterall WA et al (2005) International Union of Pharmacology. XLVIII. Nomenclature and structure-function relationships of voltage-gated calcium channels. Pharmacol Rev 57(4):411–425
Xu X et al (2022) The role of insulin-like growth factor 2 mRNA binding proteins in female reproductive pathophysiology. Reproductive Biology Endocrinol 20(1):89
Yasin EB et al (2022) Methylation of the suppressor of cytokine signaling 1 gene (SOCS1) in Philadelphia-negative myeloproliferative neoplasms. Eur Rev Med Pharmacol Sci 26(22):8451–8458
Rafiemanesh H et al (2016) Colorectal cancer in Iran: epidemiology and morphology trends. Excli j 15:738–744
Power E et al (2011) Assessing awareness of colorectal cancer symptoms: measure development and results from a population survey in the UK. BMC Cancer 11:366
Hu T et al (2016) Mechanisms of drug resistance in colon cancer and its therapeutic strategies. World J Gastroenterol 22(30):6876–6889
Orłowska J (2010) Serrated polyps of the colorectum: histological classification and clinical significance. Pol J Pathol 61(1):8–22
Liu X et al (2020) Identification of crucial genes and pathways associated with colorectal cancer by bioinformatics analysis. Oncol Lett 19(3):1881–1889
Zhang P et al (2021) Bioinformatics analysis of candidate genes and pathways related to hepatocellular carcinoma in China: a study based on public databases. Pathol Oncol Res 27:588532
Komori T et al (2004) Identification of differentially expressed genes involved in colorectal carcinogenesis using a cDNA microarray. J Exp Clin Cancer Res 23(3):521–527
Wentzensen N et al (2004) Identification of differentially expressed genes in colorectal adenoma compared to normal tissue by suppression subtractive hybridization. Int J Oncol 24(4):987–994
Ng JM-K, Yu J (2015) Promoter hypermethylation of tumour suppressor genes as potential biomarkers in colorectal cancer. Int J Mol Sci 16(2):2472–2496
Mäki-Nevala S et al (2021) Immunoprofiles and DNA methylation of inflammatory marker genes in ulcerative colitis-associated colorectal tumorigenesis. Biomolecules 11(10)
Chakraborty C (2006) Biochemical and molecular basis of insulin resistance. Curr Protein Pept Sci 7(2):113–121
Acknowledgements
The authors would like to thank all the staff of the Cancer Department in the Research Institute for Gastroenterology and Liver Diseases, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
Funding
The research has been supported by the Research Institute for Gastroenterology and Liver Diseases of the Shahid Beheshti University of Medical Sciences.
Author information
Authors and Affiliations
Contributions
“Conceptualization, SN.M and E.NM.; Methodology, N.Z and MY.T; Software, SN.M, N.F and E.M ; Validation, S.S and V.CH ; Formal Analysis, SN.M and E.M.; Investigation, SN.M and L.R.; Writing – Review & Editing, SN.M and N.F; Visualization, E.NM and P.KM ; Supervision, E.NM.“All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Ethical approval
The study was conducted according to the guidelines of the Declaration of Helsinki, and approved by the Ethics No. IR.SBMU.RIGLD.REC. 11399.539 of the Research Institute for Gastroenterology and Liver Disease, Shahid Beheshti University of Medical Sciences, Tehran, Iran.
Consent to participate and publication
Written informed consent obtained from all participants for participation and publication of the information in this survey.
Competing interests
The authors declare no competing interests.
Conflict of interest
The authors state that there are no conflicts of interest regarding the publication of this article.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Mirbahari, S.N., Fatemi, N., Savabkar, S. et al. Unmasking early colorectal cancer clues: in silico and in vitro investigation of downregulated IGF2, SOCS1, MLH1, and CACNA1G in SSA polyps. Mol Biol Rep 51, 764 (2024). https://doi.org/10.1007/s11033-024-09683-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11033-024-09683-3