Introduction

Colorectal cancer (CRC), ranking as the third most frequently diagnosed cancer worldwide with a notable mortality rate1, is influenced by a combination of germline mutations in high penetrance genes (60% cases) and lifestyle-related factors (40% cases)2,3. The individual susceptibility to CRC is intricately tied to genetic polymorphisms, wherein certain variations exhibit noteworthy ethnic disparities, reflecting diverse frequencies of mutant genotypes across populations4,5,6.

Numerous studies have identified specific genes and their single nucleotide polymorphisms (SNPs) associated with CRC in various ethnic groups. For instance, the FABP2 gene, encoding the fatty acid-binding protein 2 (FABP2), demonstrates a crucial role in the uptake, transport, and regulation of fatty acids, with polymorphisms in this gene linked to insulin resistance a well-established CRC risk factor in Han Chinese populations7. This emphasizes the intricate interplay between genetic variations, lifestyle factors, and CRC susceptibility, shedding light on the diverse molecular landscape of this complex disease.

The delicate equilibrium between xenobiotic absorption and elimination plays a crucial role in safeguarding against deoxyribonucleic acid (DNA) damage induced by chemical carcinogens. A well-established fact is that both exogenous (xenobiotics) and endogenous chemical carcinogens necessitate biotransformation into activated forms to manifest their carcinogenic potential8.

The Cytochrome P450 2E1 (CYP2E1) gene encodes an enzyme within the cytochrome P450 superfamily, pivotal for the detoxification and activation of various low molecular compounds. These encompass ethanol, acetone, and pharmaceuticals like acetaminophen, isoniazid, chlorzoxazone, and fluorinated anesthetics, as well as several procarcinogens such as benzene, N-nitrosodimethylamine, and styrene4. Beyond detoxification, these enzymes frequently facilitate the metabolic activation of procarcinogens into their ultimate carcinogenic forms9. Mutations in the CYP2E1 gene, responsible for encoding the CYP2E1 enzyme, can yield enzyme variants with varying activity levels-higher, lower, or absent. Consequently, tailoring therapy based on individual’s CYP2E1 genotype has the potential to enhance drug efficacy4.

Reportedly, a myriad of mutations (approximately 200) and aberrant expression in the TP53 gene have been implicated in various cancers10. Epidemiological evidence underscores that TP53 mutations are prevalent in around half of all colorectal cancers11,12. The TP53 gene encodes the p53 protein, a transcription factor pivotal for maintaining genomic integrity through surveillance of cell cycle progression and cell survival13. Renowned as the 'guardian of the genome,' the p53 protein is recognized for stimulating the expression of multiple pro-apoptotic signaling molecules, including FOXO1, FOXO3, and TRAIL, while concurrently orchestrating DNA repair and cell division regulation14.

Additionally, p53 is known to activate the murine double minute 2 (MDM2) protein at the transcriptional level. MDM2, in turn, acts as a crucial negative regulator of TP53 by intensifying proteasomal degradation, particularly when there is an excess of p53 protein15. This intricate relationship suggests that MDM2 overexpression may impede p53 function, allowing damaged cells to evade cell cycle checkpoint control and potentially evolve into carcinogenic entities16.

While the association of single nucleotide polymorphisms (SNPs) in TP53, MDM2, CYP2E1, and FABP2 with colorectal cancer (CRC) has been explored in various populations, there remains a critical void in the literature pertaining to such investigations in the Pakistani context. Initially classified as a low-risk zone for CRC, recent findings by Bhurgri et al.17 have documented a notable surge in CRC cases among individuals aged 50 and above in Pakistan. This increased risk is attributed to factors such as the elevated consumption of preserved foods, and animal products, smoking, excessive alcohol intake, and inflammatory bowel disease. Compounding this, diminished physical activity levels coupled with a significant rise in obesity prevalence further contribute to the heightened CRC risk within the Pakistani population18.

Despite recommendations advocating clinical and genotypic screening as an integral part of future health sector planning, particularly focusing on high-risk younger age groups, implementation of these suggestions has been lacking17. Recognizing this gap, the current study seeks to address whether the examined SNPs, including rs1799883 in FABP2, rs3813865 in CYP2E1, rs1042522 in TP53, and rs2279744 in MDM2, individually or in combination, are correlated with CRC in the Pakistani population. By doing so, we aim to provide crucial insights into the genetic determinants of CRC susceptibility specific to Pakistan, thereby contributing to the development of targeted preventive and therapeutic strategies for this population.

Materials and methods

Subjects and data collection

The cross-sectional case–control study was conducted at the Institute of Molecular Biology and Biotechnology from May 2020 to March 2021, encompassing clinically confirmed colorectal cancer (CRC) patients and healthy controls. Blood samples from CRC patients (N = 100) were procured from the Multan Institute of Nuclear Medicine and Radiotherapy (MINAR) and the Oncology ward of Nishter Medical University and Hospital, Multan, Pakistan. Patient diagnosis relied on carcinoembryonic antigen (CEA) levels and histological evidence obtained through endoscopic biopsy. Enrolled subjects, representing diverse cities in Southern Punjab, exhibited variations in ethnic origins, gender, and age. Controls (N = 100) were age-matched to cases and exhibited no colorectal cancer or systemic illnesses. Sample size estimation during random collection employed Solvin’s formula, calculated as follows:

$$ {\text{n }} = {\text{ N/}}\left( {{1} + {\text{N }}*{\text{ e}}^{{2}} } \right). $$

whereas: n = no. of samples, N = total population, and e = margin of error19.

After obtaining informed consent, a comprehensive questionnaire was administered to all participants to gather essential epidemiological data, covering aspects such as age, gender, marital status, family history, and habits related to smoking and exercise.

Blood collection and DNA extraction

A blood sample of 3–5 ml was collected from each participant and stored at − 4 °C in Ethylene Diamine Tetra Acetic acid (EDTA)-coated vials until further analysis. DNA extraction from the whole blood was performed using an inorganic DNA extraction protocol, as detailed by Arshad et al.19.

Amplification and genotyping of rs1799883 in FABP2

For genotyping the Ala54Thr SNP (rs1799883) in the FABP2 gene, a polymerase chain reaction-restriction fragment length polymorphism (PCR–RFLP) approach was employed. Oligonucleotide primers, as per Baier et al.20 were utilized for amplification: forward primer 5ʹ-ACAGGTGTTAATATAGTGAAAAG-3ʹ and reverse primer 5ʹ-TACCCTGAGTTCAGTTCCGTC-3. A reaction mixture of 20 µl was prepared that contained 13 mM Tris–HCl (pH 8.3), 65 mM KCl, 1.5 mM MgCl2, 300 µM of each dNTP,1U of DNA Polymerase (Thermo Scientific, United States), 0.5 µM of each primer and 5 µl of template DNA20. The amplification of DNA was performed using a thermal cycler (Gene Amp™ PCR system 2700, Applied Biosystems Inc., United Kingdom). The thermo-profile included an initial denaturation at 95 °C for 3 min, followed by 30 cycles of denaturation at 95 °C for 30 s, annealing at 55 °C for 30 s, and elongation at 72 °C for 45 s. The final extension was carried out for 3 min at 72 °C21. Subsequently, the amplified product underwent restriction using Hha1, a restriction enzyme isolated from Haemophilus haemolyticus bacteria (New England BioLabs, USA). This step was conducted in a 20 µl reaction mixture composed of 10 µl PCR product, 2 µl CutSmart buffer, and 7 µl double-distilled water, maintained at 37 °C for 1 h21. The resolution of the restriction products was achieved using a 2% agarose gel and visualized on an ultraviolet (UV) transilluminator (Biostep, Germany).

Amplification and genotyping of rs3813865 in CYP2E1

A T-ARMS–PCR was employed to amplify rs3813865 (G/C), a single nucleotide polymorphism (SNP) in the CYP2E1 gene, following the protocol outlined by Suhda et al.9. The primers utilized in this T-ARMS PCR were as follows: outer forward 5ʹ TGA TGT TGG TTG GGC ATC TA 3ʹ, outer reverse 5′ CCTCGA GGT GAG AAC TGA CA 3′, inner forward 5ʹ CTC ACC CCA CCA AAG CCT AC 3ʹ, inner reverse 5ʹ-CCA CAG ACT GAA ATT GAA CCC 3ʹ. A reaction mixture of 25 µl was prepared that contained 13 mM Tris–HCl (pH 8.3), 65 mM KCl, 2 mM MgCl2, 300 µM of each dNTP,1U of DNA Polymerase (Thermo Scientific, United States), 0.5 µM of each primer and 5 µl of template Subsequently, the amplification of rs3813865 in CYP2E1 was conducted under the following thermal conditions: initial denaturation for 5 min at 95 °C, followed by 30 cycles of denaturation for 30 s at 95 °C, annealing for 30 s at 56 °C, elongation for 30 s at 72 °C, and a final extension for 7 min at 72 °C9.

Amplification and genotyping of rs1042522 in TP53

A T-ARMS–PCR analysis was performed to investigate the rs1042522 (G/C) single nucleotide polymorphism (SNP) in the TP53 gene, utilizing the methodology outlined by Asadi et al.11. The primers employed in this T-ARMS PCR were as follows: outer forward 5ʹ TGCAGGGGGATACGGCCAGGCATTGAAGTC 3ʹ, outer reverse 5ʹ TGGGGGGCTGAGGACCTGGTCCTCT 3ʹ, inner forward 5ʹ GCTGCTGGTGCAGGGGCCAGGG 3ʹ, inner reverse 5ʹ CCAGAATGCCAGAGGCTGCTCCGCG 3ʹ. A reaction mixture of 25 µl was prepared that contained 13 mM Tris–HCl (pH 8.3), 65 mM KCl, 1.5 mM MgCl2, 300 µM of each dNTP,1U of DNA Polymerase (Thermo Scientific, United States), 0.5 µM of each primer and 5 µl of template DNA. For the amplification of rs1042522, the thermal profile included an initial denaturation for 10 min at 95 °C, followed by 25 cycles of denaturation at 95 °C for 45 s, annealing at 56 °C for 30 s, elongation at 72 °C for 45 s, and a final extension for 10 min at 72 °C11.

Amplification and genotyping of rs2279744 in MDM2

A T-ARMS–PCR analysis was carried out to investigate the rs2279744 (G/T) single nucleotide polymorphism (SNP) in the MDM2 gene, following the protocol outlined by Xiao et al.22. The primers utilized in this T-ARMS PCR were as follows: outer forward 5ʹ GGC AGTCGCCGCCAGGGAGGAGGGCGG 3ʹ, outer reverse 5ʹ TGCCCACTGAACCGGCCCAATCCCGCCCAG 3ʹ, inner forward 5ʹ GGGGGCCGGGGGCTGCGGGGCCGTTT 3ʹ, inner reverse 5ʹ ACCTGCGATCATCCGGACCTCCCGCGCTGC 3ʹ. A reaction mixture of 25 µl was prepared that contained 13 mM Tris–HCl (pH 8.3), 65 mM KCl, 1.5 mM MgCl2, 300 µM of each dNTP,1U of DNA Polymerase (Thermo Scientific, United States), 0.5 µM of each primer and 5 µl of template For the amplification of rs2279744, the thermal profile comprised an initial denaturation for 10 min at 95 °C, followed by 25 cycles of denaturation at 95 °C for 45 s, annealing at 52 °C for 30 s, elongation at 72 °C for 45 s, and a final extension for 10 min at 72 °C22.

Statistical analysis

The analysis of data was performed using Minitab version 18 (Minitab, USA). Genotypic frequencies were ascertained through direct counting. The chi-square test was employed to compare genotype and allelic frequencies between cases and controls, as well as to assess the correlation between colorectal cancer (CRC) and the investigated risk factors. All statistical tests were two-tailed, with a significance level set at P < 0.05 to establish statistical significance. Odds ratio and adjusted odds ratio were calculated where required.

Ethics approval and consent to participate

Ethical Research Committee of the Institute of Pure and Applied Biology at Bahauddin Zakariya University Multan (Pakistan) approved all the experimental procedures and protocols applied in this study via letter number IP&AB/Ethics/41/2020. All the subject handling procedures and experimental protocols were performed in accordance with the Declaration of Helsinki. Informed consent was obtained from all subjects involved in the study.

Results

Genotypic and allelic frequency at rs1799883 in FABP2 and their association with CRC

The PCR successfully amplified a 180 bp amplicon from FABP2. Subsequent restriction with Hha1 resulted in a distinct pattern: homozygous dominant individuals (GG) exhibited two bands (81 bp and 99 bp), heterozygous (GA) subjects displayed three bands (81 bp, 99 bp, and 180 bp), and the uncut 180 bp product indicated homozygous mutant individuals (AA) (Supplementary Fig. 1A). Data analysis demonstrated significant variation in both genotypes and allelic frequencies at rs1799883 in FABP2 when comparing the case and control groups (P = 0.05). Notably, cases exhibited a higher frequency of the homozygous dominant (GG) genotype, whereas the control group had a higher frequency of the heterozygous (GA) genotype. The odd ratio (OR) of 1.47 is indicating that cases have 1.47 fold higher risk than controls to develop CRC (Table 1).

Table 1 Distribution of genotypic and allelic frequencies of rs1799883 in FABP2, rs3813865 in CYP2E1, rs1042522 in TP53, and rs2279744 in MDM2 gene among cases and controls and their possible association with colorectal cancer.

Genotypic and allelic frequency at rs3813865 in CYP2E1 and their association with CRC

The Tetra-primer Amplification Refractory Mutation System–Polymerase Chain Reaction (T-ARMS–PCR) generated a 499 bp amplicon for the outer primers, 303 bp for homozygous wild type (GG), and 236 bp for homozygous mutant (CC), while the heterozygous (GC) configuration produced all three bands of 499, 303, and 236 bp (Supplementary Fig. 1B). Analysis of genotypes and allelic frequency distribution unveiled significant variation (P = 0.006) in genotypic frequencies at rs3813865 in CYP2E1 when comparing patients and controls. Notably, the frequency of heterozygous alleles (GC) was higher in CRC patients, whereas homozygous mutant alleles (CC) were more prevalent in the control group. The OR values are indicating that cases has 1.56 fold more risk than controls to develop CRC (Table 1).

Genotypic and allelic frequency at rs1042522 in TP53 and their association with CRC

The T-ARMS–PCR yielded a 493 bp amplicon for the outer primers, 247 bp for homozygous wild type (CC), and 200 bp for homozygous mutant (GG), while the heterozygous (GC) configuration produced all three bands of 493, 247, and 200 bp (Supplementary Fig. 1C). Genotypic and allelic frequency analysis uncovered significant variation (P = 0.05) in genotypic frequencies at rs1042522 in TP53 when comparing patients and controls. Remarkably, the frequency of homozygous mutant alleles (GG) was higher in CRC patients, whereas heterozygous alleles (GC) were more prevalent in the control group. OR analysis revealed that cases had 3.5 fold higher risk to develop CRC than controls (Table 1).

Genotypic and allelic frequency at rs2279744 in MDM2 and their association with CRC

The T-ARMS–PCR produced a 224 bp amplicon for the outer primers, 158 bp for homozygous wild type (GG), and 122 bp for homozygous mutant (TT), while the heterozygous (GT) configuration generated all three bands of 224, 158, and 122 bp (Supplementary Fig. 1D). Analysis of genotypic distribution and allelic frequencies indicated that none of the genotypes at rs2279744 in MDM2 were found to be associated with CRC (P = 1). OR analysis indicated that both case and controls had equal susceptibility to develop CRC (OR = 0.97) (Table 1).

Interactions between SNPs and their association with CRC

Analysis of genotype frequencies revealed that certain combinations of genotypes at two or more single nucleotide polymorphisms (SNPs) significantly increased the risk of developing colorectal cancer (CRC). Protective combinations, with significantly higher frequencies among controls, contrasted with higher-risk combinations more prevalent among cases (Table 2). Comparing the genotype combination of rs1799883 (FABP2) and rs3813865 (CYP2E1) revealed that individuals with the wild type (GG) at FABP2 and the heterozygous (GC) genotype at CYP2E1 had a significantly higher incidence of CRC development (P = 0.02) (Table 2). Similarly, comparing rs1042522 (TP53) and rs2279744 (MDM2) genotype combinations demonstrated that individuals with the wild type (GG) at TP53 and the mutant (TT) genotype at MDM2 had a significantly higher incidence of CRC development (P = 0.05) (Table 2). The most CRC-susceptible genotypes were observed when different genotype combinations of rs3813865 (CYP2E1) and rs1042522 (TP53) were examined. Specifically, all three genotypes of CYP2E1 at rs3813865 were found to be susceptible to CRC when present with either wild-type (GG) or heterozygous (GC) genotypes at rs1042522 in TP53 (P = 0.04) (Table 2). However, when all genotypes of the four studied SNPs were analyzed in combination, no specific combination types were found to be susceptible to developing CRC (P = 0.801) (Table 2).

Table 2 Genotype frequency distribution among cases and controls for various combinations at FABP2 rs1799883, CYP2E1 rs3813865, TP53 rs1042522, and MDM2 rs2279744 and their possible association with colorectal cancer.

Association of Risk Factors and CRC

Upon analyzing various clinical risk factors, significant differences were observed in marital status (P < 0.001), smoking (P < 0.001), family history (P = 0.002) and exercise habits (P = 0.01) when comparing controls to patients with CRC (Table 3). Additionally, there were no statistically significant differences in age and gender (P > 0.05). Odd ratio analysis indicated that married subjected (OR = 1), smokers (OR = 5.27), subjects having family history (OR = 2.64) and those do not have exercise habit (OR = 1) had higher susceptibility to develop CRC during present investigation (Table 3).

Table 3 Analysis of the clinical risk factors associated with colorectal cancer.

Discussion

The incidence of colorectal cancer (CRC) is on the rise in Pakistan, yet the country lacks a specific CRC control program or a national cancer registry. Due to limited health care facilities, especially in rural areas, diagnosis of CRC patients in Pakistan is often delayed and usually patients are diagnosed at an advanced stage of the disease23. Moreover, there is a dearth of data in the literature regarding the genetic determinants of CRC in the Pakistani population. A comprehensive literature review identified only one study reporting an association between the deletion of the Glutathione S transferase M1 gene and the risk of CRC development in individuals from Khyber Pakhtunkhwa, Pakistan24. Given this gap in knowledge, the present study was undertaken to investigate the association of genotypes at rs1799883 in FABP2, rs3813865 in CYP2E1, rs1042522 in TP53, and rs2279744 in MDM2, both individually and in various combinations, with the development of CRC in subjects enrolled from Punjab, Pakistan.

The FABP2 protein plays a crucial role in the absorption and intracellular transport of long-chain fatty acids. The rs1799883 single nucleotide polymorphism (SNP) in the FABP2 gene leads to a threonine-for-alanine substitution, altering the structure and function of FABP225. Individuals with this genetic variant exhibit increased fatty acid uptake in the intestinal lumen, as the variant protein binds to fatty acids with double the affinity compared to the wild-type protein26. Our analysis revealed an association between rs1799883 in the FABP2 gene and colorectal cancer (CRC), with cases exhibiting a higher frequency of the homozygous dominant (GG) genotype, while controls had a higher frequency of the heterozygous (GA) genotype (Table 1).

In contrast to our findings, studies by Andersen et al.27, Hu et al.7, and Kato et al.28 reported no association between any genotypes at rs1799883 in FABP2 and CRC development in Danish, Han Chinese populations, and residents of the Metropolitan Detroit Tri-County area in the US. This observed discrepancy in results emphasizes the potential role of ethnicity in CRC. Additionally, the genotyping technique employed for the studied SNP may contribute to the variation in results.

Susceptibility to colorectal cancer (CRC) varies among individuals due to differences in their metabolism and detoxification potential of gastrointestinal carcinogens. Environmental and genetic factors both play significant roles in CRC development24. Our results demonstrated a significantly higher frequency of heterozygous alleles (GC) in CRC patients compared to controls at rs3813865 in CYP2E1, indicating an association with CRC incidence (Table 1). However, the association between rs3813865 in CYP2E1 and CRC has been observed in a limited number of studies. Our findings align with Tang et al.4, who reported an association between genotypes at rs3813865 and CRC in Chinese populations.

In our investigation, we analyzed one of the most commonly studied SNPs in exon 4 of the TP53 gene, rs1042522 (CCC changes to CGC), resulting in the Pro72Arg amino acid change. This nucleotide alteration disrupts the Proline-rich region of the p53 protein, impacting its normal function in cell cycle regulation and DNA repair, ultimately contributing to cancer development12. Consistent with these findings, our study revealed a higher frequency of homozygous mutant alleles (GG) in CRC cases, while heterozygous alleles (GC) were more frequent in controls (Table 1). Our results align with previous studies reporting an association between rs1042522 in TP53 and CRC in subjects from Jammu and Kashmir29, Denmark30, China10, and Bangladesh31. Conversely, Asadi et al.11 and Polakova et al.32 reported no significant differences in allele prevalence at rs1042522 in the TP53 gene between patients and controls in Iranian Azari and Czech Republic populations, respectively.

The human MDM2 gene, located on chromosome 12q13–14, possesses two promoters, a constitutive and a p53-responsive intronic promoter33. A common polymorphism (rs2279744, T309G) in the MDM2 promoter has been shown to increase MDM2 mRNA and protein expression by altering Sp1-binding affinity, thereby inhibiting p5334. In our study, none of the genotypes at rs2279744 in MDM2 were found to be associated with colorectal cancer (CRC) (Table 1). Numerous studies have explored the association of rs2279744 in MDM2 with CRC, yielding controversial and inconclusive results. Similar to our findings, Talseth et al.35 found no association of any genotype at rs2279744 with CRC in subjects from Australian and Polish populations. Alhopuro et al.36 also reported no association between genotypes at rs2279744 and CRC patients of Finnish origin. Conversely, Yueh et al.37, Atabey et al.15, and Liu et al.38 reported a significant association of rs2279744 in MDM2 in Taiwanese, Turkish, and Chinese populations, respectively.

Results from studies examining individual SNPs are inconsistent, as each SNP alters the function of a single gene among many involved in carcinogenesis. Disease progression may result from the interaction of several proteins. Considering this, we performed SNP-SNP interaction analysis, exploring various possible SNP combinations from the four SNPs screened in our study to identify combinations most likely associated with CRC risk. Our results indicated that combinations of various SNPs at rs1799883 (FABP2) and rs3813865 (CYP2E1), rs1042522 (TP53), and rs2279744 (MDM2), as well as genotype combinations at rs3813865 (CYP2E1) and rs1042522 (TP53), were associated with a higher incidence of CRC. Our findings support those of Zhang et al.39, who reported that the combined effect of TP53 Arg72Pro (rs1042522) and MDM2 SNP309 (rs2279744) variant genotypes increased CRC risk in a Chinese population. Additionally, we identified two new SNP combinations associated with CRC development in our study, which should be further analyzed in other populations to confirm their association with CRC.

Analysis of the studied risk factors revealed that unmarried status, smokers, and subjects with exercise habits but no family history were at a higher risk of developing colorectal cancer (CRC). Numerous studies worldwide have reported risk factors associated with CRC across various ethnic groups. Bhattacharya et al.3 reported that individuals of Indian origin living in urban areas, those who were obese, smokers and those with a history of non-vegetarian dietary intake were more susceptible to CRC. Diet composition, especially one rich in N-nitroso compounds, has been documented as having a significant association with the development of gastrointestinal tract cancers40.

During the present study, it was also observed that males and patients older than 50 years were more susceptible to CRC than females and subjects younger than 50, although these differences did not reach statistical significance (Table 3). Our results align with Zubair et al.24, who reported that the age and sex of subjects were not associated with the development of CRC in individuals enrolled from Khyber Pakhtunkhwa in Pakistan. In contrast to our observations, Bhattacharya et al.3 reported a significantly increased risk of developing CRC with increasing age, with individuals older than 50 having a higher incidence of CRC in the Indian population3. Similar to our findings, Bhattacharya et al.3 also observed an increased risk of CRC in males compared to females, although the difference did not reach statistical significance.

The current study has several limitations that need addressing. First, we aimed to genotype as many patients and healthy individuals as possible. The larger the sample size, the more reliable the observations and results. However, we finally analyzed 100 CRC cases and 100 healthy controls. Secondly, the number of genes and SNPs related to CRC in our study was limited, whereas many other genes have been proven to be associated with CRC. There is a need to explore more genes and SNPs related to CRC to accurately assess susceptibility genes for CRC.

While our study provides valuable insights into the genetic factors associated with colorectal cancer (CRC) susceptibility in the Southern Punjab population, it is important to acknowledge certain limitations. The modest sample size of 100 CRC cases and 100 controls, although informative, suggests the potential for broader investigations to enhance statistical robustness and generalize findings more effectively. Our genetic focus on specific genes (FABP2, CYP2E1, TP53, and MDM2) and associated SNPs undoubtedly contributes to the understanding of CRC risk factors. However, there exists an opportunity to explore a wider array of genetic variations to uncover a more comprehensive picture. Geographically, our study concentrated on Southern Punjab, thereby offering insights specific to that region. While insightful, this regional focus may limit the broader applicability of findings. Future research should consider expanding to diverse regions in Pakistan to capture a more representative spectrum of CRC patterns. In terms of study design, the cross-sectional case–control approach suits genetic associations well. However, establishing causation necessitates longitudinal studies, an aspect for consideration in future research endeavors. Our study primarily delved into genetic factors, excluding detailed examinations of gene-environment interactions. Exploring these interactions could provide a more nuanced understanding of CRC risk factors. While we recognize these limitations, it is important to emphasize that our study forms a valuable foundation for understanding CRC susceptibility in the Southern Punjab population. The insights gained pave the way for future research that addresses these limitations and further refines our understanding of CRC in the Pakistani context.

Conclusion

In conclusion, our study sheds light on the intricate interplay of genetic factors in colorectal cancer (CRC) susceptibility within the Southern Punjab population. The investigation identified significant associations between specific single nucleotide polymorphisms (SNPs) in FABP2, CYP2E1, TP53, and MDM2 genes and the risk of CRC development. Notably, the observed variations in genotypic frequencies underscore the ethnic diversity in CRC susceptibility, emphasizing the need for region-specific considerations in genetic studies. The uniqueness of our study lies in its focused examination of distinct SNPs within key genes associated with CRC, providing a foundation for understanding the genetic landscape of CRC in Southern Punjab. These findings contribute to the evolving field of personalized medicine, paving the way for tailored screening and preventive strategies based on an individual's genetic profile. Moving forward, prospective studies with expanded sample sizes, the inclusion of additional genetic markers, and the exploration of gene-environment interactions could unravel more intricate details of CRC etiology in the Pakistani context. Additionally, our results underscore the importance of incorporating genetic screening into comprehensive health planning, offering potential avenues for targeted interventions and improved CRC management in the region.