Background

Congenital heart disease (CHD) is a type of structural abnormality present in the heart and great vessels since birth. It is a common congenital anomaly that affects approximately 8 out of 1000 live births globally[1]. CHD is a significant cause of birth defect-related mortality [2, 3]. The prevalence of CHD is increasing globally, with unmet diagnostic needs observed in Africa [4]. CHD has a multifactorial basis that involves genetic and environmental components [5, 6].

The NKX 2.5 transcription factor is an important protein that plays a vital role in the development of the heart. It is a highly conserved homeobox transcription factor and consists of two exons that encode 324 amino acids. It is situated on chromosome 5q34 [7, 8] and is expressed in the early cardiac mesoderm [9]. It is a homo-domain transcription factor that acts as a central regulator of cardiac development and is expressed in both the first and second heart fields, known as FHF and SHF, respectively [10, 11]. Various studies conducted on animal models have shown that NKX2.5 knockout mice are embryonically lethal, highlighting the critical role of this factor in cardiac development [12, 13].

Over the past decade, it has been the focus of extensive research, providing valuable molecular insights into the mechanisms underlying heart development and related disorders. The proper functioning of NKX2-5 is essential for normal cardiac development, and any abnormalities in this gene have been linked to congenital heart disorders in humans [14]. In fact, NKX2.5 mutations have been identified as key factors responsible for various forms of CHD, making this transcription factor a critical target for further study and potential therapeutic interventions [15].

People with CHD have been found to have over 40 heterozygous NKX2-5 germline mutations; these mutations are dispersed throughout the coding area and affect protein function [16]. Heterozygous mutation of human NKX2.5 has been associated with various congenital heart diseases such as atrial septal defect (ASD), ventricular septal defect, tetralogy of Fallot, and tricuspid valve abnormalities, including Ebstein's anomaly [17]. Furthermore, NKX2.5 mutations have been associated with atrial septal defects and hypoplastic left heart syndrome (ASD and HLHS) and have also been found in patients with tetralogy of Fallot, secundum ASD, truncus arteriosus, double-outlet right ventricle, TOF, and familial ASD [18,19,20,21,22,23]. Mutation of human NKX2.5 has been associated with various congenital heart diseases and can also cause atrioventricular (AV) conduction block in a patient with CHDs [17].

Patients with atrial septal defect (ASD) who have mutations in the NKX2-5 gene are more likely to develop conduction disturbances, which can eventually lead to sudden cardiac death. For such patients, it is highly recommended to consider implantation of an implantable cardioverter defibrillator that can effectively prevent sudden cardiac death [24]. Studies have reported that point mutations in the NKX2-5 gene can lead to the development of familial ASD (autosomal dominant form)[25]. Additionally, heterozygous mutations in the same gene have been linked to the occurrence of atrial septal defect (ASD) and/or atrioventricular (AV) conduction disturbance in certain families [26, 27]. The NKX2.5 gene is also associated with congenital heart diseases, with two specific single nucleotide polymorphisms (SNPs), rs2277923 and rs703752, having been identified in previous research [28,29,30,31,32]. However, the prevalence of these SNPs and their correlation with congenital heart diseases remain unknown in our current setting. The NKX 2.5 mutations and their relationship to CHD characteristics are summarized in Table 1.

Table 1 NKX2.5 mutations and phenotypes reported in previous studies

In our country, advanced diagnostic facilities have become more widely available and as a result, the prevalence of congenital heart diseases (CHDs) has increased. Although both genetic and environmental factors have been suggested as possible causes of this condition, the exact etiology is still unknown. This lack of understanding about the factors contributing to CHDs can impede risk control measures, genetic counseling, and the ability to predict disease outcomes. Therefore, it is crucial to gain a better understanding of the etiological factors underlying CHDs as this knowledge can enhance the clinical management of the condition by enabling accurate prognosis predictions.

In individuals with CHDs, identifying mutations in the NKX2-5 gene is of utmost clinical significance as it helps in predicting the risk of progressive conduction disease and sudden cardiac death. Such genetic information is crucial for making informed decisions regarding the use of pacemakers and implantable cardiac defibrillators. Additionally, understanding the genetic basis of CHDs is essential for providing appropriate genetic counseling and developing risk-reduction strategies. In this study, we sequenced exon 1 and exon 2 of NKX 2.5 in patients with sporadic congenital heart diseases to determine NKX2.5 transcription factor gene mutations and the risk of congenital heart diseases among infants.

Methods

A retrospective case–control study was conducted among infants at the Jakaya Kikwete Cardiac Institute (JKCI), Tanzania . We used a convenience sampling technique to obtain 70 cases with isolated congenital heart diseases and 70 health control subjects. Echocardiography was performed by a pediatric cardiologist.

Blood sample collection and DNA extraction

Blood samples were collected for the purpose of DNA extraction using DBS paper. The blood was collected by filling each well on the DBS paper with a single drop of blood, which was then left to dry in the open air for one hour. The dried DBS paper was then packed in specialized air-tight parcels for transfer and subsequent DNA extraction.

For genomic DNA extraction, the Quick-DNA Miniprep Kit protocol was followed. The extracted DNA was then quantified and its quality was assayed by the use of a computer-based Thermo Scientific NanoDrop™ 1000 Spectrophotometer. This apparatus is capable of measuring microvolumes of dilute and concentrated DNA samples of varying concentrations with high accuracy and reproducibility. The DNA samples were then diluted accordingly to equal concentrations of 10 ng/µl for the next step of analysis.

Primer design and PCR amplification of NKX2.5 gene (Exon 1 and Exon 2) for mutation screening

To identify the reference gene sequence of NKX2.5, the National Center for Biotechnology Information (NCBI) was used. Based on published NKX2-5 coding regions (exon 1 and exon 2) sequences, allele-specific oligonucleotide primers kit was designed using bioinformatics pipelines Primer 3 software. The primers were validated using the Program Basic Local Alignment Search Tool (BLAST) and were also checked for hairpin, homodimer, and heterodimer using Integrated DNA Technologies (IDT) oligo Analyzer software.

Due to alternative splicing NKX2.5 has five transcript variants. The variant considered for this study was Homo sapiens NK2 homeobox 5 (NKX2-5), transcript variant 1, mRNA (NM_004387.4). Exon 1 and Exon 2 of the NKX2.5 gene were amplified using the designed primers. The primers for exon 1 were Forward Primer 5’- CGCCCTTCTCAGTCAAAGAC-3’ (20 bp), GC content 55%, and Reverse primer 3’- AAAGGCAGACGCACACTTG-5’ (19 bp), GC content 52.6%. The primers for exon 2 were Forward Primer 5’- CCTCAACAGCTCCCTGACTC-3’ (20 bp), GC content 60% and Reverse primer 3’- CTCATTGCACGCTGCATAAT-5’ (20 bp), GC content 45%. The designed primers (forward and reverse) were manufactured by Microgen Europe B.V. Company in Amsterdam, Netherlands.

For PCR amplification, AccuPower® PCR PreMix from Bioneer (Bioneer Corporation, 8–11 Munpyeongseo-ro, Daedeok-gu, Daejeon 306–220, Republic of Korea) was used on GeneAmp® PCR System 9700 (Applied Biosystems). The PCR mixture consisted of 2 μl of extracted DNA, 1 μl of forward primer, 1 μl of reverse primer, and 16 μl of nuclease-free water in a micro-tube containing AccuPower® PCR PreMix concentrate making a total reaction volume of 20 μl. Cycling conditions consisted of initial denaturation at 95◦C for 5 min followed by 35 cycles of 95◦C for 30 s, 57 ◦C for 30 s, and 72 ◦C for 45 min. A final extension at 72 ◦C for 5 min was performed to complete the extension.

Gel electrophoresis of PCR products

For the analysis of PCR products, a gel electrophoresis technique was used. A 1.5% agarose gel was prepared by dissolving 1.5 g agarose in 100 ml of 1X Sodium borate buffer and heating it on a hot plate until it was completely dissolved. The gel was then stained with 4 μl of GelRed® Nucleic Acid Stain. In each well of the gel, 4 μl of every sample and 4 μl of the 100-bp DNA ladder were loaded. The first well was loaded with the DNA ladder to indicate the size of any fragments. The voltage was set to 100 V, and electrophoresis was allowed to run for 40 min. The DNA fragments were observed as gray bands against a black background on Bio-Rad’s Gel Doc™ EZ Imaging System. Figures 1 and 2 show the gel electrophoresis results for exon 1 and exon 2, respectively.

Fig. 1
figure 1

Gel electrophoresis exon 1

Fig. 2
figure 2

Gel electrophoresis exon 2

DNA sequencing

The amplicons, which are amplified segments of DNA, were sent to Macrogen Europe (Meibergdreef 57, 1105 BA, Amsterdam, the Netherlands) for sequencing. The sequencing was performed on samples from cases and controls, with a sample size of 70 for each (Case: Control; n = 70 + 70). To conduct the sequencing, gene-specific primers were used to amplify the DNA segments. The PCR products were then purified and sequenced directly using the BigDye Terminator Cycle Sequencing Kit (Applied Biosystems, Foster City, CA, USA). The sequencing was conducted using a genetic analyzer (ABI 3730xl System from Applied Biosystems). After the sequencing was completed, the raw sequence data were cleaned, edited, and assembled using the Geneious Prime software, which is version 2021.2.2. The software was used to obtain consensus sequences, which are sequences that represent the most common base at each position of the DNA segment. Finally, the sequences were aligned to a reference genome known as hg37, using ClustalW, which is a software used to align multiple DNA or protein sequences. The alignment allowed for the identification of any mutations present in the DNA segments (Figs. 3 and 4).

Fig. 3
figure 3

Multiple sequence Alignment exon 1

Fig. 4
figure 4

Multiple sequence Alignment exon 2

Single nucleotide polymorphisms and the subsequent changes in the amino acids were detected. The obtained nucleotide sequences were also subjected to pair-wise alignment using the heuristic sequence alignment approach, Basic Alignment Search Tool to determine the identity and similarity of the study nucleotide sequences and protein sequences by comparing them with other published sequences available in the GenBank database.

The possible effects of genetic variants (mutation pathogenicity analysis) were predicted whether they are pathogenic or tolerated using computational analysis methods such as SIFT (Sorting Intolerant from Tolerant) [43, 44]; PolyPhen-2 (Polymorphism Phenotyping, version 2)[45, 46] MutationTaster [47]. The protein secondary structure was predicted using a Swiss model [48, 49].

For the SIFT score, amino acids with probabilities of less than 0.05 are predicted to be deleterious, and a SIFT score of greater than 0.05 is considered to signify a tolerated variant. PolyPhen-2 results are presented with qualitative levels of benign, potentially detrimental, and probably damaging. PolyPhen-2 prediction results have a numerical score range of 0 to 1. PolyPhen-2 variations with scores equal to or greater than 0.5 are anticipated to be deleterious. For the Mutation Taster, we classified an alteration as either a disease-causing polymorphism, a polymorphism probably harmless, or a polymorphism known to be harmless.

Results: for exon 1:

A non-synonymous mutation g 316C > T; cDNA 316C > T Leucine 37 Arginine in a patient with the tetralogy of Fallot was predicted by mutation taster to be disease-causing and by using PolyPhen; the mutation was predicted to be possibly damaging with a score of 0.655, sensitivity of 0.79, and specificity of 0.84. Another novel synonymous mutation g 2295–2298; cDNA 755–758 delins AGGG: in a ventral septal defect was predicted by mutation taster to be disease-causing. Not found in ExAC (exome aggregation consortium) or 1000G (Fig. 5).

Fig. 5
figure 5

Polyphen multiple sequence alignment prediction: amino acid leucine at mutation position 37 is highly conserved in the NKX2.5 protein in different species

Another non-synonymous g 225G > T; g 245C > A; g 257C > A; g 410G > T; 429 T > C, resulted in leucine48 Methionine substitution in a patient with patent ductous arteriosus. Using SIFT, the substitution leucine48 Methionine is predicted to affect protein function with a score of 0.04 and median sequence conservation of 3.37, and using polyphen, the mutation was predicted to be probably damaging with a score of 0.963, sensitivity of 0.62, and specificity of 0.92(Fig. 6).

Fig. 6
figure 6

Polyphen multiple sequence alignment prediction: the amino acid leucine at the mutation site is highly conserved in the NKX2.5 protein in different species

We found synonymous mutations g 457 T > G; g 63 T > G in a patient with atrial septal defects resulting in K110Q, F92V, and L41R. Using the SIFT algorithm, substitution at position 110 from K to Q is predicted to be TOLERATED with a score of 0.42. Median sequence conservation: 3.18 Sequences represented at this position. Substitution at position 92 from F to V is predicted to be TOLERATED with a score of 0.38. Median sequence conservation: 3.18. Sequences represented at this position:9. Substitution at position 41 from L to R is predicted to be TOLERATED with a score of 0.15. Median sequence conservation: 3.41 Sequences represented at this position also found synonymous mutations, g 11G > C; g 19G > A; g 71 insertion T was found in a patient with ventral septal.

For exon 2: We found an alteration in 3UTR;cDNA 1282 T > C;g2822T > C which was predicted to be polymorphism in a patient with ventral septal defect. In another sample, we found a novel frameshift, disease-causing mutation; c95_95 del A; cDNA.369–369 delA;g 369-369delA which resulted in phenylalanine 295 leucine substitution in patients with atrial-ventricular septal defect which resulted in a truncated NKX2.5 protein (Figs. 7,8 and 9).

Fig. 7
figure 7

Polyphen multiple sequence alignment prediction: amino acid phenylalanine at mutation position 295 is highly conserved in the NKX2.5 protein in different species

Fig. 8
figure 8

PSIPRED.A wild-type NKX2.5 PROTEIN

Fig. 9
figure 9

PSIPED.A mutant NKX2.5 PROTEIN (c95_95 del A in exon 2)

Discussion

The early development of the vertebrate heart is a complex and coordinated process that involves the specification and differentiation of myocardial and endocardial cells in the anterior lateral mesoderm shortly after gastrulation. This culminates in the formation and rightward looping of the early heart tube, which is the precursor to the mature heart[50].This intricate process involves multiple cell types and a highly regulated machinery of genetic events involving transcriptional [51, 52]. One of the key transcriptional factors involved in cardiac development is NKX2-5, which is a homeodomain-containing transcription factor that plays a critical role in the development of heart [37]. It is indispensable for normal cardiac development, and mutations of the gene are associated with human congenital heart diseases (CHD)[14]. NKX2-5 mutations alter a section of the protein called the homeodomain, which interferes with the ability of NKX2-5 to bind to DNA or associate with other important cardiac proteins called cofactors [53]. Humans who are born with congenital heart disease carry mutations in the gene that encodes this protein [54].

In a recent study, researchers have identified a single nucleotide polymorphism (SNP) of the NKX2.5 gene, which is associated with congenital heart defects (CHD) [55]. The study detected five different variants, including the previously documented p.R25C variant, in eleven families. Interestingly, the p.R25C variant was found in seven patients from different families and one healthy individual [56]. Other studies have also identified various genetic variations in different CHD phenotypes. For instance, a novel heterozygous DNA sequence variant 1433A > G was discovered in a patient with tetralogy of Fallot (TOF), while another was found in a patient with a persistent left superior vena cava [57]. Another study identified several single nucleotide polymorphisms (SNPs) (rs2277923, rs3729753, rs703752, and rs202071628), and a novel heterozygous DNA sequence variant (1500G > C) in the 3'UTR region of the NKX2-5 gene, in two [2] VSD patients [32]. In our study, we identified a novel single nucleotide polymorphism (SNP) in 3UTR of NKX2.5; cDNA 1282 T > C; g2822T > C in a patient with ventral septal defect. These findings provide valuable insights into the genetic architecture of CHD and may have important implications for the diagnosis and treatment of these conditions.

Furthermore, various mutation analyses carried out in different research studies have discovered a new kind of heterozygous NKX2.5 mutation, known as p.Q181X, in a patient who suffers from ASD [38]. Additionally, another study has identified two different types of NKX2-5 missense mutations—T178M—in a family with ASD but without AV (atrioventricular) conduction block. Moreover, the same study also found a missense change, E21Q, in a patient with ASD who did not have any AV block [23]. In our study, however, we found non-synonymous mutations c 457 T > G and c 63 T > G in a patient with atrial septal defects without AV block, which resulted in K110Q,F92V and L41R amino acid changes that were tolerated and not disease-causing.

In a previous study, researchers identified several genetic mutations associated with different cardiac conditions. Among the mutations identified were missense nucleotide substitutions, deletions, and a premature termination of translation. One of the mutations identified was a missense mutation (R190L) observed in patients with atrial septal defect type II (ASD-II), and a frame-shift mutation (A255fsX38) observed in a patient with hypoplastic left heart syndrome (HLHS)[20]. In our study, we discovered a novel frame-shift mutation in exon 2; c95_95 del A; cDNA.369-369 delA; g 369-369delA which resulted in a substitution of phenylalanine to leucine (F295L) in a patient with atrial-ventricular septal defect (AVSD). This genetic mutation caused a truncated NKX2.5 protein, which is typically associated with AVSD.

A common genetic variation, known as single nucleotide polymorphism (SNP), c.63 A > G (E21E), and c.606 G > C (L202L), has been identified among individuals with congenital heart diseases in the Chinese population [58]. Additionally, a study conducted on CHD patients found four NKX2.5 mutations. Among those, two were novel genetic variations in the coding region of exon 1 (c. 95 A > T and c. 93 A > T), while the other two were previously reported as single nucleotide polymorphisms (SNPs)—rs72554028 (c. 2357 G > A) and rs3729753 (c. 606 G > C) in exon [59].

In our study, however, we found a novel non-synonymous mutation in exon 2, g 316C > T; cDNA 316C > T Leucine 37 Arginine in a patient with the tetralogy of Fallot, which was predicted by mutation taster to be disease-causing. We also found a novel non-synonymous mutation g 2295–2298; cDNA 755–758 delins AGGG: in a ventral septal defect was predicted by mutation taster to be disease-causing. Another sequence variation c 225G > T; c 245C > A; c 257C > A; c 410G > T; c429T > C, which resulted in leucine48 methionine substitution in a patient with patent ductous arteriosus, was detected in our study.

These variations have never been reported from other studies. These findings could play a crucial role in understanding the underlying genetic mechanisms of these conditions and may help in the development of new treatment options.

Conclusions

Our research has led us to discover specific genetic variants that are linked to various phenotypes of congenital heart diseases. Through our analysis, we have identified that mutations in the NKX2.5 gene could play a crucial role in the development of structural cardiac defects in humans. Our findings have significant implications in the clinical management of patients with these mutations. We advise that individuals with NKX2.5 mutations should undergo regular screening to detect any potential cardiac conduction abnormalities. Additionally, they should be evaluated for the possibility of implanted cardiac defibrillators and pacemakers to manage any cardiac complications that may arise.