Introduction

Congenital heart disease (CHD) is a functional and structural defect in the heart that arises during embryonic development. The clinical outcome and anatomic details of CHD are varied [1, 2]. Globally, more than 1 million fetuses with CHD resulted in a high disease burden [3]. Over the past few decades, the complex cyanotic CHD resulted in a higher mortality and morbidity rate in low-and middle-income countries (LMICs) [4]. The overall prevalence in Asian regions is higher due to milder lesions [5]. The clinical spectrum of cardiac malformations is broad that can range from solitary lesions such as ventricular septal defect (VSD) or atrial septal defect (ASD) to complex forms, including tetralogy of Fallot (TOF). Single nucleotide polymorphisms (SNPs) in more than 400 genes are identified to cause CHD in patients [6].

Congenital heart malformation is a multifactorial disease that involves a complex interaction between genetic and environmental factors [7]. Environmental factors are implicated in 2 to 10% of CHD, including maternal chronic disease conditions such as maternal diabetes, obesity, infection, or nutritional deficiencies [8, 9]. In 90% of cases, molecular mechanisms underlying disease pathogenesis are still not completely understood [10]. However, linkage disequilibrium (LD) analysis help in a better understanding of complex genetic diseases [11]. Genes involved in cardiac development fall into three functional categories: structural proteins, transcriptional factors, and signaling molecules [12]. The NKX2-5 gene is the earliest known marker that expresses both in the fetal and adult heart located at chromosome 5, the 5q34-q35 region. Approximately 105 polymorphisms have been reported in this homeobox gene, including missense SNPs, deletion mutations, synonymous variants, and insertion polymorphisms. The two most studied markers are rs2277923 SNP results in the substitution of T to C (glutamine to glutamine at position 21 in the protein), and rs3729753 (C to G) is a synonymous variant (leucine to leucine at position 202 in the protein sequence) [13]. Similarly, rs703752 is 3 prime UTR variant in which A replaces the reference C base, [14] while rs28936670 (G > A) is a missense variant that replaces the arginine (R) by cytosine (C) at position 25 [15]. Another selected gene in this study was the left-right determination factor (LEFTY) which is the negative regulator of the Nodal pathway and thus plays an essential role during left-right patterning and cardiac morphogenesis. The LEFTY1 gene is primarily expressed on the prospective floor plate, and the LEFTY2 gene is expressed predominantly in the left lateral plate mesoderm [16]. Thus, a missense variant rs360057 (T > G) (mutates aspartate to alanine at position 322) from LEFTY1 and rs2295418 (G > A) that replaces the proline by leucine from LEFTY2 were selected. To the best of our knowledge, this was the first comparative study from the Pakistani population, and there is no SNPs information available for this disease in this population. The purpose of these SNPs analysis in the Pakistani subjects was 1) the Pakistani population is a unique ethnic group, these genetic markers showed inconsistent results with CHD in different ethnic groups but have not been analyzed in Pakistani CHD patients. 2) Comparative evaluation between different phenotypes of CHD can provide us better insights into disease pathophysiology and genetic susceptibility to CHD. 3) Association of maternal chronic disease conditions with SNPs of the NKX2-5, LEFTY1, and LEFTY2 in children with congenital heart malformations can further enhance our understanding of disease pathogenesis that may provide us with novel direction for future CHD prevention.

Materials and methods

Recruitment of study population

This case-control study was approved by the ethical committee of Children’s Hospital and The Institute of Child Health, Lahore (2021-282-CHICH) (2) Institutional Review Board Committee of Punjab University (84/DFEMS). This study followed the principles of the Declaration of Helsinki, and written informed consent was taken from the parents of diseased and healthy subjects before data and blood sample collection. A total of 376 subjects (151 controls, 225 cases) were recruited from hospitals for the study from 2021 to 2022. The CHD group was composed of non-syndromic patients. The CHD diagnosis was made through echocardiography and physical examination by pediatric cardiologists. All the included controls were healthy subjects that were age and gender-matched to the case group. Exclusion criteria were patients diagnosed with syndromic CHD, infectious diseases, and acquired heart diseases. A questionnaire was designed to record all the relevant clinical information from patients and parents.

Variants selection

The current study used an extensive literature survey through various databases, including the National Center for Biotechnology Information (NCBI) and the 1000 genome browser. The selection criteria for SNPs were based on their potential role in disease susceptibility reported in different populations and on identifying the polymorphisms that have not been investigated in Pakistani CHD patients. LEFTY1 and LEFTY2 genes’ key role is to regulate human embryonic stem cells and mesodermal cell differentiation. The SNPs rs2295418 and rs360057 are present in the transforming growth factor beta (TGFβ) domain of the LEFTY protein hence the selected SNPs can affect the growth regulation mechanism of the heart. The homeobox NKX2-5 gene plays a crucial role in the proliferation and differentiation of cardiac cells during embryonic development. The four SNPs in the NKX2-5 gene were selected based on their significant haplotype associations or LD pattern in the various populations.

Genomic DNA extraction and quantification

The DNA was extracted from 200–500µl of anticoagulated peripheral blood using the salting-out method. The DNA was quantified manually through the agarose gel and by SepctraMax384 (Molecular Devices, CA, USA) and diluted to a final working concentration of 25ng/µl.

Multiplex amplification PCR and genotyping by using minisequencing

Multiplex polymerase chain reaction was performed to amplify the genomic regions containing the selected SNPs. Amplification primers and single base extension (SBE) probes were designed using primer3 (version 4) and Batch Primer 3 software (MA, USA), respectively. PCR reactions were performed in a total volume of 10µl containing the following working concentrations of reaction components: 25ng/µl DNA, 1.7X Taq buffer, 0.4mM DNTP’s mix, 3mM MgCl2, 2µl primer mix (Eurofins Genomics), and 2.0 Units of Taq polymerase (Thermo Scientific). The PCR program consisted of initial denaturation at 95°C for 5min, followed by 30 cycles of 95°C for 45s, 62°C for 45s, extension at 72°C for 45s, and a final extension at 72°C for 10min. The variants were genotyped by using ABI Prism SNaPshot Multiplex Kit (Applied Biosystems, Foster City, CA, USA). The multiplex PCR product was purified by using Exonuclease-I (2 units/µl) and FastAP (3 units/µl) (Thermo Scientific). Both enzymes and PCR product were incubated at 37°C for 1 hour followed by 80°C for 15min. The SNaPshot PCR was performed in a total volume of 5µl containing 2µl PCR product, 2µl SBE primer mix, and 1µl of nuclease-free water. The minisequencing PCR program consisted of 25 cycles at 96°C for 10s, 50°C for 5s, and 60°C for the 30s. The final purification was done by using 0.7 units of FastAP and incubating the samples at 37°C for 1 hour followed by 75°C for 15min. The final reaction mix containing 0.05µl of GeneScan-120 LIZ internal size standard (Applied Biosystems), 8.5µl Hi-Di Formamide, 1.45µl purified SNaPshot product was denatured at 95°C for 5min. Finally, the ABI-3130XL genetic analyzer was used for capillary electrophoresis and genotyping of different SNPs was analyzed by fragment length and fluorescent signals. GeneMapper ID-X (Applied Biosystem) software was used to analyze these selected variants.

Statistical analysis

The chi-square test was used to analyze the allelic and genotypic frequencies and the odds ratio (OR) was computed at a 95% confidence interval (CI). Hardy-Weinberg equilibrium (HWE) was evaluated by the chi-squared goodness of fit test. Logistic regression was used to analyze the association of polymorphism with variants. Haploview was used to construct the linkage disequilibrium map. SHEsis, SNPStats, and GraphPad prism was used for statistical and graphical analysis. Sample size estimation was based on disease prevalence. Power analysis was conducted using online calculators online power calculator OSSE (http://osse.bii.a-star.edu.sg/calculation2.php) and QUANTO program. The cut-off p-value for all statistically significant tests was < 0.05.

Results

Baseline characteristics study

The case and control subjects were age and gender-matched. The VSD (46.22%) and TOF (36.44%) were the most common types in our pediatric population. The maximum age limit in both studied groups was up to 15 years. The baseline parameters for each CHD type are given in Table 1.

Table 1 Baseline parameters of population under study

Role of selected polymorphisms in CHD

The studied SNPs were in HWE (p > 0.05). Genotypic and allelic distribution suggested the independent role of polymorphisms in the NKX2-5, LEFTY1, and LEFTY2 genes except for rs703752 (p = 0.53). The detailed results for all SNP’s frequency distribution are given in Table 2. In addition, genetic contrast models also supported the results in Table 1, as each variant significantly increased the risk of disease (Table 3).

Table 2 Genotype and allele frequency distribution between case and control group
Table 3 Genetic contrast models analysis between healthy and CHD subjects

Comparative assessment of polymorphisms between different phenotypes

The comparison of polymorphisms association between VSD and TOF group suggested similar significant findings except for rs703752. The comparative evaluation of this SNP suggested non-significant results in ventricular septal defects (OR = 1.13, p = 0.69), while results in the TOF group were significant (OR = 2.71, p = 0.0008). The frequency distribution is given in Table 4.

Table 4 Comparative assessment of variants between different phenotypes of CHD.

Association of maternal hypertension and polymorphisms of NKX2-5 and LEFTY1 and LEFTY2 in children with the congenital heart defect

Based on the above genetic association analysis, further association analysis was performed. And based on the prevalence, maternal hypertension, and diabetes were selected for association analysis with polymorphisms. The association of maternal hypertension was significant for only one SNP rs2295418 (OR = 16.41, p = 0.0032). However, the results for maternal diabetes were non-significant (p > 0.05) (Table 5). In addition, the selected SNP’s association with maternal hypertension and diabetes is also non-significant for VSD and TOF groups (p > 0.05).

Table 5 Association of maternal hypertension and maternal diabetes with SNPs in children

Linkage disequilibrium and haplotype analysis

The linkage disequilibrium map showed moderate LD between rs703752 and rs2277923 (D’=80) (Fig. 1). The haplotype frequency distribution was significant for CC (OR = 1.78, p = 0.000359) and wild haplotype alleles CT (OR = 0.533, p = 4.38 × 10− 5).

Fig. 1
figure 1

Linkage disequilibrium map

Discussion

Congenital heart defects are complex genetic malformations that involve interaction between different genes and the environment, particularly maternal factors [17, 18]. In the present study, six single nucleotide polymorphisms from three different genes were genotyped to find their independent association first time in Pakistani subjects. In addition, the association of maternal chronic disease conditions with variants in CHD children was further analyzed, which may provide us with novel and better prevention strategies. The Pakistani population represents a unique tool to find the contribution of genetic markers to complex disorders based on the restricted cultural, religious, and social settings.

The genotypic and allelic frequencies of selected markers suggested that compared to the healthy subjects, CHD had a higher minor allele frequency (MAF), except rs703752, for which results were not statistically significant. However, the comparative analysis between two clinically different phenotypes of CHD showed that rs703752 allelic and genotypic frequencies were statistically different in TOF (p < 0.01) as compared to ventricular septal defects (p > 0.05). Although, the remaining five SNP’s results were similar in VSD and TOF. Similarly, a significant association of two variants (rs28936670 and rs2277923) in the NKX2-5 gene was found in Egyptian CHD children [15]. Chen et al. performed a meta-analysis in the Chinese population to investigate the effect of the rs3729753 variant in CHD patients. The findings of their study were non-significant for rs3729753 [19]. In Greek patients, a significant association was observed for rs2277923 [20]. This study’s findings were in accordance to those reported by Cao et al. in 2015. Similar to this study, they reported association of rs2277923 and rs703752 was significant and non-significant, respectively in the Chinese Bai population. The LD map analysis was also the same as strong linkage disequilibrium was shown between rs703752 and rs2277923 [21]. The current study also investigates the association of maternal hypertension and diabetes with polymorphisms that showed an independent significant role in CHD patients. This study suggested a significant association between maternal hypertension and CHD in children with variant rsrs2295418 (OR = 16.41, p = 0.003), while a weak difference was observed for variant rs360057 and maternal diabetes (OR = 21.60, p = 0.08). Zhao et al. suggested a significant association between maternal diabetes and rs2277923 (OR = 17.72, p = 0.001) [22]. Another study reported that congenital heart defects in children were associated with maternal hypertension and diabetes [23]. However, no significant association was found between de novo variants in CHD children and maternal diabetes or obesity [24].

To the best of our knowledge, this was the first study that elucidates the association of maternal hypertension with variants in children. However, this study also had several limitations. The first sample size was not large enough due to social and psychological stigma among patients, parents, and their families. Thus current study has limitations for the stratification analysis of maternal risk factors, including the type of therapeutic strategies. The second three genes selected that are potentially involved in the pathogenesis of CHD were selected in this study. Third, information about maternal genotype and disease status should be included in future studies.

Conclusion

In conclusion, all variants showed an independent association with CHD except for one rs703752. However, detailed investigations between different clinical phenotypes suggested that this SNP was potentially involved in TOF patients as compared to VSD. Moreover, some variants showed a higher risk in diabetic mothers or maternal hypertension. This information can provide us with a new direction for CHD prevention so that high-risk mothers should be screened during pregnancy. Although to the best of our knowledge, this was the first study from a lower-income Pakistani population. However, there is a need to conduct more studies with a larger sample with other parameters analysis. Moreover, SNPs correlation analysis should include association analysis with maternal hypertension and diabetes that can provide more insights into disease pathogenesis and novel direction for future CHD prevention.