Introduction

Familial Hypertrophic Cardiomyopathy (FHC) is an autosomal dominant disorder affecting the cardiac sarcomere and is characterized by myocardial hypertrophy and myocyte disarray. FHC is estimated to affect one in every 500 individuals [1] and more importantly appears to be the most common cause of sudden cardiac death in otherwise healthy young individuals [2]. An important feature of FHC is its extensive clinical heterogeneity ranging from death within the first three decades of life to almost complete absence of significant clinical symptoms [3]. At least thirteen disease causing genes for FHC have been identified to date [4, 5]; nearly all coding for sarcomere proteins. This genetic heterogeneity has been proposed to be the main cause for the underlying clinical heterogeneity. The β-cardiac myosin heavy chain (MYH7) and Myosin binding protein C3 (MYBPC3) genes are estimated to account for about 60–70% of all FHC cases [6]. MYH7 was the first (and perhaps the most common) FHC gene to be identified, with more than 500 mutations reported till date. Mutations in MYH7 and in the gene coding for Troponin T2 (TnnT2; the third major FHC gene) appear to often result in severe symptoms including death at a young age [7, 8]. MYBPC3 mutations, however, usually result in delayed onset of symptoms [9]. In addition, the presence of more than one mutation in the same (compound heterozygosity) or different (double heterozygosity) genes has been documented in a significant proportion of FHC patients [10]. The insertion/deletion (I/D) polymorphism of the gene coding for the angiotensin-1 converting enzyme (ace) has often been linked to severe hypertrophy among FHC patients [11], though other studies have also revealed no significant association [12]. Given the wide allelic heterogeneity and rarity of recurrent mutations, it is important to determine the mutation spectrum in any given population to facilitate studies on genotype–phenotype correlation. We earlier commenced analysis of disease causing mutations in Indian FHC patients [13, 14]. In the current study, we have analyzed 55 FHC patients from India and the results indicate that mutations in MYH7 and MYBPC3 may not be as frequent as reported in other populations.

Materials and methods

Patients and samples

Eighty patients, all from the State of Andhra Pradesh in India, were included in the study. Fifty-five presented at one of the two collaborating hospitals viz. the Usha Mullapudi Cardiac Centre, Hyderabad and Care hospital, Hyderabad. Remaining 25 patients were from remote areas and presented at associated clinics of the collaborating hospitals. The study was approved by the ethics committees of the respective hospitals as per the Helsinki declaration of 1975. FHC diagnosis was based mainly on echocardiography, ECG, and family history as reported earlier [14]. One hundred age and gender matched normal individuals were recruited from the local population for the study. Blood samples were collected from each patient, family members, and normal asymptomatic individuals following informed consent. Unfortunately, samples collected for 25 patients (from remote areas) were insufficient for complete genetic analysis of the two genes.

Genetic analysis

We performed mutation screening in 55 patient samples, though ace polymorphism analysis was completed for all 80 samples. Screening for mutations in MYH7 and MYBPC3 genes was performed as described earlier using Polymerase Chain Reaction-Deoxyribose Nucleic Acid (PCR-DNA) sequencing [14]. The sequence of primers for both genes and corresponding annealing temperatures for PCR are given in Table S1. All mutations were validated by bi-directional DNA sequencing and each novel mutation was confirmed to be absent in at least 100 normal individuals. Analysis of the ace I/D polymorphism was carried out as described earlier [13, 14], on all 80 patient samples.

Sequence and structure analysis

Multiple sequence alignments (MSAs) for several orthologues of the human MYBPC3 and MYH7 proteins (Figs. S1 and S2, respectively), identified by PSI-BLAST searches against the non-redundant database, were obtained by using ClustalW (http://www.ebi.ac.uk/clustalw). The MSAs were used for calculating the position-specific Gribskov score using the Prophecy program available as part of the EMBOSS package [15]. X-ray or NMR structures for the human MYH7 are not yet available. Since the human MYH7 (N-terminal amino acids 14–842) shows significant sequence identity (63%) with Scallop myosin S1, whose crystal structure with bound ADP is available (PDB ID: 1QVI), we modeled the human MYH7 with bound ADP by employing comparative modeling procedures using the scallop myosin structure as the template. Sequence alignment of human MYH7 and scallop myosin was carried out using ClustalW. A comparative model was built using Modeller 9v5 [16] (http://salilab.org/modeller); the model with the best PROCHECK [17] G-score (overall G-score = −0.05) and VERIFY-3D [18] profile was subjected to energy minimization using the GROMACS [19] program package (http://www.gromacs.org) adopting the GROMOS96 43a1 force field parameters and steepest descent method.

Human MYBPC3 comprises of seven Immunoglobulin (Ig)-like (C1–C5, C8, and C10) and three Fibronectin type III (C6, C7, and C9) globular domains [20] but its three dimensional structure (either NMR or X-ray) is not yet available. However, NMR/X-ray structures are available for some individual domains. Since all missense and nonsense mutations identified in this study mapped to the C5 or C10 domains (except one which was a truncation mutation and mapped to the C9 domain), we used the available NMR structure of the C5 domain and a homology model of the C10 domain for structural analysis of the mutations. The C10 domain homology model was built using the Ig domain of Neuroplastin (PDB ID: 2wv3) as the template, identified using FUGUE server (http://tardis.nibio.go.jp/fugue/prfsearch.html) and homology modeling was carried out by Modeller 9v5. The model so obtained was validated using PROCHECK (Overall G-score = −0.14) and Verify-3D.

Results

Eighty patients were diagnosed with FHC in the two hospitals during 2006–2008 and included 63 male and 17 female patients. The low proportion of female patients could be a reflection of socio-economic factors and may not necessarily be a reflection of gender-specific difference in incidence rates. The IVS diameter among patients ranged from 1.3 to 2.7 cm (mean = 1.97 cm). Since MYH7 and MYBPC3 genes have been reported to be the two most common genes causing FHC [6, 7], we proceeded to screen the two genes in 55 FHC patients. We screened the coding region and splice junctions of exons 3–23 of MYH7 since these code for the globular head and neck region of the protein that harbor majority of mutations [21]; while the complete coding region and all splice junctions of the MYBPC3 gene were screened. The MYH7 exon 6 was excluded since it is only 28 bp long and no mutations have been discovered in this exon. Disease-causing mutations were identified in either of the two genes in only nineteen patients (35%) out of which two each harbored double heterozygous and compound heterozygous mutations, respectively. Excluding the patients that harbored compound/double heterozygous mutations, mean age of patients harboring MYBCPC3 mutations (41.7 years) was lower than those harboring MYH7 mutations (47.4 years). The details of mutations and corresponding clinical features for each of the nineteen patients are given in Table 1. Electropherograms for all novel mutations and novel single nucleotide polymorphisms (SNPs) are shown in Fig. 1 and known mutations are shown in Figs. S3A and S3B.

Table 1 Clinical and genetic analysis of FHC patients harboring MYBPC3/MYH7 mutations
Fig. 1
figure 1

Novel mutations and SNPs detected in MYBPC3 and MYH7 among Indian FHC patients. a c.456delC; b c.2128G>A (p.E710K); c c.3641G>A (p.W1214X); d c.3656T>C (p.L1219P); e c.1790+7G>A (IVS17+7G>A) (SNP); f c.4052delA (SNP), all from MYBPC3, and g c.965C>T (p.S322F) from MYH7. For each, electropherogram showing the mutation is on the left and the one showing the normal sequence is on the right. Each mutant nucleotide is indicated by an arrow; in a and e, the deleted base is indicated in the normal sequence. The MYBPC3 electropherograms for the c.456delC, c.3656T>C, and c.4052delA mutations were obtained from cloned PCR products harboring the heterozygous mutations

MYBPC3 mutations

In the current study, MYBPC3 exhibited a higher mutation frequency than MYH7; 12 patients harbored MYBPC3 mutations while only five harbored MYH7 mutations (Table 1). Two patients exhibited double heterozygosity harboring a mutation each in MYH7 and MYBPC3 genes (Table 1). A total of nine MYBPC3 mutations were identified, including four novel mutations viz. c.456delC, c.2128G>A (p.E710K), c.3641G>A (p.W1214X), and c.3656T>C (p.L1219P) (Fig. 1; Table 1). Three mutations viz. c.456delC, c.3656T>C, and c.3628-24_48del25 were confirmed by cloning the corresponding PCR products in the PCR2.1 Topo vector (Invitrogen, USA). The youngest patient in our study was a 3-year old male child who harbored the MYBPC3 c.3372C>A (p.C1124X) mutation; the maternal grandmother of the proband suffered sudden cardiac death. This mutation has been reported earlier [22], but clinical symptoms were not described. Interestingly, the mutation was also detected in the mother and an elder sibling of the proband but both were asymptomatic, though all three harbored the ace I/D genotype (data not shown). The MYBPC3 c.3628-24_48del25 mutation, a common genetic lesion associated with FHC in the Indian population [23], was detected in five patients. Among these, one exhibited compound heterozygosity harboring the additional novel MYBPC3 c.3656T>C mutation (Table 1). A second patient exhibited compound heterozygosity harboring two novel MYBPC3 mutations namely c.2128G>A and c.456delC (Table 1). We identified two novel SNPs in MYBPC3 viz. c.1790+7G>A (IVS17+7G>A) (detected in two patients and in three of 100 normal samples screened; Fig. 1) and c.4052delA, located 227 nucleotides downstream of the stop codon in the 3′ UTR (detected in three patients and in five of 94 normal samples screened; Fig. 1).

MYH7 mutations

MYH7 mutations were detected in only seven patients out of whom two exhibited double heterozygosity (Table 1). Only one novel MYH7 mutation was identified viz. c.965C>T (p.S322F) (Fig. 1; Table 1). Both off springs of the proband (a son aged 21 years and a daughter aged 18 years) harbored the p.S322F mutation but symptoms were absent (data not shown), perhaps indicating late onset. The p.M659I was reported earlier but clinical symptoms were not described [7]. The p.R787H mutation in patient 16 and 17 (Table 1) was reported by us previously [14] and are included in this report, since two additional patients belonging to independent families (patient no. 18 and 19 in Table 1) were detected with the same mutation. Family members of patient 18 and 19 consented for genetic analysis. However, we did not detect any additional member harboring the p.R787H mutation in the respective families (out of 16 members for patient 18 and 26 members for patient 19) indicating the possibility of a de novo origin in these two patients (data not shown).

Status of angiotensin-1 converting enzyme gene I/D polymorphism

The ace gene has been suggested to be an important candidate modifier gene for FHC, based on several earlier studies [24]. We determined the distribution of the two ace alleles in the local population of Andhra Pradesh. Among 91 normal samples (age and gender distribution was comparable to that in patients), 31 harbored the I/I genotype (34%), 51 the I/D genotype (56%), and 9 the D/D genotype (10%). Among 80 symptomatic patient samples screened, 33 harbored the I/I genotype (41%), 35 the I/D genotype (44%), and 12 the D/D genotype (15%). Therefore, the D allele frequency in symptomatic patients (0.37) was not significantly different from that in normals (0.38). The status of ace polymorphism in each of the 19 patients in whom mutation was identified is shown in Table 1.

Sequence and structure analysis

In order to understand the mechanism underlying the pathogenic consequences of missense mutations in the MYH7 and MYBPC3 proteins, sequence and structure analyses of the mutant MYH7 and MYBPC3 proteins were carried out.

MYBPC3 mutations

The MYBPC3 missense mutations detected in this study localize to highly conserved regions (Fig. S1) reflected in the high Gribskov scores for the wild-type residues (Table 2). Each mutant residue, however, exhibited comparatively lower scores (Table 2) indicating non-preference at the respective positions. In addition, we evaluated the effect of each missense and nonsense mutation on the protein by using a recently developed web-server called Hansa (hansa.cdfd.org.in:8080; Acharya and Nagarajaram, communicated). Hansa hosts a recently developed support vector machine based tool for classification of missense mutations into “Disease” and “Neutral” categories. Hansa extracts a set of features associated with missense mutations which include position-specific information, local protein structure information, and intrinsic amino acid properties to classify the missense mutations. Benchmark studies have shown that Hansa outperforms other widely used methods (Acharya and Nagarajaram, communicated). All mutations were predicted as “Disease” (Table 2). MYBPC3 comprises of seven Ig-like (C1–C5, C8, and C10) and three fibronectin type III domains (C6, C7, and C9) [20]. Yeast two-hybrid studies have revealed that domains C5 and C8 as well as C7 and C10 form interacting pairs facilitating trimerization of MYBPC3 protein molecules. Based on this, a model has been suggested wherein three MYBPC3 protein molecules bind serially to form a collar-like structure around the myosin thick filament such that the domains from adjacent MYBPC3 molecules take part in the inter-domain interactions [25]. From this model, it has become clear that the C1124X and W1214X mutant proteins would be devoid of the C10 domain, and therefore may abolish inter-domain interactions thereby compromising myosin binding. The L1219 residue is predicted to reside on the surface of the C10 domain, is exposed, and is likely to participate in C7–C10 interaction. The mutation to Proline at this position is expected to bring about a conformational change at the C7–C10 interface, and hence may not be conducive for inter-domain interactions. Residues E710 and G758 map to the C5 domain whose NMR structure is available [PDB ID: 1GXE; chain A]. E710 is in the conformationally flexible region of the C5 domain and may perhaps be involved in inter-domain interaction with the C10 domain of another MYBPC3 molecule by means of a stabilizing salt-bridge or hydrogen bond interaction. These interactions will be precluded by the mutation to Lysine. The G758 residue occupies the i+3 position of the β turn in one of the β-hairpins in C5 domain. Replacement with Aspartic Acid is expected to destabilize the whole β sheet thereby affecting the intermolecular packing.

Table 2 Analysis of effect of amino acid substitutions in MYH7 and MYBPC3

MYH7 mutations

The MSAs of human MYH7 with myosin from other species revealed extensive sequence conservation of each amino acid residue exhibiting mutation (Fig. S2) reflected by the corresponding Gribskov scores (Table 2). The Gribskov scores for the mutant amino acid residues were significantly lower (Table 2) indicating that the mutations were not favored at their respective positions. In addition, barring the E710K mutation, all missense mutations were predicted as “Disease” by the Hansa server. The impact of each mutation on protein structure was deduced using a homology model of MYH7 which was built as described in the “Materials and methods” section. MYH7 exhibits actin-dependent ATPase activity and therefore possesses an ADP/ATP binding site characterized by the presence of the Walker motif. MYH7 S322 is located in a loop covering the ADP/ATP binding site such that the S322 side chain hovers around the binding site, and is within 6 Å from the ADP/ATP indicating that S322 may be a part of the ADP/ATP binding region (Fig. 2a). Our model strongly suggests that the S322 polar side chain (i.e., the OH group) may interact with ADP/ATP and from this perspective the mutated residue (F) becomes unsuitable at this position as it precludes the above said interaction. The other two mutated residues viz. M659 and R663 are located in the actin binding region and are part of the actin–myosin interface (Fig. 2b). It is likely that the two mutations may result in conformational change at the actin binding region, and hence affect actin binding.

Fig. 2
figure 2

a Structure analysis of the MYH7 p.S322F mutation. The nucleotide binding region (NBR) with bound ADP (shown as ball and stick in the centre) is highlighted in the model derived as described in the “Materials and methods” section. The S322 residue (shown as ball and stick at the top) is part of a loop which lies above the NBR. The NBR comprises of I113, Y114, N125-Y130, Y133, I153, G177-V185, N186, R189, E227, N231, D238, N237, N239, S240, S241, S321, D460, I461, G463, and C671 which lie within a distance of 5 Å from ADP. b Structure analysis of the MYH7 p.M659I and p.R663H mutations. Figure shows part of the predicted structure of the MYH7 globular head depicting a part of the actin binding region. The two residues M659 and R663, shown in ball and stick representation, are part of the actin binding region

Discussion

This is perhaps the first analysis of mutation frequency of the two most common genes causing FHC viz. MYH7 and MYBPC3 among Indian patients. A previous study by Tanjore et al. [26] evaluated 95 patient samples but screened only a few selected MYBPC3 exons. Another study included 69 FHC patients but was restricted to genotype–phenotype correlation of only six MYH7 patient mutations [27]. A third study was conducted to determine mutations causing FHC and dilated cardiomyopathy [28] using SSCP, and therefore mutation detection frequency was very low. In addition, the recurrent 25 bp MYBPC3 deletion has earlier been reported from India [23, 29]. A few other studies (including from our own laboratory) were either restricted to very few patients or to a few specific mutations.

In this study, our main objective was to determine whether MYH7 and MYBPC3 were important FHC genes among Indian patients. Surprisingly, mutations in either of the two genes were detected in only 35% of the patients screened indicating perhaps the importance of other disease-causing genes. A recent study from Italy also revealed similar results [30]. Of note, frequency of MYBPC3 mutations was significantly higher than MYH7 among Indian patients (even if the common MYBPC3 c.3628-24_48del25 mutation was excluded from the comparison). In all, we detected nine mutations in MYBPC3 and only four in MYH7 among the nineteen patients (Table 1). Although we did not screen the rod region of MYH7 gene in the current study, only a few mutations are expected to occur in this region, if any. In addition, mutations in the rod region are more likely to cause dilated cardiomyopathy [31]. A recent study from France [32] and an earlier study from eastern Finland [33] also revealed a higher frequency of MYBPC3 mutations compared to MYH7. It would be important to extend this study to a larger sample size and to include patients from other ethnicities within the Indian population to confirm the observation. We are also aware that this study is specific to symptomatic patients, and therefore excludes patients exhibiting sudden cardiac death as well as those that are asymptomatic/pre-symptomatic; the former are usually caused by mutations in MYH7 or TnnT2. Recurrent mutations have been suggested to be rare in FHC; however, out of thirteen mutations identified in this study, five were detected in more than one patient belonging to independent families. The MYBPC3 c.3628-24_48del25 mutation has been reported to occur at a high frequency in the Indian population [23]. It was also suggested that the mutation, in heterozygous form, resulted in incomplete penetrance and low expressivity [29]. In the current study, this mutation was detected in a heterozygous form in five out of 55 patients screened (9%) and all exhibited clinical symptoms (Table 1); one of the five patients also harbored an additional novel MYBPC3 p.L1219P mutation, thereby exhibiting compound heterozygosity. However, we cannot rule out the possibility of presence of mutation in other candidate FHC genes in rest of the four patients. We screened one hundred normal asymptomatic individuals and detected the c.3628-24_48del25 mutation at a frequency of 6%; the p.L1219P mutation was, however not detected.

Five novel mutations were identified in the study; four in MYBPC3 and one in MYH7 (Table 1). For each, absence in at least one hundred normal individuals (200 alleles) was confirmed. The MYBPC3 c.456delC mutation generates a premature termination codon (PTC) located 33 nucleotides upstream of the subsequent exon–exon junction, and therefore is not expected to trigger nonsense mediated decay (NMD) [34]. The resultant truncated protein is, however, expected to be devoid of a major portion of the protein, and therefore will be inactive. The W1214X mutation would result in a truncated protein devoid of the C10 domain which will interfere with trimerization and myosin binding. Among the other two novel MYBPC3 mutations, the p.L1219P mutation is predicted to be a “disease” mutation by the Hansa server; the amino acid substitution results in a drastic reduction in the Gribskov score; and the mutation is expected to affect trimerization of the protein (Table 2). The p.E710K is not predicted to be a “disease” mutation although there is a significant reduction in the Gribskov score for the substituted amino acid (Table 2) and the mutant protein may also be compromised for trimerization. The novel MYH7 mutation p.S322F also results in drastic reduction in Gribskov score for the mutant residue (Table 2) and the mutant protein is expected to be defective in binding ATP/ADP. Taken together, these data provide sufficient evidence in support of the five novel mutations, although it is difficult to associate a novel mutation with FHC without carrying out functional studies, given the existence of several disease causing genes.

All four MYH7 mutations detected in this study were missense mutations. Majority of MYH7 mutations causing hypertrophic cardiomyopathy have been suggested to manifest their effect through a “poison polypeptide” phenomenon [34], thus resulting in a dominant mode of inheritance [23]. In contrast, since most heterozygous MYBPC3 mutations are predicted to trigger NMD leading to reduced mRNA levels, the associated milder symptoms are perhaps because of haploinsufficiency [35]. Of the nine MYBPC3 mutations detected in this study, three occur at consensus splice sequences and are therefore, expected to affect splicing (c.821+1G>A, c.2905+1G>A, and c.3628-24_48del25); three result in a truncated protein (p.W1214X, p.C1124X, and c.456delC); and three are missense mutations (p.G758D, p.E710K, and p.L1219P) which appear to affect key residues in the Ig-like or fibronectin domains and are expected to compromise inter-molecular interactions. Each of the three MYBPC3 truncation mutations result in PTCs located more than 50 nucleotides upstream of the respective subsequent exon–exon junctions and therefore, the respective mutated transcripts may be degraded through NMD [34]. The three mutations affecting splicing can also be predicted to result in generation of PTC because of a change in reading frame as is the case with most splice mutations. Therefore, six out of nine MYBPC3 mutations are expected to cause haploinsufficiency because of loss of transcript through NMD. In addition, we analyzed 417 FHC mutations from the Cardiogenomics database (http://www.cardiogenomics.med.harvard.edu) and from published literature; among 207 MYH7 mutations, 199 were missense mutations (96%), while for MYBPC3 only 87 out of 210 (42%) were missense mutations. Therefore, 58% of MYBPC3 mutations may not result in a “poison polypeptide” providing a possible explanation for the milder symptoms usually associated with this gene. In our study, however, the median age of patients harboring MYBPC3 mutations was lower than those harboring MYH7 mutations; however, no definite conclusions can be drawn given the small numbers analyzed.

The absence of clinical symptoms in family members of the proband (the youngest patient in our study) who harbored the MYBPC3 p.C1124X mutation and also shared the identical I/D ace genotype, indicates a possible role of other modifying genes. The youngest FHC patient reported to date appears to be a 2 years old who harbored an MYH7 mutation [6]. The MYBPC3 p.G758D mutation detected in two patients (Table 1) was reported earlier from the Chinese population; however, details of clinical symptoms were not reported [36].

The novel MYH7 p.S322F mutation is likely to affect ADP/ATP binding, whereas the p.M659I and p.R663H mutations may affect actin–myosin interaction. The p.R787H mutation was detected in four patients including two from our previous study [13]; however, we could not detect a founder effect for this mutation (data not shown). The MYBPC3 missense mutations appear to cause perturbations in the Ig-like domains and are expected to destabilize inter-molecular MYBPC3 interactions thereby compromising binding to the Myosin heavy chain.

Out of 19 patients in whom mutations were identified four patients harbored two mutations each; two exhibited compound heterozygosity, while the other two exhibited double heterozygosity (Table 1), thus yielding a double mutation frequency of about 20%, similar to previous estimates [37]. Given the significant proportion of patients harboring more than one mutation, it may be important to completely screen several FHC genes in each patient, as suggested earlier [4]. Interestingly, the four patients harboring two mutations did not appear to exhibit more severe symptoms as compared to patients harboring a single mutation (Table 1), contrary to previous reports [38].

The present study did not reveal a significant association between the ace D/D genotype and extent of hypertrophy in the patients, similar to our previous results [13, 14], nor was there a significant difference in the D allele frequency in patients and normals. Previous reports have yielded inconsistent association between the D allele frequency and symptomatic FHC patients with varying results for different populations [11, 12]. An earlier study of 118 FHC patients from the northern part of India [39] revealed a significant association of the D allele with patients as compared to controls. Of note, the authors used only one set of primers for genotyping which might increase chances of mistyping of ID genotype as DD [40]. It is also possible that the different ethnicities of patients in the two studies may be the cause for the observed differences in the association.

In conclusion, our analysis of 55 FHC patients has revealed a lower proportion of mutation in the MYH7/MYBPC3 genes than what is reported for other populations. It may therefore be important to screen additional sarcomere coding genes to determine their role in FHC among Indian patients. Sequence and structure analysis of MYH7 missense mutations has revealed that they probably affect actin or nucleotide binding, whereas MYBPC3 mutations either result in the loss of protein expression or may affect protein trimerization ability which in turn may inhibit myosin binding. Our results and analysis of published MYBPC3 mutations also indicate that the milder phenotype usually associated with several MYBPC3 mutations could result from a lack of the dominant “poison polypeptide” phenomenon because of the loss of the mutated gene transcript through NMD.