Introduction

The price gap between targeted panel sequencing and genome-wide approaches such as whole-exome and whole-genome sequencing (WES and WGS, respectively) is narrowing as a consequence of the decreasing costs of next-generation sequencing (NGS) [1, 2]. Although the exact cost of each approach depends on several variables—including the desired sequencing depth and target coverage, the platform used and, for panels, the number of targeted genes—the per-sample cost of WES is increasingly comparable with that of targeted panels, and the cost figure for WGS is steadily below USD 2000 since 2017 (Fig. 1).

Fig. 1
figure 1

Indicative price per sample (excluding consumables and service costs) for whole-genome sequencing (green line), whole-exome sequencing (purple) and targeted gene panels (red) over the last decade, alongside price fold difference between the three approaches (inset). The price of targeted gene panels is assumed to have remained constant given the higher price per sequenced base in the early years was balanced by the smaller number of genes targeted by panels. Price figures have been obtained from the web, scientific literature and personal communications [3, 4]

So far, the most prevalent approach to genetic testing in cardiac conditions has been that of targeted gene panels comprising most or all the genes implicated in the disease in question, with this choice driven by the convenient combination of relatively low prices and excellent variant detection sensitivity [3]. Extended panels targeting genes implicated in multiple conditions, such as the TruSight Cardio panel by Illumina [4], provide the additional advantage of a standardized approach on all patients affected by an inherited cardiovascular disease. As cost is becoming a less important discriminator in choosing the sequencing strategy in diagnostic laboratories, the routine usage of WES (and WGS) for diagnostic purposes on all patients, irrespective of the disease in question, is a realistic scenario for the near future. In addition, the cost of such approaches would be one-off, without the need of new panel-based genetic tests in case new genes are associated with the disease later on or a different disorder is suspected.

Such adoption of WES as a “universal panel” would enable laboratories to benefit from increased standardization, to re-analyse genotype-negative cases over time as new gene-disease associations emerge and to implement gene-discovery approaches de facto constituting research activities but directly translating also into potential diagnoses that would often be missed with targeted gene panels. In addition, it would spare those (often paediatric) patients affected with complex disorders of unclear genetic origin the necessity to undergo multiple tests before reaching a diagnosis.

First-tier adoption of WGS has also been reported to improve the diagnostic pathway compared with targeted panels through higher diagnostic yield and reduced time-to-diagnosis for similar reasons, though with the additional advantage of detecting also copy number variants (CNV), gross chromosomal abnormalities and deep intronic variants [5,6,7]. In hypertrophic cardiomyopathy (HCM), WGS has been observed to increase the yield by 20% compared with standard genetic testing using targeted panels, with approximately half of such increment due to deep intronic variants [8]. However, while WGS possesses the intrinsic technical advantage of not relying on an upstream exon capture method compared with any type of targeted sequencing (including WES), routine usage of WGS implies the need for computational infrastructures suited to store and analyse terabytes of data.

In any case, the technology enabling sequencing of DNA has evolved at a much faster pace than the ability to correctly interpret genetic variants, and the “narrative potential” of genome-wide sequencing data represents a high risk for erroneous (and potentially harmful) interpretation of variants as pathogenic [9]. The potential extent of such mistakes is highlighted by the fact that every individual exome contains ~ 54 variants previously (mis)classified as causal for a rare disease [10], and additional false positive interpretations of variants’ pathogenicity are probable if the analysis is not restricted to genes with a robustly validated association with disease.

In this review, we analyse advantages and perils associated with the routine adoption of a genome-wide sequencing approaches as first-tier genetic test, which will likely occur in many diagnostic laboratories in the short term, and how this will influence the interplay between diagnostics and research. While this review broadly focuses on cardiomyopathies, the issues discussed here are relevant to all Mendelian diseases (especially those with a partially unexplained genetic basis) in both the diagnostics and research contexts.

Genome-Wide Diagnostic Sequencing and the Potential for Gene Discovery

One advantage in the adoption of WES/WGS as a routine approach for clinical genetic testing is the possibility to use it as a tool for gene discovery when no (potentially) relevant variants (pathogenic/likely pathogenic/of uncertain significance) are detected in the known disease genes. While only genes with a proven role in disease should be routinely screened for diagnostic purposes (to minimize the probability of reporting false positive findings), the availability of sequencing data from all other genes enables research on potentially disease-causing alleles beyond validated disease genes. This can be done by means of co-segregation analysis in the patient’s family (if the pedigree is sufficiently large/informative) and/or by means of case-control burden testing (if a sufficiently large cohort is available) (Fig. 2).

Fig. 2
figure 2

Interplay between the diagnostic and research contexts relative to clinical genetic testing. Periodic re-analysis of variant evidence (pink loop) characterizes especially variants of uncertain significance (VUS), which can be upgraded to (likely) pathogenic or downgraded to benign as new evidence emerges, but can affect also variants previously classified as pathogenic or benign (dashed pink line) due to new evidence or, for example, updated interpretation guidelines. Publication of new robust evidence for genetic association with disease generally causes an upgrade in classification of variants previously believed to be benign to either VUS or (likely) pathogenic, constituting a new diagnostic finding

However, rigour and caution are necessary, especially in light of recent findings that demonstrated the need for strict and robust approaches in demonstrating new genetic associations with disease. The release of the Exome Aggregation Consortium (ExAC) database in October 2014—comprising publicly accessible WES data of > 60,000 individuals from various populations and common disease sequencing studies, now embedded in the larger Genome Aggregation Database, counting > 200,000 WES/WGS samples)—revealed how rare variants are collectively more common than previously estimated [10, 11]. Subsequently, gene-based rare variant association analyses demonstrating an actual lack of rare variation enrichment in cardiomyopathy patients compared with ExAC for many genes implicated in disease [12, 13]. More recently, the ClinGen consortium (established with the specific aim of assessing the clinical relevance of genes and genetic variants in a range of diseases [14]) classified 22 of the 33 genes (67%) evaluated for HCM as with limited or de facto non-existent evidence in support of their role in disease [15]. Furthermore, ClinGen curation efforts published thus far suggest how cardiovascular diseases are characterized by particularly high proportions of dubious gene-disease associations (Fig. 3), further underscoring the importance of stringent approaches to genetic association analyses in cardiomyopathies.

Fig. 3
figure 3

Proportions of genes implicated in 10 different diseases/groups of diseases classified in terms of the strength of evidence in support of their association with disease by ClinGen. The three cardiovascular conditions curated so far (panel on the left) are characterized by the lowest proportions of definitively validated gene-disease associations, all below 25%. In the case of HCM, genes classified as “definitive” by Clingen are MYH7, MYBPC3, TNNT2, TNNI3, MYL2, MYL3, ACTC1 and TPM1. Of the other seven genes listed in the main text for which the role in HCM is supported by robust segregation evidence, only CSRP3 and JPH2 are included in the figure in the “moderate evidence” group. The others have been either classified by Clingen as “definitive” for disease entities different from (but related to) HCM (ACTN2 and PLN for “intrinsic cardiomyopathy”, FLNC and ALPK3 for “syndromic conditions where isolated HCM may be seen”) or have been associated with HCM after the Clingen curation effort for HCM was published (FHOD3). Symbols between brackets (# and §) indicate conditions part of the same published curation effort by ClinGen

Taking HCM—the only cardiomyopathy curated by ClinGen so far—as an example (Fig. 3, left hand-side), the 8 sarcomeric gene-disease associations classified as definitive by ClinGen (MYBPC3, MYH7, TNNI3, TNNT2, MYL3, MYL2, ACTC1 and TPM1, reported to explain 30–50% of HCM cases [13, 16, 17]) were all originally based on strong segregation evidence in large pedigrees alone (18–96 individuals each) [18,19,20,21,22,23,24] and published in the decade following the first association study (MYH7) in 1990. Since 2000, seven additional genetic associations (CSRP3, PLN, ACTN2, FLNC, ALPK3, JPH2 and FHOD3) [25,26,27,28,29,30] with robust evidence of variant co-segregation with disease, alone or combined with other types of evidence, were proposed in the space of 19 years. However, these genes are estimated to explain at most 1% of HCM cases, with some of them being extremely rarely mutated. Accordingly, the increase in diagnostic yield given by the screening of 51 extra genes beyond those with a fully proven role in disease is, at most, negligible [31]. Taking this into account, on one hand, the existence of unknown Mendelian HCM genes is plausible (hence the utility of WES/WGS as gene discovery tools), but on the other hand, it is unlikely that such genes will be major contributors to the disease burden. In dilated cardiomyopathy (DCM), the main disease gene—TTN, explaining up to 20% of disease cases—has been discovered less than one decade ago [32], and it may be more likely that major genetic contributors to disease are still to be identified.

In addition to segregation analysis of large family pedigrees, gene-centric case-control association analysis is another effective strategy to gather statistical evidence in support of association with disease for dominant disease genes. In some cases, such the associations of FLNC and FHOD3 with HCM, family segregation evidence was combined with cohort-based case-control analyses, but in absence of sufficiently informative pedigrees, case-control association analysis on large cohorts can be a valid alternative. This is demonstrated for example by the case of PLN in HCM, for which evidence of variant segregation in large families is not available, but multiple cohort-based approaches provided statistical evidence in support of the association with disease [12, 13, 17, 33], ultimately classified as definitive by ClinGen [15].

Case-control comparisons across all genes through the utilization of WES/WGS is potentially a powerful approach to identify novel dominant disease genes. However, it can pose issues around multiple testing correction, given that adjustment for the testing of ~ 20,000 genes would require very large sample sizes to detect significant associations for rarely mutated disease genes. Gene prioritization approaches, such as the one proposed by Roca et al. [34] (based on scores measuring gene-specific (in)tolerance to variation) or by Zaidi et al. [35] (based on the pre-selection of genes highly expressed in the heart), can be applied ahead of variant filtration to exclude from the effective analysis those genes unlikely to play any role in cardiovascular disease for example, especially if cohorts are of moderate size. Large-scale efforts to create unprecedentedly large cohorts of rare disease patients without identified causative variants are underway to address these limitations in sample size and power. For example, Genomics England [36] have sequenced 100,000 genomes from rare disease and cancer patients, with plans to extend this to 5 million in the next few years.

In contrast to dominant inheritance, recessive inheritance enables analysis on smaller pedigrees, as stricter filtering based on genotypes and variant segregation in the family can be applied. Examples are recessive early-onset HCM caused by variants in ALPK3 and the recently published association between recessive DCM and the PPCS gene, demonstrated using 3 and 2 pedigrees of 2–10 sequenced individuals, respectively [28, 37]. There are also rare examples of recessive disease caused by specific variants in the established (and usually dominant) disease genes such as the protein-truncating variant p.Arg69Alafs*8 in TNNI3, observed to cause DCM only when homozygous [38]. In the case of de novo dominant variant occurrence, case-parents trios are per se sufficient to identify the pathogenic variant, given the expectation of only 1–2 de novo coding alleles per exome [34, 39]. This trio-based approach often translates into the identification of variants causing early-onset disease, as in the case of the p.His33Asn variant in ILK in arrhythmogenic right-ventricular cardiomyopathy (ARVC) (14 years old proband). Notably, WES has been reported to achieve a diagnostic yield of 50% in 18 parent-offspring pedigrees where early-onset DCM was observed [40], and overall, recessive cardiomyopathy and de novo dominant variant occurrence represent the cases in which routine usage of WES/WGS will likely be most productive in terms of Mendelian cardiomyopathy gene discovery.

Dominant or recessive inheritance models imply that causative variant(s), characterized by a large effect size, are sufficient for phenotype manifestation. However, cardiomyopathies are characterized by variable disease expressivity, with disease often presenting with different severity in different individuals also when caused by the same pathogenic variant. One hypothesis to explain this variability is the presence of genetic modifiers, i.e. variants that are not pathogenic in isolation but able to exert an effect on the phenotype in presence of a primary pathogenic allele, worsening or ameliorating disease severity.

For this reason, large or multiple families that share a common pathogenic Mendelian variant, and are characterized by variable penetrance and disease severity among variant carriers, represent a rare and valuable resource to study genetic modifiers of the disease phenotype, given the reduced phenotype variability associated with the primary disease variant. Such families are usually encountered in areas with genetic founder effects, such as the Netherlands [41,42,43,44], Italy [45] and South Africa [46]. Mouton et al. reported variants in the sarcomeric MYBPH gene to modify the degree of hypertrophy in HCM using 27 families carrying pathogenic alleles in MYH7 and TNNT2 [47]. Similar effects have been reported also for variants in non-sarcomeric genes, such as the regulator of cardiac development XIRP2, shown to exacerbate the effects of DCM caused by a variant in TNNT2 [48]. Research on modifier alleles is complex and generally requires particular datasets, and it is still preliminary even within validated disease genes, where some modifier alleles modulating the risk of adverse events and the disease severity have been identified for example in the main genes associated with Long QT syndrome (KCNQ1) and Brugada syndrome (SCN5A) [49, 50].

Some studies estimated the overall prevalence of carriers of multiple Mendelian (potentially) pathogenic variants (hence able to cause disease in isolation and not modifiers per se) and described a more severe cardiomyopathy phenotype in such patients [51,52,53,54] proposing an additive effect of Mendelian disease variants on disease risk and severity.

Our limited current knowledge on variable disease expressivity and penetrance in cardiomyopathies advocates for research efforts on validated disease genes (included in targeted gene panels) in the first instance, but adoption of WES/WGS for diagnostic purposes in centres where clinical evaluation is also performed would enable research to be also expanded to novel genes. This would represent a key development toward a deeper understanding of disease variability in the future, with the potential for improved risk prediction in cardiomyopathy patients and their families.

However, it is increasingly considered unlikely that the answer to the > 50% cardiomyopathy patients in whom the genetic cause of disease still remains elusive lies in Mendelian models of disease. Gene discovery efforts remain important as demonstrated by the novel gene-disease associations discovered in the last years such as FLNC, recently associated with both HCM and DCM [27, 55], but their impact on the yield of genetic testing will likely remain limited. In this respect, gene discovery aimed at finding modifier genes could have a greater impact on the effectiveness of the clinical care offered to patients through augmented understanding of disease variability, translating into more precise prognostication.

Genome-Wide Sequencing and Non-Mendelian Inheritance Models

As far as the high proportion of genotype-negative cardiomyopathy is concerned, innovative approaches to study non-Mendelian models of disease are much needed, as these more complex modes of inheritance are likely holding important answers. HCM in patients with no overt family history and no identifiable genetic causes has been shown to progress with a particularly favourable clinical course and has been defined as “non-familial”, with a hypothetically different and more complex underlying genetic aetiology [56]. Reports of cardiomyopathy caused by the co-occurrence of multiple variants and concurrent proof of each allele not being pathogenic in isolation are still very sparse, even for variants in validated disease genes. A rare example is that of p.Asn83His in TNNT2 and p.Asp955Asn in MYH7, both currently classified as variants of uncertain significance (VUS) but observed to cause severe DCM in a consanguineous family only when co-inherited [57].

Evidence of the contribution to cardiomyopathies of more common variants with lower effect sizes dates back to about a decade ago, with the published association of a common intronic variant in MYBPC3 with HCM in South Asians [58]. Subsequently, moderately-sized case-control genome-wide association studies (GWAS) have been performed both in HCM and DCM [59, 60] and detected significant associations for variants in FHOD3 and BAG3, respectively, with rare variants in both genes associated with the respective disease later on. Other GWAS studies have also highlighted how common variants influence cardiomyopathy-related phenotypes such as left-ventricular hypertrophy [61] and heart failure development [62, 63]. One area of research trying to address complex inheritance models and directly related to GWAS is that of polygenic risk scores (PRS), quantifying disease susceptibility based on large numbers of common variants of individually small effect sizes. Usually, PRS are developed using large cohorts of patients included in GWAS and genotyped with SNP arrays, as done in atherosclerosis or coronary artery disease [64, 65]. Routine adoption of WGS in the diagnostic setting could help to define patients’ family members disease risk (once PRS have been developed) and also further validate the role of PRS both in potentially non-Mendelian disease and in quantifying the effect of modifiers if in presence of a Mendelian pathogenic variant. However, such application in the diagnostic setting is still highly speculative for cardiomyopathies.

The contribution of environmental factors to cardiomyopathy has started to be surveyed in relation to the presence of rare genetic variants using large-scale cohorts only recently, for example, in the context of DCM and exposure to chemotherapy, alcohol consumption and pregnancy, revealing an interplay between genetic and environmental exposures [66,67,68]. However, larger studies are needed to elucidate such interplay between non-genetic factors and common variants (rather than rare ones) in determining non-Mendelian cardiomyopathy.

Overall, in the case of research on non-Mendelian models of disease, the main issues are neither related to a restricted potential for discovery through the usage of targeted panels nor to the lack of particular datasets, but rather to the paucity of sufficiently large and well characterized cohorts and bespoke strategies to investigate these inheritance models. In conclusion, the contribution of routine genome-wide sequencing for diagnostic purposes is unlikely to contribute to discovery or identification of non-Mendelian disease in cardiomyopathies at present but could indirectly support the validation of PRS through the creation of increasingly large sequenced cohorts.

Complex Phenotypes: Where Genome-Wide Sequencing Means Genotype-First Diagnostics

Cardiomyopathies can be observed as isolated conditions or in conjunction with extra-cardiac features if occurring as one of the manifestations of a syndromic disease. While several syndromes, such as Friedrich ataxia and Barth syndrome, always present with extra-cardiac involvement, for others (like Fabry disease and Danon disease) cardiomyopathy can be the only manifestation.

HCM is characterized by the existence of metabolic genocopies such as Fabry disease (GLA gene), Danon disease (LAMP2) and PRKAG2-cardiomyopathy (PRKAG2) that can mimic its phenotype, sometimes in absence of extra-cardiac manifestations. In such cases, making a reliable diagnostic differentiation based on imaging techniques alone can be very difficult for the cardiologist [69]. Potentially pathogenic variants in the genes associated with these conditions are observed in 1.5–5% of patients referred for HCM genetic testing [13, 17, 70], and routine inclusion of these genes in targeted panels for HCM genetic testing enables a “genotype-first” diagnostic approach that is directly informative on the underlying molecular cause of disease and can inform the radically different clinical management that these patients require compared with patients affected by “classical” HCM.

While genes associated with HCM genocopies are today routinely included in targeted panels in use on HCM patients, many other genes associated with complex syndromic conditions characterized by cardiomyopathy are grouped in different targeted panels used for testing pre-specified disease categories. In some cases, this can translate into long, costly and distressing “diagnostic odysseys”, with patients undergoing multiple genetic tests and clinical evaluations before reaching a definitive diagnosis [71, 72]. In such cases, the usage of WES as first-tier diagnostic tool can translate into a decrease in medical interventions and incremental savings estimated in the range USD 1727–4140 per diagnosis [73,74,75], with earlier reports also anticipating significant time and resource optimization in case of adoption of WES at first genetic test appointment [76, 77].

Irrespective of the disease in question, such a comprehensive approach to diagnostic sequencing at first evaluation, will require the downstream adoption of specific strategies to minimize the risks of erroneous variant interpretation and false positive findings, as proposed for example in the context of paediatric DCM [78]. In addition, savings brought by the application of a single genetic test to all patients could be used to fund multi-disciplinary teams evaluating case selection and aiding correct variant interpretation, with an estimated cost of GBP ~ 400 (USD 500) per case [79]. In other words, if the advantages of genome-wide sequencing as a tool for genotype-first diagnostics will be handled with the necessary caution to avoid “over-interpretation” of the estimated ~ 25,000 genetic variants detected in each individual, first-tier WES (or WGS, in the future) will bring undeniable benefits, especially to a subset of patients.

Exome vs Genome Sequencing: 2% VS 100%

The rationale for utilizing WES is that ~ 2% of the genome— the portion comprising protein-coding regions of all genes—harbours ~ 85% of genetic variants with large effects on disease-related traits [80]. Therefore, WES represents a powerful strategy to comprehensively evaluate protein-altering variation in the genome at lower costs and less computationally intensive labour compared with WGS.

However, WES suffers from two main limitations. First, like targeted gene panels, it relies on an exon-capture step upstream of the sequencing. Different approaches to exon capture have been developed and enhanced, but no method currently exhibits perfect uniformity of target coverage [81, 82]. Second, implicit in the rationale of WES is the fact that non-coding regions of the genome harbour an estimated 15% variants with large effect sizes on disease traits, and any pathogenic variant in such regions would be missed [80].

The former drawback translates into some exons or genes being only sub-optimally covered by WES, with GC content significantly and inversely associated with gene coverage and potentially creating bias toward variant identification in a subset of better-covered genes [83]. In this respect, multiple studies have demonstrated WGS—especially if not requiring DNA enrichment—to be more powerful than WES in detecting exome variants, including (but not limited to) copy number variants (CNV) expanding beyond protein-coding regions and providing hitherto unprecedented exome coverage [84,85,86].

WGS virtually enables detection of all types of variants occurring in a genome (with some caveats e.g. lower depth of coverage implies lower costs but also lower variant detection sensitivity), but the production, processing and analysis of huge amounts of data currently renders its routine adoption in many small diagnostic facilities unrealistic. In spite of the drop of sequencing prices over the last decade, estimates of sequencing costs per individual sample often exclude consumables costs, which are particularly high for WGS [87]. However, costs of hardware and computing power have also dropped quickly over the years, making the acquisition of appropriate sequencing and informatics infrastructure increasingly feasible for large diagnostic laboratories.

The main diagnostic advantage in choosing WGS lies in the detection of (potentially) causative variants not readily detectable by WES–CNVs and variants in the non-coding regions of the genome. In the context of cardiomyopathies, there have been limited studies on such variants so far, with analysis hampered by access to sequencing datasets and the still limited variant interpretation capabilities. As far as CNVs are concerned, large insertions or deletions have been reported in ~ 1% of HCM cases in several cohort-based studies, with such estimates including CNVs affecting non-validated disease genes [88,89,90], suggesting a minor contribution of CNVs to the disease burden in HCM. The contribution of these variants may be higher in other conditions such as ARVC and DCM, with reported detection rates of 4–7% in two recently published studies [91, 92], consistent with the more prominent role for loss of function variants in these conditions. Of note, a diagnostic pipeline including both routine usage of WES and detection of CNV has been recently proposed for DCM but implies additional costs for a SNP array specifically used for CNV detection [78]. Usage of WES would however prevent screening of non-coding regions of the genome. Studies on such variants in cardiomyopathies are still limited in number but suggest that deep-intronic variation may contribute substantially to the disease burden. For example, WGS identified pathogenic variants in 9% of a selected subset of patients (with HCM/left-ventricular hypertrophy/surgical interventions/family history of HCM) in which the genetic cause was elusive [8] and deep-intronic variants in MYBPC3 alone are estimated to explain 6.5% of HCM cases [93].

Of note, our ability to interpret these types of variants is still limited and hampered by the lack of public catalogues of clinically classified non-coding variants, although efforts to characterize the effect of non-coding variants at the genome-wide level are being carried out [94]. Currently, definitive interpretation of deep-intronic or regulatory variation requires RNA as well as DNA sequencing to investigate the effects of such variants on splicing patterns or gene expression levels [8, 93, 95], with access to myocardial tissue rarely available for routine genetic testing. However, in the near future, WGS will plausibly be employed by a larger number of laboratories thanks to the combination of lower costs and more accessible informatics infrastructure. While research on variants on non-coding regions is still in the early stages, WGS will enable much needed research on such variant classes which, along with improved CNV detection through the optimal coverage of WGS, promises a substantial impact on the yield of genetic tests in cardiomyopathy.

Not All that Glitters Is Gold: Dangers of Genome-Wide Diagnostic Sequencing

As mentioned above, in the last 5 years, new population genetics resources such as ExAC contributed to raise awareness in the scientific and clinical communities about the dangers of a permissive approach to genetic associations and variant interpretation. Resources of fundamental importance to clinical geneticists such as ClinVar [96] are partially contaminated from inconsistent variant interpretations, although enhancements in variant classification frameworks—such as the release of the current variant interpretation guidelines [97]—and efforts to identify variants that have been misclassified or are characterized by low penetrance aid more accurate and concordant results [98, 99].

The expansion of clinical genetic sequencing facilitates a deeper understanding of genetic variation and its role in health and disease, enabling in turn enhanced variant detection and interpretation in the diagnostic setting. However, routine adoption of genome-wide sequencing assays for clinical genetic testing in cardiomyopathies—as well as in other conditions—will also pose some challenges (Fig. 4).

Fig. 4
figure 4

Summary of the advantages and the disadvantages/dangers associated with the routine adoption of genome-wide sequencing approaches for genetic testing in cardiomyopathies. WGS = whole-genome sequencing

It is key that researchers, physicians and geneticists remain aware of the dangers that false positive variant interpretation (i.e. a variant incorrectly classified as (likely) pathogenic) can have on patients and—often especially—on their family members. Labelling a healthy individual as intrinsically affected can have serious clinical, psychological and financial consequences, for example, by unnecessary implants of defibrillators, life-long anxiety and insurance discrimination [100,101,102]. Uncertain test results—irrespective of the classification as “VUS” being correct or not—also bring negative consequences such as concern and poor understanding of the implications [103].

Both WES and WGS imply the detection of extremely high numbers of variants compared with targeted gene panels, many of which may be plausible candidates for disease causation as characterized by rarity in population databases and therein classifiable as VUS. Data sharing efforts and strategies based on the intersection of public exome/genome variation databases with private cohorts will doubtlessly help in limiting the number of candidate pathogenic variants, for example through the creation of “non-pathogenic variants blacklists” [104].

Rigorous variant interpretation frameworks applied by diagnostic laboratories and recommended by regulating bodies are of little utility if not paired with accurate gene selection for the disease in question. The main risk of not applying both strategies lies, in the best-case scenario, in a substantial increase in the number of uncertain genetic findings, as shown on a smaller scale with the adoption of extended targeted gene panels compared with those including only validated disease genes [105]. As reflected by the complex gene classification scheme adopted by ClinGen [15], the degree of robustness of a given gene-disease association is not easy to define and it may be difficult to establish, in the clinical genetics setting, which genes with moderate evidence of association to include in diagnostic panels. With the routine adoption of WES or WGS for diagnostic sequencing, guidelines defining which genes to include in virtual panel testing are warranted.

Of note, restricting the testing to a limited subset of genes can also pose ethical dilemmas, as findings of potential medical value, unrelated to the disease being tested for, can occur in other genes. For this reason, the American College of Medical Genetics and Genomics has issued dedicated guidelines, identifying 59 genes in which clinically relevant variants should always be reported to the patients [106]. Naturally, when WES/WGS sequencing will be routinely used for diagnostics, incidental findings will be encountered at much higher frequencies, likely posing further challenges in establishing the optimal trade-off between the risks of “over-reporting” variants in absence of the related phenotype and often characterized by low penetrance, and not informing about the increased susceptibility to a specific, potentially life-threatening condition.

On the research side, approaches that investigate Mendelian protein-altering genetic causes of disease beyond validated disease genes are necessary, although those investigating non-coding variants and more complex inheritance models are plausibly holding the answer to a greater proportion of the genotype-negative cardiomyopathy patients. In this respect, genome-wide sequencing assays can be used in analysing families where the observed disease segregation is incompatible with Mendelian inheritance. This has been done in several studies that, using WES, proposed the cause of disease to lie in the combined effect of multiple variants segregating in the pedigree [53, 57, 107,108,109]. Such analyses are largely needed to elucidate potential oligo-genic inheritance in cardiomyopathies and could also serve the creation of catalogues of variants with intermediate effect sizes that in combination (perhaps also with non-genetic factors) can cause disease, as currently done for single pathogenic alleles e.g. in ClinVar, besides potentially representing validation datasets for PRS development. However, while investigations on these poorly explored disease models are absolutely warranted, they pose difficult challenges related to discerning variants that contribute to the disease onset from those that are instead benign bystanders or that contribute only in some cases due, for example, to low penetrance. Of note, only some of these studies include robust complementary evidence demonstrating that any subset of the proposed pathogenic combination of variants cannot cause disease in absence of the other allele(s). In some instances, WES-based studies on families hypothesise a decisive contributory role in reaching disease onset for variants in genes that are far from validated for the disease in question, such as TTN in HCM, or report very high diagnostic yields including many such genes and hypothesising oligo-genic inheritance [107, 110].

Despite cardiovascular genetics professionals willing to pay approximately USD 150 for every 1% increase in diagnostic yield, the lower number of VUS was a significant determinant for the uptake of a specific test over the others, and cardiovascular geneticists are currently still reported to have an overall strong preference for targeted panel testing over WES or WGS [111]. This reflects the fact that, irrespective of the disease in question, the mode of inheritance and the potential proportion of cases explained by the hypothesised genetic cause, it is fundamental that clinical genetics keeps following an “innocent-until-proven-guilty” philosophy, also to protect geneticists from having to cope with unmanageable amounts of uninterpretable variants.

This implies that what remains hypothetical in research—pending definitive validation, which always constitutes research progress—should not be incorporated into diagnostic practice and that the relevant bodies issue standardized guidelines concerning the diagnostic workflow in relation to genes’ and variants’ association with disease. Carefully addressing ethical aspects related to the potential discovery of pathogenic variants or risk alleles for conditions unrelated to the one being tested will also be a priority, given that incidental findings will most likely be encountered increasingly often.