“MENDELIAN CODE” OF COMMON DISEASES

Common diseases are a major public health problem around the world, therefore, the formation of new strategies for the prevention and treatment of widespread pathologies remains the focus of attention of researchers. The hereditary component of such diseases is complex, so genetic research in this area is still relevant today, although approaches to its study have changed significantly.

In recent years, the “Mendelian code,” according to which there is a connection between Mendelian and common diseases hypothesis has attracted considerable interest. Many of the common variants contributing to the formation of complexly inherited diseases are localized in the genes causal for monogenic pathologies [1]. The phenotypic effects of mutations leading to syndromic or monogenic pathology are extremely strong, while polymorphisms bring weak effects, modified by both environmental factors and genetic background. Therefore, polymorphisms in the genes of Mendelian diseases may be significant for common diseases in which the same biochemical pathways are involved. This idea has been confirmed and developed in a number of studies. For example, it has been shown that more than 23% of genes in which mutations lead to highly penetrant Mendelian diseases are also associated with common diseases, and the OR values ​​for high risk variants of such genes are higher than for high risk variants of genes that do not lead to monogenic diseases, but are associated with common diseases [2]. It is known that carrying out genome-wide association studies (GWAS) makes it possible to identify substitutions in close proximity to genes of Mendelian diseases, and the phenotypic manifestations of complexly inherited pathologies partially overlap with manifestations of monogenic diseases [3].

A look through the prism of the “Mendelian code” hypothesis can give a second wind to the candidate approach in the study of the genetic component of common diseases. Our attention was drawn to a group of genes for proteins of DNA repair systems. Disturbance of DNA repair processes leads to a number of monogenic and oncological diseases [4, 5]. The involvement of gene polymorphism of DNA repair systems in the development of pathological conditions of various etiologies is currently being actively studied, but many questions remain. Most attention is focused on identifying the role of genes of DNA repair systems in the development of oncopathology. It is known that mutations in some genes of proteins of various DNA repair systems cause the development of a number of hereditary oncological diseases. Polymorphismsin the genes of this system are associated with oncopathology, sensitivity to chemicals, and radiation. The study of these phenomena most often includes the involvement of genetic polymorphism of these gene systems [5]. At the same time, the participation of polymorphism of genes of the repair systems in the formation of a predisposition to a wide range of complexly inherited pathologies, such as diseases of the cardiovascular system [69], mental [10, 11], metabolic [12], immunological [1315], and other disorders has been studied (including by GWAS).

Hundreds of proteins are involved in DNA repair, according to GenOntology, the products of 511 genes are involved in DNA repair in humans (GO: 0006281, “Homo sapiens,” “experimental evidence,” and “UniProt” filters) [16]. In general, the proteins of the repair systems form multiprotein complexes that allow DNA damage to be corrected, and can also form complexes outside their own repair systems [17, 18]. In this review, the participation of polymorphisms of genes of the BRCA1-associated genome surveillance complex (BASC) in the formation of a predisposition to common diseases is considered. The BASC contains tumor suppressors and DNA damage repair proteins, namely, BRCA1, MSH2, MSH6, MLH1, PMS2, RF-C, PCNA, ATM, BLM, MRE11, NBN, and RAD50. The BASC is a dynamic system that changes its composition both throughout the cell cycle and within subcellular domains. The proteins of this complex (Fig. 1) participate as components in various repair systems and are also capable of forming small stable subcomplexes independent of BRCA1 [19].

Fig. 1.
figure 1

BASC. The network was built using the on-line resource STRING v: 11.0; the lines connecting proteins in the network represent experimentally proven interactions, co-localization, and co-expression. In addition to the BASC, these proteins are included in protein complexes (all protein complexes are underlined) such as MRN (MRE11, RAD50, and NBN), MRX (MRE11 and RAD50), RF-C (RFC1, RFC2, RFC3, RFC4, and RFC5), and MMR (MLH1, PMS2, MSH2, MSH6, PCNA, and RF-C). Along with BRCA1, ATM, and BLM, the protein complexes MRN and MRX are involved in homologous recombination processes, and MRX and ATM are involved in non-homologous end joining processes. The RF-C heteropentamer and PCNA are involved in DNA replication and excisional repair. The MMR complex performs mismatch repair.

BASC PROTEINS AND DNA DOUBLE-STRAND BREAK REPAIR

Immediately after the DNA double-strand break, the nearby histone H2AX quickly becomes phosphorylated (by ATM kinase or other kinases of this family), forming gH2AX [20]. Phosphorylation of histone is necessary for coordinating the assembly of the repair complex, as well as for additional recruitment of the necessary enzymes [2127]. Nibrin (NBN) in the MRN complex (including the MRE11, RAD50, and NBN proteins) binds to gH2AX and ATM, which promotes the accumulation of DNA repair factors in the chromatin surrounding damage, thereby activating the repair of DNA double-strand breaks [2830]. The MRN complex is the only sensor for ATM activation in telomere dysfunction [31]. It plays a central role in the activation of ATM kinase at sites of DNA double-strand breaks [32]Fig. 1.

The ATM protein kinase, a member of the P13/P14 family , plays an important role in the signaling pathways for double-strand breaks in the DNA of higher eukaryotes. ATM indirectly controls the presence of DNA double-strand breaks through induced changes in the chromatin structure [33]. The ATM protein can be found both in the nucleus and in the cytoplasm, including in peroxisomes, vesicles, and mitochondria. The ATM kinase has hundreds of targets in a number of signaling pathways involved in the maintenance of cellular and redox homeostasis and regulation of mitochondrial functions [3436]. It is assumed that in response to the appearance of DNA double-strand breaks, ATM is activated by two independent pathways, one involving TP53BP1, and the other NBN [37]. In addition, ATM can be activated in mitochondria in response to oxidative stress regardless of the cellular response to DNA damage, DNA single-strand breaks, and changes in chromatin structure [3841]. Mutations in the ATM gene lead to ataxia-telangiectasia (Table 1), which, in addition to the main neurodegenerative disorders, is characterized by vascular changes and frequent respiratory infections. Associations of ATM gene variants with longevity [42, 43], schizophrenia [10], coronary atherosclerosis, diabetes mellitus [6, 44], insulin resistance [45], and response to metformin therapy in type 2 diabetes mellitus [4648] have been described.

Table 1.   Monogenic diseases caused by mutations in the genes of the BASC proteins

The MRN complex, consisting of the MRE11, NBN, and RAD50 proteins, is involved not only in the repair of DNA double-strand breaks, but is also associated with telomere formation and verification of DNA damage [49]. This complex is involved in signaling pathways that mediate the development of innate immunity [50]. Mutations in the NBN gene (NBS1) lead to Nijmegen chromosomal breakage syndrome, and mutations in the RAD50 gene lead to a disease similar to Nijmegen syndrome (Nijmegen breakage syndrome-like disorder) (Table 1). Moreover, polymorphisms of the NBN gene are associated with autoimmune diseases [13], metabolic disorders [12], and aging processes [51]. Variants of the RAD50 gene are associated with bronchial asthma [14, 52, 53] and cardiovascular pathology [54]. MRE11 gene mutations have been described in ATLD1 (Ataxia-telangiectasia-like disorder 1 (Table 1)) [56, 57], and the association of variants of this gene with myocardial infarction [7] and immune aging of T-cells [58] has also been described.

For effective signal transmission through MRN and ATM, a number of other factors are required, including TP53BP1 and BRCA1, the presence of which in the complex determines the pathway along which the repair of DNA double-strand breaks will go, either homologous recombination (HR), mediated by BRCA1, or non-homologous end joining (NHEJ) mediated by TP53BP1 [59, 60]. BRCA1 is a nuclear phosphoprotein that plays an important role in maintaining genomic stability in general, and as a tumor suppressor. The functions of BRCA1 are extremely diverse: it interacts with the components of the histone deacetylase complex, causing this enzyme to participate in the processes of transcription, DNA repair, and recombination [61], and participates in the assembly of the mitotic spindle [62]. By binding to RAD50, BRCA1 blocks the exonuclease activity of the MRN complex [63, 64]. Of all the genes of the BASC proteins, BRCA1 is the most actively studied - mutations in this gene are responsible for about 40% of hereditary breast cancer cases and more than 80% of cases of hereditary breast cancer with ovarian cancer (Table 1) [4, 5]. Polymorphisms of BRCA1 are associated with different types of inflammatory response [5, 65].

One of the components of the homologous recombination system, the product of the BLM gene, interacts with topoisomerase 3α and two helicases of the same subfamily as BLM (DExH-box-containing DNA and RNA helicases). BLM possesses DNA-dependent ATPase and DNA-helicase activity; it is involved in the processes of recombination, repair, replication, segregation of sister chromatids in mitosis, resolving the Holliday structure and untwisting G‑quadruplexes, recombination-mediated telomere elongation, and regulation of gene expression [66–69]. Mutations in this gene cause the development of Bloom syndrome (Table 1), which is characterized by an excessively high frequency of recombination of homologous chromosomes and sister chromatids [70, 71]. By influencing cell viability and apoptosis, BLM is involved in cataract progression [72]. In a mouse model, BLM has been shown to ensure the proper development and functioning of B-lymphocytes of various subtypes [73]. The involvement of BLM in the pathogenesis of colorectal and prostate cancer has been shown [74, 75].

It should be noted that there is an association of polymorphisms in the NBN, MRE11, RAD50, ATM, BRCA1, and BLM genes with various oncological diseases [5, 67, 7685].

BASC PROTEINS AND MISMATCH REPAIR

Another complex associated with BASC is that of proteins of the very complex system of postreplicative DNA mismatch repair (MMR). The main role of this complex (which includes at least 20 proteins) is to eliminate errors associated with DNA replication (in addition to such mismatches as G/T, G/G, and A/C, the following are also targets for this system: O6-methylguanine, coupled with C or T, cisplatin-induced CpG interchain crosslinks, UV-induced photoproducts, purine adducts of benzpyrene, aminofluorene derivatives, 8‑oxyguanine, insertions, and deletions) [86]. In addition, MMR inhibits recombination between non-identical sequences and affects many processes associated with DNA metabolism, including signaling of DNA damage, expansion of trinucleotide repeats, switching of synthesis of immunoglobulin classes, and somatic hypermutability [87]. MMR appears to play a dual role in the response to DNA damage (direct mismatch repair and signaling of DNA damage) and is also involved in the activation of apoptosis induction [88]. In vivo experiments have shown that MMR mediates the cellular response to telomere dysfunction by weakening the induction of the p21 protein [89].

Mismatches are recognized and bound by MutS-subunits, which are heteroduplexes of three types: the MutSα complex, formed by the MSH2 and MSH6 proteins, recognizes mostly single mismatches and small insertions/deletions; the MutSβ complex (MSH2/MSH3), links extended insertions/deletions; and the MutSγ complex (MSH4/MSH5), is involved in the meiotic recombination process [9093]. MutS binds the MutL-subunit, and is also represented by several heterodimeric forms: MutLα, MutLβ, and MutLγ. The MLH1 and PMS2 proteins form the MutLα subunit. Physical interactions of MutLα with DNA polymerase III, which bind to the repair site, are shown. MLH1 can also form heterodimers with MLH3, forming the MutLγ subunit involved in the meiotic process. PMS2 and MLH3 have weak endonuclease activity, which is critical for the functioning of the MutL subunit. PMS2 cannot cut methylated DNA, therefore it is likely that only de novo disorders can be corrected in this way, but this has not yet been proven in human cells [17, 18, 93, 94]. MLH1 in combination with PMS1 (MutLβ) or PMS2 suppresses homologous recombination, especially when it comes to short homologous DNA stretches. In addition, the human MutL complex protects against the formation of genomic rearrangements by participating in processes that are not directly related to mismatch repair [95]. Thus, PMS2 is involved in spermatogenesis [96]. Mutations in the MLH1, MLH3, PMS2, PMS1, MSH2, and MSH6 genes can be the cause of some hereditary cancers (Table 1), such as Lynch syndrome (hereditary non-polyposis colorectal cancer) and Turcot syndrome (mismatch repair syndromic cancer, brain tumor polyposis syndrome) [4, 97101]. Mutations in MLH1 and MSH2 can cause Muir-Torre syndrome (MRTES) and in MLH1, MLH3, MSH2, MSH3, and MSH6—endometrial cancer [97, 102105]. Associations of polymorphisms of these genes have also been shown with other oncological diseases [5, 106111]. In addition, associations of polymorphisms of the MLH1 and PMS2 genes with lifespan have been revealed [112, 113]. Mutations in the MSH5 gene have been described, leading to premature extinction of ovarian function [114, 115], as well as the association of polymorphisms with IGAD1 (IgA deficiency) [116]. The MSH4 gene is associated with male infertility [117]. In a mouse model, it was shown that MSH2 deficiency leads to a genome-wide increase in the degree of histone H3 methylation [118].

Another protein complex, replicative factor C (RF-C), which has ATPase activity in the presence of DNA and proliferating cell nuclear antigen (PCNA), plays an important role in mismatch repair. RF-C is a heteropentamer, the subunits of which are encoded by the RFC1, RFC2, RFC3, RFC4, and RFC5 genes [119]. RF-C interacts with the 5'-end of the DNA sequence and then binds PCNA, mediating its binding to DNA. PCNA is required for the assembly of the replication complex; it stabilizes the complex of template DNA and DNA polymerase (delta or epsilon), which provides processive DNA synthesis [120]. PCNA can also bind a number of proteins (ligase 1, methyltransferase, flap endonuclease 1, and others), attracting them, if necessary, to the replicative complex; it is involved in maintaining the viability of proliferating cells [121123]. The endonuclease activity of MutLα is activated by PCNA [124]. RF-C is involved in the processes of replication, excisional nucleotide repair, mismatch repair, and maintenance of telomere stability [125127]. Polymorphism of the PCNA, RFC1, RFC2, RFC3, RFC4, and RFC5 genes is actively studied in connection with oncopathologies [5, 119, 128130]. Expansion of the pentanucleotide repeats in the RFC1 gene leads to autosomal recessive late age cerebellar ataxia (Table 1) [131]. An association of the polymorphisms in the RFC1 gene with immunological reactions was revealed [132]. The RFC2 gene is located in the deleted region in patients with Williams-Beuren syndrome (Table 1) [133, 134]. Mutations in the PCNA gene lead to the development of ATLD2 (Table 1), a disease similar to ataxia-telangiectasia [135]. The PCNA level is actively used as a protein marker of proliferating cells; therefore, a change in its level in various pathologies and in model systems has been noted in many works (see [5] and references therein). The involvement of PCNA in the pathogenesis of Parkinson’s disease has been shown [136, 137].

GENES OF BASC PROTEINS AND COMMON DISEASES

At present, there is no sufficiently convincing evidence of the participation of most genes of proteins of this functional class in the development of common diseases, although a lot of data have already been accumulated in this area. A. Ciccia and S.J. Elledge, investigating 40 syndromes caused by mutations in more than 80 genes, concluded that defects in DNA repair primarily affect the homeostasis of the nervous, immune, and reproductive systems; they can also lead to premature aging or a predisposition to cancer [34]. Indeed, from the data presented by these authors, it can be seen that pathology of the nervous system is observed in 62% of cases of syndromes caused by mutations in the genes of DNA repair proteins, an increased predisposition to oncopathology in 37%, disorders in the functioning of the immune system in 35%, and certain signs of premature aging in 25% of patients [34]. In the available sources, most information concerns the involvement of genes of DNA repair proteins in the development of oncopathologies (see, for example, DisGenet, and also [4, 5, 67, 7685, 97101, 106111, 119, 128130]). Therefore for this review, other aspects are more interesting to consider.

Mitochondrial Dysfunction

Chromosomal rearrangements and genome instability are markers of impaired DNA repair processes. However, a number of signs common to some syndromes associated with impaired DNA repair (ataxia-telangiectasia, Bloom syndrome, and Nijmegen syndrome), such as premature aging, growth retardation, insulin resistance, endocrine disorders, and immunodeficiency, can be caused by defects in antioxidant protection and, as a consequence, accumulation of oxidative DNA damage [138]. These processes are associated not only with the direct accumulation of mutations in the nuclei of somatic cells (correct DNA repair is especially important for neurons), but also with the accumulation of damage in mtDNA. Since mtDNA encodes important subunits of the mitochondrial respiratory chain, defects often lead to disruption of oxidative phosphorylation processes [34, 139]. Mitochondrial dysfunction can be the cause of a number of pathological conditions affecting primarily tissues with an active metabolism, including the central nervous system, skeletal muscles, heart, and number of other tissues, causing neurodegeneration, cardiovascular and metabolic diseases, aging, cancer, and other disorders [139, 140]. Disturbances in the ultrastructure and functions of mitochondria, and increased production of mitochondrial ROS are characteristic of ataxia-telangiectasia and ATLD, Bloom syndrome, Nijmegen syndrome, Cockayne syndrome, and pigmented xeroderma [36, 138].

Most ROS are generated during normal cellular metabolism, mainly in mitochondria, peroxisomes, and the endoplasmic reticulum [141]. The immediate physical proximity to the sources of ROS and the absence of histones leads to an increase in the rate of mutagenesis in mitochondria, which is 10–20 times higher than in the nuclear genome [140, 142]. The most stable oxidative damage to DNA, 8-oxyguanine, can form mismatches with adenine during subsequent replication, and these mismatches occur more often than “correct” pairs with cytosine [143]. Thus, a “mutator” phenotype is formed, i.e. DNA damage causing further mutations [144, 145]. Oxidative DNA damage is corrected primarily by BER (base excision repair); in mitochondria errors are also corrected primarily by BER. However, this is far from the only mechanism. It has now been proven that NER (nucleotide excision repair), MMR, and DNA double-strand break repair systems also actively function in mitochondria, both based on homologous recombination and non-homologous end joining, probably based on microhomology mediated end joining, MMEJ. Despite the fact that the details of the functioning of these systems in mitochondria and in the nucleus may differ [140, 145147], the participation of proteins of the BASC in the repair processes in mitochondria is expected.

Several proteins of the BASC are found in mitochondria and are required for normal functioning. It has been shown in cellular and animal models that the loss of ATM kinase activity leads to a rapid change in mitochondrial homeostasis. In fact, this testifies to the important role of ATM in the functioning of mitochondria, independent of damage in both nuclear and mtDNA. It is assumed that an increase in the level of oxidative stress in mice with knockout of the ATM gene is associated with an increase in the level of ROS in mitochondria, possibly due to decreased activity of complex I [148]. When the MMR complex (MLH1 or MSH2) is deficient in the cell, the activity of complex I also decreases [124, 145], and the loss of MLH1 leads, in addition, to a significant decrease in the number of mtDNA copies. When assembling the MMR complex on DNA mismatches, MLH1 and ATM interact, which is necessary for the development of a further cellular response to DNA damage [149]. Mitochondrial dysfunction caused by a deficiency of proteins of the mismatch-repair system can either be mediated precisely by a violation of the MLH1/ATM interaction [145], or be an independent event with a still unknown implementation mechanism. The involvement of the MRN complex and the BLM gene product in the functioning of the MMEJ has been shown, however, evidence for the presence and functional significance in mitochondria has so far been obtained only for MRE11 and RAD50 [140, 150, 151]. The role of BRCA1 in maintaining the stability of both nuclear and mitochondrial genomes is considered universal [152]. The localization of other proteins of the BASC in mitochondria has not yet been established.

Thus, there are convincing data on the role of a number of genes of DNA repair systems in the development of mitochondrial dysfunction and response to oxidative stress. The participation of the ATM gene product in these processes has been studied most fully [153155]. The involvement in the development of oxidative stress can explain the association of the ATM gene with the pathogenesis of a limited spectrum of diseases, in particular, cardiovascular or neurodegenerative diseases. In our opinion, a more universal mechanism by which proteins of genes of DNA repair systems can contribute to the development of common diseases of various etiologies is the modification of the immune response and inflammation.

Immune Response

The presence of concomitant immunological disorders in patients with monogenic diseases caused by mutations in the genes of DNA repair systems (along with increased radiosensitivity and a tendency to develop oncopathology) is well known. Thus, in patients with ataxia-telangiectasia (mutations in the ATM gene), an increased frequency of bacterial respiratory tract infections is observed from infancy due to a disturbance of the assembly of genes of immunoglobulins and T-cell receptors (TCR). Similar disorders were found in patients with Nijmegen (mutations in the NBN gene) and Bloom (mutations in the BLM gene) syndromes [156158].

The formation of antigen-recognizing sites of immunoglobulins and TCRs during the maturation of T- and B-lymphocytes occurs as a result of V(D)J-recombination (rearrangement), which is somatic non-homologous recombination. Mutations in the genes of some proteins that carry out rearrangement lead to the development of severe combined immunodeficiency disorders (SCID) caused by disorders in the maturation and/or functioning of lymphocytes, such as, for example, Omenn syndrome. Disorder of rearrangement is not the only pathogenetic mechanism of SCID development; in addition, accumulation of toxic metabolites, impaired cytokine signaling, thymic abnormalities, and decreased survival of lymphopoietic progenitor cells have been described. However, it should be emphasized that increased radiosensitivity was observed only in patients with SCID, in whom the disease is caused by mutations in the genes of proteins involved in V(D)J-recombination [159161]. Among the proteins of the BASC, ATM [162, 163], MRE11 [164], NBN [164–166], and RAD50 [164] are involved in such rearrangements.

At later stages of lymphocyte maturation, the formation of a further variety of antibodies is associated with two processes: somatic hypermutation (SHM) and immunoglobulin class switching (CSR) [167]. Violation of these processes leads to the development of rare primary immunodeficiencies, characterized by the absence of the production of the switched isotype (IgG, IgA or IgE) [168, 169]. The involvement of enzymes of the mismatch repair system in immunoglobulin class switching was described in a mouse model as early as 1999 [170]. Later, the involvement of the proteins of the MRN complex in this process was shown in humans [171]. It is known that CSR is carried out with the involvement of the MRN complex proteins, MRE11, RAD50, and NBN [171, 172], as well as MSH2, MLH1 [170], PMS2 [169, 170], ATM [173], and BRCA1 [174]. All proteins of the MMR system are also involved in somatic hypermutation processes [175, 176]. There are known mutations in the genes of the MMR system (MLH1 and MSH2) and the MRN complex (NBN and RAD50), leading to the development of IgAD and CVID antibody deficiency syndromes, as well as polymorphisms in the genes of various DNA repair systems associated with disorders of the process of immunoglobulin class switching [55].

As a result of the analysis of more than a million case histories, Blaire et al. compiled a list of more than 3000 highly correlated pairs of Mendelian disease/common diseases [1]. A query using the keyword “Immunodeficiency” yielded a list of two monogenic (Immunodeficiency with Increased IgM and SCID) and 50 common diseases. Moreover, in about a third of these diseases, inflammation is the leading or concomitant pathogenetic mechanism (for example, viral infection, rheumatoid arthritis, Crohn’s disease, psoriasis, bronchitis, acne, acute myocardial infarction, cataracts, osteoarthritis, and others).

“Mendelian Code”

A list of 3276 genes has been published, in which three groups are distinguished, genes for “complex and Mendelian diseases (CM),” “complex but not Mendelian (CNM)”, and “Mendelian but not complex (MNC)” [2]. Of the 16 genes of the BASC, eight are presented in this list, and only one of them, the ATM gene, belongs to the “CM” type, as it leads to ataxia-telangiectasia and is associated with rheumatoid arthritis. Two genes are categorized as “CNM” and are associated with asthma/atopic dermatitis (RAD50) and Alzheimer’s disease (RFC3). Five more genes are classified as MNC—MLH1, PMS2, BRCA1, BLM, and MSH2 [2]. Indeed, mutations leading to Mendelian pathology have been described in each of these five genes (Table 1). The questions remains whether these genes are truly not associated with complexly inherited diseases, or if such studies have simply not been carried out.

The development of monogenic diseases is associated with 13 out of 16 proteins of the BASC (Table 1). Mutations in the genes of seven of them lead either to the development of oncopathology (BRCA1, MLH1, PMS2, MSH2, and MSH6), or to an increased susceptibility to cancer (ATM and NBN). Mutations in eight genes are associated with neurological disorders (ATM, NBN, RAD50, MRE11, BRCA1 (with Fanconi anemia), BLM, RFC1, and PCNA), and nine, with immunological disorders as a part of the syndrome (immunodeficiency states—ATM, NBN, MRE11, and BLM; T-lymphomas in Turkot syndrome—MLH1, MSH2, MSH6, and PMS2) or as a result of other mutations in the same genes (NBN, RAD50, MLH1, and MSH2) [4]. Accordingly, it can be assumed that these genes are involved in the development of common diseases, which may be based on impaired immune processes, oxidative stress, and/or mitochondrial dysfunction.

Based on the hypothesis of the involvement of polymorphisms in the genes of proteins of DNA repair systems in the development of common diseases, we conducted a pilot study of coronary artery disease (CAD) and bronchial asthma (BA) and studied the variability of several genes of the BASC proteins (ATM, MLH1, PMS2, NBN, and MRE11) in these two pathologies. A number of associations with pathogenetically significant signs for the development of coronary artery disease (CAD) were revealed: body mass index and the presence of peripheral atherosclerosis (NBN and MLH1), levels of low-density lipoproteins (PMS2), dyslipidemia and myocardial mass index in patients with coronary artery disease (MRE11), and echocardiographic parameters (MLH1) [177179]. In addition, associations of the ATM and MLH1 genes with BA have been revealed, it has been shown that the impact of environmental factors (smoking and parasitosis) significantly modifies the manifestation of associations [178, 180].

In addition, using the data presented in PheWeb, we analyzed the association of 16 genes of the BASC proteins with a wide range of phenotypes [181, 182]. PheWeb is a publicly available repository of phenome-wide association studies (PheWAS) of approximately 28 million SNPs with 1403 binary phenotypic traits. This analysis was carried out by the UK Biobank on 408 961 samples of Caucasian DNA using a specially adapted statistical approach [181, 182]. Table 2 shows associations with the highest levels of significance for each gene (a total of 71 pathological phenotypes of various etiologies; grouping of phenotypes and their names in the text are given according to PheWeb). Studies in the PheWAS format only began to be carried out in recent years due to a significant decrease in the cost of genotyping, but have not yet become widespread. Nevertheless, the PheWeb Internet resource has perhaps the most data.

Table 2.   Clinical phenotypes associated with BASC protein genes*

The largest number of phenotypes associated with genes of the BASC proteins (Table 2) are related to disorders of the cardiovascular system, such as myocardial infarction, angina pectoris, various variants of coronary artery disease (associations of these phenotypes with the BLM gene are shown), coronary and cerebral atherosclerosis (BLM and MRE11), peripheral vascular disease (RFC4), hemorrhage (MSH6), and various types of hypertension (BLM, PCNA, and RAD50) (Table 2). Various sources also describe associations with both cardiovascular pathology in general and individual endophenotypes of the ATM, MRE11, and RAD50 genes [69, 54, 177179]. There are also associations with a large group of phenotypes classified as dermatological disorders (Table 2) including alopecia (association with the NBN gene), diseases of the hair and hair follicles (MLH1), nails (MSH6), sebaceous glands (MLH1), disorders of the skin and subcutaneous tissue (BRCA1), and diffuse connective tissue diseases (RFC3 and MSH2). Most of these signs are found in progeria and/or physiological aging processes. It is noteworthy that when there are mutations in the ATM, BML, and NBN genes, premature aging is included in the complex of syndromic signs [138]. Associations of the NBN gene with aging processes [51] and associations of ATM, MLH1, and PMS2 with life expectancy [42, 43, 112, 113] are known. Other dermatological disorders (according to PheWeb, Table 2) include all variants of dermatitis, both of unspecified etiology and atopic/contact (RFC1), while asthma (RAD50) is classified as a respiratory system disease along with hemoptysis (PMS2), nasal polyps (RAD50), and other respiratory symptoms (PMS2) (Table 2). BA is associated with the RAD50 [14, 52, 53], ATM, and MLH1 genes [178, 180]. Both atopic dermatitis and atopic BA belong to allergic diseases (we mentioned them in the context of immunological pathologies). A number of associations of the genes under consideration with other immunological disorders are known including autoimmune diseases [13, 15], IgA deficiency [116], immune senescence of T-cells [58], etc. According to PheWeb, the BRCA1, MSH6, RFC1, and NBN genes are associated with some mental disorders (paranoid disorders, alcoholism, phobias, etc.) (Table 2), in particular the ATM gene is associated with schizophrenia [10]. The involvement of ATM kinase in the regulation of insulin response [35, 36] logically explains the associations of this gene with insulin resistance [6, 4448], diabetes mellitus, and patient response to metformin treatment. According to PheWeb data (Table 2), there are associations of other genes of DNA repair proteins with metabolic disorders (hypothyroidism, MLH1), lipid metabolism disorders, and overweight (BLM and RFC2).

Thus, analysis of the results of associative studies of genes for DNA repair proteins (including information from PheWeb (Table 2)) allows us to make the following conclusions. A huge number of publications have been devoted to the role of genes of DNA repair proteins in the pathogenesis of neoplasia and carcinogenesis. In addition, associations of the genes discussed in this review with disorders of the cardiovascular system, digestive system, urogenital system, musculoskeletal system, hematopoietic system, sensory organs, endocrine disorders, metabolic disorders, mental disorders, neurological disorders, infectious diseases, respiratory diseases, dermatological diseases, traumatic injuries, and the body’s reactions to injuries and poisoning have been revealed. Accordingly, the considered genes have a wider scope than was classically suggested. All this indicates that genes of this functional class should be involved in the study of the genetic component of common diseases. The clearly pronounced pleiotropic effects of these genes explains their likelihood of participation in the development of various bodily states, both in health and disease, and, therefore, the association of markers of these genes with many diseases.