INTRODUCTION

Genomic imprinting is an epigenetic mechanism that allows a gene to be expressed in a parent-of-origin specific manner without altering the DNA sequence. The differential expression of an imprinted gene depends on its parental origin, i.e., diploid cells that contain two parental copies of all genes will express only one parental copy of an imprinted gene and silence the other. Being genetically imprinted, only the maternal or only the paternal genome is unable to ensure normal embryonic development, since paternal epigenetic imprints determine parent-of-origin expression of genes fundamental to placental development, and maternal epigenetic imprints contribute to the early development of embryonic structures [1, 2]. Genomic imprinting has been identified not only in marsupial and placental mammals, but also in flowering plants and some groups of insects, which indicates the evolutionary conservation of genomic imprinting by convergent evolution [3].

Allele-specific DNA methylation is the main mechanism of genomic imprinting, which introduces parental allele-specific epigenetic marks, leading to their monoallelic expression. It is the most studied epigenetic modification and has a major role in imprinting processes. All known imprinted genes contain differentially methylated regions (DMRs) on the two parent chromosomes, and these differences are essential for their monoallelic expression [4]. DMRs are commonly regulatory imprinting centers (ICs). The functional activity of imprinted genes depends not only on methylation, but is also significantly associated with chromatin structure [5, 6]. Almost all differentially methylated imprinted regions share overlapping DNA regions transcribed into non-coding and antisense RNAs with regulatory function [710].

The main mechanisms leading to aberrant function of imprinted genes and regions include: (1) gene mutations; (2) structural rearrangement of chromosomes, mainly deletion and duplication; (3) uniparental disomy (UPD) on certain chromosomes or their regions; (4) imprinting anomalies due to epimutations [11], and (5) functional anomalies of the epitranscriptome due to malfunction of non-coding RNAs [10].

At present, approximately 150 imprinted genes with tissue-specific monoallelic expression have been identified in humans, but the number of imprinted genes is likely more than 200 [4, 12, 13]. Herein, approximately 17 syndromes are associated with abnormal genomic imprinting. Multi-locus imprinting disturbances (MLID) are of particular interest. MLID are non-syndrome conditions associated with multi-locus methylation anomalies of imprinted regions and genes. Apparently, a system of reciprocal control operates in the epitranscriptome between non-coding RNAs at imprinted loci, which in some cases explains the clinical overlap between imprinting disorders [9, 14]. In addition, complete or segmental chromosomal maternal or paternal UPD is often detected in patients with non-syndrome pathology [15, 16]. In a range of cases, phenotypic overlap is associated with MLID caused by mutations in genes that establish or maintain specific allele methylation in regulatory regions subject to imprinting [17, 18].

Almost every human chromosome contains genes or regions that are differentially methylated and expressed monoallelically in a tissue and/or during a particular period of ontogenesis. It is known that gene methylation in gametes, placenta, and at the earliest stages of embryogenesis significantly differs from methylation and expression in the tissues of an adult organism. This review focuses on differentially methylated and monoallelically expressed imprinted genes and chromosomal regions, disorders in which (both structural and functional) cause pathological conditions called syndromes or imprinting diseases.

IMPRINTED CHROMOSOMAL REGIONS AND GENES, WHOSE PATHOLOGY LEADS TO IMPRINTING DISEASES

Imprinted Chromosomal Region 6q24.2 and Transient Neonatal Diabetes Mellitus

Transient neonatal diabetes mellitus (OMIM #601410) is a rare type of diabetes that occurs in infants and has a range of specific clinical features (Table 1) [19]. Transient neonatal diabetes mellitus type 1 is caused by aberrant expression of the PLAGL1 imprinted gene on chromosome 6q24.2. Several sporadic cases of transient neonatal diabetes mellitus with paternal UPD of chromosome 6 suggest either a double dose of a paternally expressed imprinted gene, or deficiency of a maternally expressed one. In a small number of familial cases, the transmission is always paternal and is associated with duplications in 6q24 [20]. The smallest size of duplications identified by molecular exploration of such families was 500 kb [21].

Table 1.   Imprinted regions of human chromosomes, main clinical features, and molecular pathology that leads to imprinting diseases

The PLAGL1 gene regulates transcription and is a tumor suppressor gene [22]. The PLAGL1 protein has a zinc finger motif and induces transcription of the PACAP1-R (ADCYAP1R1) gene, which encodes pituitary adenylate cyclase-activating polypeptide, induces insulin secretion, and regulates pancreatic β-cell function. Overexpression of PLAGL1 impaired β-cell function [23]. PLAGL1 includes 12 exons and is expressed from four different promoters, resulting in the formation of four isoforms of the protein. Transcripts from promo-ters 2, 3, and 4 are produced via biallelic expression, only the PLAGL1 coding transcript that overlaps DMR and the imprinting center (IC) is paternally expressed from promoter 1 (Р1) [24]. In addition, PLAGL1 regulates a range of imprinted and non-imprinted genes, including IGF2, H19, SLC2A4, CDKN1C, and PPARγ1, involved in cell growth and metabolism [25].

HYMAI is another gene in the PLAGL1 region that encodes a long non-coding RNA (lncRNA). The HYMAI gene consists of one exon and has a transcription start site that overlaps the PLAGL1 gene transcript in IC/P1. Hence, the IC/P1 methylation status regulates both the expression of HYMAI and certain PLAGL1 transcripts. DMR/IC, which is methylated on the maternal chromosome and unmethylated on the paternal chromosome, regulates both imprinted genes, i.e., PLAGL1 and HYMAI, which are abundantly expressed in the pancreas [26].

The paternal IC/P1 is commonly hypomethylated, and the maternal IC/P1 is hypermethylated; hence, the HYMAI and PLAGL1 transcripts regulated by IC/P1 are expressed from the paternal allele. Consequently, transient neonatal diabetes mellitus type 1 results from paternal UPD on chromosome 6 in 35–40% of cases, 6q24 duplication—in 35–40% of cases, and aberrant IC methylation (epimutation) of the maternal allele in 6q24 in 20–28% of cases. The loss of methylation at the maternal allele in patients with this type of diabetes leads to an elevated body mass index [19].

Another cause of transient neonatal diabetes mellitus is associated with mutations in the ZFP57 gene that is located on chromosome 6p22.1 and encodes a transcription factor involved in maintaining correct methylation at imprinted loci; these mutations are rare and have been described in 14 cases of transient neonatal diabetes mellitus [27]. We discuss the functions and role of ZFP57 in more detail in the Multi-Locus Imprinting Disturbances (MLID) Affecting Methylation in Regulatory Regions section.

No more than 20 cases of maternal UPD on chromosome 6 have been reported. These UPDs were not accompanied by any characteristic pathological features, except for intrauterine growth restriction [28].

Imprinted PEG13–KCNK9 Locus in Chromosome Region 8q24.3 and Birk–Barel Syndrome

The PEG13KCNK9 imprinted cluster that harbors two imprinted brain-specific genes, PEG13 and KCNK9, has been mapped to chromosome region 8q24.3. The PEG13 gene located in an intron within the TRAPPC9 non-imprinted gene is paternally expressed to form a long non-coding PEG13 transcript that arises from a maternally methylated DMR. This DMR binds a CTCF-cohesin complex that acts as a methylation-sensitive enhancer-blocker on the unmethylated paternal allele [29]. The KCNK9/TASK3 gene is expressed from the maternal allele and encodes a member of the two pore-domain potassium channel subfamily. KCNK9 potassium channels are found throughout the body, and are especially abundant in the brain, where they play a role in the migration of cortical pyramidal neurons [30]. The reciprocal expression of KCNK9/TASK3 genes is regulated by maternal DMR located within the PEG13 transcript. In addition, chromatin loops with classic histone modification (H3K4me1, H3K27ac) for the enhancer regions were also found between the enhancer domain located in intron 17 of the TRAPPC9 gene and the KCNK9 and PEG13 promoter regions in brain tissues, indicating mutual regulation of the allelic expression of the PEG13 and KCNK9 transcripts [29, 31].

Birk–Barel syndrome (OMIM #612292) is characterized by multiple features of dysembriogenesis, delayed physical development and mental retardation (Table 1) [32]. This disease is usually caused by a maternal specific missense mutation (c.770G>A or 770G>С, p.Gly236Arg) [33]. A novel variant c.710C > A: p.Ala237Asp has also been described in the maternal copy of the KCNK9 gene [34]. In addition, complete loss of the maternal allele of the gene has been reported that arises due to complex structural rearrangement of chromosome 8 involving the imprinted region [35].

Molecular Organization of Imprinted Chromosome Region 11p15.5 and the Genes Associated with Beckwith–Wiedemann, Russell–Silver, and IMAGe Syndromes

The 11р15.5 chromosomal region contains a cluster of imprinted genes that appear intermittently with non-imprinted loci. Several imprinted genes and their regulatory regions make a major contribution to the phenotypes of Beckwith–Wiedemann and Russell–Silver syndromes.

The CDKN1C/Р57KIP2 gene encodes a cyclin-dependent kinase inhibitor of the CIP family factors. The gene is a negative regulator of cell proliferation, it is expressed from the maternal chromosome, and its overexpression may promote cell cycle arrest at the G1 border of cell cycle [36, 37].

The KCNQ1 gene encodes a potassium channel. It is expressed in almost all tissues from the maternal allele and has biallelic expression only in the heart muscle. The KCNQ1 human gene is known to have four transcript isoforms. The first form is transcribed only from the maternal allele (in most tissues), the second is transcribed biallelically in the heart muscle, and the third and fourth isoforms are not translated [38].

The KCNQ1OT1/LIT1 gene is localized in intron 10 of the KCNQ1 gene. The paternally expressed lncRNA emerges from an intron of the KCNQ1 gene in the antisense direction [39]. This lncRNA may interact with chromatin enriched with trimethylated lysine residues at positions 9 and 27 (H3K9 and H3K27), as well as with H3K9- and H3K27-specific histone methyltransferases G9a, EHMT2 and the polycomb-2 repressor complex. KCNQ1OT1 may cause specific transcription silencing of neighboring genes by recruiting chromatin remodeling complexes and maintain this state for several cell divisions [40, 41].

The IGF2 gene encodes fetal growth factor, which is a mitogen for almost all cell types, but specifically modulates muscle cell growth and proliferation. IGF2 contains 9 exons, the coding region is confined to only exons 7–9. IGF2 has five promoter regions (P1, P0, P2, P3, and P4) located in exons 1, 2, 4, 5, and 6, respectively, and four CpG-islands, the final CpG island maps to coding exon 9. In both embryonic and mature tissues, it is expressed only from the paternal chromosome [42].

The H19 gene is expressed from the maternal chromosome and encodes lncRNA, which modulates growth suppression, placental growth retardation, cell cycle regulation, carcinogenesis, and metastasis [43]. The study of gene methylation during embryogenesis revealed that H19 biallelic expression is confined to the placenta until 10 weeks of gestation, after which it becomes exclusively maternal and does not affect allele-specificity or levels of IGF2 expression [44].

Structural and Functional Organization of Imprinting Centers IC1 and IC2 in Chromosome Region 11р15.5

The H19 and IGF2 genes are tightly linked, and their expression is regulated by a common enhancer located in the 3'-region of H19 [45]. The common enhancer regulates the transcription of H19 lncRNA, intragenic microRNA miR-675 on the maternal chromosome, and IGF2 and intragenic miR-483 on the paternal chromosome. The IC1 domain, also known as the H19IGF2 intergenic DMR, contains tandem repeats and binds CTCF, POU5F1, and SOX2 transcription factors, which maintain the maternal allele in an unmethylated state, which silences IGF2 expression. At the same time, ZFP57 maintains the methylated state of the IC1 paternal allele and, hence, is not able to bind CTCF and does not prevent the possible activation of IGF2 by an enhancer on the paternal chromosome [4, 45]. The CTCF protein blocks communication between enhancers and the promoter regions of genes, it is necessary for the normal functioning of epigenetic marks [46]. In addition, IC1 demonstrates different states of chromatin on the parent chromosomes: the presence of the repressive marks of histone 3 and 4 (H3K9me2, H3K9me3 and H4K20me3) on the methylated allele, and the activated state of chromatin with di- and tri-methylated histone 3 lysine residue 4 (H3K4me2 and H3K4me3) associated with the non-methylated maternal allele [47]. The secondary DMRs: H19 promoter, IGF2 DMR0, and IGF2 DMR2, are methylated on the paternal chromosome [4, 42]. IGF2 DMR0 is methylated on the paternal chromosome and has a silencer function, it downregulates the expression of IGF2 on the maternal chromosome in mesodermal tissues, except for muscles. Maternal deletion is associated with a lack of the IGF2 silencing effect: the gene is expressed biallelically, with loss of imprinting. When a IGF2 deletion is transmitted paternally, the paternal IGF2 allele is transcribed. The region that contains miR-483 in IGF2 includes another regulatory element that has a silencer function—DMR2. Its deletion on the maternal chromosome causes biallelic expression of IGF2, robustly in the muscles, especially in the muscles of the tongue, which leads to macroglossia. Deletion of DMR2 on the paternal chromosome does not perturb IGF2 expression [48].

A second DMR within the IC2 region is located in intron 10 of the KCNQ1 gene prior to the transcriptional start site for KCNQ1OT1/LIT1, a paternally expressed antisense transcript [49]. It was shown that KCNQ1OT1 lncRNA represses the expression of coding genes in cis in this region, i.e., it downregulates expression of the KCNQ1 and Р57KIP genes, which are expressed from the maternal allele [50]. Methylation of IC2 and silencing of the KCNQ1OT1 promoter are maintained through interaction with ZFP57 on the maternal chromosome [4]. Epimutation results in aberrant expression of KCNQ1OT1 from the maternal chromosome, which leads to biallelic silencing of KCNQ1 and Р57KIP and a pathological phenotype [51]. Hence, the imprinted region of chromosome 11р15.5 contains two ICs located at a distance of about 600 kb. Their dysfunction leads to Beckwith–Wiedemann or Russell–Silver syndromes.

Beckwith–Wiedemann syndrome (OMIM #130650) is a common disease with a characteristic phenotype and a prevalence of 1 : 10 000–15 000 live births (Table 1) [51, 52].

Molecular-genetic and epigenetic anomalies may be detected in 80–85% of patients with Beckwith–Wiedemann syndrome. Approximately 50% of the cases are caused by loss of methylation (LOM) at the maternal IC2 allele (IC2-LOM), leading to reduced expression of CDKN1C and KCNQ1. IC2-LOM is usually a sporadic primary epigenetic defect; however, rare familial cases carrying genetic mutations causing secondary hypomethylation have been described [51, 53, 54]. Recently, a growing proportion of patients with IC2-LOM that have impaired methylation at other imprinted loci has been noticed, which is associated with the appearance of additional phenotypic features [55] (see the Multi-Locus Imprinting Disturbances (MLID) Affecting Methylation in Regulatory Regions section).

Mosaic segmental paternal UPD of chromosome 11 accounts for 20% of cases and leads to altered expression at both gene clusters: IC2-LOM and gain of methylation at IC1 (IC1-GOM), which inactivates Н19 on the maternal chromosome. Complete UPD at chromosome 11 is quite frequent and associated with higher cancer risk [54, 56].

IC1-GOM results in biallelic expression of the IGF2 gene, which is normally expressed by the paternal allele only, and a lack of H19 gene expression from the maternal allele. IC1-GOM is found in 5–10% of cases and in a subset of patients is caused by microdeletions encompassing the OCT4/SOX2 binding site localized inside IC1, leading to a maternally transmitted Beckwith–Wiedemann phenotype [51, 53, 54]. Mice with H19 deletion display epigenetic lesions at the IGF2 gene that lead to overgrowth and exomphalos [57, 58].

Maternal CDKN1C loss-of-function mutations responsible for the maternal transmission of Beckwith–Wiedemann syndrome account for 5–10% of all cases of this disease. Mutations of this gene are found in almost 40% of familial cases and in less than 5% of sporadic cases [53, 59].

Approximately 1% of Beckwith–Wiedemann cases are caused by chromosomal rearrangements (duplications, translocations, inversions, deletions) involving the 11p15.5 chromosomal region, which leads to secondary IC2-LOM and IC1-GOM [60]. Approximately 15% of individuals with a clinical diagnosis of Beckwith–Wiedemann syndrome have no detectable molecular defect when investigated using commonly employed diagnostic molecular techniques. However, low somatic mosaicism of the above mentioned defects is increasingly found by using novel sequencing techniques and analyzing tissues other than blood (buccal epithelium, skin fibroblasts) [53].

Molecular subtypes of Beckwith–Wiedemann syndrome are characterized by a gradient in cancer development probability and display different histotypes allowing differentiation of tumor surveillance protocols according to the epigenotype. This facilitates the early detection of the relevant associated tumors [51]. Biallelic expression of IGF2 is often detected in nephroblastoma, rhabdomyosarcoma, adrenal cortical cancer and other embryonal tumors characteristic of Beckwith–Wiedemann syndrome. Overexpression of this gene in transgenic mice causes overgrowth, macroglossia, organomegaly, and tumors [57, 61]. Indeed, it was found that 20% of tumors show UPD and aberrant IC1 and IC2 methylation. Isolated IC1-GOM Н19 was observed in 7% of patients, and isolated IC2-LOM KCNQ1OT1 was reported in 55% of cases. In the latter group of patients, no tumors were detected, whereas individuals with IC1-GOM or paternal UPD had neoplasms characteristic of the syndrome in 33% of cases [54].

Russell–Silver syndrome (OMIM #180860) is a relatively common inherited condition. Its prevalence in various populations is 1 : 75 000–1 : 100 000 live births. The main phenotypic features of Russell–Silver syndrome include postnatal growth failure, relative macrocephaly at birth, signs of dysmorphogenesis, among others (Table 1) [62, 63].

Since Russell–Silver syndrome displays an array of dysembriogenic features, and delayed mental and physical development, chromosomal analysis was the first stage in identification of its causes. Indeed, several cases were linked to various nonspecific chromosomal anomalies: translocations and inversions, partial trisomies of the long arm of chromosome 1q42, deletion of 8(q11–q12), 13(q22–q32), 18р, and others [64]. Several cases of ring chromosome 15 with deletion of region 15q26.3 encompassing the IGF1R gene, have been reported. Isolated deletions of this gene lead to severe growth delay, triangular face, clinodactyly, micrognathia, microcephaly and mental retardation, but no mutations of the gene were found in Russell–Silver syndrome [65]. In patients with a phenotype similar to Russell–Silver syndrome, balanced de novo translocations were detected corresponding to a breakpoint in chromosome 17(q24–q25). This region contains a cluster of growth hormone genes; their deletions were found in patients without overt syndromic disorders [64, 66].

It had not been expected that alterations in this region may cause another syndrome until two maternal microduplications of chromosomal region 11p15.5 with contrasting phenotypes were described. In the first case, an inverted duplication of the 11p15 maternal chromosome with a size of 1.2 Mb led to Russell–Silver syndrome. The duplication spanned the entire cluster of 11p15.5 imprinted genes, and CpG hypermethylation was observed in the IC2 region. In the second case, a maternally inherited inverted duplication with a size of 160 kb included only IC2 and 20 kb of the 5′-KCNQ1OT1 region, which led to the Beckwith–Wiedemann phenotype in five people over two generations [67].

A nuanced molecular analysis revealed hypomethylation at the paternal allele of the Н19 gene in a range of patients due to epimutation, i.e., the gene switches to biallelic expression. This occurs due to the fact that IC1 is not methylated at the paternal chromosome (found in 40–60% of patients) and binds the CTCF protein, which leads to biallelic silencing of IGF2 [53, 62]. For example, it was shown that DMR in the 5'-region of IGF2 on the paternal chromosome is also hypomethylated in these patients. Since Beckwith–Wiedemann syndrome and Russell–Silver syndrome are imprinting disorders with mirror opposite alterations at the genomic loci in 11p15.5, IC2 (KCNQ1OT1) contributed to the Russell–Silver syndrome phenotype, but insignificantly. Several cases of de novo IC2 methylation on the paternal chromosome have been described [53, 57, 68].

In a range of patients with Russell–Silver syndrome, aberrant Н19 methylation occurs with a frequency of 0–35%. This means that Н19 demethylation on the paternal chromosome occurs only in some cells [69]. The mosaic etiology of imprinting anomalies shows that the lesions arise during the first rounds of cell division, indicating that most cases of the disease are sporadic.

The molecular cause is detected in approximately 60% of patients with a clinical diagnosis of Russell–Silver syndrome. Maternal UPD on chromosome 7 is the second most common etiological factor, which accounts for 5–10% of cases [70, 71]. A systematic analysis of differential methylation at chromosome 7 in maternal and paternal UPD showed that 65% of DMRs on the maternal chromosome are hypomethylated [72]. The short arm region on chromosome 7 (р11.2–р13) contains an imprinted GRB10 gene encoding a cytoplasmic adapter protein that acts as a negative regulator of tyrosine kinase receptor signaling. Paternal expression of this gene is observed in the brain and spinal cord, maternal—in skeletal muscles, and in all other tissues the gene is expressed biallelically. No mutations or aberrant methylation were identified in the regulatory region of the GRB10 gene in patients with Russell–Silver syndrome [73]. The genes in the imprinted loci on the long arm of chromosome 7 (SGCE/PEG10 and PEG/MEST) also did not have any structural and functional disturbances that could cause the development of Russell–Silver syndrome [74]. Therefore, no gene has been conclusively implicated in Russell–Silver syndrome [62, 75]. In addition, paternal UPD for chromosome 7 has no apparent phenotypic effect [72].

CDKN1C mutations of maternal origin have been described in patients with Russell–Silver syndrome in several families [7678] and paternal inactivating IGF2 mutations in other cases [79, 80]. In addition, variants of two non-imprinted genes were identified in Russell–Silver syndrome: HMGA2 variants were described for three families and three sporadic cases [81], and two families and one sporadic case with PLAG1 alterations have been reported [53, 70, 82].

The PLAG1 transcription factor contains seven canonical zinc finger domains for DNA binding and a C-end enriched with serine residues that can activate transcription. PLAG1 binds the P3 IGF2 promoter, thereby increasing its expression, which is necessary for normal embryonic growth. Silencing of the gene may lead to prenatal and postnatal growth retardation and delayed development [83].

Furthermore, multi-locus methylation imprinting disturbances (MLID) in imprinted regions of other chromosomes are possible in Russell–Silver syndrome which leads to a partial phenotypic overlap. For example, when structural and functional disorders of chromosome 11р15 and maternal UPD on chromosome 7 are not detected, maternal UPD on chromosomes 6, 16, 20, as well as the methylation status of two DMR on chromosome 14 need to be investigated [53, 84].

IMAGe syndrome (OMIM #614732) is characterized by intrauterine growth restriction, and dysembriogenic features (Table 1). It is caused by missense mutations in the CDKN1C gene, which inhibits cell-cycle progression [85]. All known single-nucleotide variants associated with IMAGe syndrome are located in a highly conserved hotspot in the PCNA-binding domain of CDKN1C between codons 272–279 and disrupt the binding of PCNA (proliferating cell nuclear antigen). The mutations most often detected are p.Asp274Asn, p.Lys278Glu or p.Arg279Leu. These mutations may increase protein stability and enhance its function, thereby preventing growth [86, 87]. The mechanism has not been established, but it may be associated with decreased degradation of CDKN1C, allowing prolonged cell cycle repression and delayed S-phase progression. CDKN1C is expressed only from the maternal allele, hence only maternal transmission of the mutation results in IMAGe syndrome [88].

Imprinted Region of the RB1 Gene in Chromosome Locus 13q14.1

It has long been known that the tumor suppressor gene would be silenced in various types of tumors via methylation within the promoter regions. One or both RB1 alleles are functionally inactivated frequently in the tumor [89], and inactivation of the gene due to germline epimutation is a rare event. Mosaic methylation at CpG 106 encompassing the promoter region of RB1 has been described in a patient with retinoblastoma, indicating that epimutation occurred in the maternal allele at an early stage of embryonic development [90].

A more nuanced molecular-genetic analysis of the promoter region and CpG islands of the RB1 gene revealed a range of features. The RB1 gene contains an imprinted site (1.2 kb)—CpG 85, a CpG island within the second intron of the gene, which shows parent-of-origin specific methylation—it is methylated on the maternal chromosome and unmethylated on the paternal chromosome. In addition, RB1 contains two more CpG islands: CpG 106 is associated with the promoter and exon 1, it is unmethylated and provides biallelic expression of the main transcript encoding the RB protein; and CpG 42 is located in intron 2 of the gene, it is methylated on both chromosomes and has no regulatory activity [91].

It was found that CpG 85 is part of a 5'-truncated processed pseudogene derived from the KIAA0649 protein-coding gene on chromosome 9 that integrated into RB1 in reverse orientation. CpG 85 acts as a promoter for an alternative RB1 transcript, which is expressed only from the unmethylated paternal chromosome [91]. Despite the expected higher overall expression level of mRNA transcripts from the paternal allele (compared to the maternal allele), the level of expression from the paternal allele is twice or three times lower due to transcriptional interference by combined expression of regular and alternative transcripts. Transcription interference is a mechanism whereby transcription of one gene suppresses the transcription of another gene. For example, the transcription complex of the alternative transcript of the RB1 gene, which binds to the unmethylated CpG 85 island on the paternal chromosome, acts as a block for the transcription complex of the regular transcript on the same allele, which decreases the expression on the paternal chromosome [91, 92]. This indicates an epigenetic parent-of-origin-specific regulation of RB1 gene expression.

In inherited retinoblastoma (ОMIM #180200) (Table 1), which accounts for 40% of retinoblastoma tumors, a germline mutation on one allele of the RB1 gene causes predisposition for retinoblastoma and its familial transmission. The tumor is initiated by a somatic mutation on the other allele of the RB1 gene in a retinal cell, which occurs in utero or early in childhood. In some families with retinoblastoma with two or more carriers of the same germline mutation, the mutations cause a milder phenotype with reduced disease penetrance, where some carriers of the germline mutation do not develop retinoblastoma, or with variable expressivity, and the same mutation in different family members manifests as unilateral or bilateral disease. The phenotypic manifestation of inherited retinoblastoma depends on the functional type of germline mutation in the RB1 gene [93]. In turn, the molecular mechanisms underlying the variable phenotypic expression of the same mutation in different members of the same family are explained by the parent-of-origin impact of the RB1 mutation. Herein, paternally inherited low-penetrance germline RB1 mutations more frequently lead to retinoblastoma and cause a more severe form than maternally inherited mutations. It is believed that when the mutation is inherited from the maternal side, pRB retains sufficient tumor suppressor activity to prevent retinoblastoma development. In contrast, when the mutation is paternally transmitted, the low residual activity would mimic a null mutation and subsequently lead to retinoblastoma [94, 95].

Imprinted Chromosome Region 14q32.2 and Uniparental Disomy at Chromosome 14

The 14q32.2 chromosome region contains a cluster of imprinted genes: some are expressed from the paternal chromosome—DLK1, RTL1 and DIO3, and others are expressed from the maternal chromosome—lncRNA genes MEG3/GTL2, MEG8, RTL1as, snoRNAs, and miRNAs [96]. DLK1, preadipocyte factor 1, or fetal antigen 1, encodes an EGF-like membrane-bound protein involved in Notch signaling and regulation of preadipocyte differentiation. It is expressed in neuroendocrine tissues, especially in the adrenal cortex [97]. A range of structural alterations of the gene were detected in patients with central precocious puberty (Table 1). At the same time, the metabolic phenotype, including an increased incidence of obesity, early glucose intolerance, type 2 diabetes mellitus, and hyperlipidemia, was more common in patients with DLK1 mutations than in patients with mutations in other genes [98].

The retrotransposon-like RTL1 gene is expressed in the placenta at the late fetal stage. RTL1 is responsible for fetal muscle defects characteristic of Kagami–Ogata and Temple syndromes. It also plays an important role in the central nervous system, since its irregular expression leads to impaired innervation of motor neurons to skeletal muscles as well as malfunction of the hippocampus-amygdala complex [99]. The strong expression of DIO3 in the developing and adult brain inactivates thyroid hormones and plays an important role in preventing neurological abnormalities caused by inappropriate levels of these hormones. Altered exposure to thyroid hormones during development may lead to some phenotypic features associated with chromosome 14 UPD [100]. MEG3 encodes lncRNA, which orchestrates a wide range of different cellular processes that require epigenetic regulation of genes and interaction with key signaling proteins. Its intronic DMR contains two CTCF-binding domains and thus has regulatory functions [101]. It is possible that CTCF binding upregulates all genes expressed from the maternal chromosome, which represents a single transcriptional unit and represses all paternal genes by impairing histone acetylation [102]. The functions of RTL1as and MEG8, other than regulatory roles, are unknown. Parent-of-origin patterns of expression are regulated by the germline intergenic DMRs (MEG3/DLK1 DMR), and DMR that is acquired after fertilization (MEG3-DMR). Both DMRs are normally methylated on the paternal chromosome [102, 103]. Loss of DLK1 expression in conjunction with RTL1 and DIO3, due to maternal UPD, leads to Temple syndrome [104].

Temple syndrome (OMIM #616222) has overlapping clinical features with Prader–Willi and Russell–Silver syndromes (Table 1) [9, 62].

The mechanisms that impair hemizygosity of the imprinted genes in the 14q32 region and the clinical phenotypic anomalies include: (1) maternal UPD on chromosome 14 (72–78%); (2) isolated loss of methylation (epimutation) in the MEG3-DMR region (12–20%); and (3) paternal origin of the 14q32 deletion (10%) [104, 105].

Kagami–Ogata syndrome (OMIM #608149) is an imprinting disorder with a range of dysmorphic features and developmental defects (Table 1). The pathognomonic feature is a bell-shaped thorax with coat-hanger configuration of the ribs. Some non-specific phenotypic features are similar to the manifestations of Beckwith–Wiedemann syndrome [106].

The syndrome may be caused by three different molecular events: (1) paternal UPD at chromosome 14 (65% of cases); (2) maternal microdeletions on chromosome 14q32.2 (20%); (3) hypermethylation of MEG3-DMR on the maternal chromosome (15%) [107]. While paternal UPD 14 and MEG3 hypermethylation occur sporadically, microdeletions may lead to a maternal inheritance. It has been shown that deletions of the imprinted region do not necessarily include DMR, hence a normal methylation pattern does not exclude the possibility of the syndrome [108].

Hypomethylation of the DLK1/MEG3 domain decreases the expression of imprinted genes such as IGF2, SNURF, and IPW, as well as several other non-imprinted genes involved in growth stimulation. Alterations in expression may reflect, directly or indirectly, the participation of MEG3 and MEG8 lncRNAs in this process; lncRNAs regulate gene expression both in cis, as well as in trans by recruiting chromatin modifiers [7]. Overexpression or inactivation of MEG3 and MEG8 expression in normal fibroblast cultures may be associated with impaired regulation of imprinted genes in the 11p15.5 and (15q11–q13) regions. For example, it was found that: (1) MEG3 overexpression is associated with lower levels of IPW transcripts; (2) overexpression of MEG8 is associated with a lower level of the SNURF transcript; and (3) concomitant overexpression of MEG3 and MEG8 was associated with lower levels of IGF2 transcripts. This indicates that MEG3 and MEG8 can regulate, in trans, the expression of other imprinted genes [9]. It has been found that IPW overexpression can lead to the downregulation of MEG3 expression [109]. Hence, it is likely that there is a reciprocal control system in the epitranscriptome between lncRNAs within imprinted regions, and this, in turn, may contribute to the overlapping phenotypes of the Prader–Willi, Beckwith–Wiedemann, and Russell–Silver syndromes [14].

The molecular organization of the imprinted chromosomal region 15(q11.2–q13) and the genes linked to Prader–Willi, Angelman, Schaaf–Young syndromes, and central precocious puberty. Chromosome region 15(q11.2–q13) includes one of the extended imprinted regions containing a cluster of imprinted genes. The structural and functional defects in these genes cause the well-known Prader–Willi (OMIM #176270) and Angelman syndromes (OMIM #105830) as well as the rare Schaaf–Yang syndrome (OMIM #615547) and central precocious puberty type 2 (OMIM #615346). Phenotypic features, disease rates, and variants of molecular pathology are shown in Table 1.

The critical region of chromosome 15(q11.2–q13) is the most susceptible region to deletions (5–7 Mb) in the human genome. It may be divided into four fragments: (1) the proximal region containing non-imprinted genes (TUBGCP5, CYPFIP1, NIPA2 and NIPA1); (2) an imprinted region containing genes expressed monoallelically only from the paternal chromosome: protein-coding genes (MKRN3/ZNF127, MAGEL2, NECDIN, NPAP1/C15orf2, SNURF-SNRPN), genes encoding untranslated RNAs (MKRN3-AS/ZNF127-AS, IPW, UBE3A-ATS), and a cluster of genes encoding snoRNAs (SNORD116: 29 copies, SNORD115: 48 copies, SNORD64, SNORD107, SNORD108, SNORD109A and SNORD109B); (3) a region with monoallelic expression of the UBE3A protein-coding gene from the maternal chromosome and biallelic expression of ATP10C; (4) the distal region contains biallelically expressed genes—OCA2 responsible for albinism type 2, the HERC2 gene and three gamma-aminobutyric acid (GABA) receptor genes [110112].

Deletions arise due to non-homologous recombination between domains of highly homologous low-copy repeats at break points (BP) BP1–BP5. ВР1 and ВР2 are localized proximally to MKRN3, and ВР3, 4 and 5 are located in the telomeric area of the imprinted region. There are two classes of deletions: class 1 deletions constitute 40% and they are located in ВР1–ВР3; class 2 deletions are present in 50% of deletion cases and extend from BP2 to BP3. In rare cases (less than 10%), the position of a deletion may not coincide with regular break points, but may be much smaller or extend beyond ВР4 and ВР5 [113117]. The distal cluster of break points has been mapped telomeric to the HERC2 gene. Such deletions include a cluster of GABA receptor genes—GABRB3, GABRA5 and GABRG3. The most frequent break points BP1, BP2 and BP3 are flanked by low-copy repeats originating from duplicated fragments of the HERC2 gene. The original copy of the HERC2 gene is located in ВР3. ВР4 and ВР5 also contain low-copy repeats, but they do not have homology with HERC2. Deletions are caused by intra- or interchromosomal crossover, occur de novo, due to the formation of a large (4–5 Mb) loop, and only in rare cases are they caused by structural rearrangement [114, 118]. In addition, in a large number of cases, the region 15(q11.2–q13) was involved not only in deletions, but also in inverted duplications (tetrasomy), duplications (trisomy), unbalanced (monosomy) and balanced translocations, as well as inversions.

The SNURF/SNRPN gene, the SNORD non-coding gene cluster, and the lncRNA UBE3A-ATS are central to the imprinted region and are expressed from the non-methylated paternal allele. This locus is sophisticated complex genetic locus SNHG14 (Small Nucleolar RNA Host Gene 14) that spans 465–600 kb and contains at least 148 exons subject to alternative splicing [119, 120]. SNURF-SNRPN is a bicistronic gene encoding two different proteins. Its 5'-region contains a CpG island, including the promoter, the first exon and the PWS-IC intron-regulatory region, with a length of 4.3 kb. The SNRPN minimal promoter region includes 71 bp of 5′-UTR and the first 51 bp of SNURF-SNRPN exon 1. This region represents an essential part of IC [121, 122] and is crucial for the regulation of the entire imprinted region [123]. Imprinting is associated with parental allele-specific methylation of CpG residues, and the methylation imprints are established during or after gametogenesis and maintained during embryogenesis. The PWS-IC region is unmethylated on the paternal chromosome and methylated on the maternal one [124].

The main function of PWS-IC is to upregulate the transcription of paternal allele genes, including MKRN3, MAGEL2, NECDIN, NPAP1, SNURF-SNRPN, IPW and UBE3A-ATS. The expression of UBE3A-ATS downregulates UBE3A on the paternal allele by means of transcriptional interference [125].

IC is a bipartite imprinting center, i.e., it contains PWS-IC and AS-IC [126]. AS-IC is located 35 kb more proximal, consists of 880 bp, and contains an alternative 5'-non-coding SNRPN exon, which is expressed only in oocytes [127]. The oocyte-specific transcription leads to methylation and transcriptional inactivation of the maternal allele, but not the paternal allele. AS-IC is crucial for silencing imprinted genes on the maternal allele. The lack of UBE3A-ATS expression on the maternal allele is necessary for normal UBE3A expression [128, 129]. The structural pathology of AS-IC erases methylation imprints on the maternal promoter of SNRPN and upregulates the imprinted genes on the maternal allele [130].

The PWS-IC functions in omatic tissue to activate paternally expressed genes during gametogenesis and guarantee paternal gene expression. The inheritance of a structural pathology or epimutation in PWS-IC leads to Prader–Willi syndrome, biallelic expression of UBE3A and, consequently, silences UBE3A-ATS on the paternal chromosome. During oogenesis, the AS-IC functions to negatively regulate the PWS-IC and suppresses the processes modulated by PWS-IC during spermatogenesis. Hence, a deletion or epimutation in the maternal AS-IC allele interferes with silencing of the paternal genes on the maternal chromosome, including UBE3A-ATS, which leads to biallelic UBE3A gene inactivation [131133].

Mutations or epimutations affecting the function of PWS-IC alter the methylation profile of secondary DMR genes NDN and MKRN3 and lead to loss of allelic expression from the entire imprinted region, indicating that PWS-IC is the main regulator of imprinting within this region in somatic tissues [134].

The first three exons of the SNURF-SNRPN transcript give rise to the small nuclear SNURF protein of unknown function [135], exons 4–10 correspond to the SNRPN portion and encode the SmN spliceosomal protein necessary for the formation of spliceosomes, which are responsible for alternative splicing of various mRNAs [136]. Six snoRNA genes extend more distally and are expressed as a single long transcript; they are regulated by SNURF-SNRPN expression and do not encode proteins [120, 137, 138]. These genes contain five one-copy snoRNA genes (SNORD64, SNORD107, SNORD108, SNORD109A and SNORD109B) and two genes, SNORD115 and SNORD116, which include clusters of repeat sequences that encode C/D-box snoRNAs. These snoRNAs are located in the introns of the SNHG14 “maternal” locus, which encodes an untranslated brain-specific transcript expressed from the paternal chromosome [139]. Ribonucleoprotein complexes between C/D-box snoRNAs guide 2′-O-methylation at specific rRNA sites in association with fibrillarin [140].

In humans, SNORD115 is only expressed in neurons and participates in RNA editing of the 5HTR2C receptor [141]. SNORD116 is located in the nucleolus and may participate in splicing, RNA modifications, and post-transcriptional regulation, although its exact role in the genome still remains to be determined [112, 142]. Isolated deletions of SNORD116 in both humans and mice lead to Prader–Willi syndrome [110, 112, 143]. The snoRNAs processed from the SNORD116 and SNORD115 introns, referred to as 116HG and 115HG, are localized in the form of RNA clouds in the regions of their own transcription in the nucleus of neurons and regulate the functioning of many genes [112, 144, 145].

UBE3A-ATS is part of the lncRNA SNHG14 and is expressed from the SNRPN promoter on the paternal chromosome [119]. SNHG14 can be divided into two functional units based on tissue-specific transcription patterns in humans [119]. The proximal part of the SNHG14/SNRPN transcript includes two mRNAs encoding the SNURF and SNRPN proteins; two lncRNAs with snoRNAs at their 5'-ends and polyadenylated at 3'-ends, i.e., SPA1 and SPA2 that are involved in protein binding and alternative mRNA splicing [146, 147]; the non-coding “maternal” gene of several C/D-box snoRNAs (SNORD109A, SNORD107, SNORD108 and SNORD116) [119], and IPW encoding polyadenylated lncRNAs [148] represent exons in the proximal part of SNHG14, which is transcribed in all tissues [149, 150].

It has been shown that SPA2 and IPW lncRNAs regulate the expression of other genes. For example, lncRNA IPW controls the DLK1-DIO3 imprinted region and expression of the MEG3 imprinted gene on chromosome 14. Some clinical features of Prader–Willi syndrome may be caused by aberrant expression of maternal alleles within the DLK1-DIO3 region [109].

The transcription from the distal part of SNHG14, which includes another “maternal” non-coding gene containing snoRNA (SNORD115 and SNORD109B), and the non-coding UBE3A-ATS, is restricted almost exclusively to the brain [119, 120, 151]. The region separating the expressed proximal portion of the SNHG14 from the repressed distal portion includes a stretch of weak polyadenylation sites and conserved sequences within the last IPW exon and a cluster of CTCF binding sites in and around the exon of SNHG14 annotated as PWAR1/PAR1. Although both elements contribute to boundary function, IPW plays a larger role and is required to completely stop transcription in nonneuronal cells [152]. This confirms that UBE3A-ATS silences the paternal UBE3A allele by means of transcriptional interference.

The imprinted genes in the proximal region (MKRN3, MAGEL2, NECDIN, PWRN1, and NPAP1/C15orf2) are methylated on the maternal chromosome and expressed from the paternal chromosome.

The MKRN3 gene includes one exon. It is expressed in all tissues and encodes a zinc finger containing protein (for more information, see the Central Precocious Puberty Type 2 section).

The MAGEL2 gene is also a single-exon gene. It encodes MAGEL2, a member of the MAGE-protein family, whose mutations cause a phenotype similar to Prader–Willi syndrome (for more information, see the Schaaf–Young Syndrome section). MAGE proteins interact with E3-ubiquitin ligases containing the RING-zinc finger motif, with ubiquitin-specific proteases (deubiquitinases) and affect the ubiquitination of substrate proteins [153, 154].

The NDN gene contains one exon. It encodes the NECDIN protein that also belongs to the MAGE family and participates in neuronal differentiation and maturation, and it suppresses the growth of almost all postmitotic neurons in the brain [155].

The PWRN1 gene (Prader–Willi Region Non-Protein Coding RNA 1) encodes lncRNA with biallelic expression in the testes and kidneys and monoallelic expression in the brain. PWRN1 represents an alternative 5′-part of SNURF-SNRPN. It is assumed that PWRN1 may function to maintain the open chromatin conformation on the paternal allele and provide access to transcription factors [156].

The NPAP1/C15orf2 gene encodes a nuclear pore complex associated protein; it is part of the nuclear pore complex that mediates the transport of macromolecules between the nucleus and the cytoplasm and regulates gene expression, mRNA biogenesis, and the cell cycle. The gene contains one exon, is biallelically expressed in adult testis, and monoallelically expressed in the fetal brain, including the hypothalamus, which is related to several endocrine features of Prader–Willi syndrome [157, 158].

The only imprinted gene in the distal part of the imprinted region is UBE3A. The UBE3A gene is monoallelically expressed from the maternal allele in the brain, while the paternal allele is imprinted. The paternal UBE3A allele is silenced by the neuron-specific antisense transcript UBE3A-ATS on the paternal allele [119, 159].

The UBE3A gene, consisting of 16 exons, encodes E6-AP-ubiquitin-protein-ligase [160]. This gene is biallelically expressed in all tissues, and in several brain structures UBE3A is active only on the maternal chromosome [161]. In mice, the Ube3a gene is monoallelically expressed in the Purkinje cells of the cerebellar cortex, hippocampal neurons, and in the olfactory bulb [162]. The silencing of the maternal allele for UBE3A leads to Angelman syndrome.

The gene encoding ATPase phospholipid transporting 10A-ATP10A/ATP10C is located 200 kb more distally to the UBE3A gene. Initially, it was assumed that it is mainly expressed from the maternal allele in fibroblasts and various brain structures and functions to maintain contacts between cell membranes and communicate signals to the central nervous system, but it was revealed that this gene is not imprinted [163]. It is likely that the lack of gene expression in the brain in patients with Angelman syndrome caused by deletions may lead to more severe symptoms related to severe autism.

The pathogenesis of epileptic seizures in Angelman syndrome is related to a cluster of GABA-receptor genes that are subject to frequent deletions. One such gene, GABRB3, encodes the GABA receptor β3 subunit [164]. Experimental mice with a homozygous deletion of the Gabrb3 gene display memory disorders, learning disability, convulsive seizures, and motor disorders similar to the symptoms of Angelman syndrome. In animals heterozygous for this deletion, neurological disorders are less expressed [165]. Consequently, in the case of an extended deletion, the clinical manifestations of Angelman syndrome may be due to GABRB3 haploinsufficiency [166].

The OCA2 gene encoding melanosomal transmembrane protein is also subject to deletions in Angelman and Prader–Willi syndromes. The loss of both alleles of this gene leads to albinism, and heterozygous carriers of the mutation demonstrate a slight decrease in pigmentation. Indeed, skin hypopigmentation in Prader–Willi and Angelman syndromes occurs only due to extended deletions of the critical area, but is absent in patients with UPD, IC mutations, or point mutations in the UBE3A gene [118, 167].

Molecular Pathology that Causes Prader–Willi and Angelman Syndromes

The most common cause of Prader–Willi and Angelman syndromes is an extended deletion in the 15(q11.2–q13) critical region. This deletion occurs in 65–75% of patients, and in the population its prevalence is 0.67–1.0 per 10 000 live births. Deletion in the critical region on paternal chromosome 15 causes Prader–Willi syndrome, whereas Angelman syndrome is caused by deletion of the same region on its maternal homologue [168170]. Several patients diagnosed with Prader–Willi syndrome had 400 kb microdeletions of the SNORD116 cluster [110, 171]. The smallest microdeletion is restricted to the SNORD116 cluster, a portion of IPW, and the SPA2 transcript. These microdeletions show that a region between SNRPN and UBE3A, the SNORD116 cluster in particular, may be critical to the key clinical presentations of Prader–Willi syndrome [172]. Since this imprinted region has a large homology with the corresponding mouse region, several models with SNORD116 deletion have been created. Mice with the deletion demonstrated phenotypic features characteristic of Prader–Willi syndrome, such as difficulties in motor learning, poor memory, hyperphagia, growth delay, and elevated anxiety [110].

UPD is the second most common cause of Prader–Willi and Angelman syndromes. Maternal UPD accounts for 20–30% of Prader–Willi cases. Heterodisomy occurs in most patients with Prader–Willi syndrome and UPD due to nondisjunction of maternal chromosomes during the first meiotic division. Isodisomy due to nondisjunction of maternal chromosomes in the second meiotic division or segmental disomy because of crossing-over errors occur less frequently [170, 173]. This may be explained by the higher survival rate from zygotes with chromosome 15 trisomy and the early death of monosomic zygotes. Paternal UPD accounts for 3–7% of Angelman syndrome cases; UPD in this syndrome is commonly manifested by isodisomy due to the nondisjunction of paternal chromosomes during the second meiotic division and occurs due to corrections of monosomy to disomy or, more likely, as a result of postzygotic events [111, 173, 174]. Since zygotes carrying mono- or trisomy on chromosome 15 are not viable, the doubling of a single chromosome in monosomy and loss of an extra chromosome in trisomy may be regarded as an emergency measure, after which further development of the embryo becomes possible. The loss of an extra chromosome in a trisomic germ cell appears a more probable event than chromosomal doubling in a monosomal cell at an early stage of development, which explains the different frequency of UPD in Prader–Willi and Angelman syndromes.

The imprinting pathology in Prader–Willi and Angelman syndromes accounts for no more than 1 and 3%, respectively, and may involve a difficult-to-determine structural pathology in the IC or epigenetic alterations affecting methylation and gene expression in the entire imprinted region [175]. In both cases, patients inherit one copy of chromosome 15 from each parent. Loss of methylation at PWS-IC on the maternal chromosome leads to biallelic expression of SNHG14 and silencing of the maternal UBE3A. The imprinting pathology in 10–15% of cases is associated with a small deletion affecting the AS-IC, which regulates the establishment and maintenance of imprinting on the maternal chromosome [176]. More than 85% of the imprinting pathology in Angelman syndrome is caused by epimutation without an underlying change in the DNA sequence. In such cases, the maternal chromosome carrying an epimutation, and an aberrant paternal epigenotype/imprint (modifications of DNA and histone proteins that mark parental alleles and provide a monoallelic expression of imprinted genes) may be inherited from a maternal grandmother or maternal grandfather. The imprinting pathology is caused by an error in the establishment/switching of epigenetic imprints during gametogenesis or results from failure to maintain epigenetic imprint during the early stages of embryo [131]. Patients with Angelman syndrome quite frequently have somatic mosaicism including epimutation, which indicates that mosaic mutations may be acquired after fertilization during early embryogenesis [177].

The imprinting pathology in Prader–Willi syndrome is represented by abnormal methylation of PWS-IC, which downregulates the expression of imprinted genes on the paternal chromosome; 10–15% of cases include PWS-IC microdeletions, both inherited and arising during spermatogenesis or after fertilization. Most cases are associated with epimutations, i.e., accidental errors that occur during the establishment of paternal imprint or due to disruption of the maternal imprinting switch to paternal imprint during spermatogenesis or imprint erasure; if it occurs in early embryogenesis, then somatic mosaicism will result [131, 175].

Mutations in the UBE3A gene occur in 10–15% of cases of Angelman syndrome and represent mutations of the maternal allele with a premature stop during translation. Most of the known mutations alter the HECT-ligase domain. About 29% of mutations are inherited maternally and 71% occur de novo. Frameshift mutations and nonsense mutations dominate. Apparently, missense mutations do not significantly alter the protein function, and the phenotype of such patients differs from the phenotype characteristic of Angelman syndrome. Mutations in the UBE3A gene on the paternal chromosome are not phenotypically manifested [169, 178181]. Atypical UBE3A deletions or rare types of structural rearrangements that alter the critical area have been reported in only a small number of patients with Angelman syndrome [182, 183].

Deficient expression of the maternal copy of the UBE3A gene in Purkinje cells may explain ataxia and tremor in Angelman syndrome, while epileptic seizures and learning inability may be associated with the lack of expression of this gene in hippocampal neurons [162].

Schaaf–Young syndrome (OMIM #615547), which is similar to Prader–Willi syndrome (Table 1), is caused by nonsense and missense mutations in the MAGEL2 gene, which is imprinted on the maternal chromosome and is expressed from the paternal chromosome [184, 185]. The phenotypic expressions range from severe fetal akinesia to mild akinesia, and include mental retardation and finger contractures [186].

MAGEL2 regulates ubiquitination, which is necessary for protein decay [187, 188]. Surprisingly, mutations that results in a shortened protein of the MAGEL2 gene cause Schaaf–Young syndrome, while deletion of the entire gene leads only to an insignificant clinical phenotype or to a complete lack of symptoms. Since MAGEL2 contains only one exon, mutations that result in a truncated protein may be explained by dominant-negative effects. It is likely that deletion of the entire paternal copy of the gene, including the promoter, may partly reverse the expression of the maternal methylated allele [189, 190].

Another disease is central precocious puberty type 2 (OMIM #615346), also known as gonadotropin-dependent precocious puberty (Table 1) [191, 192].

Loss-of-function mutations in the MKRN3 gene are detected most frequently in this disease. The MKRN3 gene encodes a 507-amino acid protein with a RING (C3HC4) zinc finger motif and multiple C3H zinc finger motifs, which predict MKRN3 RNA binding function. The gene contains one exon; it is expressed ubiquitously to yield a 3-kb transcript. The entire coding sequence and 5'-CpG island overlap a second gene, ZNF127AS, which is transcribed from the antisense strand with a different transcript size and pattern of expression. The antisense ZNF127AS RNA of unknown function probably regulates MKRN3 expression. Allele-specific analysis showed that the MKRN3 gene is expressed only from the paternal allele, and the maternal allele is imprinted [193]. Therefore, all affected patients with familial Schaaf–Young syndrome inherit MKRN3 mutations from their fathers [194].

Multiple mutations have been identified in the coding region of the MKNR3 gene, including deletions, nonsense, missense mutations, and frameshift loss-of-function mutations [195, 196]. In addition, some researchers reported mutations in the promoter region and at the transcription start site that alter normal MKNR3 gene expression [197].

MKRN3 can reduce pulsatile GnRH secretion and regulate puberty initiation. The level of MKRN3 decreases during puberty and negatively correlates with gonadotropin secretion in prepubertal girls. Its level in the blood decreases during puberty in healthy boys, but the exact mechanism of its action remains to be determined [198].

Another cause of central precocious puberty type 2 is mutations in the DLK1 gene that resides in the imprinted region of chromosome 14q32.2. Only the paternal allele of the DLK1 gene is expressed (similar to the MKRN3 gene), while the maternal allele is imprinted (for more information, see the Imprinted Chromosome Region 14q32.2 section). It is assumed that the KCNK9 imprinted gene (paternal allele), whose polymorphic variants were described in association with the early age of menarche, may participate in the pathogenesis of central precocious puberty type 2 [199].

Finally, we mention duplication of the imprinted region on chromosome 15q(11.2–q13.1) (OMIM #608636). Carriers of maternal duplication at the imprinted area have hypotonia and delayed motor and physical development and mental retardation, seizures, autism, and other behavioral anomalies, such as psychoses. Furthermore, the grade of these symptoms varies significantly even in individuals with the same genotype. Patients with an isodicentric chromosome of maternal origin (tetrasomy of the imprinted region) are commonly affected more seriously than patients with interstitial duplication. Such pathology of paternal origin is not phenotypically manifested [200203].

Imprinted Chromosome Locus 20q13.2 and a Range of Diseases Associated with Defects in the GNAS gene

GNAS is a complex imprinting locus resulting in maternally, paternally, or biallelically expressed transcripts in differentially imprinted tissues. Gsα A (the stimulatory G protein alpha-subunit) encoded by exons 1–13 has no DMR and is biallelically expressed in most tissues except for the proximal renal tubules, neonatal brown adipose tissue, thyroid gland, gonads, paraventricular nucleus of the hypothalamus, and pituitary gland, where it is expressed from the maternal allele, but the promoter of the paternal allele is not methylated [204, 205].

The three alternative first exons of this locus (А/B, XLas and NESP55) are spliced with exons 2–13 to form different transcripts. DMRs are located in close proximity to alternative exons, which leads to the expression of NESP55 only from the maternal chromosome (its DMR is methylated on the paternal chromosome), whereas XLas, exon А/В, and NESP55АS/GNAS-AS1 antisense transcript are expressed from the paternal chromosome, since the maternal DMRs are methylated [205207].

XLas, a longer version of Gsα, is synthesized mainly in neuroendocrine tissues and the nervous system; truncated neural transcripts of Gsα and XLas, referred to as GsαN1 and XLN1, are also known, and prematurely terminate prior to exon 4. Both proteins differ only by the N-terminal region. NESP55, another product of the GNAS gene, is a chromogranin-like neuroendocrine secretory protein. Similar to XLas, the alternative exon is spliced with exons 2–13, but NESP55 has no homology with the Gsα protein, since it contains a termination codon. The other two transcripts, A/B and AS1, which have their own exons, do not overlap with any of the other exons, and they are expressed in all tissues from the paternal chromosome, but are not translated [206208].

The GNAS locus has two different IC regions. The first is located within the STX16 gene and controls the establishment of differential imprinting at the alternative GNAS A/B promoter, while the second, encompassing the antisense GNAS-AS1 transcript in exons 3–4, controls the establishment of imprinting over the entire GNAS locus [209].

The STX16 gene encoding syntaxin 16, which mediates intracellular interactions, is mapped 220 kb centromeric of the GNAS locus. Evidently, this gene cannot be involved in the pathogenesis of the disease, but all patients with maternal STX16 deletions demonstrate A/B loss of methylation, which impairs a cis-acting element regulating the imprinting of the maternal GNAS allele [205]. In most cases, two variants of deletions are detected: a 3-kb deletion with the loss of exons 4–6 or a 4.4-kb deletion with the loss of exons 2–4. The smallest region of deletion overlap contains a CpG-site that is not subject to differential methylation. STX16 deletions are not restricted only to this region, large deletions and deletions of the entire gene have been reported [210, 211].

In several cases of GNAS locus imprinting pathology, maternal deletions in exons 3 and 4, or 40 and 33-bp microdeletions in introns 4 and 3 of GNAS-AS1 were detected, which led to a loss of methylation in four DMRs and the appearance of biallelic DMR methylation of the NESP55 exon. These deletions lead to complete loss of the maternal imprint in the GNAS locus, resulting in biallelic expression of XL, A/B and antisense transcript, thus pointing to another regulatory element within antisense exons 3 and 4 necessary for full methylation of the maternal GNAS allele [207, 212, 213].

Molecular-genetic and epigenetic alterations in the GNAS locus cause a heterogeneous group of rare endocrine disorders called pseudohypoparathyroidism. It is mainly characterized by renal resistance to parathyroid hormone, which stimulates hypocalcemia, increased phosphate levels, and an increased secretion of parathyroid hormone. In addition to the increased level of parathyroid hormone, resistance to other hormones, such as thyrotropic hormone, whose activities are mediated by Gsα-subunit receptors, has also been described. Depending on the molecular anomalies, pseudohypoparathyroidism includes other endocrine disorders associated with resistance to a range of hormones and some non-endocrine features. The estimated prevalence of pseudohypoparathyroidism is 1.1 per 100 000. The occurrence of clinical heterogeneous phenotypes is related to structural and functional changes in the GNAS gene [209].

Pseudohypoparathyroidism type 1A (OMIM #103580) caused by loss-of-function mutations in the maternal allele of the GNAS gene has characteristic clinical features (Table 1) [206, 214].

Loss of Gsα function on the paternal allele causes pseudopseudohypoparathyroidism (OMIM #612463). The renal tubular cells express mainly the maternal GNAS allele, hence a paternally inherited mutation leads to a normal response of the kidney to the parathyroid hormone (Table 1) [215]. Paternal loss-of-function mutations may cause progressive osseous heteroplasia (OMIM #166350) (Table 1) [216]. In pseudopseudohypoparathyroidism and progressive osseous heteroplasia, Gsα expression in red blood cells is reduced twofold, although GNAS is normally biallelically expressed. The phenotype of Albright hereditary osteodystrophy can be caused by Gsα haploinsufficiency in tissues where GNAS is expressed from both alleles.

In contrast, pseudohypoparathyroidism type 1B (OMIM #603233) is characterized by isolated renal resistance to parathyroid hormone, and in some cases, resistance to thyrotropic hormone. In such patients, Albright hereditary osteodystrophy is rarely detected (Table 1) [209, 215, 217].

All patients with pseudohypoparathyroidism type 1B have at least loss of methylation at GNAS A/B DMR on the maternal allele, which leads to biallelic expression of the А/В-transcript and decreased expression of the GNAS-Gsα transcript in tissues subject to imprinting; hormone resistance occurs due to loss of methylation on the maternal allele [204]. The majority of sporadic pseudohypoparathyroidism type 1B involves disturbances of imprinting at both А/В exon DMR and all DMRs of the GNAS locus. In the absence of deletions within STX16 or NESP55 DMR and paternal UPD, molecular disorders in other distant regulatory regions may be suggested that still need to be determined. While mutations in GNAS are commonly detected in patients with pseudohypoparathyroidism type 1A, such mutations have not yet been detected in patients with pseudohypoparathyroidism type 1B. Since Gsα expression is disrupted in the disease, the correct methylation of exon А/В on the maternal chromosome located closely to the Gsα promoter is necessary for the expression of this protein, at least in the proximal renal tubules [209]. Approximately 20% of pseudohypoparathyroidism type 1B cases are inherited and caused by deletions within IC, while the other 80% are sporadic and are linked to methylation anomalies over the entire GNAS locus [211, 215].

Complete or segmented paternal UPD on chromosome 20 can reach 24% and leads to sporadic pseudohypoparathyroidism type 1B [218].

Maternal UPD on chromosome 20 (Table 1) commonly results from trisomy reduction to disomy, which occurred upon chromosomal nondisjunction in the second meiotic division, and causes Mulchandani–Bhoj–Conlin syndrome (ОМIМ #617352) [219, 220].

Multi-Locus Imprinting Disturbances (MLID) Affecting Methylation in Regulatory Regions

New multi-locus imprinting disturbances (MLID) caused by maternal hypomethylation of various imprinted loci have been in discussion since 2006 [221]. In a family (closely-related marriage), two daughters had phenotypic features of transient neonatal diabetes mellitus with some symptoms of Beckwith–Wiedemann syndrome. The investigation of methylation status at the imprinted regions showed that loss of methylation occurred not only in the imprinted PLAGL1 (6q24) region, but also within KCNQ1OT1 (11p15.5), GRB10 (7p11.2-р12), PEG3 (19q13), PEG1/MEST (7q32) and NESP55AS (20q13). It was hypothesized that the family either carried an autosomal recessive defect that disturbed methylation in children, or establishment of imprints in oocytes was disrupted [222].

The investigation of patients with transient neonatal diabetes mellitus and MLID revealed mutations in the ZFP57 gene [223]. Fourteen families with missense and nonsense mutations in ZFP57 have been described. All patients had very similar methylation patterns at imprinted DMRs: complete hypomethylation at DMR PLAGL1 and combinations of mosaic hypomethylation at GRB10, PEG3, MEST, NAP1L5 and GNAS [27].

ZFP57 encodes a transcription repressor that contains the Krüppel associated box (KRAB) domain encoded by exons 4 and 5 and seven zinc finger motifs in exon 6. The main function of the protein is to maintain DNA methylation in germline DMRs by binding a methylated hexanucleotide TGCmetCGC motif [224]. ZFP57 forms a complex with the KAP1 corepressor protein (KRAB-associated protein-1). KAP1 acts as a scaffold protein for the formation of an inactivating complex comprising histone lysine methyltransferase (SETDB1), the nucleosome remodeling and deacetylation complex (NuRD), heterochromatin protein 1 (HP-1), DNMT1 and UHRF1, which are necessary for DNA methylation maintenance. Proteins containing zinc finger motifs and the KRAB-domain act as transcription repressors by inducing KAP1 heterochromatin and DNA methylation in early embryonic cells [225]. Therefore, this protein complex plays a significant role in the regulation and maintenance of DNA methylation in various imprinted DMRs [226]. Consequently, heterozygous ZFP57 mutations, leading to loss or appearance of a defective protein, perturb methylation at various ICs, resulting in loss of imprinting [27, 227, 228].

ZFP57 binds methylated ICs in the preimplantation embryo, protects these ICs from demethylation and conserves their parental identity. The ZFP57 binding site was found in 17 out of 31 imprinted DMRs, and mutations in it most frequently impair methylation at PEG3, PLAGL1, INPPF5, NAP1L5 and GRB10 [228].

The expression profile of another member of the zinc finger motif protein family, ZNF445, its resistance to loss-of-function mutations, and its ability to bind to KAP1 and form heterochromatin in IC regions indicates its important role in methylation maintenance at the early stages of embryo development. Knockdown of the gene impairs KAP1 binding and H3K9me3 methylation, and, gene expression increases and loss of methylation in IC, including Н19, is detected. All this confirms that ZNF445, similar to ZFP57, can bind to IC, recruit KAP1 and trigger H3K9me3 methylation [229]. At the same time, these two genes are not expressed simultaneously: ZNF445 is expressed first and then ZFP57, and they recruit KAP1 one after another. ZNF445 binds to 13 imprinted DMRs: DIRAS3, ZDBF2, MEST, PEG 13, H19, KCNQ1OT1, MEG3-DLK1, MEG3, NET, GNAS-NESP55, GNAS-AS1, GNAS-XL, and SNU13. These two proteins work in tune to preserve imprints (methylation) on DMRs during early embryonic development [228].

Another gene, ZNF202, binds only four imprinted DMRs (FAM50B, PLAGL1, KCNQ1OT1, and L3MBTL1), but it is thought that it performs a similar function [228].

ZFP42, a zinc finger motif protein, is a marker of stem cells and is abundantly expressed in the preimplantation embryo. It protects the normally unmethylated alleles at imprinted DMR from methylation, Peg3 and Gnas in particular [230]. One patient with Russell–Silver syndrome and MLID with loss of methylation at H19/IGF2 (11p15.5) and MEST (7q32.2) was found to carry a paternally inherited mutation of this gene [55].

Mutations in the maternal genes encoding proteins of the subcortical oocyte complex may cause reproductive disturbances at the epigenetic level [231]. The subcortical oocyte complex plays an essential role at the early stage of embryogenesis and contains at least seven proteins (NLRP2, NLRP5, NLRP7, PADI6, KHDC3L, TLE6, and OOEP). These proteins are expressed only by the maternal genome in oocytes and at the early stages of embryo development, and they are then inactivated when the embryonic genome begins to function independently [231]. The NLRP2, NLRP5, NLRP7 genes encoding a small subfamily of cytoplasmic proteins containing pirin domains are actively expressed in growing oocytes and are necessary for oocyte maturation, regulation of methylation at the early stages of embryogenesis, and maintenance of ploidy in the early embryo [231]. Pathogenic variants of these proteins were found in mothers of children with MLID. These women had reproductive abnormalities (miscarriage and molar pregnancy). For example, the NLRP2 maternal mutation was found in two children with Beckwith–Wiedemann syndrome and MLID. The NLRP5 mutations led to loss of methylation at both maternal and paternal imprinted DMRs, resulting in MLID. NLRP7 is involved in the establishment of oocyte-specific methylation, and its mutations lead to recurrent molar pregnancy with extended loss of the maternal methylated imprint, while the paternal methylated DMRs are not altered [17, 232]. Mutations in the NLRP2, NLRP5, NLRP7 genes lead to Beckwith–Wiedemann syndrome, Russell–Silver syndrome, and transient neonatal diabetes mellitus; OOEP gene mutation cause transient neonatal diabetes mellitus, UHRF1 is associated with Russell–Silver syndrome, and ZAR1—Beckwith–Wiedemann syndrome [17]. Mutations in the PADI6 gene cause loss of methylation at imprinted DMRs: KCNQ1OT1, PLAGL1, GRB10, MEST, H19/IGF2, GNAS-AS1, GNAS-XL, MEG3, SNURF and may cause Russell–Silver, Beckwith–Wiedemann, and Temple syndromes [17, 233].

Other genes of the subcortical oocyte complex, i.e., KHDC3L, TLE6, OOEP, UHRF1, and ZAR1, also contribute (not significantly) to MLID [17, 232, 233].

Molecular-genetic studies have made it possible to detect MLID in some patients with imprinting diseases caused by epimutations. The frequency of MLID contribution to the phenotypic manifestations of imprinting diseases varies significantly in different studies. For example, transient neonatal diabetes mellitus with loss of PLAGL1-DMR methylation due to MLID is estimated to be 30–70%; Russell–Silver syndrome with loss of methylation at H19-DMR is caused by MLID in 7–30% of cases; Beckwith–Wiedemann syndrome with loss of methylation at KCNQ1OT1 or biallelic methylation at H19-DMR due to MLID may constitute 20–50%. MLID accounts for 6.3–12.5% of pseudohypoparathyroidism type 1B cases caused by loss of methylation in GNAS-A/B DMR. MLID-linked methylation disorders occur very rarely in Prader–Willi and Angelman syndromes [4, 17, 18, 234].

Multi-locus DNA methylation disturbances in imprinted DMRs of the genome have already been detected in a significant number of patients with imprinting disorders. Patients with MLID demonstrate a specific classical phenotype of a certain imprinting disease, but some of them develop a complex of symptoms characteristic of different syndromes caused by imprinting anomalies, as shown above. In addition, MLID are often mosaic, since the pathology occurs in a limited number of cells at the earliest stages of embryo development. Therefore, it is very difficult to determine specific correlations between epi-/genotype and the phenotypic manifestations in patients with MLID.

CONCLUSIONS

Over the past decade, significant progress has been achieved in the study of epigenetic regulation of gene expression. Investigation of the structural organization, specific allelic methylation and a certain allelic structure of chromatin in regulatory regions, cis- and trans-interaction of lncRNA with primary/germline (PLAGL1, H19/IGF2, PEG13, IGF2-DMR0, KCNQ1OT1, RB1, MEG3/DLK1, SNURF, GNAS-AS1, GNAS-XL, GNAS A/B) and secondary/somatic (IGF2-DMR2, DLK1, GTL2, MEG8, MAGEL2, NDN, SNRPN, SNORD116, GNAS-NESP) regulatory regions (DMRs) at various stages of ontogenesis allows us to understand how the epigenome functions in normal and pathological conditions. Disturbances of concerted interactions between these structural and functional components of the genome may cause pathological conditions, such as imprinting diseases, non-syndromic forms of mental retardation and delayed physical development, and multi-factor diseases, including oncological, autoimmune, and others.

Genomic imprinting plays a key role in a range of ontogenetic stages. It is controlled by a complex of genes that are monoallelically expressed in certain tissues or cell types. Therefore, the role of imprinted genes in development processes represents the most investigated area of epigenetics. Modern methods of molecular genetic analysis allow us to determine complete and segmental chromosomal UPD, where imprinted genes can be located [235237], to conduct a whole-genome analysis of methylation [238, 239] or to study epitranscriptome and monoallelic expression, including in individual cells [15, 240]. These investigations reveal an increasing quantity of imprinted genes and DMRs. The next stage will clarify the role of these genes in ontogenesis and the contribution of methylation pathology to phenotypic manifestations. It is reasonable to assume that the cause of a number of syndromic conditions and non-specific forms of intrauterine/postnatal delayed development, where a molecular defect is still unknown, are epigenetic disorders represented by abnormal DNA methylation, impaired chromatin structure, and changes in lncRNA expression. In addition, one needs to take into account structural disorders of the genes involved in the establishment and maintenance of monoallelic gene expression both in germline cells and at the early stages of embryo development. Mutations in such genes may lead to MLID, and this, in turn, will require the elaboration of diagnostic protocols to determine molecular genetic pathology and prevent phenotypic disturbances in the offspring. Modern achievements in molecular biology and genetics inspire optimism that in the next decade, regularities of the functioning and interaction between the genome and epigenome will allow not only conduction of high-tech molecular genetic diagnostics of diseases based on epigenetic regulation disorders, but also development of effective ways to treat such diseases.