INTRODUCTION

Repeated DNA sequences constitute a significant part of the eukaryotic genome, causing the C-value paradox phenomenon: when the number of transcribed sequences of structural and regulatory genes does not match the amount of DNA in the haploid set [1]. In the human genome, the share of duplicate elements accounts for more than two-thirds of the total DNA [2]. One example of this type of DNA is satellite DNA (satDNA), whose monomers form long arrays of tandem repeats ranging from hundreds or thousands of copies to several million base pairs (bp) in the genomes. The share of satDNA in the human genome is at least 10% [3].

Human satDNA is represented by various classes of tandem repeats that differ both in the length of monomers and in their enrichment in AT and CG base pairs. For example, α-satDNA and satDNA I are the AT-rich fraction of the genome, while β-satDNA is predominantly CG-rich [4]. Classical satellites II and III include the AT and CG pairs of DNA bases [5]. SatDNAs I, II, and III have a repeating sequence length of 5–20 bp, while α-, β-, and γ-satDNAs type are 171, 68, and 220 bp, respectively [5, 6].

SatDNA has a specific location in the human karyotype. For example, centromeric regions consist of α-satDNA tandem repeats with a monomer length of about 171 bp, which are grouped into higher order repeats (HORs) [5, 6]. Separate monomers of satDNA in the composition of a HORs of nonhomologous chromosomes may differ, but within the centromere of a single chromosome, they have a very high level of homology, which is 95–98% [7]. They form extended areas in regions of constitutive heterochromatin. The length of high order repeats of α-satDNA can vary from 0.5 to 5 million bp, and the size of the centromeric unit of satDNA of individual chromosomes of the set may differ in various individuals by an order of magnitude [5].

The pericentromeric regions of chromosomes flanking the centromeres are characterized by lower levels of homogeneity of high order repeating elements. Their α-satDNA monomers are interspersed with other sequences and repeating elements [8]. In addition to the mobile elements of the genome from the LINE and SINE families, the pericentromeric regions of individual chromosomes are characterized by the presence of classical satellites in them, as well as β- and γ-satDNA [1, 6, 9].

Chromatin of the pericentromeric regions of most cell types has typical features of constitutive heterochromatin with a high degree of DNA methylation and trimethylation of histone H3 for lysine in position 9 [10], as well as interaction with nonhistone protein HP-1 (heterochromatin protein 1), a marker of repressed chromatin [11].

Centromeric chromatin has a unique epigenetic status. In addition to the presence of proteins characteristic of kinetochore formation, such as CENP-A, the DNA of the centromere region itself is hypomethy-lated [12]. In nucleosomes containing CENP-A, histone H4 is monomethylated for lysine in position 4 [13], and in nucleosomes carrying histone H3, it is additionally dimethylated for lysines in positions 9 and 27 [14]. In addition, in the human karyotype, α‑satDNA of centromeres of all chromosomes except for chromosome Y contains the CENP-B box: a motif of 17 base pairs that binds the CENP-B protein and participates in nonrandom phasing of nucleosomes in centromere regions [9, 15].

SatDNA is involved in the occurrence of structural rearrangements both in meiosis during the formation of hereditary information of gametes and in the transfer among a number of somatic cells of the body [5, 16]. Structural adjustments in the human karyotype can be divided into categories: balanced and unbalanced. In balanced rearrangements, in contrast to unbalanced ones, no loss or acquisition of genetic material is observed [17]. Although satDNA consists of tandem repeats and does not contain genes whose structure could be changed as a result of structural rearrangements, it has important structural and functional characteristics, the disturbance of which may have a phenotypic manifestation.

Thus, satDNA participates in the formation of kinetochore and proper segregation of chromosomes [9], encodes regulatory molecules [11, 18], determines the topology of the nucleus [19], secures fixed positioning of centromeres on chromosomes [20], and participates in the evolution of karyotypes [21, 22]. In addition, the movement of a chromosomal locus as a result of rearrangements into the satDNA region can lead to a change in the activity of genes located in that locus because of the “position effect” [23]. Disturbed topology at the chromatin stacking level in the nucleus can also affect the differential activity of genes by changing the position relative to regulatory elements [24]. In addition, owing to the homology of the nucleotide sequences of satDNA of a number of chromosomes, synapsis and segregation of nonhomologous chromosomes may become disturbed, and also nonreciprocal exchanges in meiotic division may arise [25].

MECHANISMS CAUSING STRUCTURAL CHROMOSOME REARRANGEMENTS WITH PARTICIPATION OF SATELLITE DNA

At present, it is known that physical chromosome breaks often occur precisely in satDNA of centromeric and pericentromeric chromosome regions [15]. This is partly due to the nature of its forming units: repeating DNA sequences help slow down or stop the replication fork, which can cause the formation of double-strand breaks in these areas [26]. It was shown that normally up to 40 double-strand breaks occur in cells during replication [27], which are restored by homologous recombination during synthetic and postsynthetic periods of the cell cycle [28]. Such recombination may jeopardize the stability of the genome, since it “allows” genetic exchange between homologous repetitive sequences distributed across the genome, which will provoke structural chromosomal rearrangements [29].

For example, one of the most common unbalanced translocations without phenotypic manifestation (cytogenetically visible copy number variations) is a rearrangement between satDNA III of the distal heterochromatin band of Y chromosome and short arms of acrocentric chromosomes, mainly chromosomes 15 and 22 [6, 30]. Owing to the homology of the satDNA III sequence constituting these regions, the sexual bivalent is associated with the short arms of acrocentric chromosomes and moves from the periphery to the central part of the nucleus at the pachytene stage of the prophase of the first division of meiosis, which disturbs the synapsis and further chromosome segregation [6, 25, 31]. Such an association may result in double-strand breaks of satDNA with the subsequent formation of a derived autosome containing the satDNA material of Y chromosome.

Another example of rearrangements involving satDNA are balanced Robertsonian translocations, which involve long arms of nonhomologous or (less commonly) homologous acrocentric chromosomes, with double-strand breaks and their recovery in centromeric or pericentromeric regions [21, 29, 32, 33].

Double-strand breaks, leading to abnormal (transverse) centromere separation, can simultaneously cause reduplication of one of the chromosome arms and the formation of an isochromosome [29, 32]. Recoupling of double-stranded breaks in the pericentromeric regions of sister chromatids (isochromatid break) by the U-type can lead to the formation of a dicentric isochromosome [29, 32].

At the same time, it was found that the very blocks of highly repetitive satDNA regions arise owing to recombination and repair of double-strand breaks [34]. Moreover, the features of the organization of pericentromeric regions of different chromosomes indicate that the formation of heterochromatic blocks is predominated by interchromosomal exchanges [35]. Slowing the replication fork can also lead to slipping of the polymerase and rereplication of individual DNA sections [36]. Normally, there are several parallel systems in cells that prevent polymerase slipping; however, such events may happen in the case of spontaneous disruptions of the cell cycle [37]. This leads to an increase in the copy number of satDNA monomers, which may not have a phenotypic manifestation and not be subjected to elimination.

An increase in the copy number of the repeating α‑satDNA elements contributes to stronger binding of the centromere to the spindle apparatus during meiosis [38]. An important role in maintaining the stability of α-satDNA belongs to the CENP-A protein, which not only is responsible for the proper formation of the kinetochore and the attachment of spindle apparatus microtubules [39] but also suppresses recombination in centromere regions of satDNA in human proliferating cells [20]. This makes it possible to distance hot recombination points from centromeres and prevent unwanted exchanges during cell divisions [20, 40].

At the same time, one of the factors that increase instability in the centromere regions of chromosomes is the peculiarity of the spatial organization of DNA of these regions, which is characterized by the formation of complex secondary and tertiary packing and the presence of regions with noncanonical DNA helix and hairpins [26, 41]. In addition to higher torsion stress, which provokes active introduction of double-strand breaks followed by the need to repair them, melting the DNA duplex in such areas requires a large amount of energy, which hinders the passage of the replication fork and causes it to stop and accumulate replication stress factors in centromere regions [26, 42, 43]. Thus, on one hand, repetitive satDNA blocks in the pericentromeric chromosome regions stabilize centromeres and, on the other hand, increase genome instability and contribute to the evolution of karyotypes [21, 22].

An increase in satDNA arrays requires maintaining their epigenetic status, while its disruption leads to chromosome fragility [44, 45]. Thus, disturbed methylation of the pericentromeric satellite II on human chromosomes 1 and 16 and satellite III of chromosome 9 [46] leads to the fragility of these chromosomes and the development of a ICF syndrome (Immunodeficiency, Centromeric Instability, Facial Anomalies), which is also characterized by severe mental retardation [47]. Cytogenetically, the ICF syndrome is manifested by decondensation of the pericentromeric heterochromatin regions of these chromosomes, associations in the centromeric regions on metaphase chromosomes of lymphocytes with the formation of radial structures, and by deletions with breaks in the pericentromeric satDNA [48]. It has been found that the development of this syndrome can be triggered by mutations in different genes, but all of them are directly or indirectly associated with maintaining the level of DNA methylation [4951]. The exact mechanism leading to chromosome fragility in the areas of satDNA in the case of disturbed methylation has not been fully studied. However, indirectly, it can be caused by dysregulation of transcription in these regions or activation of mobile elements present in pericentromeric chromatin blocks, which will destabilize the centromeres [52, 53]. In addition, the enrichment in mobile elements with the monomers being located in the opposite direction can also provoke chromosomal rearrangements [54].

TRANSCRIPTIONAL ACTIVITY OF SATELLITE DNA IN THE PRESENCE OF STRUCTURAL REARRANGEMENTS IN KARYOTYPE

On the basis of the fact that the centromeric and pericentromeric regions of chromosomes lack protein-coding genes, it was historically believed that these regions are transcriptionally inert [15]. However, at present, satDNA transcription is shown for many organisms and cell lines [3, 5557].

A large number of chromosomal rearrangements detected in karyotypes of hematological and solid tumors [16, 58], balanced and unbalanced rearrangements without phenotypic manifestation [6, 17, 59], and chromosome variants [6] involve the satDNA of centromeric and pericentromeric chromosome regions, which may disturb the transcriptional status of satDNA of these chromosomal regions [60].

Thus, the activation of transcription of satDNA II and α-satDNA is observed in a number of tumors, where overexpression of satellite II is specific namely to tumor cells [61]. It should be noted that the sequence of satDNA II of chromosome 1 (region 1q12) is one of the frequent break points in rearrangements in hematological tumors [16]. In addition, such rearrangements can promote the effect of heterochromatinization of the genes that should normally be transcribed or, conversely, promote the transcription of the genes that as a result of the rearrangement are distant from constitutive heterochromatin [58].

In general, it should be noted that aberrant transcription of classical satellites I, II, and III, as well as α-satDNA, is observed in many types of tumors [60, 62, 63] regardless of the involvement of satDNA regions in the rearrangement, which, apparently, is related to global genetic and epigenetic disorders of the genome of these cells [60, 64].

Thus, mutations or knockouts of the tumor suppressor genes KDM2A and BRCA1 lead to disruption of epigenetic modifications of histones in the centromeric and pericentromeric chromosome regions and their interaction with the nonhistone protein HP-1, which contributes to aberrant activation of satDNA transcription and leads to instability of the genome [60, 64].

In addition, it is assumed that the α-satDNA transcription hyperactivation of centromere repeats may lead to a decrease in the content or delocalization of the CENP-A protein [15, 20], which, in turn, may contribute to an increase in the number of chromosomal rearrangements and aneuploidy because of disturbed attachment of the spindle apparatus.

Hypomethylation of satDNA and especially the satellite II is also one of the characteristic epigenetic features of centromeric and pericentromeric chromosome regions in tumor cells [63]. However, it is known that satDNA transcription occurs regardless of the methylation status [65]. For example, unlike most tumors, no aberrant transcription of satDNA II was found in lymphocytes from patients with the ICF syndrome [66], in which this satellite is also hypomethylated.

A special place among chromosomal rearrangements belongs to the above-mentioned translocations between satDNA III of Y chromosome and autosomes, which have the population frequency of 1 : 2000 [59]. The carriers of such rearrangements are phenotypically normal but may have reproductive problems associated with impaired synaptonemal complex formation and chromosome segregation, as well as a possible risk of developing genomic imprinting diseases [6, 17, 25, 30]. It has been suggested that women inheriting, together with the father’s gamete, a derivative autosome containing satDNA III translocated from Y chromosome may be associated with higher risk of ovarian malignancy [67, 68]. Since heterochromatic regions do not contain genes, the pathological effect of such a rearrangement is not entirely clear. It is an open question whether this may be due to the transcription of satDNA, which is activated in many tumors, or its aberrant level. However, the latter is supported by the detection of satDNA transcription in the pericentromeric regions of a number of chromosomes and satDNA III of the Yq12 band, in particular, in the developing tissues of human testes occurring during normal differentiation [62, 69].

Understanding all aspects of satDNA transcription, namely, its strand- and stage specificity, a set of transcribed satDNA classes, changes in their transcription level, and the role of the formed noncoding RNAs, is also of interest from the point of view of the significance of polymorphic variants of heterochromatin blocks, the most frequent of which is a structural rearrangement of chromosome 9, a pericentromeric heterochromatin inversion [6]. Multiple studies do not provide clear answers to questions about the association of polymorphic variants of chromosomes, including changes in the copy number and location of satDNA of heterochromatic blocks, with disturbed reproductive function and clinical or phenotypic manifestations [32]. Despite data on the transcriptional activity of satDNA in human cells [55, 56], including embryonic and extraembryonic tissues [3, 70, 71], the transcriptional status or change in the level of satDNA transcription in the group of carriers of polymorphic variants remains unexplored.

CONCLUSIONS

Structural rearrangements are changes in the chromosome structure, which lead to the disturbed position of loci on chromosomes with or without loss of genetic material. A huge number of combinations of chromosomes involved in the formation of a particular type of rearrangement, as well as break points on the short or long arms of these chromosomes, determine the uniqueness of each chromosome rearrangement [17].

In view of the structural and functional significance of satDNA, structural rearrangements whose formation involves regions of constitutive heterochromatin of chromosomes deserve special attention. Understanding the mechanisms causing structural chromosomal rearrangements involving satDNA has been hampered for a long time by the lack of information about their organization and functioning. Despite the fact that in 2003 the Human Genome international project annotated the completion of the assembly of the reference human genome [72], it actually did not include more than 10% of the human genome [15]. Missing sequences in the assembly were mainly tandem repeats, including regions of nucleolar organizers, as well as satDNA of constitutive heterochromatin [73]. The use of new methods of sequencing and bioinformatics analysis made it possible to recreate the linear structure of repetitive elements for some chromosomes, for example, satDNA of the centromere region of the human Y chromosome [5, 74, 75]. SatDNA transcription is currently a proven fact for both normal and tumor human cells. A change in the transcriptional status of satDNA, and satDNA II, in particular, is a prognostic sign of a number of tumors [60, 64].

The data of current studies indicate the functional significance of satDNA, which normally plays an important role in maintaining the structural and functional integrity of the karyotype. The development of the genome and transcriptome research methods opens the way to further study of the organization and functioning of human satDNA and also contributes to the revision of the long-standing view about this part of the genome as “junk DNA.”