Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

8.1 Satellite DNAs in Insects

Satellite DNAs (satDNAs) are the major DNA component of eukaryotic heterochromatin. These noncoding sequences constitute a considerable part of the genomic DNA in many insect species, which can reach over half of the genomic content (Davis and Wyatt 1989). Composed of tandemly reiterated arrays, usually millions of base pairs long, satDNA is located mainly in the pericentromeric and telomeric regions of chromosomes and are the major building elements of functional centromeres in many eukaryotes including insects (Ugarković 2009a).

The complexity of satDNA sequences varies between insect species. Some are very simple, composed of short repeats oriented in a head-to-tail fashion, such as ten Drosophila melanogaster satellites that are between 5 and 10 bp long (Lohe and Roberts 1988). The nucleotide sequences of ten simple repeats in D. melanogaster conforms to a formula (AAN)m(AN)n where N is any nucleotide. In addition to simple sequence monomers, D. melanogaster and other related Drosophila species share also a complex satellite of 359 bp, known as 1.688 satellite according to CsCl buoyant density. All D. melanogaster satellites are AT rich and, together, they make approximately 20% of the genome. Satellite monomers in insect can also exceed 1,000 bp, as revealed in some beetle species (Pons 2004). However, most of insect satDNAs fall into two size classes: one ranging from 140 to 190 bp and the other in the range of 300–400 bp (reviewed in Palomeque and Lorite (2008)). The total length of the satellite tandem arrays varies from less than 100 bp to over 100 Mb between species. Some satellites are restricted to particular chromosomes such as Drosophila 359 bp satellite that is located on X chromosome, while most of the insect satellites are spread on all chromosomes of particular species. Satellites present on the same chromosome can be organized into separate arrays or can exhibit interspersed type of organization characteristic for beetle species Tribolium madens (Durajlija Žinić et al. 2000). In many insect species, satDNAs encompass both centromeric and pericentromeric regions. However, satellite sequences in centromeric and pericentromeric regions are almost indistinguishable, even in the best studied Drosophila centromere (Sun et al. 1997). Moreover, due to the technical difficulties associated with the sequencing and assembling of highly repetitive regions, the detailed structure and organization of heterochromatic and centromeric regions for many insect species is not known.

satDNAs sometimes possess an additional level of organization, called higher order repeats (HORs), which are best described for α satellite of primates and are characteristic for human centromeres (Schueler et al. 2001). For example, the predominant form of α satellite on human chromosome 17 is a 2.7 kbp HOR unit consisting of 16 α satellite monomers (Waye and Willard 1986). In insects, however, HORs are not found very often and are usually in the form of complex dimers and trimers (Palomeque and Lorite 2008).

It has been demonstrated that related insect species share a set of common satDNAs, differentially amplified between species, so that changes in the number of copies produce a species-specific profile of satDNAs (Meštrović et al. 1998; Bruvo-Mađarić et al. 2007). According to the “library” hypothesis (Fry and Salser 1977), some of the satDNAs originally contained in the common ancestor could be amplified during speciation. In each of the descendants, usually one satDNA would exist as a major satellite sequence while the others would remain as low-copy-number satellites. It is proposed that satellite sequences residing within a library exhibit certain structural characteristics that enable them to confer a centromeric role and therefore could represent a source for the evolution of new centromere (Ugarković 2008, 2009b).

Satellite sequences are known to evolve fast, changing their nucleotide sequence and copy number by following a pattern known as concerted evolution. Changes among repeats create new variants, which are spread and homogenized within the genome by a variety of DNA turnover mechanisms, such as unequal crossing over, gene conversion, replication slippage, and rolling circle replication (Charlesworth et al. 1994). Variants are subsequently fixed within the reproductive group of organisms in a stochastic process known as molecular drive (Dover 2002). Taken together, recombinational mechanisms and molecular drive seem to be the major cause of high turnover of satDNA sequences, resulting in the significant sequence divergence of satDNAs and changes in copy number, even between closely related species (Ugarković and Plohl 2002).

8.2 Functional Elements Within satDNAs

Due to the homogenizing effects of concerted evolution, satDNAs usually display low internal sequence variability. However, comparison of monomer sequences of a given satDNA reveals that some monomer regions are more conserved while others show higher mutation rates (Borstnik et al. 1994; Romanova et al. 1996; Mravinac et al. 2004). Such a nonuniform rate of evolution along the sequence indicates the presence of selective pressure on satDNAs. It can be proposed that selection affects pericentromeric satellite repeats whose transcripts play a role in heterochromatin establishment through RNA interference mechanism (Volpe et al. 2002). In fission yeast, Schizosaccharomyces pombe, analysis of siRNAs involved in heterochromatin formation showed that they derive preferentially from the most conserved regions of repeats (Djupedal et al. 2009). This indicates that conservation is more probably due to functional constraints than to frequent events of homologous recombination causing sequence identity. Therefore, conserved regions found in other satDNAs could be functional in the sense that they represent a preferential source of siRNAs that recruit protein complexes responsible to heterochromatin formation.

Due to possible functional constraints on satDNAs, it is not surprising that some characteristics of satDNAs are shared between many eukaryotic organisms (Ugarković 2005). Probably the most common feature of satDNA is its intrinsic curvature. Satellite repeats are generally AT rich, and the periodical distribution of AT tracts causes DNA to bend into a super-helical tertiary structure (Fitzgerald et al. 1994). This sequence-dependent property is thought to be responsible for the tight packing of DNA and proteins in heterochromatin (Ugarković et al. 1992; Fitzgerald et al. 1994). Conserved CENP-B box-like motifs have been identified within satDNA of mammals and insects (Kipling and Warburton 1997; Lorite et al. 2002a; Mravinac et al. 2004). The CENP-B box is a 17 bp motif in human α-satDNA and a binding site for centromere protein B (CENP-B) (Masumoto et al. 1989) whose homologs have been found in many eukaryotes. Not every repeat of α satellite contains a functional CENP-B box, but they appear at regular intervals in human centromeres and seem to be essential for centromeric chromatin assembly (Ohzeki et al. 2002).

Given their relatively simple sequence and the lack of any significant open-reading frame, previously reported transcription of satDNA has been ascribed to read-through from upstream genes and transposable elements (Diaz et al. 1981; Wu et al. 1986; Gaubatz and Cutler 1990). However, promoter elements and transcription start sites as well as binding motifs for transcription factors have been mapped within some satellites. Putative internal promoters have been reported in the wasp Diadromus pulchellus (Renault et al. 1999) where motifs cognate to RNA Pol II and III are present within the satellite monomer sequence. In schistosome satDNA, which encodes an active ribozyme, a functional RNA Pol III promoter is present (Ferbeyre et al. 1998). The sequence of highly conserved satellite 2 found in distant families of salamanders shares structural and functional properties with the typical vertebrate small nuclear RNA (snRNA) promoter (Coats et al. 1994).

The Drosophila GAGA transcription factor that binds GA/CT-rich elements in promoters of many Drosophila genes and activates transcription by opening chromatin structure was found associated with heterochromatin throughout the cell cycle. It is proposed that GAGA factor directly interacts with a GA/CT-rich subset of satDNA repeats and modifies heterochromatin structure (Raff et al. 1994). Human satellite III has a binding motif for the heat-shock transcription factor 1, which drives its RNA Pol II-dependent transcription in stress conditions (Metz et al. 2004; Chap. 5 in this book). γ-satDNA, the abundant pericentromeric sequence of all murine chromosomes, contains conserved binding sites for ubiquitous transcription factor Yin Yang 1 (YY1) (Shestakova et al. 2004). YY1 belongs to the Polycomb group of proteins involved in gene regulation during development. It has been found to be associated with γ-satDNA in proliferating cells, but the association strongly diminishes during transition to the quiescent state (G0). It has been proposed that the interaction of YY1 with γ-satDNA could lead to the targeting of proteins required for heterochromatization or to the silencing of euchromatic genes by bringing them in close proximity to pericentromeric heterochromatin.

Even though some time ago satDNAs were considered useless evolutionary remnants, the functional significance of their sequences is becoming ever more clear. The existence of conserved motifs and structural properties as well as emerging evidence on their widespread transcriptional activity prompt us to reexamine these highly abundant eukaryotic sequences.

8.3 Preserved satDNA Repeats in Insects

Beetles of the family Tenebrionidae (Coleoptera ) represent convenient models for studying satDNAs. Located in the (peri)centromeric heterochromatin of virtually all chromosomes, satDNAs comprise up to half of the genome size in most of the coleopteran species analyzed so far (Ugarković and Plohl 2002). They are characterized by low intraspecific variability of the basic repeating units. However, their sequences differ dramatically between species, with some exceptions such as satDNAs from the genus Pimelia (Pons et al. 1997).

Members of the genus Palorus share a “library” of satDNAs – a collection of common satellite sequences, differentially amplified in each species. Besides the species-specific major satellite, each species contains several low-copy-number satellites, which account for approximately 0.05% of the genome and are dispersed over the whole heterochromatic block, interrupting large arrays of the major satellite. Shared satellites are extremely conserved with respect to sequence, monomer length, and tandem repeat organization (Meštrović et al. 1998). Satellites from some Palorus species are widely distributed even beyond the level of the genus, such as PRAT and PSUB, major satellites that comprise 40 and 20% of the respective genome size of P. ratzeburgii and P. subdepressus (Ugarković et al. 1992; Plohl et al. 1998). These two satellites share no sequence homology but have similar repeat lengths of 142 bp (PRAT) and 144 bp (PSUB). In both sequences, a motif homologous to the human CENP-B box has been identified (Mravinac et al. 2005). PRAT and PSUB are found in a low number of copies in species belonging to the related genera Tribolium, Tenebrio, Pimelia, and Latheticus, subfamily Pimelinae, and family Chrysomelidae (Mravinac et al. 2002, 2005). Despite the fact that some of these species diverged from the Palorus group up to 60 Myr ago, PRAT and PSUB sequences remained virtually unchanged, showing no species diagnostic mutations and retaining the nonrandom pattern of variability along the sequence. It is assumed that the extreme preservation of these two satellites over a long period is related to their functional significance (Mravinac et al. 2005; Ugarković 2005).

Ancient satDNAs up to 80 Myr old have also been reported in some species of fish (De la Herrán et al. 2001) and whales (Arnason et al. 1984), and their remarkable preservation is thought to be related to the low mutation rates generally observed in aquatic environments (De la Herrán et al. 2001). However, functional constraints have been implicated in the preservation of salamander satellite 2 sequence – its promoter activity and the self-cleavage ability of its transcripts have remained conserved for 200 Myr (Green et al. 1993).

8.4 Transcription of Insects satDNAs

Insects represent one of the most diverse groups of animals, accounting for more than half of all known living organisms. Despite this, they are poorly represented as model organisms for the study of satDNA function. With the present study, satDNA transcription has been described in species from only four orders: Hymenoptera, Orthoptera, Diptera, and Coleoptera (Table 8.1).

Table 8.1 Transcription of satDNAs in insects

8.4.1 Hymenoptera

Transcription of satDNAs seems to be a general phenomenon in Hymenopteran suborder Apocrita, which includes ants, wasps, and bees (Renault et al. 1999; Lorite et al. 2002b). Satellite expression rates vary significantly between sexes, and female sex-specific transcripts are detected. Also, the amount of satellite transcripts differs among the queen, the worker, and the male. The presence of specific transcription factors might influence different rates of satellite expression in males and females, which suggests that transcription is regulated in trans rather than by elements inside the satellite sequence itself. The wasp Diprion pini has a higher satDNA expression in females, despite lower satDNA sequence content compared to the male genome, indicating that transcription is not constitutive (Rouleux-Bonnin et al. 1996). Gender-specific satDNA transcription in hymenopteran species could be related to sexual differentiation at the chromatin level and it is proposed that long multimeric transcripts probably have a structural role (Rouleux-Bonnin et al. 2004).

In addition to gender-specific expression, satDNAs seem to be differentially transcribed during Hymenopteran insect development, displaying higher expression in earlier stages. In the wasp Diadromus pulchellus, single-stranded, nonpolyadenylated transcripts heterogeneous in size but with discrete bands of 1.9 kb and 0.6 kb were detected. satDNA seems to be differentially expressed during developments since the transcripts were found to be more abundant in embryos and larvae than adults (Renault et al. 1999). In addition, putative promoters and transcription initiation site were mapped within the monomer sequence. In the bumble bee Bombus terrestris, multimeric transcripts arise from one strand preferentially in embryos opposed to both strands in imagos (Rouleux-Bonnin et al. 2004). It seems therefore that satDNA transcription interferes with development as well as sexual and caste-differentiation in this insect order.

Stage-specific transcription associated with differentiation has also been observed in systems other than insects. The most abundant mouse γ satDNA is differentially expressed in cells of the developing central nervous system as well as in adult liver and testis (Rudert et al. 1995; Chap. 5 in this book). In chicken and zebrafish, transcription of alphoid repeat sequences displays a specific temporal and spatial expression pattern during embryogenesis (Li and Kirby 2003).

8.4.2 Diptera

In Diptera, transcription of satDNA has been reported in primary spermatocytes of D. melanogaster and D. hydei. (Trapitz et al. 1988; Bonaccorsi et al. 1990). Transcripts of simple satellite sequence AAGAC that are highly heterogeneous in size, ranging from less than 1 kb to 10s of kb, have been found on Y chromosome loops. They do not appear to migrate to cytoplasm and are degraded during the first meiotic prophase. It has been proposed that the transcripts probably act as fertility factors by providing a structural framework for accumulation of Y-encoded proteins involved in sperm differentiation.

In addition to the transcripts of simple sequence satellite AAGAC, transcripts of 1.688 D. melanogaster satellite with complex repeat unit of 359 bp were found in germinal tissues. Transcription proceeds from both DNA strands and is under the control of RNA interference machinery (Usakin et al. 2007). It is revealed that RNAi is necessary to maintain the silenced state of centromeric and pericentromeric 1.688 repeats located specifically on X chromosome. The heterochromatic locus on X chromosome that contains 1.688 satellite is responsible for hybrid female lethality in crosses between D. simulans females and D. melanogaster males (Ferree and Barbash 2009). It induces mitotic failure in early embryos due to the inability of 1.688 (359 bp) satellite block to form a proper heterochromatin state. Both Drosophila species share common satellites that differ in amount and location between the species, but 1.688 (359 bp) satellite is 50 times more abundant in D. melanogaster relative to D. simulans. It is proposed that hybrid female lethality occurs owing to the absence of the 1.688 satellite-derived small RNAs in the maternal cytoplasm that are required for heterochromatin establishment on 1.688 satellite array.

Transcription of complex telomeric repeats that are characteristic for chromosomal ends in Diptera was demonstrated in Chironomus thummi (Martínez-Guitarte et al. 2008). Transcripts are heterogeneous in length and correspond to multimers of the repeat. Moreover, transcription of telomeric repeats is not constitutive and is activated under conditions of environmental stress, such as heat shock.

8.4.3 Orthoptera

satDNAs were studied in the cave cricket genus Dolichopoda (Rhaphidophoridae). Three different satellite families were characterized in this species, among which pDo500 satellite is present in all species of the genus. The 500-bp satDNA family is actively expressed in the form of long multimeric transcripts, although the monomeric transcripts are also detected (Rojas et al. 2000). The transcripts act as ribozymes , as they have the ability to adopt hammerhead-like secondary structures and self-cleave in vitro. It is also possible that hammerhead sequences from the pDo500 satellite can trans-cleave host transcripts in the cells of Dolichopoda. The physiological role of these ribozymes is unknown, but it can be proposed that they may affect certain regulatory mechanisms in the cell. High sequence conservation of their corresponding satellites and active transcription suggests that they are under selective pressure (Rojas et al. 2000).

The hammerhead ribozyme structures associated with transcribed satDNA sequences have also been found in salamanders (Epstein and Gall 1987) and schistostomes (Ferbeyre et al. 1998). All hammerhead ribozymes detected in animal satDNA so far have been shown to self-cleave in cis long multimeric satellite transcripts into monomers.

8.4.4 Coleoptera

Within order Coleoptera, expression of satDNAs was investigated in species of genera Palorus and Tribolium that belong to family Tenebrionidae. Species of the two genera are characterized by the presence of large blocks of pericentromeric heterochromatin on all chromosomes and molecular analyses suggested that these blocks are composed almost exclusively of satDNAs that comprise up to 40% of the whole genome and encompass the regions of functional centromere (Ugarković et al. 1996; Durajlija Žinić et al. 2000).

In beetles Palorus ratzeburgii, Palorus subdepressus, and Tribolium castaneum, the major satDNAs called PRAT, PSUB, and TCAST, respectively, are continuously expressed during larval, pupal, and imago stages. The transcripts are of heterogeneous size, ranging from 0.5 kb to more than 5 kb, and originate from both strands of satDNA, albeit with a difference in expression between the two strands (Pezer and Ugarković 2009). Most of the transcripts are detected in the nucleus and are not polyadenylated. Although transcription from both DNA strands could potentially activate the RNA interference (RNAi) pathway, no processing of long PRAT and PSUB transcripts into small interfering RNAs (siRNA) was detected. However, small RNAs cognate to the major satellite TCAST (Ugarković et al. 1996) have been detected in the red flour beetle T. castaneum, (unpublished results). Small RNAs are more abundant in embryos than in later developmental stages, ranging in size between 21 and 26 nt with a predominant size of 24 nt. According to their size, these RNAs could be assigned to small interfering RNAs (siRNAs 21–23 nt) and piwiRNAs (piRNAs, 24–26 nt). piRNAs are characterized as the long class of siRNAs that bind to the Piwi clade of Argonaute proteins (Hamilton et al. 2002; Aravin et al. 2003). It is proposed that both types of small RNAs function as guide molecules during heterochromatin formation. The piwiRNAs, also known as repeat-associated RNAs ranging 23–26 nt in size, are most abundant in testes and early embryos, which may be related to dramatic changes in heterochromatin structure that occur in these stages. In addition to siRNAs and piRNAs, some components of the RNAi machinery have been identified in the sequenced genome of T. castaneum, such as Dicer and Argonaute protein families but not the RNA-dependent RNA polymerase (RdRP) gene (Tomoyasu et al. 2008). RdRP transcribes single-stranded RNA from an RNA template and is important for the production of siRNA as well as the amplification of the RNAi effect in fungi, protists, nematodes, and plants. However, it seems to be lacking in insects and vertebrates.

Multiple transcription initiation and termination sites as well as putative RNA Pol II promoter elements were mapped within PRAT and PSUB sequences. Overlapping promoter-like sequences on both DNA strands and the close position of transcription initiation sites suggest bidirectional activity of putative promoters. Presence of 5′ cap structure on portion of PRAT transcripts and susceptibility of transcription to inhibition of Pol II further confirm role of RNA Pol II in transcription of satDNAs. In addition, motifs similar to A and B boxes, associated with RNA Pol III transcription, are located in PSUB and PRAT satellites. Thus, Pol III or some other polymerase(s) might be responsible for the production of the main nonpolyadenylated fraction of transcripts. Involvement of different RNA polymerases in the production of siRNAs was demonstrated in plants. Noncoding transcripts generated by RNA Pol II in Arabidopsis act as a scaffold for the recruitment of two other polymerases, Pol IV and Pol V, which seem to be important for the production of siRNAs (Zheng et al. 2009). The interplay of all three polymerases is required for siRNA-mediated transcriptional gene silencing in Arabidopsis.

The heterogeneous transcript size of satDNAs in beetles could be explained by the multiple transcription initiation and termination sites. Read-through transcription from the nearby gene promoters and transposable elements cannot be excluded either, although there is a strong indication that satDNAs are transcribed as autonomous transcription units from own promoters that reside within the satellite sequences (Pezer and Ugarković 2008, 2009). However, a smaller portion of PRAT and PSUB satDNA transcripts is polyadenylated and is found in the cytoplasm. Cytoplasmic localization and the presence of a polyA tail have been reported before for the satellite transcripts of various species. Polyadenylated transcripts of the G + C-rich satDNA of the Bermuda land crab are present in the cytoplasm of different tissues (Varadaraj and Skinner 1994). Satellite 2 is an abundant tandemly repeated sequence distributed in clusters throughout the genome of the newt Notophthalamus viridescens and is transcribed on lampbrush chromosomes. However, stable, strand-specific transcripts homologous to satellite 2 are present in the cytoplasm in a variety of different tissues (Epstein et al. 1986). Satellite III DNA is transcribed in response to stress in human cells, generating heterogeneous-sized RNAs that contain a polyA tail but remain in the nucleus (Valgardsdottir et al. 2005; see Chap. 5 in this book). In addition, many eukaryotic long ncRNAs that have regulatory roles are always polyadenylated (Amaral and Mattick 2008). For instance, the polyA tail is part of the mature Xist RNA, which mediates X chromosome inactivation in dosage compensation (Lucchesi et al. 2005; see Chap. 3 in this book)

In conclusion, expression of satDNAs in beetles is developmentally regulated and proceeds in the form of long, stable, nonpolyadenylated transcripts that remain mostly in the nucleus where they probably play a structural role in the organization of pericentromeric heterochromatin. A small portion of transcripts is exported to the cytoplasm where they perform an unknown role. In addition, long transcripts are processed into small RNAs, 21–26 nt long, that are proposed to function as guide molecules during heterochromatin formation.

8.5 Heterochromatin Formation in Drosophila: Role of Heterochromatic Transcripts

D. melanogaster heterochromatin is prominent in pericentromeric regions and is mostly comprised of satDNA and transposon elements (TE). As in fission yeast Schizosaccharomyce pombe, it is associated with histone H3 methylation on lysine 9 (H3K9) by the histone methylase Su(var)3-9 that enables recruitment of heterochromatin protein HP1 necessary to maintain and spread heterochromatic state (Ebert et al. 2006). It has been speculated for a long time whether an endogenous siRNA pathway, similar to those in S. pombe, is involved in the formation of heterochromatin in Drosophila. Small RNA molecules related to several types of repetitive DNA have been isolated from D. melanogaster (Aravin et al. 2003). These repeat-associated RNAs, 23–26 nt in size, are most abundant in testes and early embryos, which may be related to the regulation of transposon activity and the dramatic changes in heterochromatin structure that occur in these stages.

Examination and analysis of small RNA libraries obtained from different developmental stages of fly revealed the presence of TE-derived small RNAs in all stages: in early embryos, most of them correspond to 25 nt long piwiRNAs. They are formed in gonads from long transcripts of TEs and induce silencing of TEs through a feedback regulatory mechanism involving the Piwi subfamily of Argonaute proteins (Brennecke et al. 2007). In other developmental phases, 25 nt piRNAs are partially replaced by a population of 21 nt long RNAs that also derive from long TE transcripts. Due to the limitation of method of high throughput deep sequencing that is restricted to nontandemly repeated DNA, small RNAs that derive from satDNA were not systematically examined. However, siRNA deriving from 1.688 satellite have a size range between 19 and 28 nt and were detected in early embryos as well as in larvae (Aravin et al. 2003). It has been shown that a nuclear pool of TE-derived 21 nt long siRNAs is involved in heterochromatin formation in somatic cells of Drosophila and that components of the RNAi pathway participate in heterochromatin process (Fagegaltier et al. 2009). This implicates similarity between mechanisms of heterochromatin formation in S. pombe and Drosophila and points to the role of pericentromeric transcripts, either satDNA or transposon-derived, in heterochromatin formation. The possible mechanism by which repeat-derived siRNAs could promote heterochromatin formation in Drosophila is by tethering complementary nascent transcript of satDNAs and transposons and guiding chromatin modifiers, such as histone methylase Su(var)3-9, that induce H3K9 methylation. Identification of proteins that tether siRNAs to chromatin in Drosophila and other animals needs, however, to be elucidated.

In Drosophila, distinct heterochromatic loci are the source of primary piRNAs, which target a large number of transposons that are active in the germline and induce degradation of their transcripts. The mechanism of silencing is not well explained but includes Piwi proteins loaded with piRNAs that target and cleave RNA molecules. It is also proposed that piRNAs could promote chromatin modifications and, recently, a role for Rhino protein, one of the HP1-like proteins, in piRNA generation has been established (Klattenhoff et al. 2009). Rhino protein in germ cells of Drosophila replaces HP1 and seems to promote the expression of piRNAs and transposon silencing. It is suggested that piRNAs associated with Piwi protein target Rhino to transposon clusters and promote the production of additional piRNAs from the cluster. In this way, a link between the two major transposon defense pathways involving heterochromatin and RNA silencing mechanism exists (Klattenhoff et al. 2009). This could also represent another mechanism of heterochromatin formation specific for germline. It can be also proposed that numerous satDNA-derived piRNAs present in germ cells could also contribute to heterochromatin establishment and maintenance by similar mechanism that might include Piwi protein and HP1 germline analog Rhino.

8.6 Possible Regulatory Role of satDNAs and Their Transcripts

Although satellite repeats show remarkable restriction in their distribution along chromosomes to pericentromeric and subtelomeric heterochromatin, there are several exceptions involving minor amounts of satellite sequences present in euchromatin. Such examples of limited localization of satellite sequences in euchromatin involve a simple and a complex satellite. Eight tandem repeats of D. melanogaster satellite AATAC are found in front of the s38 chorion gene on X chromosome (Spradling et al. 1987). The 359-bp repeats of the 1.688 satellite, located predominantly in pericentromeric heterochromatin of X chromosome, are also found in other positions of the same chromosome (Tartof et al. 1984). In beetle T. castaneum 360 bp repeats of abundant centromeric and pericentromeric satellite TCAST are found dispersed in the vicinity of genes on all chromosomes (unpublished results). The discovery of short satellite segments interspersed among the genes in euchromatic portion of genomes suggest possible regulatory role of these sequences, since they are often source of regulatory elements such as promoters and/or transcription factors binding sites (Ugarković 2005, Fig. 8.1). Recently, a regulatory role of 32 bp satellite repeats located in the intron of the major histocompatibility complex gene (MHIIβ) of fish Salvelinus fontinalis, on MHIIβ gene expression was demonstrated (Croisetiere et al. 2010). The level of gene expression depends on temperature being higher at lower temperatures as well as on the length of satellite repeats: a longer satellite array induces reduced expression. Although the mechanism of cis-acting satellite gene regulation is not clear, there is evidence that temperature-sensitive satellites could play an important role in the gene regulation of the adaptive immune response.

Fig. 8.1
figure 1

Role of satDNAs and corresponding transcripts in the regulation of genes located in heterochromatin and euchromatin. Transcripts of tandemly repeated satellite repeats, located in (peri)centromeric regions, play a role in heterochromatin formation as well as in the regulation of the genes located in heterochromatin. Transcripts of satDNA repeats dispersed within the euchromatin could play a role in the regulation of the neighboring genes. Transcription of satellite repeats is temperature sensitive, and the role of transcripts in the environmental stress response is proposed

Influence of satDNAs and their transcripts on gene regulation could not refer only to genes located in euchromatin but also on heterochromatic genes (Fig. 8.1). It is known that important developmental genes are located in heterochromatin, as revealed for D. melanogaster, (Pimpinelli et al. 1985) and that the proximity of heterochromatin is an important regulatory requirement for their function (Dimitri et al. 2009). Heterochromatin is also involved in gene silencing, and this process is developmentally programmed in Drosophila and mammals (Lu et al. 1998). Heterochromatin formation in D. melanogaster is influenced by transcripts of satDNA elements and transposons present in heterochromatin (see Sect. 8.5 in this Chapter). On the other hand, insect development is very sensitive to changes in the environment, particularly temperature. With a lowering temperature, the length of the development period is prolonged, and at a critical temperature, development ceases altogether. It has been shown that in beetle T. castaneum, expression of satDNA is temperature sensitive during embryogenesis, being significantly decreased at low temperatures where development is stopped (unpublished results). It can be proposed that decrease of satDNA expression affects heterochromatin formation during embryogenesis and in this way influences activity of heterochromatin-localized developmental genes. Temperature-sensitive expression of heterochromatic satDNAs also indicates their involvement in the signaling mechanism responsible for insect development , differentiation, and stress response (Fig. 8.1).

8.7 Conclusion

satDNAs are major heterochromatin constituents in many insect species and are found to be transcribed during all developmental stages. Transcripts are heterogeneous in size ranging from long multimers to small interfering RNAs. Their role in heterochromatin establishment and regulation is proposed although the detailed molecular mechanism and proteins involved are not elucidated yet. The satDNA transcription is not constitutive but associated with development and differentiation and is actively regulated by environmental factors such as temperature. It is proposed that satDNAs play a role in regulation of genes positioned within heterochromatin as well as those located in the vicinity of satellite elements in euchromatin. Mechanism of gene regulation is not explained but could be related to the presence of active regulatory elements within satDNAs such as promoters and transcription factor binding sites as well as corresponding satellite transcripts. Further studies are needed in order to explain the complex role of satDNAs and their transcripts in the signaling mechanism responsible for insect development, differentiation, and stress response.