Keywords

1 Introduction

Genomic circular DNAs are frequent macromolecules among simple organisms, from small prokaryotic plasmids to the larger genomes of many bacteriophages or viruses, bacteria, archaea and plastids. On the other hand, circular RNAs have been regarded as very rare nucleic acids in biology till very recently. Now we know that numerous life beings express stable circRNAs [1], and among them, it is noteworthy the recent discovery of a myriad of splicing-derived circRNAs in eukaryotes [2,3,4] with diverse functions in regulation of splicing [5] and transcription [6], small RNAs biology [7], RNA-mediated inheritance and epigenetics [8] and some others, as described in this book. However, it has been recently reported that eukaryotes have a second natural pathway that allows the expression in vivo of abundant circular RNAs [9, 10]. This alternative mechanism does not require any classical spliceosome reaction but the involvement of small self-cleaving RNAs or ribozymes called hammerhead ribozymes (HHRs) [11].

The finding of catalytic RNAs or ribozymes more than 30 years ago [12, 13] propelled the revolution in the RNA field and started with the uninterrupted discovery of the many different roles and capabilities of this macromolecule in biology. Moreover, the ground-breaking discovery of ribozymes strongly supported the hypothesis of the prebiotic RNA world [14], where RNAs carried out both informative (RNA genomes) and catalytic (ribozymes) roles. Somehow, it is thought that these primal RNA molecules would have evolved to present organisms based in DNA and proteins as the genetic material and catalytic machines, respectively [15,16,17]. Proofs supporting this hypothesis are the existence of RNA genomes among the simplest organisms (such as RNA viruses and viroids), as well as catalytic and regulatory ribo-functions among all living beings, where RNA itself is the final molecule in charge of the activity. A remarkable example of a catalytic RNA would be the central machine of life, the ribosome [18], which is the universal ribozyme that catalyses the peptide bond formation during protein synthesis in all known living entities. This fact allows to connect DNA and proteins through a catalytic RNA, which offers a solution to the chicken or the egg (or more precisely, the DNA or the protein) causality dilemma. Other key ribozymes and regulatory RNAs considered as ancient relics of the prebiotic RNA world would be the autocatalytic introns [12] and small ribozymes [19], the RNase P [13], the spliceosome [20], the riboswitches [21], most non-coding RNAs (such as those small RNA guides found in the CRISPR [22] and the RNAi [23] pathways) or even the circRNAs described in this book, which altogether confirm the extraordinary potential of any RNA molecule present in a living organism.

2 The Family of Small Self-Cleaving RNAs and the Singular Case of the Hammerhead Ribozyme

Among the simplest ribozymes so far described, it can be highlighted the enigmatic group of small (50–200 nt) self-cleaving RNAs, which all catalyse a sequence-specific intramolecular reaction of transesterification. This reaction starts by a nucleophilic attack of the 2′ oxygen to the adjacent 3′ phosphate, resulting in cleavage of the phosphodiester bond to form two RNA products with a 5′-hydroxyl and a 2′,3′-cyclic phosphate ends each (Fig. 5.1a). The family of small self-cleaving ribozymes is composed so far by nine different classes: hammerhead (HHR) [24, 25], hairpin (HPR) [26], human hepatitis-δ (HDV) [27], Varkud satellite (VS) [28], GlmS [29], twister [30], twister sister, hatchet and pistol [31] ribozymes. The HHR was the first discovered and one of the best known members of the family of small self-cleaving ribozymes. It is composed of a conserved catalytic core of 15 nucleotides surrounded by three double helixes (I to III), which adopt a γ-shaped fold where helix I interacts with helix II through tertiary interactions required for efficient in vivo activity (Fig. 5.1b) [32,33,34]. There are three possible circularly permuted topologies for the HHR, named type I, type II or type III, depending on the open-ended helix (Fig. 5.1c). The HHR were first found encoded in the small circRNA genomes of a group of infectious subviral agents of plants, such as viral RNA satellites and viroids [24, 25], where it catalyses a self-cleavage transesterification reaction required for the rolling-circle replication of these pathogens. Surprisingly, few other examples of HHR motifs were also found encoded in the genomes of some unrelated eukaryotes such as newts, trematodes or even some mammals, among others [35,36,37,38,39]. In 2010, different labs reported the widespread occurrence of HHR motifs in prokaryotic and eukaryotic genomes [40,41,42,43], including our own genome [44], which confirmed that the HHR was a ubiquitous catalytic RNA motif in all life kingdoms [45, 46]. Interestingly, the occurrence of genomic HHR motifs along the tree of life seems to follow a kind of structural or functional compartmentalization. This way, anyone of the three topologies of the HHR motif (types I, II and III, Fig. 5.1c) can be frequently detected in the genomes of prokaryotes and bacteriophages. However, metazoan genomes mostly show type-I HHR motifs, whereas plant genomes, as well as their subviral agents, almost exclusively show the presence of type-III HHR motifs.

Fig. 5.1
figure 1

RNA self-cleavage by the hammerhead ribozyme (a) Mechanism of internal transesterification in the RNA. The cleavage reaction starts with an attack of the 2′ hydroxyl to the 3′ phosphate, followed by a bipyramidal transition state. The cleavage products are a 2′,3′-cyclic phosphate at the 5′ RNA product and a 5′-hydroxyl at the 3′ RNA product. (b) Classic two (left)- and three (right)-dimensional diagrams of the hammerhead ribozyme motif. Black boxes indicate the highly conserved nucleotides (in white letters) at the catalytic core. (c) Representation of the three possible hammerhead ribozyme topologies (types I, II and III). Dotted and continuous lines refer to non-canonical and Watson-Crick base pairs, respectively. The three topologies have been reported in the genomes of bacteriophages and prokaryotes. Type-I hammerheads are mostly found in metazoan genomes, whereas typical type-III motifs are found in the plants and their infectious circRNAs (viroidal RNAs). N stands for any nucleotide, whereas R stands for purines (A or G), Y for pyrimidines (U or C) and H for either A, U or C

Other small self-cleaving RNA motifs have been also found widespread in DNA genomes, such as HDV [47] or twister ribozymes [30], which confirms that small catalytic RNAs would be more frequent than previously thought. Although the precise biological roles of all these genomic self-cleaving ribozymes are still under study, a direct connection with retrotransposons and other mobile genetic elements has been reported for most of them [10, 48,49,50,51].

3 Hammerhead Ribozymes in Plant Genomes Promote circRNA Expression: The Retrozymes

Two examples of type-III HHR motifs were originally reported in the genome of A. thaliana [35]. Numerous copies of this ribozyme were also detected in the genomes of diverse flowering plants [40]. In many instances, these HHRs have been found as tandem repeats of several motifs (usually two or three) separated by a few hundred base pairs. These observations have been recently extended in our lab, and we have reported the occurrence of hundreds of type-III HHRs in more than 40 plant species [10]. Comparative genomics revealed that sequences flanked by tandem HHR motifs sized from 600 to 1000 bp with almost no identity. However, these genomic repetitive elements show a similar topology: they are delimited by 4 bp target site duplications (TSDs), whereas HHRs are embedded in direct long terminal repeats (LTRs) of ~350 bp. LTRs delimit a central region (~300–700 bp), which begins with the primer binding site (PBS, corresponding to the tRNAMet) and finishes with the polypurine tract (PPT) sequences characteristic of LTR retrotransposons [52] (Fig. 5.2a). Altogether, these elements were classified as a new family of nonautonomous retrotransposons with hammerhead ribozymes (so-called retrozymes) similar to other nonautonomous retroelements of plants like TRIMs [53] and SMARTs [54]. Most likely, autonomous retrotransposons of the Ty3-Gypsy family would mobilize the small retrozymes based on the sequence similarities (PBS and 5′ and 3′ LTR ends) between both types of retroelements.

Fig. 5.2
figure 2

Genomic plant retrozymes (a) Schematic representation (top) of a full genomic retrozyme element of plants. Target side duplications (TSDs) delimiting the retrozyme are shown in grey boxes. Long terminal repeats (LTRs) are shown in black boxes. The positions of the primer binding site (PBS), the polypurine tract (PPT), the hammerhead ribozymes (HHR) and the typical sizes encompassed by the ribozymes are indicated. The resulting self-cleaved retrozyme RNA after transcription (middle) and circularization (bottom) is indicated. (b) An example of a northern blot analysis of RNA extracts (~30 μg each) from physic nut (Jatropha curcas) leaves, young seedlings and seeds. Samples were run on a 5% denaturing PAGE and were detected using a digoxigenin-labelled J. curcas retrozyme fragment as a probe, which revealed the presence of both circular and linear RNA forms in each plant tissue as indicated at the right. Ethidium bromide staining of the 5S rRNA is shown at the bottom as a loading control

Northern blot analysis and RT-PCR experiments of diverse somatic and reproductive tissues from several plant species, such as physic nut, strawberry, eucalyptus or citrus plants, revealed the presence of high levels of circular and linear RNAs (up to 1 ng per μg of total RNA) of the precise size encompassed by the HHR motifs (Fig. 5.2b), which strongly indicates an RNA self-processing activity by the ribozymes during in vivo transcription followed by RNA circularization. Although sequence identity between retrozymes from non-related plant species is very low, secondary structure predictions for these circRNAs show similar architecture and high stability (Fig. 5.3a). These structured circRNAs with type-III hammerhead ribozymes highly resemble those infectious circRNAs of plants, such as viral satellite RNAs and viroids (Fig. 5.3b and c), which indicates a clear evolutionary relationship between all of them [9].

Fig. 5.3
figure 3

Minimum free energy secondary structure predictions for (a) a retrozyme circRNA of Jatropha curcas (Entry KX273075.1), (b) the Nepovirus satellite RNA sTRSV (Entry M14879.1) and (c) the viroid CChMVd (Entry AJ878085.1). HHR sequences are shown in purple (positive polarity) and green (negative polarity). The corresponding structure of the HHRs motifs are shown under each circRNA structure, and dotted lines indicate predicted tertiary interactions between HHR loops based on previous models [60, 61]. Self-cleavage sites are indicated with arrowheads. Kissing-loop interactions described for CChMVd [62] are shown. Numbering for each circRNA starts at the self-cleavage site

4 Tandem Copies of Hammerhead Ribozymes in Metazoan Genomes

Previous bioinformatic searches in metazoan genomes have also revealed the widespread occurrence of the HHR motif in animals [10, 40, 45]. As observed in plants, these ribozymes are usually found in close tandem copies, suggesting that genomic retrozymes in animals may express similar circRNA molecules with HHRs, which would also accumulate in metazoan transcriptomes. However, several differences can be highlighted between plant and animal retrozymes. On the one hand, none of the characteristic sequences of plant retrozymes, such as LTRs, PBS or PPT, are present in their animal counterparts. Moreover, whereas plant retrozymes only show a few HHR motifs (usually, just two copies per retrozyme) of the type III, ribozymes in metazoan retrozymes occur as many copies (dozens to even hundreds) of type-I HHR motifs (Fig. 5.4). These type-I HHRs not only show a characteristic set of tertiary interactions but a very short or even no helix III at all, which indicates that these ribozymes may require in many instances the adoption of dimeric HHR conformations to self-cleave efficiently [55, 24]. The minimal type-I HHRs of metazoan retrozymes highly resemble those described in the pseudo-LTRs of the autonomous Penelope-like retroelements (PLEs) of metazoans and other eukaryotes [48], which somehow links these two families of retrotransposons. On the other hand, animal retrozymes are composed of smaller minimal repeats in tandem (150–300 bp), indicating that the expected animal circRNAs are also smaller than those described in plants. These repeats, however, are frequently, but not always, flanked by TSDs as well, although these are slightly larger (8–12 bp) than those found in plant retrozymes (4 bp) [9, 10].

Fig. 5.4
figure 4

Metazoan retrozymes

Schematic representation (top) of a typical genomic retrozyme element present in metazoan genomes.

Target side duplications (TSDs) delimiting the retrozyme are shown in grey boxes. Tandem repeats of around 300 bp are indicated with arrows. Typical type-I HHRs are shown. Minimum free energy secondary structure prediction of three examples of circRNAs derived from metazoan retrozymes (rotifers, corals and arthropods) are shown (bottom). HHR sequences in the circRNAs are shown in purple letters, and the self-cleavage sites are indicated with arrowheads. Numbering starts after the HHR self-cleavage site

Recent analysis done in our lab with diverse retrozyme-containing metazoans has confirmed that, as suspected, these organisms accumulate abundant circRNAs in most of the analysed tissues, in a similar way as described for plants (De la Peña and Cervera, to be published). Altogether, the resulting landscape offered by genomic HHRs (either type I or III) in eukaryotes indicates that close tandem copies of this ribozyme allows the expression of small circRNAs (100–1000 nt). Although the presence of other small self-cleaving ribozymes in eukaryotic genomes have been described, such as HDV and twister ribozymes [30, 47], the characteristic occurrence in close tandem repeats seems to be exclusively restricted to the case of the hammerhead ribozyme.

5 A Proposed Mechanism for the Expression and Spreading of Eukaryotic circRNAs with Hammerhead Ribozymes

Retrozymes are a new and atypical group of nonautonomous eukaryotic retroelements with self-cleaving hammerhead ribozymes. In plants, genomic retrozymes resemble other small nonautonomous LTR retrotransposons such as TRIMs [53] and SMARTs [54]. As nonautonomous retrotransposons, retrozymes do not show protein-coding regions but self-cleaving HHR motifs in their LTRs, which, most likely, are responsible of the accumulation in vivo of circular and linear RNAs of the precise size encompassed by the HHRs. Regarding the life cycle of retrozymes, the most plausible model would start with the transcription of the genomic retrozyme. Similar retroelements, such as TRIMs or autonomous LTR retrotransposons, are known to be generally transcribed by RNA Pol II, although examples of RNA Pol III-transcribed retrotransposons have been also reported [56]. Plant and metazoan retrozymes do not seem to contain any recognizable promoter, and in consequence, a feasible hypothesis could be that retrozymes may undergo Pol-driven (either I, II or III) read-through transcription depending on tissues and/or their genomic location. In any case, nascent RNA transcripts would follow co-transcriptional self-processing by tandem self-cleaving HHRs, producing linear RNAs with 5′-OH and 2′,3′-cyclic-phosphate ends. Whereas the step of self-cleavage is expected to occur with high efficiency for plant retrozymes carrying type-III HHRs, in the case of metazoan retrozymes, self-cleavage frequently requires the adoption of a dimeric conformation of minimal type-I HHRs, which is expected to be slightly less efficient than the monomeric version [55]. As summarized in Fig. 5.5, covalent circularization of the resulting self-cleaved RNAs through either the HHR itself or a host RNA ligase factor [57] would finish in stable circRNAs. As the most plausible model, these circRNAs are the final template for retrotranscription, whereas linear retrozyme RNAs would be intermediaries and/or by-products of the circRNAs. In the case of plants, circRNAs derived from LTR-like retrozymes could be primed by any cellular tRNAMet through their PBS motifs. Then, retrotranscriptases encoded by Ty3-Gypsy LTR retrotransposons would produce cDNAs of different lengths, thanks to the circular nature of the RNA template. In the case of metazoan retrozymes, retrotranscriptases encoded by non-LTR retrotransposon (such as PLEs or LINEs) would be responsible of carrying out this latter step of cDNA synthesis. Finally, the resulting cDNAs would be integrated in new genomic locations through the machinery of the autonomous retrotransposons (Fig. 5.5). A last question to be addressed is related to the very high levels of circRNAs with HHRs detected in most organisms analysed. Genomic retrozymes are frequently found as many copies (from dozens to thousands of repeats) within a given genome, which suggests that even low transcription activity would result in abundant levels of circRNAs. However, most of the obtained data indicates that only a few retrozyme copies would be transcriptionally active [10], which suggests that the higher stability of these structured circRNAs with ribozymes compared with linear RNAs would be the reason of their high levels of accumulation in vivo. Moreover, the presence of a high sequence heterogeneity observed for a population of circRNAs in a given organism, together with the presence of retrozyme RNAs of the negative polarity, also suggests the intriguing possibility of replication of the circRNAs through endogenous polymerases.

Fig. 5.5
figure 5

Model for the life cycle of retrozymes

A full genomic retrozyme containing at least two HHRs in tandem is transcribed (top), and the resulting RNA would self-process through the HHRs to give a linear RNA with 5′-OH and 2′,3′-cyclic phosphate ends. The linear RNA would be circularized through an RNA ligase activity, and the resulting circRNA(+) could be recognized for either endogenous RNA polymerases (replication cycles), other cell factors (new biological roles), or retrotranscriptases encoded by autonomous retrotransposons. In the latter case, the resulting cDNAs from retrotranscription of a circular RNA template would have different lengths depending on the processivity of the retrotranscriptase. In a final step, the machinery of the retrotransposon would integrate the retrozyme DNAs at new genomic loci

6 Functional and Biotechnological Applications of circRNAs with Ribozymes

Retrotransposons, and mobile genetic elements in general, constitute a major fraction of nuclear genomes of most eukaryotes. Historically, these genomic sequences have been regarded as junk DNA, but now we know that retroelements are major drivers of genome evolution with a role in shaping the genomes that they inhabit. In this regard, genomic retrozymes and their associated circRNAs would have similar evolutionary impact as any other retroelement. However, the atypically high accumulation levels of RNA circles encoded by genomic retrozymes in the transcriptomes of most eukaryotic tissues, either somatic or reproductive, suggest that other biological roles can be possible. In this regard, several genic circRNAs have been found to play a role as microRNA sponges [7], and a comparable role for retrozyme circRNAs would be feasible. Moreover, the highly structured circRNAs derived from genomic retrozymes are suitable to be recognized and processed by the RNAi machinery of the cell, and, consequently, these abundant circRNAs with ribozymes would be potential templates for the production of miRNA/siRNAs with specific regulatory roles in the biology of the organisms where they are expressed. At the same time, we already know many examples of co-option or domestication of the transposable elements by their hosts as adaptations to diverse problems [58]. Usually, these domestications are performed with transposon-derived proteins, but also small ribozymes such as 3′ UTR HHRs [39] and intronic HHRs [44] or HDV ribozymes [59] seem to be examples of retroelement domestication. Consequently, circRNAs with HHRs in some organisms could have been specifically co-opted to play precise functions, a possibility that should be studied in the future.

Regarding the biotechnological applications of tandem small ribozymes in the expression of circRNAs, it has to be pointed out that the mechanism of backsplicing described for the synthesis of most genic circRNAs seems to be a complex pathway, which still requires deeper study in order to fully understand and use for practical applications. In this regard, our current knowledge about small ribozymes allow us to design much easier approaches for the expression of circRNAs, which, moreover, could reach higher accumulation levels as observed for most eukaryotic retrozymes so far analysed. Moreover, in vitro synthesis of circRNAs through self-cleaving ribozymes may also offer a straightforward approach for the production and study of specific genic circRNAs from eukaryotes.

7 Conclusions and Future Prospects

Genomic retrozymes are a new family of eukaryotic retrotransposons, which spread through circRNAs with hammerhead ribozymes among plant and animal genomes [10]. In plants, retrozymes seem to be mostly restricted to eudicots, although the presence of putative retrozymes with tandem HHR copies were also detected in some monocots, primitive land plants (like the spikemoss Selaginella moellendorffii) and algae (such as Chlamydomonas reinhardtii) [10, 40]. However, most of these HHRs in the genomes of primitive plants are type-I motifs, which are more related to those found in metazoan than in angiosperm genomes. This observation would indicate that retrozymes in flowering plants may have a different origin, either due to a de novo origin in angiosperms by chance or through horizontal transfer from other organisms containing type-III HHRs such as bacteria [40,41,42]. In any case, all these data suggest that genome-encoded circRNAs with HHRs would be more frequent molecules in eukaryotic transcriptomes than previously thought. Moreover, circRNAs with type-III HHRs in plants allow to propose an evolutionary path for the origin of the small infectious circRNAs with HHRs of plants (viroids and viral satellite RNAs), which may come by chance from the abundant reservoirs of circRNAs present in plant transcriptomes [9]. In contrast, metazoan retrozymes with type-I HHRs seem to indicate that this HHR topology would be more efficient for circRNA expression in animals. Future in vivo experiments will help us to better understand this new tool in the knowledge of the biology and biotechnology of eukaryotic circRNAs.