Abstract
While studies of the evolutionary histories of protein families are commonplace, little is known on noncoding RNAs beyond microRNAs and some snoRNAs. Here we investigate in detail the evolutionary history of the nine spliceosomal snRNA families (U1, U2, U4, U5, U6, U11, U12, U4atac, and U6atac) across the completely or partially sequenced genomes of metazoan animals. Representatives of the five major spliceosomal snRNAs were found in all genomes. None of the minor splicesomal snRNAs were detected in nematodes or in the shotgun traces of Oikopleura dioica, while in all other animal genomes at most one of them is missing. Although snRNAs are present in multiple copies in most genomes, distinguishable paralogue groups are not stable over long evolutionary times, although they appear independently in several clades. In general, animal snRNA secondary structures are highly conserved, albeit, in particular, U11 and U12 in insects exhibit dramatic variations. An analysis of genomic context of snRNAs reveals that they behave like mobile elements, exhibiting very little syntenic conservation.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
In most eukaryote lineages, introns are spliced out of protein-coding mRNAs by the spliceosome, a huge RNP complex consisting of about 200 proteins and five small noncoding RNAs (Nilsen 2003). These snRNAs exert crucial catalytic functions in the process (Valadkhan 2005, 2007; Valadkhan et al. 2007) in three distinct splicing machineries. The major spliceosome, containing the snRNAs U1, U2, U4, U5, and U6, is the dominant form in metazoans, plants, and fungi, and removes introns with GT-AG (as well as rarely AT-AC and GC-AG) boundaries. Another class of “noncanonical” introns with AT-AC (and rarely GT-AG (Sheth et al. 2006)) boundaries is excised by the minor spliceosome (Patel and Steitz 2003), which contains the snRNAs U11, U12, U4atac, U5, and U6atac. Just as the major spliceosome, the minor spliceosome is present across most eukaryotic lineages and traces back to an origin very early in the eukaryote evolution (Collins and Penny 2005; López et al. 2008; Lorkovíc et al. 2005; Russell et al. 2006). Recently it was found that the minor spliceosome can also act outside the nucleus and controls cell proliferation (König et al. 2007). Functional and structural differences of two spliceosomes are reviewed in Will and Lührmann (2005). In the third type of splicing, SL-trans-splicing, a “mini-exon” derived from the noncoding spliced-leader RNA (SL) is attached to each protein-coding exon. The corresponding spliceosomal complex requires the snRNAs U2, U4, U5, and U6, as well as an SL RNA (Hastings 2005). Due to the high sequence variation of the short SL RNAs and the patchy phylogenetic distribution of SL-trans-splicing, the evolutionary origin(s) of this mechanism, which is active at least in chordates, nematodes, cnidarians, euglenozoa, and kinetoplastids, is(are) still unclear.
Previous studies on the evolutionary origin of the spliceosomes have been performed based predominantly on homology of the most important spliceosomal proteins. Thus relatively little detail is known on the evolution of the snRNA sequences themselves beyond the homology of nine families of snRNAs across all eukaryotes studies so far (Collins and Penny 2005; Collins et al. 2004; Lorkovíc et al. 2005; Russell et al. 2006; Schneider et al. 2004; Shukla and Padgett 1999). This may come as a surprise since it has been known for more than a decade that at least all of the snRNAs of the major spliceosome appear in multiple copies and that these paralogues are differentially regulated in at least some species (see, Bhathal et al. 1995; Lo and Mount 1990; Morales et al. 1997; Sontheimer and Steitz 1992 ; Stefanovic et al. 1991). Very recently, however, some of these variants have been studied in more detail (see, Chen et al. 2005; Hinas et al. 2006; Kyriakopoulou et al. 2006; Pereira-Simon et al. 2004; Sierra-Montes et al. 2005; Smail et al. 2006 and references therein). The only systematic study that we are aware of is the recent comprehensive analysis of 11 insect genomes (Mount et al. 2007) which reported that phylogenetic gene trees of insect snRNAs do not provide clear support for discernible paralogue groups of U1 and/or U5 snRNAs that would correspond to the variants with tissue-specific expression patterns. Instead, the analysis supports a concerted mode of evolution and/or extreme purifying selection, a scenario previously described for snRNA evolution (Liao 1999; Liao and Weiner 1995; Nei and Rooney 2005).
In this contribution we extend the detailed analysis of the nine spliceosomal snRNAs to metazoan animals. In particular, in mammals, the analysis is complicated by a high copy number of snRNAs of the major spliceosome and an associated large number of pseudogenes (Denison et al. 1981). We focus here on four questions: (1) Is there evidence for discernible paralogue groups of snRNAs in some clades? A dominating mode of concerted evolution does not necessarily prevent this, as demonstrated by the existence of two highly diverged copies of both LSU and SSU rRNA in Chaetognatha (Papillon et al. 2006; Telford and Holland 1997), which is probably associated with a duplication of the entire rDNA cluster. (2) Are there clades with deviant snRNA structures? The prime example for a highly divergent snRNA is the U11 in a subset of the insects (Schneider et al. 2004). (3) Are there interpretable trends in the copy number of snRNAs across metazoa? (4) How mobile are snRNA genes relative to the “background” of protein coding genes? In other words, to what extent are some or all of the snRNA genes offsprings of a locus that remains stably linked to its context over large timescales.
Materials and Methods
Sequence Data
Known snRNA sequences were retrieved from GenBank (Benson et al. 2007) and Rfam (Griffiths-Jones et al. 2005) and, in some cases, extracted directly from the literature. Genomic DNA sequences were downloaded from the Web sites of Ensembl, the Joint Genome Institute, the Sanger Institute, WormBase, the Genome Sequencing Center, UCSC, CAF1, Broad Institute, BGI, and the NCBI trace archive. For some species, we also performed nonexhaustive searches in the NCBI Trace Archive using Megablast. Details on the dataset are given in the electronic supplement.Footnote 1
Overall, the published experimental evidence on metazoan snRNAs is very unevenly distributed. For example, a large and phylogenetically diverse set of U2 snRNA sequences is reported in Giribet et al. (2001), while most other snRNAs have been reported for a few model organisms only. A recent experimental screen for snRNAs in Takifugu rubripes (Myslinksi et al. 2004) resulted in copies of eight snRNAs families. U4atac was missing, but a plausible candidate can easily be found by Blast. Only a few sequences of minor spliceosomal snRNAs have been reported so far, mostly in a few model mammals (Tarn et al. 1995) and in drosophilids (Mount et al. 2007; Schneider et al. 2004).
Homology Search
In a first automatic step we used a local installation of NCBI blast (v.2.2.10) with default parameters and E < 10−6 to find candidate sequences in closely related genomes. If successful, the results of this search were aligned to the query sequence using clustalw (v.1.83). After a manual inspection using clustalx, the consensus sequence of the alignment was again used as a Blast query with the same E-value cutoff.
If this automatic search was not successful, the best Blast hit(s) was(were) retrieved and aligned to a set of known snRNAs from related species. Candidate sequences were retained only when a visual inspection left no doubt that they were true homologues. This manual analysis step included a check whether the phylogenetic position of the candidate sequence in a neighbor-joining tree was plausible, taking into account that the sequences are short and some parts of the alignments are of low quality.
In cases where no snRNA homologues were found as described above, we searched the genome again with the much less stringent cutoff of E < 0.1 (or even higher in a few cases) and extracted all short hits together with 200-nt flanking sequence. We used Sean Eddy’s rnabob with a manually constructed structure model to extract a structure-based match within the selected regions and attempted to align the candidate sequences manually to a structure-annotated alignment of snRNAs in the emacs editor using the ralee mode (Griffiths-Jones 2005).
Finally, the resulting alignments of snRNAs were used to derive search patterns for RNA motif (Macke et al. 2001) and erpin (Gautheret and Lambert 2001). To this end, the consensus structure of the alignment was computed using RNAalifold (Hofacker et al. 2002) and converted into a form suitable as input for the two search programs.
Structure Models
Structure annotated sequence alignments were manually modified in the emacs text editor using the ralee mode (Griffiths-Jones 2005) to improve local sequence-structure features based on secondary structure predictions for the individual sequences obtained from RNAfold (Hofacker et al. 1994). Consensus structures were then computed using RNAalifold (Hofacker et al. 2002). The structure models are compiled in the online supplementary material.
Upstream Region Analysis
With MEME (v.3.5.0) we discovered motifs upstream of the sequences for analysis of regulators and other possible dependencies. They were manually compared with previously published sequence elements. We visually compared the MEME patterns with the upstream elements in related species from the following literature sources: general motifs (Hernandez 2001), human (Bark et al. 1986; Domitrovich and Kunkel 2003; Kunkel and Pederson 1988; Tarn et al. 1995), chicken (Bhathal et al. 1995; Korf and Stumph 1986), insects (Mount et al. 2007), Bombyx mori (Sierra-Montes et al. 2005), Strongylocentrotus purpuratus (Stefanovic and Marzluff 1992), and Caenorhabditis elegans (Thomas et al. 1990).
Phylogenetic Analysis
Since the snRNA sequences are short and, in addition, there are several highly variable regions, we use split decomposition (Bandelt and Dress 1992) and the neighbor net (Bryant and Moulton 2004) algorithm (as implemented as part of the SplitsTree4 package (Huson and Bryant 2006)) to construct phylogenetic networks rather than phylogenetic trees. The advantage of these method is that they are very conservative and that the reconstructed networks provide an easy-to-grasp representation of the considerable noise in the sequence data.
Synteny Information
In order to assess whether snRNA genes are mobile in the genome, we determined their flanking protein-coding genes. We used the ensembl compara annotation (Flicek et al. 2008) to retrieve homologous proteins in other genomes and compared whether these homologues also have adjacent snRNAs. For consistency, this analysis is performed based on ensembl (release 46) (Hubbard et al. 2005) using the data integration platform BioFuice (Kirsten and Rahm 2006). More precisely, for each human snRNA G we examined the relation of the left homologous L H (G) and right homologous R H (G) of flanking protein coding genes L(G) and R(G) on both sides of G. We only considered annotations in L H (G) and R H (G), respectively, if the sequence distance between G H and L H (G) and R H (G) was not more than twice (five times for mammals) the distance between G and L(G) and R(G).
Results
Homology Search
Table 1 summarizes the results of the sequence homology search detailed under Materials and Methods. Only sequences that passed all filtering steps and structure checks are reported as “homologues” in the following. We found that, with few exceptions, Blast-based homology search strategies are in general sufficient to find homologues of all nine spliceosomal snRNAs in most metazoan genomes. The procedure is hard to automatize, however, since in many cases the initial Blast hits have poor E-values, while a multiple sequence alignment then leaves little doubt that a true homologue has been found. This is true, in particular, for searches bridging large evolutinary distances, especially when the search extends beyond bilateria.
With very few exceptions we found multiple copies of all five major spliceosomal RNAs that exhibited the typical snRNA-like promoter elements and were hence mostly likely functional copies of the genes. The snRNA copy numbers varied substantially between different clades. The genus Caenorhabditis, for example, was set apart from other nematodes by a two- to threefold increase in the number of major spliceosomal snRNAs. In contrast, the snRNAs of the minor spliceosome were in most cases single-copy genes.
Many genomes, most notably mammalian genomes, contained a sizable number of major snRNA pseudogenes. Table 1 therefore lists only candidates that have plausible snRNA-like promoter structure, that fit the secondary structures of snRNAs in related species, and that exhibit strong sequence similarity in the unpaired regions of the molecule. These are rather restrictive criteria. In the online supplementary material, we therefore provide a corresponding table that is based only on sequence homology.
It is surprisingly difficult to compare the present snRNA survey with previous reports on vertebrate snRNAs. The main reason for discrepancies in the count of snRNAs is that distinguishing functional snRNAs from pseudogenes is still an unsolved problem. In this contribution, we use a very stringent criterion by insisting on a recognizable promoter structure. In some cases, however, it is known that snRNAs have internal promoters only (Tichelaar et al. 1998). These cases constitute false negatives in Table 1. On the other hand, much of the published literature considers sequence similarity to the known functional genes as the only criterion, thus most likely leading to the inclusion of a substantial fraction of pseudogenes. For instance The Chimpanzee Sequencing Analysis Consortium (2005) counts 16 U1, 6 U2, and 44 U6 snRNAs in the human genome (compared to our 8, 3, and 7, respectively), while Domitrovich and Kunkel (2003) reports 5–9 U6 snRNA genes, consistent with our list. Similarly, only a fraction of the major spliceosomal snRNAs reported for the chicken genome in (Consortium 2004) passes our promoter analysis.
For drosophilids, on the other hand, our analysis is almost identical to the results in Mount et al. (2007), Table 1, and the data reported in Sierra-Montes et al. (2005). Furthermore, we come close to the results of a comparative genomics screen for noncoding RNAs in C. elegans (Missal et al. 2006), which reported 12 U1, 19 U2, 5 U4, 13 U5, and 23 U6, i.e., only a few more candidates than our present purely homology-based approach. A comparative screen of the two Ciona species for evolutionary conserved structured RNAs (Missal et al. 2005) missed a small number of snRNA genes that we indentified as most likely functional ones.
In a few species we failed to identify individual major spliceosomal snRNAs (e.g., A. pisum U4, H. bacteriophora U4, and S. mediterannea U2). Minor spliceosomal snRNAs are more often missing. In those cases where only some of the major or minor snRNAs remain undetected, the missing family member most likely escaped our detection procedure for one of several reasons.
-
1.
In the case of unassembled incomplete genomes for which only shotgun reads were searched, the snRNA may be located in the not yet sequenced fraction of the genome or it might not be completely contained within at least one single shotgun read.
-
2.
The snRNA in question may be highly derived in sequence. For instance, the U11 snRNA in drosophilids (Schneider et al. 2004) cannot be found by a simple Blast search starting from noninsect sequences. It can be found, however, by the combination of very unspecific Blast and subsequent structure search as described under Homology Search, above.
-
3.
In some cases we list a “0” in Table 1 even though there is recognizable sequence homology in the genome. In these cases we were not able to identify the snRNA-like promoter elements and/or the secondary structure did not fit the expectations. These cases are marked in Table 1.
-
4.
It is conceivable that some species had lost a particular snRNA and replaced it by corresponding snRNA from the other spliceosome. The observation that U4 may function in both the major and the minor spliceosomes (Shukla and Padgett 2004) shows that such a replacement mechnism might indeed be evolutionarily feasible.
In our data set, we most frequently were unable to find a U4atac homologue. We cannot know, of course, whether we missed these cases due to poor sequence conservation or due to loss of the gene. For instance, we did not recover a plausible U4atac candidate for the hemichordate Saccoglossus kowalevskii despite the fact that the U4atac sequence of the sea urchin Strongylocentrotus purpuratus was easily retrieved.
Surprisingly, we found neither a canonical U6 nor a canonical U6atac in Drosophila willistoni. A highly derived U6 homologue has no recognizable snRNA-like promoter structure and exhibits substantial deviations from the consensus structure (see Secondary Structures, below). Interestingly, it is aligned with the functional U6 RNAs of the other 11 drosophilids in the genome-wide “12-Fly” Pecan alignment,Footnote 2 which respects syntenic conservation. This strongly suggests that D. willistoni indeed has a highly derived U6 snRNA. According to known annotation the sequence is not located in an intron. The absence of external promoter elements has also been observed for one of the human U6 snRNAs (Tichelaar et al. 1998), hence the prediction is not at all implausible. Similarly, the U4atac candidate from Daphnia pulex deviates substantially from other arthropod sequences. It is possible that in some or all of these cases the snRNA is present in the genome but is not contained in the currently available genomic sequence data. This is most likely the case for the missing minor spliceosomal snRNAs of Ixodes scapularis, Pediculus humanus, or Drosophilia willistoni.
In some cases, however, we failed to identify all four minor spliceosomal snRNAs. Consistent with previous work (Patel and Steitz 2003) we found no convincing homologues of the minor spliceosomal snRNAs U11, U12, U4atac, and U6atac in any of the nematode genomes, suggesting that the minor spliceosome was lost early in the nematode lineage. Nevertheless, we find some Blast hits for minor spliceosomal snRNAs in some nematode genomes.
Our analysis furthermore suggests the possible loss of the minor spliceosome in Oikopleura dioica, while a complete complement of minor spliceosomal snRNAs was found in the genus Ciona. It is unclear, however, whether this is an artifact due to limiations of available shotgun traces.
Our survey provides evidence that most metazoan clades for which genomic sequences are available have retained the minor spliceosome. For many groups, such as Annelida and Cnidaria, we are not aware of earlier references to the existence of minor spliceosome.
Specific Upstream Elements
The classical snRNA-specific PSE and TATA elements that have been described in detail for several vertebrates (Domitrovich and Kunkel 2003; Hernandez 2001) are highly conserved. This appears to be an exception rather than the rule, however: the snRNA upstream elements are highly diverse across metazoa. Our analysis agrees with the recent observation that in drosophilids there is a rapid turnover in the upstream sequences. Even though the PSE is fairly well conserved within drosophilids, it already differs substantially between the major insect groups (Mount et al. 2007). Similarly, within the nematodes conservation of upstream elements is limited to the genus level. In general, the PSEs of U11, U12, and U4atac is much less conserved than their counterparts in major spliceosomal snRNA genes. For the purpose of this study, the relatively well-conserved elements were used to discriminate functional snRNAs from likely pseudogenes. We concentrated on PSE and TATA elements for this purpose because other snRNA-associated upstream elements, such as SPH, OCT, CAAT-box, GC-box, -35-element, and Inr, are even less well conserved:
A GC-box was identified in Caenorhabditis at a noncanonical position (about −68 nt). These elements are different for each single snNRA class: U1 GGACGG (44/52 sites), U2 TGGCCG (38/60 sites), and U5 CGGCCG (39/46 sites). However, also among a single snRNA this element varies a lot: insects have a U1 GC-box GCGCTG at about −75 nt (15/39 sites). About half of the U6 sequences of basal deuterostomes show the CAAT-box motif TGCCAAGAA at the known position of −70 nt. Interestingly, we found related motifs in the upstream region of drosophilids U11 (GACCAATAT; −33 nt) and other insects U5 snRNA (TTCCAATCA; −28 nt). The Octamer motif (OCT; ATTTGCAC) was found in six of seven sequences of basal deuterostomes at the known position of −54 nt upstream of U6atac. However, in 12 of 14 drosophilid sequences, the closely related motif ATTTGCTT was found at position −33 nt. About 35 nt upstream of U11 and U12 snRNAs of teleosts we found the motifs GTGACA and TGCACA, respectively. The Inr element of U1 snRNA was found in each species. For teleost fish and drosophilids we found a complete set of this element for all snRNAs. However, the Inr shows substantial sequence variations both between different genes in the same species and between homologous genes in different species. We refer to the online supplementary material for further details and lists of identified sequence elements.
Clusters of snRNA Genes
In Mammalia, we observe linkage of tandem copies of U2 snRNAs (see also Liao et al. 1997; Pavelitz et al. 1999), while there are no clusters of distinct snRNAs. In Drosophila, there are surprisingly constant patterns of snRNA clusters: (a) U2-U5 clusters are observed four to six times per genome, (b) there are one or two U1-U2 clusters, and (c) three to nine tandem copies of snRNAs. Two species deviated therefrom. In D. ananassae, we find no U2-U5 cluster but, instead, 7 U1-U2, 1 U4-U5 cluster, and 4 other tandem copies, while D. willistoni lacks the U4-U5 cluster but contains 10 U2-U5 pairs and 6 tandem copies. Teleost fish also have a common pattern: there are one or two U1-U2 pairs and two to six tandem copies. In general, however, snRNA do not appear in clusters throughout metazoan genomes.
In several species, linkage of snRNAs with 5S rRNA has been observed (Cross and Rebordinos 2005; Ebel et al. 1999; Liao 1999; Liao and Weiner 1995; Manchado et al. 2006; Pelliccia et al. 2001). We found only one further example of this type: in Daphnia pulex 5S and U5 snRNAs are separated by only 308 bp.
Phylogenetic Analysis and Paralogues
Like ribosomal RNAs, spliceosomal RNAs are subject to concerted evolution (Gonzalez and Sylvester 2001; Hillis and Dixon 1991; Schlötterer and Tautz 1994), i.e., one observes that paralogous sequences in the same species are more similar than orthologous sequences of different species. Multiple molecular mechanisms may account for this phenomenon: gene conversion, repeated unequal crossover, and gene amplification (frequent duplications and losses within family); see Liao (1999) for a review. In some cases, however, paralogues can escape from the concerted evolution mechanisms as exemplified by the two paralogue groups of SSU rRNA in Chaetogatha (Papillon et al. 2006).
Distinguishable snRNA paralogues that are often differentially expressed have previously been reported for a diverse collection of major spliceosmal snRNAs including U1 snRNAs in insects (Lo and Mount 1990; Pereira-Simon et al. 2004; Sierra-Montes et al. 2005), Xenopus (Dahlberg and Lund 1988), and human (Kyriakopoulou et al. 2006), U2 snRNAs in Dictyostelium (Hinas et al. 2006), sea urchin (Stefanovic et al. 1991), and silk moth (Sierra-Montes et al. 2005), U5 snRNAs in human (Sontheimer and Steitz 1992), sea urchin (Morales et al. 1997), and drosophilids (Chen et al. 2005), and U6 snRNAs in silk moth (Smail et al. 2006) and human (Domitrovich and Kunkel 2003; Tichelaar et al. 1998).
A phylogenetic analysis of the individual snRNA families, nevertheless, does not show widely separated paralogue groups that are stable throughout larger clades. Figure 1, for example, shows that the U5 variants described in Chen et al. (2005) do not form clear paralogue groups beyond the closest relatives of Drosophila melanogaster. On the other hand, there is some evidence for distinguishable paralogues outside the melanogaster subgroup. The situation is much clearer for the drosophilid U4 snRNAs, where three paralogue groups can be distinguished (see Fig. 2). One group is well separated from the other two and internally rather diverse. The other two groups are very clearly distinguishable for the melanogaster and obscura group (see Drosophila 12 Genomes Consortium 2007). For D. virilis, D. mojavensis, D. grimshawi, and D. willistoni we have two nearly identical copies instead of two different groups of genes.
Table 2 summarizes the presence of recognizable paralogue groups within major animal groups. Within the genus Caenorhabitis we find evidence for the formation of U5 paralogue groups in C. remanei, C. brenneri, and C. briggsae, to the exclusion of C. elegans and C. japonica. Evidence for paralogue groups of U1 snRNA in drosophilids remains ambiguous due to the small sequence differences.
In teleost fish, we find clearly recognizable paralogue groups for U2, U4, and U5 snRNAs. Surprisingly, the medaka Oryzias latipes has only a single group of closely related sequences, despite the fact that for U4, the split of the paralogues appear to predate the last common ancestor of zebrafish and fugu (Fig. 3).
Neither the two rounds of genome duplications at the root of the vertebrates nor the teleost-specific genome duplication has led to recognizable paralogue groups of snRNAs. In particular, minor snRNA genes are single-copy genes in teleosts.
Secondary Structures
The spliceosomal snRNAs have evolutionarily well-conserved secondary structures (Shukla and Padgett 1999). These structures received substantial interest in the past, as exemplified by the following nonexhaustive list of references covering a diverse set of animal species: Homo sapiens U1 (Mount and Steitz 1981), U2 (Hausner et al. 1990), U4 (Krol et al. 1981), U5 (Branlant et al. 1983; Sontheimer and Steitz 1992), U6 (Hausner et al. 1990), U11 (Montzka and Steitz 1988; Russell et al. 2006; Tarn et al. 1995), U12 (Montzka and Steitz 1988; Russell et al. 2006; Tarn et al. 1995), and U4atac (Shukla et al. 2002); Rattus norvegicus U1 (Krol et al. 1981), U4 (Krol et al. 1981), and U5 (Krol et al. 1981); Gallus gallus U4 (Krol et al. 1981) and U5 (Branlant et al. 1983); Xenopus laevis U1 (Forbes et al. 1984) and U2 (Mattaj and Zeller 1983); Caenorhabditis elegans U1, U2, U5, and U4/U6 (Thomas et al. 1990); Drosophila melanogaster U1 (Mount and Steitz 1981; Myslinski et al. 1984), U2 (Myslinski et al. 1984), U4 (Myslinski et al. 1984), U5 (Myslinski et al. 1984), U4atac/U6atac, and U6atac/U12 (Otake et al. 2002); Bombyx mori U1 (Sierra-Montes et al. 2003) and U2 (Sierra-Montes et al. 2002); Asselus aquaticus U1 (Barzotti et al. 2003); and Ascaris lumbricoides U1, U2, U5, and U4/U6 (Shambaugh et al. 1994). Large changes in snRNA structures over evolutionary time were recently reported for hemiascomycetous yeasts (Mitrovich and Guthrie 2007). The comprehensive survey of snRNA sequences throughout metazoa set the stage for a comparably detailed analysis of metazoan snRNA structures. In order to assess structural variations, we constructed structure annotated sequence alignments of all snRNA families. The complete set of alignments and consensus structure models is provided (in Stockholm format) as part of the online supplementary material.
In general we find that snRNA sequences vary more in paired regions than in the loops. The sequence variations almost exclusively comprises compensatory mutations that leave the secondary structures intact. As an example, Fig. 4 shows the structures of the U12 snRNA of Xenopus tropicalis and Capitella capitata. The sequences have few paired nucleotides in common.
Structural variations are typically limited. In Fig. 5 we use the U1 snRNAs as a typical example for the evolutionary variation of snRNAs across the metazoa. Overall the structures are extremely well conserved, with small variations in the length of the individual stems. With several notable exceptions this is true for all metazoan snRNAs.
As reported previously (Chen et al. 2005), the second stem of U5 snRNA shows some variations. More interestingly, the minor spliceosomal snRNAs tend to be derived in insects. This has been reported previously, in particular, for U11 in drosophilids (Mount et al. 2007; Schneider et al. 2004). We found substantial structural variations also for drosophilid U12 snRNAs: there are massive insertions in and after stem III, while stems I and II show mispairings. Furthermore, stem II of U6atac is completely deleted in all examined insects. Details are compiled in the online supplementary material.
Most surprisingly, Acyrthosiphon pisum exhibits highly derived structures for all four minor spliceosomal snRNAs (see Fig. 6).
The U2 snRNA of Schmidtea mediterannea does fit well to the structural alignment of the other U2 snRNAs. In Schistosoma mansoni we found a canonical U12 snRNA, while the sequences of the candidates for minor spliceosomal snRNAs do not fit well to the consensus secondary structure models. Details are given in the online supplementary material.
Syntenic Conservation
In order to assess the conservation of the genomic positions of the snRNAs we retrieved the protein coding genes adjacent to the 31 human snRNAs (8 U1, 3 U2, 2 U4, 5 U5, 7 U6, 1 U11, 1 U12, 3 U4atac, and 1 U6atac) and compared the position of their homologues in 14 vertebrate genomes (teleosts, frog, chicken, platypus, opossum, rodents, cow, dog, and chimp) with the 234 snRNA genes that were found in these genomes. We found syntenic conservation of snRNA and flanking genes in only 36 cases, of which 20 belong to the human-chimp comparison. Only 9 of the 31 human snRNA preserve synteny with adjacent genes in the mouse genome, while 22,680 annotated human genes give rise to 21,480 adjacent pairs that have adjacent homologues in the mouse. Furthermore, only a single pair is conserved between human and opossum, and no syntenic conservation can be traced back further in evolutionary history, while large syntenic blocks are conserved across chordata (Putnam et al. 2008). Including the pseudogenes increases the numbers of conserved pairs to 499 of 1609. Again, most of these (453) are human/chimp pairs. The data clearly show that snRNA locations are not syntenically conserved, i.e., snRNA behave like mobile elements in their genomic context.
Pseudogenes
As mentioned above, snRNAs are frequently the founders of families of pseudogenes. This is a property that they share with most other small RNA classes such as 7SL RNA, Y RNA, and tRNAs. Such families of pseudogenes are easily recognized as a by-product of Blast-based homology searches as a large set of hits with intermediate E-values. Figure 7 summarizes these data; more details are provided in the online supplementary material.
Spliceosomal snRNA pseudogenes families are very unevenly distributed across distinct phylogenetic groups and have clearly arisen in independent bursts multiple times across animal evolution. Within deuterostomes, almost all sequenced genomes, with the notable exception of teleosts and chicken, contain at least one large family of snRNA-derived pseudogenes.
The genus Caenorhabditis shows no pseudogenes, whereas other nematods show nearly such a high number of pseudogenes as primates. Annelids, molluscs, and platyhelminthes behave similarly. The Trichoplax adhaerens genome, on the other hand, contains a single copy of each of the nine spliceosomal snRNAs.
Discussion
We have reported here on a comprehensive computational survey of spliceosomal snRNA in all currently available metazoan genomes. We thus provide a comparable and nearly complete collection of animal snRNA sequences. The dense taxon sampling allowed us to verify homology of candidate sequences. Both the major and the minor spliceosome are present in almost all metazoan clades, nematodes (and possibly Oikopleura) being the only notable exception. For many of the metazoan families we report here the first evidence on their spliceosomal RNAs.
Using restrictive filtering of the candidates by both secondary structure and canonical promoter structure left us with a high-quality data set that was then used to construct secondary structure models. This is useful, in particular, for the snRNAs of the minor spliceosome, for which very few sequences are reported in databases; indeed, Rfam 7.0 (Griffiths-Jones et al. 2005) lists only the U11 and U12 families, with a meager set of seed sequences from few model organisms. The sequence and secondary structure data compiled in this study provide a substantially improved database and set the stage for systematic searches of even more distant homologues.
The analysis of the genomic distribution of snRNAs reveals that discernible paralogues are not uncommon within genera or families. However, no dramatically different paralogues have been found. Spliceosomal snRNAs are prone to spawning large pseudogene families, which arose independently in many species. They behave like mobile genetic elements in that they barely appear in syntenic positions as measured by their flanking genes. While in some genomes snRNAs appear in tandem and/or associated with 5S rRNA genes, these clusters are not conserved over longer evolutionary timescales. Taken together, the data are consistent with a dominating duplication-deletion mechanism of concerted evolution for the genomic evolution and proliferation of snRNA. This behavior of snRNAs is similar, in particular, to tRNAs, albeit the copy number of snRNAs is typically much smaller. Recent studies have demonstrated that snoRNAs behave like mobile genetic elements that spread via retroposition (Schmitz et al. 2008; Weber 2006). Their mode of expression from spliced-out introns, however, restricts the functional copies predominantly to introns of the same host gene, with only occasional translocations to different carriers (see Bompfünewerer et al. 2005). Spliceosomal RNAs, in contrast, appear to freely spread across the genome when they appear as multicopy genes.
References
Bandelt HJ, Dress AWM (1992) A canonical decomposition theory for metrics on a finite set. Adv Math 92:47
Bark C, Weller P, Zabielski J, Pettersson U (1986) Genes for human U4 small nuclear RNA. Gene 50:333–344
Barzotti R, Pelliccia F, Rocchi A (2003) Identification and characterization of U1 small nuclear RNA genes from two crustacean isopod species. Chromosome Res 11:365–373
Benson DA, Karsch-Mizrachi I, Lipman DJ, Ostell J, Wheeler DL (2007) GenBank. Nucleic Acids Res 35:D21–D25
Bhathal HS, Zamrod Z, Tobaru T, Stumph WE (1995) Identification of proximal sequence element nucleotides contributing to the differential expression of variant U4 small nuclear RNA genes. J Biol Chem 270:27,629–27,633
Bompfünewerer AF, Flamm C, Fried C, Fritzsch G, Hofacker IL, Lehmann J, Missal K, Mosig A, Müller B, Prohaska SJ, Stadler BMR, Stadler PF, Tanzer A, Washietl S, Witwer C (2005) Evolutionary patterns of non-coding RNAs. Th Biosci 123:301–369
Branlant C, Krol A, Lazar E, Haendler B, Jacob M, GalegoDias L, Pousada C (1983) High evolutionary conservation of the secondary structure and of certain nucleotide sequences of U5 RNA. Nucleic Acids Res 11:8359–8367
Bryant D, Moulton V (2004) Neighbor-net: an agglomerative method for the construction of phylogenetic networks. Mol Biol Evol 21:255–265
Chen L, Lullo DJ, Ma E, Celniker SE, Rio DC, Doudna JA (2005) Identification and analysis of U5 snRNA variants in Drosophila. RNA 11:1473–1477
Collins L, Penny D (2005) Complex spliceosomal organization ancestral to extant eukaryotes. Mol Biol Evol 22:1053–1066
Collins LJ, Macke TJ, Penny D (2004) Searching for ncRNAs in eukaryotic genomes: maximizing biological input with RNA motif. J Integ Bioinf 1:2004–08–04. Available at: http://journalimbiode/indexphp?paperid56
Consortium International Chicken Genome Sequencing (2004) Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution. Nature 432:695–716
Cross I, Rebordinos L (2005) 5S rDNA and U2 snRNA are linked in the genome of Crassostrea angulata and Crassostrea gigas oysters: Does the (ct)n(ga)n microsatellite stabilize this novel linkage of large tandem arrays? Genome 48:1116–1119
Dahlberg JE, Lund E (1988) The genes and transcription of the major small nuclear RNAs. In: Birnstiel ML (ed) Structure and function of major and minor small nuclear ribonucleoprotein particles. Springer-Verlag, Berlin, pp 38–70
Denison RA, Van Arsdell SW, Bernstein LB, Weiner AM (1981) Abundant pseudogenes for small nuclear RNAs are dispersed in the human genome. Proc Natl Acad Sci USA 78:810–814
Domitrovich AM, Kunkel GR (2003) Multiple, dispersed human U6 small nuclear RNA genes with varied transcriptional efficiencies. Nucleic Acids Res 31:2344–2352
Drosophila 12 Genomes Consortium (2007) Evolution of genes and genomes on the Drosophila phylogeny. Nature 450:203–218
Ebel C, Frantz C, Paulus F, Imbault P (1999) Trans-splicing and cis-splicing in the colourless euglenoid, Entosiphon sulcatum. Curr Genet 35:542–550
Flicek P, Aken BL, Beal K, Ballester B, Caccamo M, Chen Y, Clarke L, Coates G, Cunningham F, Cutts T, Down T, Dyer SC, Eyre T, Fitzgerald S, Fernandez-Banet J, Gräf S, Haider S, Hammond M, Holland R, Howe KL, Howe K, Johnson N, Jenkinson A, Käahäari A, Keefe D, Kokocinski F, Kulesha E, Lawson D, Longden I, Megy K, Meidl P, Overduin B, Parker A, Pritchard B, Prlic A, Rice S, Rios D, Schuster M, Sealy I, Slater G, Smedley D, Spudich G, Trevanion S, Vilella AJ, Vogel J, White S, Wood M, Birney E, Cox T, Curwen V, Durbin R, Fernandez-Suarez XM, Herrero J, Hubbard TJP, Kasprzyk A, Proctor G, Smith J, Ureta-Vidal A, Searle S (2008) Ensembl 2008. Nucleic Acids Res 36:D707–D714
Forbes DJ, Kirschner MW, Caput D, Dahlberg JE, Lund E (1984) Differential expression of multiple U1 small nuclear RNAs in oocytes and embryos of Xenopus laevisi. Cell 38:681–689
Gautheret D, Lambert A (2001) Direct RNA motif definition and identification from multiple sequence alignments using secondary structure profiles. J Mol Biol 313:1003–1011
Giribet G, Edgecombe GD, Wheeler WC (2001) Arthropod phylogeny based on eight molecular loci and morphology. Nature 413:157–161
Gonzalez IL, Sylvester JE (2001) Human rDNA: evolutionary patterns within the genes and tandem arrays derived from multiple chromosomes. Genomics 73:255–263
Griffiths-Jones S (2005) RALEE—RNA alignment editor in Emacs. Bioinformatics 21:257–259
Griffiths-Jones S, Moxon S, Marshall M, Khanna A, Eddy SR, Bateman A (2005) Rfam: annotating non-coding RNAs in complete genomes. Nucleic Acids Res 33:D121–D124
Hastings KE (2005) SL trans-splicing: Easy come or easy go? Trends Genet 21:240–247
Hausner TP, Giglio LM, Weiner AM (1990) Evidence for basepairing between mammalian U2 and U6 small nuclear ribonucleoprotein particles. Genes Dev 4:2146–2156
Hernandez N (2001) Small nuclear RNA genes: a model system to study fundamental mechanisms of transcription. J Biol Chem 276:26,733–26,736
Hillis DM, Dixon MT (1991) Ribosomal DNA: molecular evolution and phylogenetic inference. Q Rev Biol 66:411–453
Hinas A, Larsson P, Avesson L, Kirsebom LA, Virtanen A, Söderbom F (2006) Identification of the major spliceosomal RNAs in Dictyostelium discoideum reveals developmentally regulated U2 variants and polyadenylated snRNAs. Eukaryot Cell 5:924–934
Hofacker IL, Fontana W, Stadler PF, Bonhoeffer LS, Tacker M, Schuster P (1994) Fast folding and comparison of RNA secondary structures. Monatsh Chem 125:167–188
Hofacker IL, Fekete M, Stadler PF (2002) Secondary structure prediction for aligned RNA sequences. J Mol Biol 319:1059–1066
Hubbard T, Andrews D, Caccamo M, Cameron G, Chen Y, Clamp M, Clarke L, Coates G, Cox T, Cunningham F, Curwen V, Cutts T, Down T, Durbin R, Fernandez-Suarez XM, Gilbert J, Hammond M, Herrero J, Hotz H, Howe K, Iyer V, Jekosch K, Kahari A, Kasprzyk A, Keefe D, Keenan S, Kokocinsci F, London D, Longden I, McVicker G, Melsopp C, Meidl P, Potter S, Proctor G, Rae M, Rios D, Schuster M, Searle S, Severin J, Slater G, Smedley D, Smith J, Spooner W, Stabenau A, Stalker J, Storey R, Trevanion S, Ureta-Vidal A, Vogel J, White S, Woodwark C, Birney E (2005) Ensembl 2005. Nucleic Acids Res 33:D447–D453
Huson DH, Bryant D (2006) Application of phylogenetic networks in evolutionary studies. Mol Biol Evol 23:254–267
Kirsten T, Rahm E (2006) BioFuice: mapping-based data intergation in bioinformatics. In: Leser U, Naumann F, Eckman B (eds) Proceedings of the 3rd International Workshop on Data Integration in the Life Sciences (DILS), vol 4075. Springer Verlag, Berlin, pp 124–135
König H, Matter N, Bader R, Thiele W, Müller F (2007) Splicing segregation: the minor spliceosome acts outside the nucleus and controls cell proliferation. Cell 131:718–729
Korf GM, Stumph WE (1986) Chicken U2 and U1 RNA genes are found in very different genomic environments but have similar promoter structures. Biochemistry 25:2041–2047
Krol A, Branlant C, Lazar E, Gallinaro H, Jacob M (1981) Primary and secondary structures of chicken, rat and man nuclear U4 RNAs. Homologies with U1 and U5 RNAs. Nucleic Acids Res 9:2699–2716
Kunkel GR, Pederson T (1988) Upstream elements required for efficient transcription of a human U6 RNA gene resemble those of U1 and U2 genes even though a different polymerase is used. Genes Dev 2:196–204
Kyriakopoulou C, Larsson P, Liu L, Schuster J, Söderbom F, Kirsebom LA, Virtanen A (2006) U1-like snRNAs lacking complementarity to canonical 5′ splice sites. RNA 12:1603–1611
Liao D (1999) Concerted evolution: molecular mechanism and biological implications. Am J Hum Genet 64:24–30
Liao D, Weiner AM (1995) Concerted evolution of the tandemly repeated genes encoding primate U2 small nuclear RNA (the RNU2 locus) does not prevent rapid diversification of the (CT)n(GA)n microsatellite embedded within the U2 repeat unit. Genomics 30:583–593
Liao D, Pavelitz T, Kidd JR, Kidd KK, Weiner AM (1997) Concerted evolution of the tandemly repeated genes encoding human U2 snRNA (the RNU2 locus) involves rapid intrachromosomal homogenization and rare interchromosomal gene conversion. EMBO J 16:588–598
Lo PC, Mount SM (1990) Drosophila melanogaster genes for U1 snRNA variants and their expression during development. Nucleic Acids Res 18:6971–6979
López MD, Alm Rosenblad M, Samuelsson T (2008) Computational screen for spliceosomal RNA genes aids in defining the phylogenetic distribution of major and minor spliceosomal components. Nucleic Acids Res 36:3001–3010
Lorkovíc ZJ, Lehner R, Forstner C, Barta A (2005) Evolutionary conservation of minor U12-type spliceosome between plants and humans. RNA 11:1095–1107
Macke TJ, Ecker DJ, Gutell RR, Gautheret D, Case DA, Sampath R (2001) RNAMotif, an RNA secondary structure definition and search algorithm. Nucleic Acids Res 29(22):4724–4735
Manchado M, Zuasti E, Cross I, Merlo A, Infante C, Rebordinos L (2006) Molecular characterization and chromosomal mapping of the 5S rRNA gene in Solea senegalensis: a new linkage to the U1, U2, and U5 small nuclear RNA genes. Genome 49:79–86
Mattaj IW, Zeller R (1983) Xenopus laevis U2 snRNA genes: tandemly repeated transcription units sharing 5′ and 3′ flanking homology with other RNA polymerase II transcribed genes. EMBO J 2:1883–1891
Missal K, Rose D, Stadler PF (2005) Non-coding RNAs in Ciona intestinalis. Bioinformatics 21(S2):i77–i78
Missal K, Zhu X, Rose D, Deng W, Skogerbø G, Chen R, Stadler PF (2006) Prediction of structured non-coding RNAs in the genome of the nematode Caenorhabitis elegans. J Exp Zool Mol Dev Evol 306B:379–392
Mitrovich QM, Guthrie C (2007) Evolution of small nuclear RNAs in S cerevisiae, C albicans, and other hemiascomycetous yeasts. RNA 13:2066–2080
Montzka KA, Steitz JA (1988) Additional low-abundance human small nuclear ribonucleoproteins: U11, U12, etc. Proc Natl Acad Sci USA 85:8885–8889
Morales J, Borrero M, Sumerel J (1997) Identification of developmentally regulated sea urchin U5 snRNA genes. DNA Seq 7:243–259
Mount SM, Steitz JA (1981) Sequence of U1 RNA from Drosophila melanogaster: implications for U1 secondary structure and possible involvement in splicing. Nucleic Acids Res 9:6351–6368
Mount SM, Gotea V, Lin CF, Hernandez K, Makałowski W (2007) Spliceosomal small nuclear RNA genes in 11 insect genomes. RNA 13:5–14
Myslinksi E, Krol A, Carbon P (2004) Characterization of snRNA and snRNA-type genes in the pufferfish Fugu rubripes. Gene 330:149–158
Myslinski E, Branlant C, Wieben ED, Pederson T (1984) The small nuclear RNAs of Drosophila. J Mol Biol 180:927–945
Nei M, Rooney AP (2005) Concerted and birth-and-death evolution of multigene families. Annu Rev Genet 39:121–152
Nilsen TW (2003) The spliceosome: The most complex macromolecular machine in the cell? Bioessays 25:1147–1149
Otake LR, Scamborova P, Hashimoto C, Steitz JA (2002) The divergent U12-type spliceosome is required for pre-mRNA splicing and is essential for development in Drosophila. Mol Cell 9:439–446
Papillon D, Perez Y, Caubit X, Le Parco Y (2006) Systematics of chaetognatha under the light of molecular data, using duplicated ribosomal 18S DNA sequences. Mol Phylogenet Evol 38:621–634
Patel AA, Steitz JA (2003) Splicing double: insights from the second spliceosome. Nat Rev Mol Cell Biol 4:960–970
Pavelitz T, Liao D, Weiner AM (1999) Concerted evolution of the tandem array encoding primate U2 snRNA (the RNU2 locus) is accompanied by dramatic remodeling of the junctions with flanking chromosomal sequences. EMBO J 18:3783–3792
Pelliccia F, Barzotti R, Bucciarelli E, Rocchi A (2001) 5S ribosomal and U1 small nuclear RNA genes: a new linkage type in the genome of a crustacean that has three different tandemly repeated units containing 5S ribosomal DNA sequences. Genome 44:331–335
Pereira-Simon S, Sierra-Montes JM, Ayesh K, Martinez L, Socorro A, Herrera RJ (2004) Variants of U1 small nuclear RNA assemble into spliceosomal complexes. Insect Mol Biol 13:189–194
Putnam NH, Butts T, Ferrier DEK, Furlong RF, Hellsten UK, Takeshi Robinson-Rechavi M, Shoguchi E, Terry A, Yu JK, Benito-Gutíerrez E, Dubchak I, Garcia-Fernàndez J, Gibson-Brown JJ, Grigoriev IV, Horton AC, de Jong PJ, Jurka J, Kapitonov VV, Kohara Y, Kuroki Y, Lindquist E, Lucas S, Osoegawa K, Pennacchio LA, Salamov AA, Satou Y, Sauka-Spengler T, Schmutz J, Shin-I T, Toyoda A, Bronner-Fraser M, Fujiyama A, Holland LZ, Holland PWH, Satoh N, Rokhsar DS (2008) The amphioxus genome and the evolution of the chordate karyotype. Nature 453:1064–1071
Russell AG, Charette JM, Spencer DF, Gray MW (2006a) An early evolutionary origin for the minor spliceosome. Nature 443:863–866
Russell AG, Charette JM, Spencer DF, Gray MW (2006b) An early evolutionary origin for the minor spliceosome. Nature 443:863–866
Schlötterer C, Tautz D (1994) Chromosomal homogeneity of Drosophila ribosomal DNA arrays suggests intrachromosomal exchanges drive concerted evolution. Curr Biol 4:777–783
Schmitz J, Zemann A, Churakov G, Kuhl H, Grützner F, Reinhardt R, Brosius J (2008) Retroposed SNOfall—a mammalianwide comparison of platypus snornas. Genome Res 18:1005–1010
Schneider C, Will CL, Brosius J, Frilander M, Lührmann R (2004) Identification of an evolutionarily divergent U11 small nuclear ribonucleoprotein paricle in Drosophila. Proc Natl Acad Sci USA 101(26):9584–9589
Shambaugh JD, Hannon GE, Nilsen TW (1994) The spliceosomal U small nuclear RNAs of Ascaris lumbricoides. Mol Biochem Parasitol 64:349–352
Sheth N, Roca X, Hastings ML, Roeder T, Krainer AR, Sachidanandam R (2006) Comprehensive splice-site analysis using comparative genomics. Nucleic Acids Res 34:3955–3967
Shukla GC, Padgett RA (1999) Conservation of functional features of U6atac and U12 snRNAs between vertebrates and higher plants. RNA 5:525–538
Shukla GC, Padgett RA (2004) U4 small nuclear RNA can function in both the major and minor spliceosomes. Proc Natl Acad Sci USA 101:93–98
Shukla GC, Cole AJ, Dietrich RC, Padgett RA (2002) Domains of human U4atac snRNA required for U12-dependent splicing in vivo. Nucleic Acids Res 30:4650–4657
Sierra-Montes JM, Freund AV, Ruiz LM, Szmulewicz MN, Rowold DJ, Herrera RJ (2002) Multiple forms of U2 snRNA coexist in the silk moth Bombyx mori. Insect Mol Biol 11:105–114
Sierra-Montes JM, Pereira-Simon S, Freund AV, Ruiz LM, Szmulewicz MN, Herrera RJ (2003) A diversity of U1 small nuclear RNAs in the silk moth Bombyx mori. Insect Biochem Mol Biol 33:29–39
Sierra-Montes JM, Pereira-Simon S, Smail SS, Herrera RJ (2005) The silk moth Bombyx mori U1 and U2 snRNA variants are differentially expressed. Gene 352:127–136
Smail SS, Ayesh K, Sierra-Montes JM, Herrera RJ (2006) U6 snRNA variants isolated from the posterior silk gland of the silk moth Bombyx mori. Insect Biochem Mol Biol 36:454–465
Sontheimer EJ, Steitz JA (1992) Three novel functional variants of human U5 small nuclear RNA. Mol Cell Biol 12:734–746
Stefanovic B, Marzluff WF (1992) Characterization of two developmentally regulated sea urchin U2 small nuclear RNA promoters: a common required TATA sequence and independent proximal and distal elements. Mol Cell Biol 12:650–660
Stefanovic B, Li JM, Sakallah S, Marzluff WF (1991) Isolation and characterization of developmentally regulated sea urchin U2 snRNA genes. Dev Biol 148:284–294
Tarn WY, Yario TA, Steitz JA (1995) U12 snRNAs in vertebrates: evolutionary conservation of 5′ sequences implicated in splicing of pre-mRNAs containing a minor class of introns. RNA 1:644–656
Telford MJ, Holland PWH (1997) Evolution of 28S ribosomal DNA in chaetognaths: duplicate genes and molecular phylogeny. J Mol Evol 44:135–144
The Chimpanzee Sequencing Analysis Consortium (2005) Initial sequence of the chimpanzee genome and comparison with the human genome. Nature 437:69–87
Thomas J, Lea K, Zucker-Aprison E, Blumenthal T (1990) The spliceosomal snRNAs of Caenorhabditis elegans. Nucleic Acids Res 18:2633–2642
Tichelaar JW, Wieben ED, Reddy R, Vrabel A, Camacho P (1998) In vivo expression of a variant human U6 RNA from a unique, internal promoter. Biochemistry 37:12, 943–12,951
Valadkhan S (2005) snRNAs as the catalysts of pre-mRNA splicing. Curr Opin Chem Biol 9:603–608
Valadkhan S (2007) The spliceosome: caught in a web of shifting interactions. Curr Opin Struct Biol 17:310–315
Valadkhan S, Mohammadi A, Wachtel C, Manley JL (2007) Protein-free spliceosomal snRNAs catalyze a reaction that resembles the first step of splicing. RNA 13:2300–2311
Weber MJ (2006) Mammalian small nucleolar RNAs are mobile genetic elements. PLoS Genet 2:e205
Will CL, Lührmann R (2005) Splicing of a rare class of introns by the U12-dependent spliceosome. Biol Chem 386:713–724
Acknowledgments
This work was supported in part by the Graduierten-Kolleg Wissensrepräsentation and by the Bioinformatics Initiative of the Deutsche Forschungs-Gemeinschaft (DFG). Special thanks go to Petra Pregel and Jens Steuck for making the work much easier.
Author information
Authors and Affiliations
Corresponding author
Electronic Supplementary Material
Rights and permissions
About this article
Cite this article
Marz, M., Kirsten, T. & Stadler, P.F. Evolution of Spliceosomal snRNA Genes in Metazoan Animals. J Mol Evol 67, 594–607 (2008). https://doi.org/10.1007/s00239-008-9149-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00239-008-9149-6