Abstract
In archaea, pseudouridine (Ψ) synthase Pus10 modifies uridine (U) to Ψ at positions 54 and 55 of tRNA. In contrast, Pus10 is not found in bacteria, where modifications at those two positions are carried out by TrmA (U54 to m5U54) and TruB (U55 to Ψ55). Many eukaryotes have an apparent redundancy; their genomes contain orthologs of archaeal Pus10 and bacterial TrmA and TruB. Although eukaryal Pus10 genes share a conserved catalytic domain with archaeal Pus10 genes, their biological roles are not clear for the two reasons. First, experimental evidence suggests that human Pus10 participates in apoptosis induced by the tumor necrosis factor-related apoptosis-inducing ligand. Whether the function of human Pus10 is in place or in addition to of Ψ synthesis in tRNA is unknown. Second, Pus10 is found in earlier evolutionary branches of fungi (such as chytrid Batrachochytrium) but is absent in all dikaryon fungi surveyed (Ascomycetes and Basidiomycetes). We did a comprehensive analysis of sequenced genomes and found that orthologs of Pus10, TrmA, and TruB were present in all the animals, plants, and protozoa surveyed. This indicates that the common eukaryotic ancestor possesses all the three genes. Next, we examined 116 archaeal and eukaryotic Pus10 protein sequences to find that Pus10 existed as a single copy gene in all the surveyed genomes despite ancestral whole genome duplications had occurred. This indicates a possible deleterious gene dosage effect. Our results suggest that functional redundancy result in gene loss or neofunctionalization in different evolutionary lineages.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
During protein synthesis, transfer RNA (tRNA) requires modifications at specific nucleotides to ensure the correct incorporation of amino acids to a growing peptide chain. A superfamily of pseudouridine synthases families can isomerize uridine (U) to pseudouridine (Ψ) at multiple positions in RNA post-transcriptionally (Kaya and Ofengand 2003). In this superfamily, members of TruA, TruB, and TruD families modify tRNA; members of the RluA and RsuA families primarily involve in uridine modification of rRNA; and members of Pus10 can modify tRNA at position U54 to Ψ (Gurha and Gupta 2008).
Both TruB (of bacterial origin) and Pus10 (of archaeal origin) can modify U to Ψ at tRNA position 55 in vitro (Blaby et al. 2011; Gurha and Gupta 2008; Roovers et al. 2006). In bacteria, TrmA methylates tRNA at position U54 to m5U54 or ribo-T54 (ribothymidine) (Ny and Björk 1980). These three enzymes—TruB, TrmA, and Pus10—all have eukaryal orthologs and thus, have potential overlapped roles in modification of uridines at position 54 (by TrmA and Pus10) and 55 (by TruB and Pus10) of tRNAs. Indeed, when crystal structures of Pus family members were compared, overlapping structural similarities of an aspartate residue as the catalytic amino acid in the active pocket were revealed and confirmed. The structure is preserved in all the eukaryal orthologs studied (Foster et al. 2000; Hoang 2004; Hoang et al. 2006; McCleverty et al. 2007; Pan et al. 2003; Sivaraman et al. 2002). Moreover, mutational analysis of Pus family members identified additional four amino acids and structural features—forefinger loop (FFL) and thumb loop—to be important for substrate recognition and binding (Chan and Huang 2009; Conrad et al. 1999; Gurha and Gupta 2008; Hamilton et al. 2005; Hur and Stroud 2007; Joardar et al. 2013; Kamalampeta et al. 2013; Spedaliere et al. 2000).
In this paper, the molecular evolution of Pus10 and resolution of functional redundancy between orthologs of Pus10, TruB, and TrmA in different eukaryotic lineages are described. Lineage-specific alterations in key Pus10 sequence features are shown: length of the FFL and substitution of amino acids in the catalytic region and thumb loop. Different evolutionary hypotheses are proposed to explain how neofunctionalization, subfunctionalization, and gene loss have given different lineage-specific outcomes.
Materials and Methods
Collection of Datasets
For the identification of Pus10-like proteins, only the species with whole-sequenced and annotated genomes were retrieved from the National Center for Biotechnology Information (NCBI), the DOE Joint Genome Initiative (JGI), the Solanaceae Genomic Network (SGN), and the Ensembl genome browser (version 62) (Bombarely et al. 2011; Hubbard et al. 2009; McGinnis and Madden 2004). Sequence alignments were performed using the basic local alignment tool Protein BLAST (BLASTP; using default settings) in NCBI. The full length Pus10 sequences from Homo sapiens (NP_653310, PDB: 2V9K), Arabidopsis thaliana (NP_173466), Methanocaldococcus jannaschii (NP_247004), and Pyrococcus furiosus (NP_578868.1) were used as a query. Similarity Sequences with BLASTP score of 40 or better and with > 25% coverage were initially retained. However, those that did not harbor the essential Asp (aspartic acid) in the conserved core catalytic sequence GREDXD (Joardar et al. 2013) or were found to be significantly better aligned to another known Pus protein (e.g., Cbf5, Pus4) were excluded from later analysis. These were likely not functional orthologs of Pus10. Accession numbers for all the proteins obtained are given in Supplementary Table 1.
Protein Alignment and Phylogenetic Analysis
Orthologous Pus10 protein sequences of 116 species were collected from Eukarya and Archaea. A weakly aligned Pus10 homolog was found for Nanoarchaeum but was excluded from the downstream phylogenetic analysis due to extreme divergence of the sequence and absence of key Pus10 sequence features. The N-terminal region of Pus10 showed high divergence specific to each taxonomic domain and kingdom, and resulted in very low or no bootstrap supported values. Thus, protein alignment and phylogenetic analyses were conducted using only the conserved C-terminal catalytic domain (e.g., Gly286–Asp528 in human Pus10). In this study, amino acid positioning of Pus10 refers to the corresponding sequence position in relation to M. jannaschii. Isolation of the C-terminal domain was done by multiple sequence alignment using ClustalX (version 2.0.12) of whole protein sequences followed by removing the N-terminus region (e.g., residues 1–286 in human Pus10) (Larkin et al. 2007; McCleverty et al. 2007). Maximum likelihood analyses of all taxa with respect to the C-terminal of orthologous Pus10 were calculated with the Whelan and Goldman (WAG + I + G) empirical substitution model (Whelan and Goldman 2001). The web server RAxML Black box (http://phylobench.vital-it.ch/raxml-bb/) was used for maximum likelihood analyses (Stamatakis 2006). Newick files were uploaded into FigTree (version 1.3.1) (http://tree.bio.ed.ac.uk/software/figtree) and the following settings were altered. Line weight: 2x, cladogram was chosen as displayable transform version, while node labels were chosen to display bootstrap values. Further features were selected: node bars and scale bar (altered to line weight of two). Tree branches were colored to emphasize different kingdoms and the tree graphic was exported as a jpeg format.
Annotation of Phylogenetic Trees
The molecular phylogeny of the C-terminal Pus10 was annotated using the Interactive Tree Of Life (version 2.0) (iTOL; http://itol.embl.de/index.shtml) (Letunic and Bork 2011, 2016) featuring the key amino acids in the active pocket and structural features relevant for recognition and binding of tRNA substrates. Substitutions in the key Pus10 features were mapped onto a maximum likelihood tree and color coded. A separate species tree with 131 taxa (including those without detectable PUS10) was generated to illustrate the presence/absence pattern of Pus10 in relation to species ancestry and to determine potential gain/loss events (Supplementary Table 1). A reduced version with of the species tree was generated by acquiring taxonomy numbers from NCBI uploaded onto iTOL. The resulting species tree was color coded with respect to kingdom, except for phylum amoebozoa. Furthermore, presence/absence of Pus10 within clades was emphasized (Letunic and Bork 2011) (refer to illustration shown in Fig. 5).
Homology Modeling and Structural Superimposition
A homology model of the protein structure of M. jannaschii Pus10 was calculated using 3D-JIGSAW (version 2.0; http://bmm.cancerresearchuk.org/~3djigsaw/) (Bates et al. 2001). The crystal structure of H. sapiens (PDB: 2V9K, B factor 38.6, resolution: 2.0 Å) was used as a template (McCleverty et al. 2007). Both the crystal structures of H. sapiens Pus10 and the homology-modeled protein structure of M. jannaschii were uploaded onto the Swiss-PDB Viewer (version 4.1) and superimposed via the ‘generate structural alignment tool’ (Guex et al. 2009). Pus10 of H. sapiens and M. jannaschii are displayed as ribbon and each backbone of the 3D-structure was color coded. Conserved amino acids in all pseudouridine synthases were individually displayed, color coded, and labeled (refer to Fig. 3).
Results
Characterization of Key Conserved Structural Features of Pus10
Members of the Pseudouridine Synthase superfamily have a similar 3D structure and a conserved catalytic Asp (Mueller and Ferré-D’Amaré 2009). Other accessory domains with binding capacity and features (e.g., FFL and thumb loop) are conserved within sub-families and can distinguish between different members (Fig. 1). Crystal structure analysis of RluA and TruA identified FFL and thumb loop in the catalytic site as the ‘pinch mechanism’ holding the substrate in place (Hoang et al. 2006; Hoang and Ferré-D’Amaré 2001). Only three families (TruA, RluA, and Pus10) have both the FFL and thumb loop, whereas other families have either loop individually (Fig. 1).
The THUMP domain is unique to Pus10 within the pseudouridine synthase superfamily (Fig. 2). It is structurally similar to the THUMP domain of bacterial ThiI and present at the N-terminus, and likely involved in substrate recognition and binding (Aravind and Koonin 2001). Modification of the N-terminal THUMP domain in Pus10 was observed throughout different eukaryotic lineages, and extensively so in plants and animals (Fig. 2). Eukaryotic Pus10 THUMP domains contain large insertions to prevent detection by routine search of hidden Markov models (HMM) for a THUMP domain (when using PFAM PF02926 as template). But, all archaeal Pus10 THUMP domains tested fit the PFAM model.
Structural pairwise alignment of bacterial ThiI (PDB: 2C5S) was made to the homology-modeled THUMP domain across eukaryal and archaeal lineages. The structural comparison analysis by the DaliLite (version 3.3) (Holm et al. 2008; Holm and Rosenstrom 2010) showed an overall high similarity Z-score (Table 1). The Z-scores of PUS10 from P. furiosus and H. sapiens are very high, 2.5 and 3.7, respectively. These scores indicate that although observed deletions in P. furiosus and insertions in H. sapiens could theoretically have affected the folding of the protein, this region still remains structurally similar to ThiI THUMP.
In Pus10 of H. sapiens, the zinc-binding site consists of four cysteine (Cys) residues at positions 21, 24, 109, and 112, and is thought to be involved in the maintenance of the native N-terminal structure (McCleverty et al. 2007). Multiple sequence alignment revealed the presence of the four Cys residues in nearly all Pus10 orthologs of Archaea and Eukarya. The exceptions were found in the fungi lineage. Here, all four Cys were present in early fungi Mucoromycotina and in several Chytridiomycota, but entirely absent in Microsporidia. Furthermore, the first Cys pair (Cys21 and 24) was absent in one chytrid fungus (Allomyces macrogynus). Interestingly, the latter was encountered in the following invertebrates: Branchiostoma floridae, Daphnia pulex, and Caenorhabditis elegans. Ala (alanine) substitutions of the second pair, C106/109A in M. jannaschii (equivalent to C109/C112 of human Pus10) showed a reduced Ψ54 modification, but Ψ55 modification was unaffected (Joardar et al. 2013). This suggests Pus10 proteins may retain partial (Ψ55) function despite the absence of the second Cys pair.
In the catalytic region, five amino acids (Asp275, Tyr339, Ile412, Lys413, Leu440 in M. jannaschii) were determined to be conserved throughout all pseudouridine synthase families (Hamma and Ferré-D’Amaré 2006; McCleverty et al. 2007). Superimposition of a homology-modeled structure of M. jannaschii Pus10 onto that of H. sapiens Pus 10 revealed the same orientation and 3D positioning of the conserved amino acids present in the active pocket. This was observed throughout all 116 organisms surveyed (Fig. 3) with the exceptions of B. floridae (Cephalochordata), D. pulex (Brachiopoda), Manihot esculenta, and Solanum lycopersicum. A conserved consensus sequence of the catalytic site in the Pus10 orthologs centered on the catalytic Asp275 and five surrounding residues ‘GREDVD’ (Fig. 1). Substitutions within this catalytic site have been observed throughout Archaea and Eukaryota (Fig. 4). In addition to the active pocket, two loops were experimentally confirmed to be of value for the pseudouridylation of tRNAs at positions U55 and U54. The FFL proved to contribute to modify U55 over U54. Therefore, the length of the loop was crucial, not the identities of the amino acids within the loop. The opposite was observed for the thumb loop, where the amino acid identity (His and Arg) is of importance for pseudouridylation (Joardar et al. 2013).
Presence, Absence, and Copy Number of Pus10 Orthologs in Whole Genomes
Organisms with fully sequenced genomes were surveyed by BLASTP to identify Pus10 orthologs. In the identification of Pus10 orthologs, several eukaryotic and a few archaeal species were found to lack Pus10 and were placed on a species tree (Fig. 5). Four nodes of Pus10 loss were identified (Fig. 5). In fungi, all dikaryotes (Ascomycota and Basidiomycota) lacked Pus10 but had TruB orthologs. The most parsimonious answer would be a single evolutionary event in which Pus10 was lost in the common ancestor of all Dikaryon fungi. Other fungal taxa, including Chytrids and Microsporidia, contain Pus10 orthologs (Fig. 5). Most of Archaea had Pus10 but only two genera lacked functional Pus10 genes (Fig. 5). In Sulfolobus tokodaii, S. acidocaldarius, S. solfataricus, their most similar sequences (to Pus10) all lack the catalytic aspartate residue. The entire conserved ‘GREDVD’ sequence surrounding the catalytic aspartate residue is significantly altered (i.e., “PYSEPSDVR” in S. acidocaldarius). Although most of the residues upstream of this site are somewhat conserved. There is less conservation of the protein downstream of the catalytic site. Many of the other structural features thought to be needed to perform pseudouridylation are also missing or significantly divergent from those conserved between Archaea and Eukaryotes. The Pus10 sequence of Nanoarchaeum showed high overall divergence, with numerous substitutions throughout the sequence and resulted in low bootstrap values upon phylogenetic analysis. For these reasons, these two genera of Archaea were excluded from phylogenetic analysis. In four Archaeal orders and a few members within Desulfurococcales and Halobacteriales, shorter FFLs were observed. However, the majority of Archaea species in this study had Pus10 orthologs with completely intact Pus10 structural features (Fig. 4).
A surprising finding in all the genomes surveyed was the presence of only a single Pus10 gene per genome, despite the numerous genome duplications that have occurred in the evolution of animals, fungi, and especially in plants (where genes frequently duplicate and subfunctionalize) (Koonin and Wolf 2010; Proost et al. 2011). This finding might point to a potential dosage effect or other harmful consequence for higher copy numbers of Pus10.
Pus10 in Archaea
A maximum likelihood tree was calculated for the C-terminal domain of the Pus10 in 44 archaeal species (Supplementary Fig. 1). The tree comprises 4 clades where the outer most clade is represented by members of the Thermococcales: Cenarchaeum symbiosum and Nitrosopumilus maritimus (Thaumarchaeota). These are incorporated into the Methanococcales clade and show long branches, indicating high divergence and possible false placement due to long branch attraction or parallelism (Anderson and Swofford 2004). This sequence convergence indicates some possible alteration of the Pus10 catalytic function. When comparing key features of Pus10 protein sequences—FFL, catalytic ‘GRED[V/X]D,’ and thumb loop with His and Arg—of Thaumarchaeota to other Archaea and Eukaryota, Pus10 of both C. symbiosum and N. maritimus show the same FFL size as that of M. jannaschii. This could explain the incorporation of C. symbiosum and N. maritimus into the clade of Methanococcales. However, the sequence of their catalytic site differs considerably. In addition, the thumb loop lacks a conserved histidine residue. Multiple sequence alignment of Pyrobaculum arsenaticum, P. islandicum, and Thermoproteus neutrophilus also show modification in all structural features for Pus10.
The most important differences in Pus10 features were observed in Methanococcales and Thermococcales. Methanobacteriales, Halobacteriales, and Methanococcales have insertions in the FFL, creating a longer loop than the other clades in Archaea (Fig. 4). Moreover, the Thermococcales clade, within this insert-containing group, had further changes in the length of the FFL and substitutions (His376Asn and Arg377Ser) in the thumb loop.
Pus10 in Fungi
Of the 50 fungi with fully sequenced genomes, a total of nine species were found with orthologous Pus10 sequences (Supplementary Fig. 2). All nine were found in the earliest diverging families of fungi, the Unikaryonidae and Nosematidae (Microsporidia), Mucoraceae (Zygomycota), and Batrachochytrium (Chytridiomycota). The related lineages Ascomycota (as yeasts and sac fungi) and Basidiomycota (as mushroom fungi) did not have Pus10 orthologs in any of the 41 sequenced genomes surveyed, although other pseudouridine synthase genes (e.g., Pus4, Cbf5) were found (Fig. 5). This suggested that Pus10 was secondarily lost in the common ancestor of these fungal species.
Common sequence features among the Pus10 sequences of fungi included the presence of ten amino acid deletions in the 11th α-helix and 13th α-helix C-terminal to the catalytic site (Supplementary Fig. 2) (McCleverty et al. 2007). Microsporidia are also missing the first three secondary structures (α4, α6, β1) which are critical to the THUMP domain (Fig. 2). All early fungi species contain the same sequence length of the FFL, and three conserved amino acids were present in these Pus10 FFLs (‘PxxxxGxxxxxS’).
The catalytic site of Pus10 in R. oryzae and M. circinelloides showed conversion of the key second Asp 277 to Asn. Other minor substitutions in the catalytic site have also been observed (Fig. 4). The thumb loop in all nine fungi species is highly conserved in its sequence, except for in R. oryzae and M. circinelloides (Gln, instead of His). This changes the composition of the thumb loop from a basic amino acid to an acidic amino acid and could affect the binding of tRNA (Fig. 4). A TrmA ortholog was present in B. dendrobatidis but not in any other species of Microsporidian or Zygomycota (Table 2). Interestingly, TruB (Pus4) orthologs are not present in the earlier fungi lineages (Microsporidia, Chytridiomycota and Zygomycota) but are present in Ascomycota (S. cerevisiae).
Pus10 in Plants
Arabidopsis thaliana has a total of 24 pseudouridine synthase-like proteins. Most are paralogous copies of TruA, indicating expansion of this family in plants. Only one copy of Pus10 and TruD was detected in A. thaliana, and in 18 other plant genomes.
Multiple sequence alignment showed the incorporation of Ile (isoleucine), instead of Val (valine), in the catalytic site of ‘GREDID’ in plants. Substitutions were observed in the following species: ‘GREDLD’ in Cucumis sativus; ‘GREDMD’ in the moss Physcomitrella patens and green algae Ostreococcus tauri; ‘GREDAD’ in Chlamydomonas reinhardtii (Fig. 4). A highly modified THUMP domain was present in most plants. Nicotiana tabacum and Solanum lycopersicum (Solanaceae), Carica papaya, and Oryza sativa all lack α4 and β1 of the THUMP domain. In addition, N. tabacum and S. lycopersicum are also missing α6 and β2. Sorghum bicolor Pus10 uncharacteristically lacks the catalytic Asp and several amino acids of the catalytic site (Fig. 4), indicating it is a non-functioning protein, however other grasses all seem to have a functional Pus10 catalytic site.
The molecular phylogenetic analysis of Pus10 has the same topology as the phylogenetic tree of species, with the exception of Ectocarpus siliculosus (Heterokontophyta; brown algae) (Fig. 4). This resulted in a split of the two green algae representatives—C. reinhardtii and O. tauri, Chlorophyta—indicating possible genome fusion in the brown algae (Fig. 4). All plant genomes contain TruB and TrmA orthologs (N-terminus of TruB in C. reinhardtii). Most species within the plant kingdom have functional Pus10; but C. reinhardtii and C. papaya have modifications in the catalytic site that may have an effect on its function.
Pus10 in Animals and the Importance of THUMP Domain
THUMP is a RNA-binding domain present in 17 protein architectures and found in over 3587 proteins according to the PFAM database. Many of these proteins have different biological roles. For example, S-adenosyl methionine-dependent methyltransferase, rhodanese-like proteins, FtsJ-like methyltransferase, SpoU rRNA methylase family, and cytidine zinc-binding region. In this study, three major insertions which potentially disrupt or significantly alter the functional THUMP domain in Pus10 were identified (Fig. 2). All eukaryotic Pus10 THUMP domains were larger than archaeal ones and included numerous lineage-specific insertions between conserved portions of the THUMP-motif (Fig. 2); they make identification of the THUMP domain by sequence alignment difficult. The first insertion was observed on the N-terminal side of the THUMP domain and is found in all taxa except fungi and Archaea. A second insertion within the THUMP domain was observed to be present in animals and in some plants. The third insertion is found shortly before the FFL; it is conserved in higher eukaryotes and in several Protista (Paramecium tetraurelia and Trichoplax adhaerens). However, the insertions were absent in Archaea (Fig. 2). In this study, 42 animal species representing 32 different orders, a few animal-specific modifications in the THUMP domain and key Pus10 structure features (conserved residues in the catalytic site, FFL, and thumb loop) were observed (Fig. 5). The C-terminal catalytic site is highly conserved in all animals with some minor differences in earlier lineages. A maximum likelihood tree showed that the Protista, Tricoplacia, and Dictyostelia, followed by representatives of Nematoda, generally show the most diverse sequences alterations of key Pus10 features (Supplementary Fig. 3). Nematodes, C. elegans and Pristionchus pacificus, have a shorter FFL and the substitution of His to Lys in the thumb loop; same observations were made for the Diptera clade. However, Tribolium castaneum had ‘GREDFD’ in the catalytic motif/core instead of I412 with respect to M. jannaschii (refer to Fig. 5). Although there is no experimental evidence, these modifications to the THUMP domain may potentially alter or weaken non-specific Pus10 RNA binding and potentially alter the range of substrate RNA.
In H. sapiens, Pus10 is also known as Downstream Of Bid (DOBI). It has been shown to be a downstream interactor of the Bid (BH3 interacting-domain death agonist) protein signal involved in TRAIL-induced apoptosis and release of cytochrome C from mitochondria (Aza-Blanc et al. 2003; Jana et al. 2017). TRAIL is tumor necrosis factor-related apoptosis-inducing ligand and TNF-related apoptosis-inducing ligand. Mammalian Pus10 is recognized and cleaved by caspase-3 or caspase-8, which may create the functional unit for TRAIL signaling (Park et al. 2009). To analyze the ancestry of Pus10 involvement in TRAIL-induced apoptosis and tRNA pseudouridylation, 17 animal species were tested for the presence of (a) intact orthologs of redundant functioning pathways (TruB, TrmA) and (b) essential interacting components of TRAIL-induced apoptosis, caspase-3, and caspase-8 (Table 3). TrmA as well as caspase-3 and caspase-8 genome were not detectable in D. pulex, suggesting TRAIL-induced apoptosis is absent and Pus10 is likely to perform pseudouridylation of U54 (Table 3). Caspase-3 and caspase-8 orthologs were found in T. adhaerens, a multicellular organism belonging to the phylum Placozoa which lacks the presence of organs (Table 3). This, together with the absence of caspase-3 and caspase-8 orthologs in all fungi and plants, indicates that TRAIL-induced apoptosis could have begun as early as the last common ancestor of animals with T. adhaerens, at the base of the metazoan clade, approximately 635My ago.
Discussion
Evolutionary Resolution of Functional Redundancy in Pus10 and TruB
Pus10 is the most recently identified member of the pseudouridine synthase family and it has been experimentally shown to be crucial for tRNA pseudouridylation in Archaea (Gurha and Gupta 2008; Mueller and Ferré-D’Amaré 2009; Roovers et al. 2006). While there is significant similarity between eukaryal (H. sapiens) and archaeal (M. jannaschii) Pus10 orthologs, no remotely similar protein sequences were found in Bacteria. On the other hand, TrmA and TruB are absent in Archaea, but are present in Bacteria and Eukarya. This suggests that Pus10 must have originated in Archaea and is required to modify tRNA at both positions U55 and U54. The coexistence of Pus10, with parallel tRNA modifying genes TrmA and TruB, impacts the function of Pus10 in Eukarya. This situation may have also occurred in the observed species of Sulfolobus which have genes similar to bacterial TruB and TrmA, possibly acquired through horizontal gene transfer (HGT). In this case, the presence of both of these genes might have lead to the loss of key amino acids involved in the pseudouridine synthase function, but not the complete loss of the Pus10 protein. In another Archaea, P. furiosus, in which HGT has introduced bacterial RumA to modify tRNA position U54 (but not U55), may have resulted in a more specific loss of Pus10 function for position U54 only. In general, after a duplication event, an organism is less likely to maintain more than one copy of a gene involved in tRNA processing and this coincides with our study of Pus10 across all species examined. Redundancy of Pus10 with non-homologous proteins performing similar functions may have led to its functional diversification.
The most important differences in terms of potential for revealing the mechanism of function in Pus10 were observed in Methanococcales and Thermococcales (refer to Fig. 4). Structural comparison of P. furiosus to M. jannaschii showed these modifications are very likely relevant for hook formation and substrate binding, which may be related to the loss of in vivo modification of U54 without affecting U55 modification (Joardar et al. 2013). Several in vitro-generated mutants of M. jannaschii Pus10 at these residues lost Ψ54 activity, but not activity of Ψ55. P. furiosus Pus10 does have a weak in vitro ability to pseudouridylate at tRNA position 54, whereas M. jannaschii has a robust one (Gurha and Gupta 2008). An in silico mutagenesis of His376 to Asn and Arg377 to Ser in M. jannaschii shows that this change in the thumb loop should increase the gap width between both loops (Joardar et al. 2013). A wider gap generated by changes to the thumb loop, and/or shortening of the FFL, could contribute to a reduction in the tight pinching of the tRNA substrate that might be required for modification of U54.
In the case of P. furiosus, a bacterial rRNA methyltransferase (in the subfamily RumA of Pus superfamily) that is known to be acquired through HGT, is responsible for U54 modification in tRNA (Urbonavičius et al. 2008). Thus, in P. furiosus, the presence of RumA likely resulted in the partial relaxation of selection pressure and subsequent loss of Pus10 Ψ54 synthase ability, whereas its Ψ55 activity is retained. These differences in sequence across Archaea may indicate the extent of possible subfunctionalization and uncoupling of Ψ55 and Ψ54 activity of Pus10 for the order Thermococcales.
Based on our findings, it is likely that only TruB and TrmA orthologs remain functionally unchanged. Pus4 (TruB ortholog) converts U55 to Ψ55, and Trm2 (TrmA ortholog) modifies U54 to ribo-T54 in tRNAs of eukaryotes (Becker et al. 1997; Nordlund et al. 2000; Nurse et al. 1995). A different pseudouridine synthase, Cbf5, performs pseudouridylation of rRNA in a guide-RNA-dependent manner in both eukaryotes and Archaea. However, in eukaryotes, Cbf5 has also neofunctionalized to play a role during mitosis by binding to microtubules and centromeres (Blaby et al. 2011; Jiang et al. 1993; Rashid et al. 2006). Thus, like Cbf5, Pus10 may have neofunctionalized to signal TRAIL-induced apoptosis in humans, if not all animals either in place of or in addition to the pseudouridylation function. Pus10 protein is nuclear localized, and translocates to the mitochondria upon cleavage by caspase-3, which in turn amplifies caspase-3 activity, creating a positive feedback loop that has a central role in apoptosis (Jana et al. 2017). However, some mammalian tRNAs have been shown to have Ψ at position 54 instead of ribo-T. This indicates the existence of a protein to provide Ψ synthase activity for U54 (Roe and Tsen 1977). The obvious candidate is Pus10 (Gurha and Gupta 2008). Thus, we hypothesize that in some eukaryotic lineages Pus10 and TrmA orthologs may both have subfunctionalized, partitioning U54 modification activity, each with its own specific subset of tRNA substrates. Furthermore, we observed in the multiple sequence alignment of all 131 taxa, three insertions in the THUMP domain, which could alter RNA recognition. Interestingly, a few human mitochondrial tRNALEU do contain Ψ at position 55 (Helm 2006). However, the presence of TruB in H. sapiens does not leave out the possibility that TruB performs the aforementioned pseudouridylation rather than Pus10 (Table 2). While it is clear that mammalian Pus10 is part of TRAIL-apoptosis it remains unclear if it continues tRNA pseudouridylation at position 54 in the Mammalian clade, though it is the most obvious candidate. If so, it may still be capable to perform tRNA pseudouridylation at position 55 as it may be impossible to uncouple this reaction from position 54 pseudouridylation.
A RsuA-like Ψ synthase protein is known to be a suppressor of var2 mutant phenotype and plays a role in variegation, an apoptosis of the chloroplast, in A. thaliana (Yu et al. 2008). This suggested a potential TRAIL-like apoptosis pathway might exist in plants. However, no sequences like caspase-3 were found. Plants contain 3 metacaspases, which are involved in the so-called ‘deathosome,’ to regulate programmed cell death (Coll et al. 2010; Vercammen 2004; Vercammen et al. 2007). Metacaspases are also found in the kingdoms of Protozoa, fungi, and plants. Plants contain type I and II metacaspases, but not animal-like caspases. Metacaspase-4 and metacaspase-9 (AtMC4, AtMC9) recognize positively charged amino acids such as Arg or Lys in the motifs of substrate peptides with the amino acids FR, GRR, GKR, and VRPR and cleave after Arg or Lys (Vercammen 2004). Future experimental work will be needed to determine whether Pus10 plays a part of either a mitochondrial or plastid apoptotic-like machinery in plants.
Molecular Evolution of Pus10 in Eukaryota
In Eukarya, the common ancestor likely had three proteins from Bacteria and Archaea through genome fusion (Koonin 2010) that modified residues U54 and U55 of tRNA, which created a functional redundancy. Over time the modification and loss of Archaeal-origin Pus10 interacts with that of bacterial-origin TruB and TrmA orthologs. Surprisingly, unlike most redundancy created through gene duplication, this redundancy was not resolved through gene loss in most lineages (Sémon and Wolfe 2007). Individual eukaryotic lineages were examined and revealed lineage-specific differences in the fate of these three proteins in the fungi, plants, and animals.
A surprising finding in all the sequenced genomes of this study was the presence of only a single Pus10 gene per genome, despite the numerous genome duplications that have occurred in the evolution of animals, fungi, and especially in plants (where genes frequently duplicate and subfunctionalize) (Koonin and Wolf 2010; Proost et al. 2011). This finding might point to a potential dosage effect or other harmful consequence for higher copy numbers of Pus10.
In Fungi, TruB orthologs are not present in the earlier fungi lineages (Microsporidia, Chytridiomycota, and Zygomycota) but are present in Ascomycota. The most parsimonious solution is that the three early-branching fungal lineages have lost both TruB and TrmA, but have retained Pus10 (Table 2), while in Dikaryon fungi (Ascomycota, Basidiomycota), the opposite order of gene loss is observed, TruB and TrmA are retained and Pus10 is lost. Thus, it seems that these two groups of fungi have undergone the opposite selection of the enzymes Pus10, TruB, and TrmA that can modify uridine residues at positions 54 and 55 of tRNA. The pattern of gain and loss here clearly points to the random elimination of redundant enzymes for tRNA U54 and U55 modification that could be due to the loss of selection pressure. Given the phylogenetic analysis, it is unlikely that this pattern was the result of HGTs (refer to Fig. 5). This simple explanation for resolving functional redundancy through random gene loss is however not the case in plants and animals which retained seemingly functional orthologs of all three genes.
A recent duplication event in angiosperms should have resulted in at least two copies of Pus10 and in the 11 eudicot genomes surveyed (Proost et al. 2011). However, it has been shown that the extra paralogous genes coding for signaling and metabolism (e.g., protein kinases) are often maintained in plants while paralogous genes involved in DNA repair and modification are typically thinned to a limited number through rapid gene loss (Thomas et al. 2006). Gene ontology analysis of a tetraploid genome which revealed the preference of keeping or losing duplicated genes is frequently based on their biological role. Indeed, duplicated genes which are responsible for tRNA processing were less likely to be maintained in the genome.
Ala (alanine) substitutions in M. jannaschii differentially reduced Ψ54 activity while retaining most of wild-type Ψ55 activity (Joardar et al. 2013). However, a triple mutant in M. jannaschii (Ile412A/Lys413A/Leu440A) showed no pseudouridine synthase activity at either site. These two species, along with Sorghum bicolor that is lacking a catalytic Asp, indicate that Pus10 activity is lost in some plant species. However, the overall presence of conserved, seemingly functional Pus10 in all other plant species in this study would indicate a positive selection pressure for some function of this enzyme.
A RsuA-like pseudouridine synthase protein is known to be a suppressor of var2 mutant phenotype and plays a role in variegation, an apoptosis of the chloroplast, in A. thaliana (Yu et al. 2008). This suggested a potential TRAIL-like apoptosis pathway might exist in plants. However, no sequences like caspase-3 and caspase-8 were found. Plants contain 3 metacaspases, which are involved in the so-called ‘deathosome,’ to regulate programmed cell death (Coll et al. 2010; Vercammen 2004; Vercammen et al. 2007). Metacaspases are also found in the kingdoms of Protozoa, fungi, and plants. Plants contain type I and II metacaspases, but not animal-like caspases. Metacaspase-4 and metacaspase-9 (AtMC4, AtMC9) recognize positively charged amino acid such as Arg or Lys in the motifs of substrate peptides with the amino acids FR, GRR, GKR, and VRPR and cleave after Arg or Lys (Vercammen 2004). Future experimental work will be needed to determine whether Pus10 plays a part of either a mitochondrial or plastid apoptotic-like machinery in plants.
Conclusion
This study provides evidence of Pus10 subfunctionalization in Thermococcales (Archaea) and possibly in the Chytridiomycota. In both cases, TrmA/Trm2-like enzymes are present in the genome, indicating an alternate route for modification of U54 that co-occurs with mutations that reduce or eliminate Pus10 U54 modification. Apparently, methylation of U54 of tRNA to ribo-T54 by proteins acquired through either HGT or descent can lead to functional diversity in Ψ54 synthase activity of Pus10. A gap in the catalytic site of Pus10 in S. bicolor (plant) was noticed which could indicate that Pus10 is on the verge to become a pseudogene in some plant lineages.
References
Anderson FE, Swofford DL (2004) Should we be worried about long-branch attraction in real data sets? Investigations using metazoan 18S rDNA. Mol Phylogenet Evol 33:440
Aravind L, Koonin EV (2001) THUMP—a predicted RNA-binding domain shared by 4-thiouridine, pseudouridine synthases and RNA methylases. TRENDS Biochem Sci 26:215
Aza-Blanc P, Cooper CL, Wagner K, Batalov S, Deveraux QL, Cooke MP (2003) Identification of modulators of TRAIL-induced apoptosis via RNAi-based phenotypic screening. Mol Cell 12:627
Bates PA, Kelley LA, MacCallum RM, Sternberg MJE (2001) Enhancement of protein modeling by human intervention in applying the automatic programs 3D-JIGSAW and 3D-PSSM. Proteins 45:39
Becker HF, Motorin Y, Planta RJ, Grosjean H (1997) The yeast gene YNL292w encodes a pseudouridine synthase (Pus4) catalyzing the formation of Ψ55 in both mitochondrial and cytoplasmic tRNAs. Nucleic Acids Res 25:4493
Blaby IK, Majumder M, Chatterjee K, Jana S, Grosjean H, De Crécy-Lagard V, Gupta R (2011) Pseudouridine formation in archaeal RNAs: the case of Haloferax volcanii. RNA 17:1367
Bombarely A, Menda N, Tecle IY, Buels RM, Strickler S, Fischer-York T, Pujar A, Leto J, Gosselin J, Mueller LA (2011) The sol genomics network (solgenomics.net): growing tomatoes using Perl. Nucleic Acids Res 39:1149
Chan CM, Huang RH (2009) Enzymatic characterization and mutational studies of TruD—the fifth family of pseudouridine synthases. Arch Biochem Biophys 489:15
Coll NS, Vercammen D, Smidler A, Clover C, Van Breusegem F, Dangl JL, Epple P (2010) Arabidopsis type I metacaspases control cell death. Science 330:1393
Conrad J, Niu L, Rudd K, Lane BG, Ofengand J (1999) 16S ribosomal RNA pseudouridine synthase RsuA of Escherichia coli: deletion, mutation of the conserved Asp102 residue, and sequence comparison among all other pseudouridine synthases. RNA 5:751
Foster PG, Huang L, Santi DV, Stroud RM (2000) The structural basis for tRNA recognition and pseudouridine formation by pseudouridine synthase I. Nat Struct Mol Biol 7:23
Guex N, Peitsch MC (1997) SWISS-MODEL and the Swiss-PdbViewer: an environment for comparative protein modeling. Electrophoresis 18:2714
Guex N, Peitsch MC, Schwede T (2009) Automated comparative protein structure modeling with SWISS-MODEL and Swiss-PdbViewer: a historical perspective. Electrophoresis 30:162
Gurha P, Gupta R (2008) Archaeal Pus10 proteins can produce both pseudouridine 54 and 55 in tRNA. RNA 14:2521
Hamilton CS, Spedaliere CJ, Ginter JM, Johnston MV, Mueller EG (2005) The roles of the essential Asp-48 and highly conserved His-43 elucidated by the pH dependence of the pseudouridine synthase TruB. Arch Biochem Biophys 433:322
Hamma T, Ferré-D’Amaré AR (2006) Pseudouridine synthases. Chem Biol 13:1125
Helm M (2006) Post-transcriptional nucleotide modification and alternative folding of RNA. Nucleic Acids Res 34:721
Hoang C (2004) Crystal structure of the highly divergent pseudouridine synthase TruD reveals a circular permutation of a conserved fold. RNA 10:1026
Hoang C, Ferré-D’Amaré AR (2001) Cocrystal structure of a tRNA55 pseudouridine synthase: nucleotide flipping by an RNA-modifying enzyme. Cell 107:929
Hoang C, Chen J, Vizthum CA, Kandel JM, Hamilton C, Mueller EG, Ferré-D’Amaré AR (2006) Crystal structure of pseudouridine synthase RluA: indirect sequence readout through protein-induced RNA structure. Mol Cell 24:535
Holm L, Rosenstrom R (2010) Dali server: conservation mapping in 3D. Nucleic Acids Res 38:W545
Holm L, Kääriäinen S, Rosenström P, Schenkel A (2008) Searching protein structure databases with DaliLite v.3. Bioinformatics 24:2780
Hubbard TJP, Aken BL, Ayling S, Ballester B, Beal K, Bragin E, Brent S, Chen Y, Clapham P, Clarke L, Coates G, Fairley S, Fitzgerald S, Fernandez-Banet J, Gordon L, Graf S, Haider S, Hammond M, Holland R, Howe K, Jenkinson A, Johnson N, Kahari A, Keefe D, Keenan S, Kinsella R, Kokocinski F, Kulesha E, Lawson D, Longden I, Megy K, Meidl P, Overduin B, Parker A, Pritchard B, Rios D, Schuster M, Slater G, Smedley D, Spooner W, Spudich G, Trevanion S, Vilella A, Vogel J, White S, Wilder S, Zadissa A, Birney E, Cunningham F, Curwen V, Durbin R, Fernandez-Suarez XM, Herrero J, Kasprzyk A, Proctor G, Smith J, Searle S, Flicek P (2009) Ensembl 2009. Nucleic Acids Res 37:D690
Hur S, Stroud RM (2007) How U38, 39, and 40 of many tRNAs become the targets for pseudouridylation by TruA. Mol Cell 26:189
Jana S, Hsieh AC, Gupta R (2017) Reciprocal amplification of caspase-3 activity by nuclear export of a putative human RNA-modifying protein, PUS10 during TRAIL-induced apoptosis. Cell Death Dis 8:e3093
Jiang W, Middelton K, Yoon H-J, Fouquet C, Carbon J (1993) An essential yeast protein, CBF5p, binds in vitro to centromeres and microtubules. Mol Cell Biol 13:4884
Joardar A, Jana S, Fitzek E, Gurha P, Majumder M, Chatterjee K, Geisler M, Gupta R (2013) Role of forefinger and thumb loops in production of 54 and 55 in tRNAs by archaeal Pus10. RNA 19:1279
Kamalampeta R, Keffer-Wilkes LC, Kothe U (2013) tRNA binding, positioning, and modification by the pseudouridine synthase Pus10. J Mol Biol 425:3863
Kaya Y, Ofengand J (2003) A novel unanticipated type of pseudouridine synthase with homologs in bacteria, archaea, and eukarya. RNA 9:711
Koonin EV (2010) The origin and early evolution of eukaryotes in the light of phylogenomics. Genome Biol 11:209
Koonin EV, Wolf YI (2010) Constraints and plasticity in genome and molecular-phenome evolution. Nat Rev Genet 11:487
Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, Valentin F, Wallace IM, Wilm A, Lopez R, Thompson JD, Gibson TJ, Higgins DG (2007) Clustal W and clustal X version 2.0. Bioinformatics 23:2947
Letunic I, Bork P (2011) Interactive Tree Of Life v2: online annotation and display of phylogenetic trees made easy. Nucleic Acids Res 39:W475
Letunic I, Bork P (2016) Interactive tree of life (iTOL) v3: an online tool for the display and annotation of phylogenetic and other trees. Nucleic Acids Res 44:W242
McCleverty CJ, Hornsby M, Spraggon G, Kreusch A (2007) Crystal structure of human Pus10, A novel pseudouridine synthase. J Mol Biol 373:1243
McGinnis S, Madden TL (2004) BLAST: at the core of a powerful and diverse set of sequence analysis tools. Nucleic Acids Res 32:W20
Mueller EG, Ferré-D’Amaré AR (2009) DNA and RNA modification enzymes. In: Grosjean H (ed) Pseudouridine formation the most common transglycosylation in RNA. Landes Bioscience, Austin
Nordlund ME, Johansson JO, von Pawel-Rammingen U, Byström AS (2000) Identification of the TRM2 gene encoding the tRNA(m5U54)methyltransferase of Saccharomyces cerevisiae. RNA 6:844
Nurse K, Wrzesinski J, Bakin A, Lane BG, Ofengand J (1995) Purification, cloning, and properties of the tRNA Ψ55 synthase from Escherichia coli. RNA 1:102
Ny T, Björk GR (1980) Cloning and restriction mapping of the trmA gene coding for transfer ribonucleic acid (5-methyluridine)-methyltransferase in Escherichia coli K-12. J Bacteriol 142:371
Pan H, Agarwalla S, Moustakas DT, Finer-Moore J, Stroud RM (2003) Structure of tRNA pseudouridine synthase TruB and its RNA complex: RNA recognition through a combination of rigid docking and induced fit. Proc Natl Acad Sci 100:12648
Park S-Y, Shin JN, Woo HN, Piya S, Moon AR, Seo Y-W, Seol D-W, Kim T-H (2009) DOBI is cleaved by caspases during TRAIL-induced apoptotic cell death. BMB Rep 42:511
Proost S, Pattyn P, Gerats T, Van de Peer Y (2011) Journey through the past: 150 million years of plant genome evolution. Plant J 66:58
Rashid R, Liang B, Baker DL, Youssef OA, He Y, Phipps K, Terns RM, Terns MP, Li H (2006) Crystal structure of a Cbf5-Nop10-Gar1 complex and implications in RNA-guided pseudouridylation and dyskeratosis congenita. Mol Cell 21:249
Roe B, Tsen H-Y (1977) Role of ribothymidine in mammalian tRNAPhe. Proc Natl Acad Sci 74:3696
Roovers M, Hale C, Tricot C, Terns MP, Terns RM, Grosjean H, Droogmans L (2006) Formation of the conserved pseudouridine at position 55 in archaeal tRNA. Nucleic Acids Res 34:4293
Sémon M, Wolfe KH (2007) Consequences of genome duplication. Curr Opin Genet Dev 17:505
Sivaraman J, Sauvé V, Larocque R, Stura EA, Schrag JD, Cygler M, Matte A (2002) Structure of the 16S rRNA pseudouridine synthase RsuA bound to uracil and UMP. Nat Struct Biol 9:353
Spedaliere CJ, Hamilton CS, Mueller EG (2000) Functional Importance of Motif I of pseudouridine synthases: mutagenesis of aligned lysine and proline residues. Biochemistry 39:9459
Stamatakis A (2006) RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics 22:2688
Thomas BC, Pedersen B, Freeling M (2006) Following tetraploidy in an Arabidopsis ancestor, genes were removed preferentially from one homeolog leaving clusters enriched in dose-sensitive genes. Genome Res 16:934
Urbonavičius J, Auxilien S, Walbott H, Trachana K, Golinelli-Pimpaneau B, Brochier-Armanet C, Grosjean H (2008) Acquisition of a bacterial RumA-type tRNA(uracil-54, C5)-methyltransferase by Archaea through an ancient horizontal gene transfer. Mol Microbiol 67:323
Vercammen D (2004) Type II metacaspases Atmc4 and Atmc9 of Arabidopsis thaliana cleave substrates after arginine and lysine. J Biol Chem 279:45329
Vercammen D, Declercq W, Vandenabeele P, Van Breusegem F (2007) Are metacaspases caspases? J Cell Biol 179:375
Whelan S, Goldman N (2001) A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach. Mol Biol Evol 18:691
Yu F, Liu X, Alsheikh M, Park S, Rodermel S (2008) Mutations in SUPPRESSOR OF VARIEGATION1, a factor required for normal chloroplast translation, suppress var2-mediated leaf variegation in Arabidopsis. Plant Cell Online 20:1786
Acknowledgements
This work was supported by NIH Grant GM55045 to R.G.
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Fitzek, E., Joardar, A., Gupta, R. et al. Evolution of Eukaryal and Archaeal Pseudouridine Synthase Pus10. J Mol Evol 86, 77–89 (2018). https://doi.org/10.1007/s00239-018-9827-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00239-018-9827-y