Introduction

Ribosomes are the ribonucleoprotein complexes that translate the genetic code into proteins. Each ribosome contains two subunits: a small subunit (SSU or 40S) that decodes mRNA and a large subunit (LSU or 60S) that catalyzes protein synthesis. In eukaryotes, ribosomes are assembled along a hierarchical pathway that involves the synthesis of four ribosomal RNAs (rRNA), the incorporation of roughly 80 ribosomal proteins (RPs), and the function of more than 200 trans-acting biogenesis factors (Woolford and Baserga 2013). Many factors act as chaperones that ensure correct ribosomal structure while others drive necessary rRNA processing and catalyze structural rearrangements. Decades of biochemical and genetic studies in the model eukaryote Saccharomyces cerevisiae have informed our understanding of ribosome assembly. The early work pertaining to SSU assembly was primarily devoted to rRNA modification and processing (Venema et al. 1995; Lafontaine et al. 1995; Allmang et al. 1996; Granneman et al. 2006; Bleichert et al. 2006), while other studies identified additional biogenesis factors and roughly ordered the timing of their association with pre-ribosomal particles (Dragon et al. 2002; Grandi et al. 2002; Wehner et al. 2002; Schäfer et al. 2003; Gallagher et al. 2004; Bernstein et al. 2004; Pérez-Fernández et al. 2007). However, this gave us only an amorphous picture of the assembly pathway. It has only been with the advent of high-resolution cryo-electron microscopy (cryo-EM) that we have been able to visualize discrete intermediates of assembly (Barandun et al. 2017; Kater et al. 2017; Ameismeier et al. 2018; Zhou et al. 2019; Cheng et al. 2019, 2020; Du et al. 2020; Rai et al. 2021).

The structures obtained by cryo-EM provide static snapshots of metastable intermediate particles along the assembly pathway. However, these structures do not capture the dynamic events of ribosome assembly to tell us how the transitions between these intermediates are driven by biogenesis factors. To understand these dynamic transitions, we can turn to genetics to animate structures and reveal the functional roles of biogenesis factors. A genetic interaction occurs when an allele of one gene influences the phenotype caused by an allele of another gene. Such an interaction implies a functional relationship but not necessarily a physical relationship; the interpretation of genetic interactions has been extensively reviewed (Dixon et al. 2009; Costanzo et al. 2019). One class of genetic interaction is extragenic suppression, where the defects due to mutation of one gene are alleviated by the deletion or mutation of another. We have found that extragenic suppression is particularly well-suited to study the release of factors from multimeric complexes. On average, a mutation is more likely to disrupt a molecular interaction than promote one. Consequently, if factor A is needed to disrupt a complex of factors B and C, then a mutation in factor B or C that facilitates their separation will tend to bypass the need for factor A. Such a relationship can be observed in numerous transition events in ribosome biogenesis that often rely on the disassociation of biogenesis factors. For example, the loss of the essential kinase Hrr25 blocks the release of the 40S biogenesis factor Ltv1 and can be bypassed by phosphomimetic Ltv1 mutants that promote its release (Ghalei et al. 2015). We and others have previously made extensive use genetic interactions to delineate the cytoplasmic 60S maturation pathway (Hedges et al. 2005; Kemmler et al. 2009; Lo et al. 2009, 2010). When contextualized with structural information, genetic interactions can be used to develop verifiable mechanistic models that explain how a set of factors promote the progression of an intermediate (Patchett et al. 2017; Zhou et al. 2019; Mitterer et al. 2019). Recently, we used the deletion of BUD23 (bud23∆) to understand its role in 40S biogenesis (Black et al. 2020). In this mini-review, we highlight how screening for extragenic suppressors of bud23∆ revealed a network of biogenesis factors that, when considered in the light of recent structures of 40S precursors, reveal that Bud23 promotes a critical transition event in the assembly of the small ribosomal subunit.

The assembly and disassembly of the small subunit processome

Ribosome biogenesis begins with transcription of a primary transcript containing three of the four precursor rRNAs (pre-rRNA) flanked by spacer regions that are removed during processing. The SSU Processome, henceforth “Processome”, is an assemblage of approximately 70 biogenesis factors and RPs that encompass the 5′ portion of the primary transcript, containing the 5′ external transcribed spacer (5′ ETS), the 18S rRNA, and the internal transcribed spacer 1 (ITS1) (Dragon et al. 2002; Grandi et al. 2002; Pérez-Fernández et al. 2007). The Processome forms co-transcriptionally with the modular incorporation biogenesis factors and RPs (Pérez-Fernández et al. 2007; Chaker-Margot et al. 2015; Zhang et al. 2016). The 5′ ETS and its associated factors along with the U3 snoRNA establish the scaffolding upon which the Processome is built (Beltrame et al. 1994; Venema et al. 1995; Hunziker et al. 2016, 2019). The initial folding of the rRNA is driven by its intrinsic ability to form secondary structure resulting in four domains: the 5′, central, 3′ major, and 3′ minor. The Processome factors chaperone the tertiary structure of the rRNA domains by holding them in a splayed-open conformation that allows their independent maturation but prevents them from adopting their more compact conformations seen in downstream pre-40S intermediates (Kornprobst et al. 2016; Chaker-Margot et al. 2017; Sun et al. 2017; Barandun et al. 2017; Cheng et al. 2017). The Processome is present even under conditions in which ribosome assembly is largely halted (Talkish et al. 2016; Kos-Braun et al. 2017; Chaker-Margot et al. 2017), suggesting that it serves as a metastable intermediate that controls further progression of maturation.

The transition from the Processome to the pre-40S particles requires the disassembly of the Processome which entails pre-rRNA cleavage and removal of the 5′ ETS by the RNA Exosome, the release of most Processome factors, and RNA cleavage within ITS1 to separate the SSU and LSU precursors (Schäfer et al. 2003; Thoms et al. 2015; Lau et al. 2021) Recent cryo-EM studies reveal that the Processome disassembles in a stepwise fashion (Cheng et al. 2020; Du et al. 2020; Lau et al. 2021). The shedding of the Processome elements is accompanied by dramatic structural rearrangements that result in compaction of the 18S rRNA as it approaches its mature structure (Fig. 1). A recently described late intermediate of the disassembly pathway, the Dis-C complex (Cheng et al. 2020), retains a handful of Processome factors, some of which prevent the 3′ major and minor domains from adopting their near-final positions. Among these factors is the U3 snoRNA, which scaffolds the Processome through hybridization to the 5′ ETS and pre-18S rRNAs (Beltrame and Tollervey 1995; Méreau et al. 1997; Sharma and Tollervey 1999; Kudla et al. 2011; Dutca et al. 2011; Marmier-Gourrier et al. 2011). Its interaction with pre-18S rRNA prevents the premature formation of the central pseudoknot (CPK), a universally conserved structural hallmark that organizes all four rRNA domains of 18S rRNA as well as the decoding center. The CPK is formed by the long-range base pairing of the loop of helix 1 at the 5′-end of 18S rRNA with nucleotides A1137-U1144 to form helix 2. Displacement of U3 snoRNA by the DEAH-box helicase Dhr1 and its activator Utp14 is required for the CPK to fold (Sardana et al. 2015; Zhu et al. 2016; Black et al. 2018). Strikingly, Dhr1 and Utp14 are primed to unwind U3 to drive its removal in the Dis-C complex (Cheng et al. 2020), but what promotes this final disassembly step was not clear.

Fig. 1
figure 1

Structural overview of small ribosomal subunit biogenesis in eukaryotes. Small subunit biogenesis is a multi-step process; several SSU assembly intermediates are shown to highlight the structural rearrangements that occur during SSU biogenesis. The SSU Processome (composite of PDBs 5WLC and 5WYJ) is an early precursor which contains assembly factors (AFs, light gray) and ribosomal proteins (RPs, dark gray) that independently chaperone the four domains: the 5′ major, central, 3′ major, and 3′ minor (respectively, colored blue, magenta, yellow, and green). A yet unknown signal triggers progression of the Processome which involves the shedding of the 5′-ETS as well as many AFs and RNA rearrangements (colored arrows) that transform it into a more compact final disassembly complex, termed “Dis-C” (PDB 6ZQG) (Cheng et al. 2020). However, a set of factors retained on Dis-C prevent its final transformation into the earliest pre-40S intermediate (PDB 6G4W). We have proposed that the coordinated efforts of Bud23 and the RNA helicase Dhr1 promote the final transition into the pre-40S, allowing the 3′ domains to adopt their near-mature positions (Black et al. 2020). A series of subsequent, less dramatic maturation events fine-tune the rRNA (colored lines; lower panel) to produce the mature 40S (PDB 4V88). The figure is adapted from (Black et al. 2020). Molecular visualizations were generated in UCSF ChimeraX v0.93 (Goddard et al. 2018)

Bud23 promotes the final disassembly of the Processome

BUD23 is a nonessential gene encoding a Trm112-associated methyltransferase that modifies nitrogen 7 of guanosine 1575 (G1575) of the 18S rRNA (White et al. 2008; Figaro et al. 2012; Sardana and Johnson 2012; Létoquart et al. 2014). Its deletion (bud23∆) reduces SSU levels by 70% and causes a strong growth defect (White et al. 2008); however, the methyltransferase activity of Bud23 is fully dispensable for growth and SSU assembly (White et al. 2008; Lin et al. 2012; Létoquart et al. 2014). Thus, the physical presence of Bud23 confers its primary function, but how exactly it promotes SSU biogenesis was unknown. Bud23 is often considered a pre-40S factor, and consistent with this notion human Bud23 was resolved on the earliest pre-40S intermediate (Ameismeier et al. 2018). However, a decade of studies in yeast suggested that Bud23 first binds to an earlier precursor, that retains some Processome factors, and remains associated with the particle during its transition to a pre-40S (Figaro et al. 2012; Sardana et al. 2013, 2014; Létoquart et al. 2014).

To understand the function of Bud23, we took advantage of the remarkable propensity of bud23∆ mutants to generate spontaneous suppressors of its slow growth phenotype. The high rate of generating suppressors appears to be due to the large number of mutations that can suppress bud23∆ cells rather than an elevated mutation rate (unpublished). Through a combination of screening for spontaneous extragenic suppressors and random mutagenesis of several target factors, we identified a total of 67 unique extragenic suppressor mutations of bud23∆ in genes encoding the Processome factors Dhr1, Utp14, Utp2, Imp4, and the GTPase Bms1 as well as the ribosomal protein Rps28 (Fig. 2a) (Sardana 2013; Sardana et al. 2013, 2014; Zhu et al. 2016; Black et al. 2020). The genetic connection of Bud23 to Imp4 and Dhr1 (Sardana 2013; Sardana et al. 2014) was particularly informative since Imp4 was known to be a U3-associated factor (Lee and Baserga 1999; Gallagher and Baserga 2004), and led us to identify Dhr1 as the helicase that unwinds U3 snoRNA from pre-rRNA (Sardana et al. 2015) and Utp14 as its co-factor (Zhu et al. 2016). The genetic interactions of BUD23 suggested a role for Bud23 in Processome disassembly. Indeed, we recently showed that intermediates of Processome disassembly accumulate in Bud23-depleted cells (Black et al. 2020).

Fig. 2
figure 2

Summary of the physical interaction network underlying bud23∆ suppression. a The extragenic suppressors of bud23∆ form a tight network of physical and functional interactions that includes U3 snoRNA. b Mapping this network to the structure of the SSU Processome reveals physical interactions connecting to U3 snoRNA; Bms1 (green), Imp4 (blue), Rps28 (cyan), Utp2 (orange), and Utp14 (brown). Imp4, Utp2, and Bms1 physically connect the U3 snoRNA (magenta) to the 3′ basal subdomain (dark gray), the future binding site of Bud23. The target base of Bud23 methyltransferase, G1575 (red), is shown for reference. c The majority of the Imp4 mutants (magenta sticks) mapped to its interaction interface with the 3′ basal subdomain of 18S RNA. d The Utp2 mutants (green sticks) mapped to its interface with Imp4 near the 3′ basal subdomain. e F58 of Utp2 fits into a hydrophobic pocket in Imp4 (upper) while V170 and P252 of Imp4 help establish this pocket (lower). Amino acid substitutions of these residues disrupt the interaction between Imp4 and Utp2 (Black et al. 2020). Molecular visualizations were generated in MacPyMOL: PyMOL v1.8.2.1 Enhanced for Mac OS X (Schrödinger LLC) using PDB 5WLC

With the recent publication of several structures of the Processome and disassembly intermediates (Kornprobst et al. 2016; Chaker-Margot et al. 2017; Sun et al. 2017; Barandun et al. 2017; Cheng et al. 2017, 2020), we can postulate how the suppressors of bud23∆ impact the disassembly pathway. These structures reveal that, like Bud23, the factors Bms1, Imp4, Utp2, and Rps28 all contact the 3′ basal subdomain, a compact subdomain of the 3′ major domain (Fig. 2b). Bms1, Imp4, and Utp2 also have long helical extensions that pierce deep into the Processome where they embrace, and appear to stabilize, the U3-18S rRNA heteroduplexes that will be unwound by Dhr1 and Utp14. Furthermore, Bms1, Imp4, and Utp2 all make multiple interactions amongst one another. Thus, the bud23∆ suppressing mutations describe a network of factors connecting the 3′ major rRNA domain to the U3 heteroduplexes within the Processome or Processome disassembly intermediates (Fig. 2a, b). Close inspection of the Imp4 mutants revealed that the majority of the amino acid changes cluster in its interface with the rRNA, suggesting that these mutations weaken the affinity of Imp4 for the rRNA (Fig. 2c). Similarly, many of the Utp2 mutants map to its interface with Imp4 (Fig. 2d). Notably, F58 of Utp2 inserts into a hydrophobic pocket of Imp4 (Fig. 2e, upper) and mutation of F58 of Utp2 or V170 or P252 of Imp4, which help to create this pocket (Fig. 2e, lower), weaken the interaction between these two proteins (Black et al. 2020). Meanwhile, Rps28 binds directly adjacent to the binding site of Bud23. The single suppressor mutation that we identified in RPS28A, G24D, lies in a residue that directly interacts with the rRNA of the 3′ basal subdomain (Black et al. 2020). Because Rps28 is a constituent of the mature 40S, we speculate that this amino acid change affects rRNA structure in a manner that facilitates the release of factors that bind the 3′ basal subdomain rather than the mutation promoting Rps28 release. These results led us to conclude that the Imp4, Utp2, and Rps28 mutants destabilize the scaffold that holds the 3′ major domain in place in the Processome. Because these destabilizing mutations bypass the absence of Bud23, it can be inferred that the role of Bud23 is to promote rearrangement of the 3′ major domain.

Bms1 is an essential GTPase of the Processome whose exact molecular function remains unknown (Gelperin et al. 2001; Wegierski et al. 2001; Karbstein et al. 2005; Karbstein and Doudna 2006). The cryo-EM structures revealed that Bms1 provides a rigid strut between the 3′ major and 5′-domains, thereby constraining the movement of the 3′ major domain. In our genetic screen for bud23∆ suppressors, we did not find mutations in Bms1 at its interface with the 3′ basal subdomain. However, we did identify five mutations in BMS1 that change amino acids located within intramolecular interfaces of Bms1 (Black et al. 2020). Comparison of Bms1 to structures of the related GTPase EF-Tu suggests that these amino acid changes likely mimic a conformational change that Bms1 undergoes during GTP hydrolysis (Black et al. 2020), raising the intriguing possibility that Bud23 influences Bms1 GTPase activity. Consistent with this notion, Bms1 is in the GTP-bound state in the Dis-C complex indicating that its GTPase has not been activated (Cheng et al. 2020). We speculate that Bms1 serves as a molecular switch that senses Bud23 binding and initiates disassembly of the scaffold holding the 3′ major domain in place.

A kinetic proofreading model for central pseudoknot formation

The functional connection of Bud23 to Dhr1 is particularly interesting as Dhr1 is a DEAH box helicase that removes the U3 snoRNA (Sardana et al. 2015; Choudhury et al. 2018). Dhr1 requires Utp14 for activation (Zhu et al. 2016; Choudhury et al. 2018; Boneberg et al. 2019) which depends on an interaction between an unresolved region of Utp14 with the catalytic domains of Dhr1 (Zhu et al. 2016). Amino acid changes within this region of Utp14 suppress bud23∆, diminish the Dhr1-Utp14 interaction, and consequentially reduce Dhr1 unwinding activity (Zhu et al. 2016). In our suppressor screen, we also identified numerous mutations in Dhr1, many of which affect residues on the surface of its catalytic domains, and like the Utp14 mutants, these Dhr1 mutants weaken the Dhr1–Utp14 interaction (Black et al. 2020). Thus, diminishing the Dhr1–Utp14 interaction, and consequently Dhr1 activity, leads to suppression of bud23∆.

The result that reduced helicase activity bypasses a defect in disassembly seems counterintuitive; why would reducing Dhr1 unwinding activity, which is needed for disassembly of the Processome, suppress the disassembly defect of bud23∆? Here, we can glean clues from the structures of the disassembly intermediates (Cheng et al. 2020). During Processome disassembly, RNA rearrangements allow the stem-loop of helix 1 of the central pseudoknot (CPK) to fold, while the loop remains paired with U3. In the mature CPK, nucleotides A1137-U1144 of 18S will replace U3 to form helix 2 of the CPK. Residues A1137-U1144 are unresolved in the Dis-C complex, but their position can be inferred because they originate from the 3′ basal subdomain (Fig. 3a). We surmise that productive unwinding of U3 by Dhr1, leading to the formation of helix 2, requires the correct positioning of residues A1137-U1144. Dhr1 binds to U3 on the 3′-side of the U3-rRNA duplexes (Sardana et al. 2015; Cheng et al. 2020). Dhr1 is expected to translocate 3′ to 5′ (Boneberg et al. 2019), essentially pulling U3 towards itself to unwind it from the rRNA. We propose that the concerted actions of Bud23 and Dhr1 promote helix 2 formation to form the CPK (Fig. 3b). In this model, Bud23 binding to the 3′ basal subdomain triggers Bms1-driven rearrangements that bring residues A1137-U1144 into close proximity of helix 1. Here, these residues are poised to base-pair with the stem-loop of helix 1 to form the CPK. Subsequently, the Utp14-dependent activity of Dhr1 unwinds U3 to allow productive CPK folding.

Fig. 3
figure 3

Model for how Bud23 promotes the final disassembly step of the SSU Processome. a The structures of Bms1, Imp4, Rps28, Utp2, Utp14, and Dhr1 as observed in the final disassembly complex (Dis-C, PDB 6ZQG). Factors are colored as in Fig. 2a. Helix 1 of the 18S rRNA has formed. Nucleotides A1137-U1144 of the 18S rRNA are unresolved but originate from the 3′ basal subdomain at G1146. Molecular visualizations were generated, MacPyMOL: PyMOL v1.8.2.1 Enhanced for Mac OS X (Schrödinger LLC). b The model for Bud23 function with proteins and RNAs colored as in panel A. Bud23 binding to the 3′ basal subdomain promotes Bms1 activation (black lines), allowing Imp4 and Utp2 eviction and the rRNA rearrangements (gray arrows) that bring residues A1137-U1144 into proximity with helix 1. Subsequent activation of Dhr1 by Utp14 (green arrow) promotes productive unwinding of U3, which allows the rRNA strand containing residues A1137-U1144 to base-pair with the stem-loop of Helix 1 to form Helix 2, completing the CPK

Our model proposes that Bud23 drives the correct positioning of A1137-U1144 (Fig. 3b). We suggest that this repositioning is critical to promote productive unwinding of U3 by Dhr1 to fold the CPK (Fig. 4a). Therefore, the absence of Bud23 (bud23∆) should hinder the repositioning of these residues, leading to unproductive unwinding of U3, as the CPK cannot fold (Fig. 4b); however, CPK folding must still occur at some low rate in the absence of Bud23 since bud23∆ is not lethal. With this framework for CPK formation, we propose a kinetic proofreading model to explain how Dhr1 and Utp14 mutants bypass bud23∆ to restore 40S assembly. For the mutants that we have tested, their impact on Dhr1 is a reduction in its unwinding activity (Zhu et al. 2016; Black et al. 2020). We suggest that such mutants allow a greater window of opportunity for A1137-U1144 to move into place before Dhr1 unwinds U3, increasing the likelihood of a productive unwinding event that allows the CPK to fold (Fig. 4c). Similar kinetic proofreading models have been used to understand spliceosome dynamics where DEAD/H-box helicase-dependent RNA rearrangements are needed for transitioning through intermediate states (Semlow and Staley 2012). This framework for CPK folding also accommodates how the Utp2, Imp4, and Bms1 mutants bypass bud23∆. We proposed that Bud23 triggers disassembly of the scaffolding that constrains the 3′ basal subdomain to allow its rearrangement in the maturing particle (Fig. 3b). Weakening the scaffolding that holds the subdomain in place, through mutations in Utp2, Imp4, or Bms1 would reduce the energy barrier for the rearrangements necessary that bring A1137-U1144 into proximity of helix 1 for productive unwinding by Dhr1 (Fig. 4c).

Fig. 4
figure 4

Kinetic proofreading model for how extragenic suppressors bypass bud23∆. a In wild-type cells, Bud23 binding to the Dis-C complex leads to release of Imp4 and Utp2 and Bms1-dependent repositioning of residues A1137-U1144 of the 18S rRNA into close proximity of U3. The subsequent unwinding of U3 by Dhr1 allows A1137-U1144 to base pair with the stem-loop of Helix 1 to form Helix 2, thereby completing the CPK. b In cells lacking Bud23, the repositioning of residues A1137-U1144 is hindered and the unwinding of U3 by Dhr1 leads to unproductive folding of the rRNA and failure in CPK formation. However, a low rate of productive folding must occur in bud23∆ cells because they are viable, though very slow growing. c In bud23∆ cells harboring a suppressor mutation, the rate of productive CPK folding is partially restored. This can be due to either (1) amino acid substitutions in Dhr1 or Utp14 that slow the Dhr1 activity to allow more time for residues A1137-U1144 to come into position before U3 unwinding occurs or (2) amino acid substitutions in Bms1, Imp4, or Utp2 that promote the repositioning of residues A1137-U1144

In the disassembly complex, Dhr1 and Bms1 are positioned on opposite sides of the structure (Fig. 3a). Interestingly, the N-terminus of Dhr1 extends across the complex and interacts with Bms1. Thus, Bms1 and Dhr1 can be mechanistically and physically connected. It is tempting to compare the relationship of Bms1 and Dhr1 in Processome disassembly to that of Snu114 and Brr2 that control spliceosome disassembly where the GTPase-dependent conformational changes in Snu114 promote Brr2 helicase activity (Small et al. 2006). It is tempting to speculate that there is a similar coordination between Dhr1 and Bms1 during Processome disassembly.

Leveraging genetic fulcrums to animate structure

In this mini-review, we have highlighted our recent in-depth screen for extragenic suppressors of bud23∆ that we carried out to understand the role of Bud23 in 40S assembly (Black et al. 2020). We identified an unexpectedly large number of suppressors in multiple factors involved in the function of the Processome making bud23∆ a particularly powerful entry point for genetic analysis. We liken the bud23∆ mutant to a kind of “genetic fulcrum”, a tool that allows us to leverage a genetic defect to understand the workings of the Processome. We believe this work also nicely illustrates how genetics can be used to animate structures of macromolecular complexes that undergo dynamic conformational and compositional changes. Assembly of such complexes requires the formation of myriad macromolecular contacts to produce a functional architecture, while its transition into a subsequent state entails the disruption of those contacts. The particular genetic relationship that we uncover, mutations in factors that destabilize the Processome bypass the function of Bud23, is particularly apt for studying disassembly of the Processome, as well as other macromolecular complexes. Such mutations in critical factors must strike a fine balance between sufficient loss of function to destabilize an interface and complete disruption which would result in lethality. Indeed, the suppressors of bud23∆ showed no obvious growth defects in Bud23-replete cells individually but did negatively impact fitness when combined (Zhu et al. 2016; Black et al. 2020). Although Cryo-EM has allowed the unprecedented visualization of many pre-ribosomal intermediates (Klinge and Woolford 2019; Frazier et al. 2020), genetic approaches, such as what we have reported (Black et al. 2020), allow us to understand the dynamics of these complexes. A similar genetic analysis has recently been used to explore splicesosomal dynamics (Brow 2019). Cryo-EM will continue to provide snapshots of numerous complexes, and the use of molecular genetics should be broadly applicable to animate those snapshots and understand their dynamic natures.