Prespliceosome structure provides insights into spliceosome assembly and regulation

Plaschka, Clemens; Lin, Pei-Chun; Charenton, Clément; Nagai, Kiyoshi

doi:10.1038/s41586-018-0323-8

Prespliceosome structure provides insights into spliceosome assembly and regulation

Letter
Published: 11 July 2018

Volume 559, pages 419–422, (2018)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

From

View current issue Submit your manuscript

Prespliceosome structure provides insights into spliceosome assembly and regulation

Download PDF

Clemens Plaschka¹^na1^nAff2,
Pei-Chun Lin¹^na1,
Clément Charenton¹ &
…
Kiyoshi Nagai¹

10k Accesses
103 Citations
39 Altmetric
3 Mentions
Explore all metrics

Abstract

The spliceosome catalyses the excision of introns from pre-mRNA in two steps, branching and exon ligation, and is assembled from five small nuclear ribonucleoprotein particles (snRNPs; U1, U2, U4, U5, U6) and numerous non-snRNP factors¹. For branching, the intron 5′ splice site and the branch point sequence are selected and brought by the U1 and U2 snRNPs into the prespliceosome¹, which is a focal point for regulation by alternative splicing factors². The U4/U6.U5 tri-snRNP subsequently joins the prespliceosome to form the complete pre-catalytic spliceosome. Recent studies have revealed the structural basis of the branching and exon-ligation reactions³, however, the structural basis of the early events in spliceosome assembly remains poorly understood⁴. Here we report the cryo-electron microscopy structure of the yeast Saccharomyces cerevisiae prespliceosome at near-atomic resolution. The structure reveals an induced stabilization of the 5′ splice site in the U1 snRNP, and provides structural insights into the functions of the human alternative splicing factors LUC7-like (yeast Luc7) and TIA-1 (yeast Nam8), both of which have been linked to human disease^5,6. In the prespliceosome, the U1 snRNP associates with the U2 snRNP through a stable contact with the U2 3′ domain and a transient yeast-specific contact with the U2 SF3b-containing 5′ region, leaving its tri-snRNP-binding interface fully exposed. The results suggest mechanisms for 5′ splice site transfer to the U6 ACAGAGA region within the assembled spliceosome and for its subsequent conversion to the activation-competent B-complex spliceosome^7,8. Taken together, the data provide a working model to investigate the early steps of spliceosome assembly.

Structural insights into how Prp5 proofreads the pre-mRNA branch site

Article Open access 04 August 2021

Architecture of the U6 snRNP reveals specific recognition of 3′-end processed U6 snRNA

Article Open access 01 May 2018

Structure of a pre-catalytic spliceosome

Article 22 May 2017

Main

To gain structural insights into early spliceosome assembly, we prepared the yeast prespliceosome A-complex on the UBC4 pre-mRNA that carries a mutation in the pre-mRNA branch point sequence, which was previously used to stall the A-complex⁹ (UACUAAC to UACAAAC, in which A is the branch point adenosine and A is the mutated nucleotide) (Extended Data Fig. 1a, b). The purified A-complex contained stoichiometric amounts of the U1 and U2 snRNP proteins (Extended Data Fig. 1b), and was used to determine cryo-electron microscopy (cryo-EM) densities of the A-complex at 4.0 Å (U1 snRNP, map A2) and 4.9–10.4 Å (U2 snRNP, maps A1 and A3) resolution, respectively (Extended Data Figs. 1c–e, 2). From these densities we built a near-complete atomic model of the A-complex (Fig. 1, Supplementary Videos 1, 2, Supplementary Data, Extended Data Fig. 1f), comprising 34 proteins, U1 and U2 snRNAs, and 34 nucleotides of pre-mRNA. The final model lacks the mobile cap-binding complex, Prp5 or the U1 subunit Prp40 (Extended Data Fig. 1b, d, e; Extended Data Table 1). The elongated U1 and U2 snRNPs bind the pre-mRNA 5′ splice site (5′SS) and branch-point sequences, respectively, and associate in a parallel manner to form the A-complex (Fig. 2a). The U1 snRNP structure contains all the essential regions of the U1 snRNA and 16 proteins (Fig. 1). The U1 snRNP ‘core’ is highly similar to its human counterpart¹⁰ (Extended Data Figs. 3, 4), comprising the seven-membered Sm ring and orthologues of the human U1 snRNP proteins (Snp1, human U1-70k; Mud1, human U1A; Yhc1, human U1C), and is bound to the peripheral yeast U1 proteins Luc7, Nam8, Prp39, Prp42, Snu56 and Snu71¹¹ (Extended Data Figs. 3, 4). The U2 snRNP has a bipartite structure as observed in B-complex⁸, comprising the SF3b subcomplex (‘5′ region’) and the U2 3′ domain and SF3a subcomplex (‘3′ region’) that are organized around the 5′ and 3′ regions of the U2 snRNA, respectively (Figs. 1, 2a, Extended Data Fig. 5). At the current resolution, the conformation of the U2 5′ region appears unchanged from the B-complex⁸, in which the pre-mRNA branch-point sequence is base-paired with the U2 snRNA and the branch point adenosine is bulged out and accommodated in a pocket formed by the U2 SF3b subunits Hsh155 and Rds3. After we completed the A-complex structure, the cryo-EM structure of the free yeast U1 snRNP was reported¹². This model is in good agreement with the U1 snRNP in our A-complex structure, but there are important differences¹².

**Fig. 1: Prespliceosome A-complex structure.**

**Fig. 2: 5′SS recognition and implications for alternative splicing.**

The first ten nucleotides of U1 snRNA are disordered in the free U1 snRNP¹², but become ordered in our A-complex structure by pairing with the pre-mRNA 5′SS (Fig. 2a, b). Additional density appeared adjacent to the U1–5′SS helix, into which we could build a newly ordered Yhc1 peptide (human U1C) that contacts the 5′SS phosphate backbone (+5 and +6 positions, the ‘Yhc1–5′SS loop’) and a near-complete model of Luc7 (in the previous study Luc7 was attributed to what is now assigned as Snu71¹²) (Extended Data Figs. 3a, c, 4a). Although Luc7 is disordered in the free U1 snRNP, it associates stably with the U1–5′SS helix in the A-complex (Extended Data Fig. 4a), suggesting a mechanism for the selection of weak 5′SS sequences¹³. In our structure Luc7 is anchored by its N-terminal α-helix 1 to the Sm ring subunit SmE, and its C₃H-type zinc finger 1 (ZnF1) domain binds where the 5′ exon emerges from the U1–5′SS helix, in excellent agreement with RNA–protein crosslinks¹³ (Fig. 2b). The adjacent Luc7 C₂H₂-type ZnF2 contacts the U1–5′SS helix minor groove and the U1 snRNA phosphate backbone (nucleotides U5–C8). This interaction mirrors that between the Yhc1 ZnF domain and the 5′SS nucleotides +1 to +4 downstream of the 5′SS junction¹⁰ (Fig. 2b). Thus, Yhc1 and Luc7 make no base-specific interactions with the U1–5′SS helix, and instead cradle the U1–5′SS helix phosphate backbone to stabilize 5′SS binding. Consistent with the structure, weakening of any of these interactions can impair splicing and bypass the requirement for Prp28 helicase activity^13,14,15,16.

The A-complex structure reveals structural insights into the functions of the human alternative splicing factors LUC7-like (LUC7L, yeast Luc7) and TIA-1 (yeast Nam8) (Extended Data Fig. 4c, d). Luc7 and its human homologues LUC7L1–3 are highly conserved, suggesting that the LUC7L N-terminal α-helix also anchors it to the SmE protein and that the invariant ZnF2 helix α8 similarly stabilizes the U1–5′SS helix to promote the inclusion of weak alternative splice sites¹³ (Fig. 2b, Extended Data Figs. 3c, 6a). The yeast U1 snRNP subunit Nam8 and its human homologue TIA-1 contain three RNA recognition motif (RRM) domains and a C-terminal Gln-rich extension (Extended Data Fig. 6b). Human TIA-1 binds to uridine-rich sequences downstream of the 5′SS predominantly through the RRM2 domain^17,18 to allow the use of weak 5′SSs. The Nam8 RRM2 shows high sequence similarity to the TIA-1 RRM2, including the nearly identical RNP1 and RNP2 motifs, indicating that Nam8 also binds uridine-rich sequences through its RRM2 also (Extended Data Fig. 6b). In the A-complex structure the Nam8 RRM3 and its C-terminal region bind in a cavity of the Prp39–Prp42 heterodimer and contact the Yhc1 C-terminal region near the U1–5′SS helix (Fig. 2c). From this location, Nam8 could project its mobile RRM2 domain to bind uridine-rich intron sequences downstream of the 5′SS, consistent with crosslinking experiments¹⁷, and thereby promote meiotic pre-mRNA splicing¹⁹ (Fig. 2a, c).

In the A-complex, the U1 snRNP binds to the U2 snRNP through two interfaces, A and B (Fig. 2a). In interface A, the N-terminal helices α1–2 of the U1 protein Prp39 stably bind the U2 3′ domain subunit Lea1 (human U2A′) (Fig. 2a, Extended Data Fig. 5). The Prp39–Prp42 heterodimer binds Yhc1 to anchor the U2 snRNP 3′ domain to the U1 snRNP. Similar interactions were observed biochemically between the human alternative-splicing factor PRPF39 homodimer and U1C¹² (yeast Yhc1), suggesting that PRPF39 may contact the human U2 3′ domain in a similar manner, although it is not an obligate component of the human A-complex²⁰ (Fig. 2a). Different, non-overlapping Lea1 surfaces are used to interact with the NTC protein Syf1 in the yeast C- and C*/P-complex conformations of the spliceosome²¹ (Extended Data Fig. 5c), suggesting that Lea1 aids in the repositioning of the U2 3′ domain in multiple stages of splicing. Interface B is transient and found only in a subset of cryo-EM images (Extended Data Figs. 2a, 5a, b). It involves weak interactions between the yeast-specific U1 snRNA stem loop 3–3 and the U2 SF3b Rse1 subunit β-propellers B and C (BPB and BPC) and the C terminus of U2 SF3a Prp9. The pre-mRNA 5′SS and branch point branching reactants are positioned approximately 150 Å apart in the A-complex, with 40 nucleotides of the UBC4 intron looped out in between (Fig. 2a, Extended Data Fig. 1e, f). The small interfaces between the U1 and U2 snRNPs orient the snRNPs relative to each other, and this may facilitate 5′SS transfer in the assembled spliceosome and the subsequent dissociation of the U1 snRNP, consistent with the structural and biochemical data^7,8. Although the precise U1–U2 snRNP interfaces may differ in the human A-complex, a key function of U1–U2 (alternative) splicing factors could be to ensure that U1 and the U1–5′SS helix are oriented correctly relative to the U2 snRNP.

Before A-complex formation, the yeast Msl5–Mud2 heterodimer recognizes the branch point sequence through Msl5 and binds the U1 snRNP subunit Prp40 (human PRPF40) in the E complex, looping out the intron between the 5′SS and branch point sequences²² (Extended Data Fig. 4e). Although Prp40 was not identified in the free U1 snRNP¹² or in our A-complex structure, Prp40 crosslinks to Luc7 and Snu71¹² and unassigned cryo-EM density in the A-complex may indicate its peripheral location near Luc7 (Extended Data Figs. 1e, 4a, e). Msl5–Mud2 may then be destabilized by the Sub2 helicase, allowing the Prp5 helicase to remodel U2 snRNA for the stable association of the U2 snRNP with the branch point sequence in the A-complex⁹. Prp5 was shown to physically interact with the U2 SF3b subunit Hsh155 HEAT repeats 1–6 and 9–12²³ and with U2 snRNA at and surrounding the branch point-interacting stem loop⁹. Thus, after Prp5 activity, Prp5 needs to dissociate to fully expose the Hsh155 HEAT repeats 11–13 together with the U2 snRNA 5′ end in the A-complex, to allow for the subsequent U4/U6.U5 tri-snRNP association to assemble the spliceosome^7,8,9 (Fig. 2a).

The A-complex structure also provides new insights into formation of the fully assembled pre-B-complex spliceosome, which requires integration of the tri-snRNP with the A-complex. The subsequent Prp28 helicase-mediated transfer of the 5′SS from U1 to U6 snRNA and destabilization of the U1 snRNP produces the B-complex spliceosome²⁴. We first modelled a fully assembled yeast spliceosome, by superimposing the U2 snRNP SF3b-containing domains of the yeast A-complex (from this study) and the yeast B-complex structure⁸. As in the B-complex structure⁸, the U2 snRNP would associate with tri-snRNP via the U2/U6 helix II and Prp3 (Extended Data Fig. 7). The modelling shows that the U1 snRNP would clash with large parts of the Brr2-containing ‘helicase’ domain (‘U1-B-complex’; Extended Data Figs. 7b, 8b), which may be relieved owing to their known flexibilities⁸ (Extended Data Fig. 5a). However the known binding site for Prp28 at the U5 Prp8 N-terminal domain (Prp8^N) observed in human tri-snRNP²⁵ would be sterically occluded by the pre-bound B-complex proteins^7,8,26. We therefore considered an alternative model for the assembled yeast ‘pre-B-complex’ spliceosome, by combining the available data from yeast and human systems^8,25,27,28 (Fig. 3a, Extended Data Figs. 7a, 8a). First, the isolated human²⁵ and yeast tri-snRNP^26,29 structures differ in their protein composition and conformation, indicating that different complexes accumulate at steady-state. In the human tri-snRNP structure²⁵ the BRR2 helicase is held near SNU114 by the SAD1 protein and PRP28 is bound to the PRP8 N-terminal domain. In the yeast tri-snRNP^26,29 and the yeast and human B-complex structures^7,8 Brr2 is repositioned and loaded onto its U4 snRNA substrate and the B-complex proteins replace Prp28 at the Prp8^N domain, ready for spliceosome activation. Second, in humans, an ATPase-deficient PRP28 helicase stalls spliceosome assembly at the pre-B-complex stage, before disruption of the U1–5′SS interaction²⁸ and this complex comprises the U1 and U2 snRNPs, a loosely associated tri-snRNP, and SAD1²⁸. Third, in yeast, Sad1 is essential for splicing and is very transiently associated with the tri-snRNP²⁷. Given the high conservation of the major spliceosome components in yeast and humans, the yeast spliceosome may likewise assemble with a human-like tri-snRNP that contains Prp28, Sad1 and a repositioned Brr2 helicase^25,28. On the basis of these assumptions, we modelled a yeast pre-B-complex spliceosome that comprises all five snRNPs with a combined molecular mass of approximately 3.1 megadalton and with only minor clashes (Fig. 3a, Extended Data Fig. 7a, b). Notably, this model indicates that the U2 snRNP positions the U1 snRNP to deliver the U1–5′SS helix to the exposed U6 ACAGAGA stem in tri-snRNP, only approximately 20 Å away from where Prp28 is likely to mediate 5′SS transfer, consistent with protein–RNA crosslinks³⁰ (Fig. 3b). This suggests that the subsequent repositioning of the Brr2 helicase onto the U4 snRNA, observed in the B complex structure^7,8,would coincide with the release of the U1 snRNP owing to a steric clash, rendering Brr2 competent for spliceosome activation only after successful 5′SS transfer (Extended Data Figs. 7b, 8a). The model thus indicates a new molecular checkpoint to couple 5′SS transfer with U1 snRNP release and formation of the B-complex (Extended Data Figs. 7b, 8a).

**Fig. 3: Spliceosome assembly and 5′SS transfer.**

In summary, the prespliceosome structure reveals how the U1 and U2 snRNPs recognize the two reactants of the branching reaction and associate together with the tri-snRNP into the fully assembled spliceosome. The results further suggest how the human alternative-splicing factors LUC7L and TIA-1 may influence splice-site selection.

Methods

Prespliceosome preparation and purification

To obtain the prespliceosome A-complex for structural studies, we prepared yeast S. cerevisiae containing a genomic TAPS affinity tag on the U2 snRNP subunit Hsh155, essentially as described³¹. Yeast cells were grown in a 120-l fermenter, and splicing extract was prepared using the liquid-nitrogen method, essentially as described³². Capped UBC4 pre-mRNA containing a point mutation (U > A) two nucleotides upstream of the branch point adenosine and three MS2 stem loops at the 3′ end was produced by in vitro transcription^9,33. The RNA product was labelled with Cy5 at its 3′ end to monitor complex purification³⁴. The pre-mRNA substrate was bound to the MS2–MBP fusion protein and added to an in vitro splicing reaction carried out for 90 min at 23 °C, essentially as described³³. The reaction mixture was then centrifuged through a 40% glycerol cushion in buffer A (20 mM HEPES (pH 7.9), 50 mM KCl, 0.2 mM EDTA, 1 mM dithiothreitol (DTT), 0.04% NP-40). The cushion was diluted with buffer A containing 1% glycerol, and applied to amylose resin (NEB) pre-washed with buffer B (20 mM HEPES (pH 7.9), 75 mM KCl, 5% glycerol, 0.2 mM EDTA, 1 mM DTT, 0.03% NP-40). After 12 h incubation at 4 °C, the resin was washed with buffer B and eluted in buffer B containing 50 mM KCl and 12 mM maltose. Fractions containing A-complex were pooled and applied to Strep-Tactin resin (GE Healthcare), pre-washed with buffer B, and incubated for 4 h at 4 °C. The resin was washed with buffer B containing 2 mM MgCl₂, and eluted with buffer B containing 50 mM KCl, 2.5 mM desthiobiotin, and 2 mM MgCl₂. The A-complex fractions were pooled and crosslinked using 1.1 mM BS3 (Sigma) on ice for 1 h, and subsequently quenched with 50 mM ammonium bicarbonate. The sample was concentrated to ~0.4 mg ml⁻¹ and immediately used for EM sample preparation. Mass spectrometry (data not shown), indicated that homogenous A-complex was purified, containing sub-stoichiometric amounts of Prp5 (Extended Data Fig. 1b). The splicing assay in Extended Data Fig. 1a was carried out as for A-complex purification, but in a volume of 25 μl and in the absence of MS2–MBP fusion protein, and was visualized after 30 min of splicing at 23 °C on a denaturing 14% polyacrylamide TBE gel with a Typhoon scanner (GE Healthcare).

Electron microscopy

For cryo-EM analysis the A-complex sample was applied to R2/2 holey carbon grids (Quantifoil), precoated with a 5–7-nm homemade carbon film. Grids were glow-discharged for 20 s before deposition of 2.5 μl sample (~0.4 mg ml⁻¹), and subsequently blotted for 2–3.5 s and vitrified by plunging into liquid ethane with a Vitrobot Mark III (FEI) operated at 4 °C and 100% humidity. Cryo-EM data were acquired on three separate FEI Titan Krios microscopes (datasets one to three) operated in EFTEM mode at 300 keV, each equipped with a K2 Summit direct detector (Gatan) and a GIF Quantum energy filter (slit width of 20 eV, Gatan). Datasets one and three were recorded using ‘Krios 1’ and ‘Krios 2’ at the MRC-LMB, respectively, and dataset three using ‘Krios 2’ at the Astbury Biostructure Laboratory (University of Leeds). For dataset one, 5,935 movies were acquired using EPU (FEI) with a defocus range of –0.4 μm to –4.4 μm at a nominal magnification of 105,000× (1.13 Å pixel^–1). The camera was operated in ‘counting’ mode with a total exposure time of 13 s fractionated into 20 frames, a dose rate of 4.25 e⁻ pixel⁻¹ s⁻¹, and a total dose of 43 e⁻ Å⁻² per movie. Dataset two was collected in the same manner, except that 727 movies were recorded using SerialEM³⁵, at a nominal magnification of 105,000× (1.14 Å pixel⁻¹), a total exposure time of 8 s fractionated into 20 frames, a dose rate of 4.33 e⁻ pixel⁻¹ s⁻¹ and a total dose of 27 e⁻ Å⁻² per movie. Dataset three was collected with EPU (FEI) similar to dataset one, except that 2,745 movies were collected at a nominal magnification of 130,000× (1.07 Å pixel⁻¹), a total exposure time of 8 s fractionated into 20 frames, a dose rate of 7.94 e⁻ pixel⁻¹ s⁻¹ and a total dose of 56 e⁻ Å⁻² per movie.

Image processing

Movies were aligned using MOTIONCOR2³⁶ with 5 × 5 patches and applying a theoretical dose-weighting model to individual frames. Contrast transfer function (CTF) parameters were estimated using Gctf³⁷. Resolution is reported on the basis of the gold-standard Fourier shell correlation (FSC) (0.143 criterion) as described³⁸ and B-factors were determined and applied automatically in RELION 2.1^39,40. Particles from dataset one were automatically picked using Gautomatch (K. Zhang) and screened manually, and were then extracted in RELION with a 5,602 pixel box size and pre-processed. Particles from datasets two and three were picked and pre-processed in the same way, and were then rescaled to the pixel size of dataset one (1.13 Å pixel⁻¹) in RELION 2.1 by Fourier cropping during particle extraction with a 5,602 pixel box. For rescaling, we first calculated 3D refinements in RELION 2.1 for each dataset (one to three) and performed real space correlation fits in UCSF Chimera to identify scaling factors for datasets two and three relative to dataset one. Because the absolute magnification values differed slightly for the different microscopes, we re-determined the CTF values for datasets two and three using the new pixel sizes with Gctf³⁷, and then re-extracted and rescaled the particles to the 5,602 pixel box. Combining datasets one to three yielded a total dataset of 406,272 particles that were used for subsequent processing.

The first 22,319 particles from dataset one were used to generate an ab initio 3D reference for the A-complex using default parameters and three classes in cryoSPARC⁴¹ (Extended Data Fig. 2a). The complete dataset (one to three) was subjected to a ‘heterogeneous’ (multi-reference) refinement in cryoSPARC using default parameters and four classes: the ab initio A-complex reference and three ‘junk’ references (Extended Data Fig. 2a; round 1). Class one contained 153,570 particles (37.8%, percentage of particles form the full dataset) and was used for a 3D refinement in RELION 2.1 with a soft mask in the shape of the A-complex. This yielded a density (map A1) with an overall resolution of 4.9 Å and a B-factor of −188 Å², comprising U1 snRNP and the U2 snRNP 3′ region (Extended Data Figs. 1e, d, 2, 9). To improve the U1 snRNP density, we prepared a soft mask enveloping the U1 snRNP with the volume eraser in UCSF Chimera⁴² and RELION 2.1^39,40. This enabled the focused refinement of the U1 snRNP (map A2) from the same 153,570 particles to an overall resolution of 4.0 Å and a B-factor of −146 Å² (Extended Data Figs. 1e, d, 2, 9). In the A-complex the U2 snRNP 5′ region is flexible relative to the U1 and the U2 3′ region (Extended Data Fig. 2). To position the U2 snRNP 5′ region in the A-complex, we used a soft mask surrounding the U2 5′ region and carried out 3D classification without image alignment with six classes (Extended Data Fig. 2a; round 2). This revealed a class with defined U2 5′ region from 19,937 particles (4.9%) that could be refined to an overall resolution of 10.4 Å (Extended Data Figs. 2, 9). Local resolution was estimated using ResMap⁴³ (Extended Data Fig. 2d, e).

Structural modelling

We prepared a composite model of the A-complex by combining the A1-3 densities (Extended Data Fig. 1e, f). Model building was carried out in COOT⁴⁴. The U1 snRNP coordinates were refined into the sharpened A2 density in PHENIX⁴⁵ using the phenix.real_space_refine routine, and applying secondary structure, rotamer, nucleic acid and metal ion restraints. Homology models for yeast Yhc1, Snp1, and Mud1 were generated using MODELLER⁴⁶ from the human U1 snRNP crystal structures¹⁰ (PDB ID 4PJO, 4PKD) and were fitted and manually adjusted in the A2 map. The yeast B-complex U5 Sm ring model was used as the initial model for the U1 Sm ring, and was manually adjusted in the A2 density. Initial models for Prp39 and Prp42 were generated by I-TASSER⁴⁷ and were subsequently adjusted and extended manually. The Prp39 N-terminal residues 47–339 were modelled as poly-alanine owing to a lower local resolution of ~5–6 Å (Extended Data Figs. 2d, e, 3c). Snu56, the Yhc1 C terminus, the Snu71 N terminus were modelled de novo; Yhc1 residues 48–82 and 135–142 were modelled as poly-alanine. To build the Luc7 model a C₃H-type ZnF (from PDB ID 1RGO) for ZnF1 and a C₂H₂-type ZnF (from Yhc1) for ZnF2 were used to guide modelling in the A2 density, with a local resolution of 4–5 Å (Extended Data Fig. 3c). The helices connecting Luc7 ZnF1 and ZnF2 (α5–7) were modelled as poly-alanine, and were assigned on the basis of density connectivity. The U1 snRNP protein model is in excellent agreement with biochemical and protein crosslinking results¹². The U1 snRNA model was generated on the basis of similarity to the U1 snRNA in the human U1 snRNP crystal structures (PDB ID 3CW1, 4PJO, 4PKD) and according to the yeast U1 snRNA secondary structure prediction⁴⁸. All base-pairing U1 snRNA regions (helix H; SL1; SL2-1 and -2; SL3-1, -2, -3, -4, -5 and -6), except for the SL3-7 and the tip of SL3-3, were modelled (Extended Data Fig. 3f, g). The human SL1 loop (PDB ID 4PKD) was rigid-body-fitted together with the homology model of the yeast Snp1 (described above), and the human U1 snRNA sequence was replaced with that of yeast. The loops connecting SL2-1 to SL2-2 as well as SL3-3 to SL3-4 and SL3-4 to SL3-5 and the tips of SL2-2, SL3-3, -4 and -5 were not built, owing to a lower local resolution (~4.5 Å). The location of a region of U1 snRNA SL3-7 was modelled as a phosphate backbone only and may correspond to the sequence surrounding residues 378–391 and 428–440. The U1–5′SS was modelled de novo, and the UBC4 pre-mRNA contained 12 nucleotides, ten from the intron (+1 to +10) and two from the 5′ exon (−1 to −2).

The U2 snRNP 3′ region (U2 3′ domain and SF3a subcomplexes) from the yeast B-complex structure (PDB ID 5NRL) were fitted into the A1 density using UCSF Chimera⁴², and the positions of Lea1, Msl1 and U2 snRNA residues 139–1169 were adjusted as a rigid body in COOT⁴⁴. The U2 snRNP 5′ region from the yeast B-complex structure (PDB ID 5NRL) was fitted into the A3 density in UCSF Chimera. This provided an excellent fit, suggesting that the U2 5′ region structure is not changed substantially from that observed in the yeast B-complex⁸. To generate the complete A-complex model, the refined U1 snRNP model and the U2 snRNP 3′ region were fitted into the A3 density in UCSF Chimera, together with the fitted U2 snRNP 5′ region. The final model comprises 34 proteins, U1 and U2 snRNAs, and the pre-mRNA substrate.

To generate the alternative pre-B-complex model shown in Fig. 3, we modified and combined structural models using COOT⁴⁴, on the basis of structural and biochemical data from yeast and human systems^8,25,28. We first superimposed our A-complex structure on the yeast B-complex structure⁸ using the U2 SF3b-containing domain. The free human tri-snRNP structure (PDB ID 3JCR), which may resemble the pre-B conformation^7,25, was used to model the yeast tri-snRNP in the pre-B-complex conformation. We first removed the B-complex proteins from the yeast B-complex structure, because these are absent in the purified human pre-B-complex²⁸. Human pre-B instead contained the PRP28 helicase and SAD1, and we therefore placed crystal structures of the yeast Prp28 helicase⁴⁹ (PDB ID 4W7S) and yeast Sad1⁵⁰ (PDB ID 4MSX) in their human tri-snRNP locations²⁵. We then positioned the U4 Sm ring and Brr2 as in the human tri-snRNP structure, in which the Brr2 PWI domain makes a conserved contact with Sad1⁵¹. We removed a Snu66 peptide bound to Brr2 from the model, because its binding at this site is uncertain in the pre-B-complex conformation. Several minor differences remain between the free human tri-snRNP structure²⁵ and the pre-B-complex model, and these were not modelled. The final pre-B model contained only minor clashes, and one observed clash between the highly flexible Prp28 RecA-2 lobe²⁵ and the flexible U6 snRNA 5′ stem loop^8,26 could be resolved by a minor repositioning of either domain. The final pre-B model comprises 66 proteins, five snRNAs, the pre-mRNA substrate, and has a combined molecular mass of ~3.1 MDa.

Figures were generated with PyMol (https://www.pymol.org) and UCSF Chimera.

Reporting summary

Further information on experimental design is available in the Nature Research Reporting Summary linked to this paper.

Data availability

Three-dimensional cryo-EM density maps A1, A2 and A3 have been deposited in the Electron Microscopy Data Bank under the accession numbers EMD-4363, EMD-4364 and EMD-4365, respectively. The coordinate file of the A-complex has been deposited in the Protein Data Bank under the accession number 6G90.

References

Will, C. L. & Lührmann, R. Spliceosome structure and function. Cold Spring Harb. Perspect. Biol. 3, a003707 (2011).
Article PubMed PubMed Central CAS Google Scholar
Papasaikas, P. & Valcárcel, J. The spliceosome: the ultimate RNA chaperone and sculptor. Trends Biochem. Sci. 41, 33–45 (2016).
Article PubMed CAS Google Scholar
Wilkinson, M. E., Lin, P. C., Plaschka, C. & Nagai, K. Cryo-EM studies of pre-mRNA splicing: from sample preparation to model visualization. Annu. Rev. Biophys. 47, 175–199 (2018).
Article PubMed CAS Google Scholar
Behzadnia, N. et al. Composition and three-dimensional EM structure of double affinity-purified, human prespliceosomal A complexes. EMBO J. 26, 1737–1748 (2007).
Article PubMed PubMed Central CAS Google Scholar
Gao, G. & Dudley, S. C. Jr. RBM25/LUC7L3 function in cardiac sodium channel splicing regulation of human heart failure. Trends Cardiovasc. Med. 23, 5–8 (2013).
Article PubMed CAS Google Scholar
Hackman, P. et al. Welander distal myopathy is caused by a mutation in the RNA-binding protein TIA1. Ann. Neurol. 73, 500–509 (2013).
Article PubMed CAS Google Scholar
Bertram, K. et al. Cryo-EM structure of a pre-catalytic human spliceosome primed for activation. Cell 170, 701–713.e711 (2017).
Article PubMed CAS Google Scholar
Plaschka, C., Lin, P. C. & Nagai, K. Structure of a pre-catalytic spliceosome. Nature 546, 617–621 (2017).
ADS PubMed PubMed Central CAS Google Scholar
Liang, W. W. & Cheng, S. C. A novel mechanism for Prp5 function in prespliceosome formation and proofreading the branch site sequence. Genes Dev. 29, 81–93 (2015).
Article PubMed PubMed Central CAS Google Scholar
Kondo, Y., Oubridge, C., van Roon, A. M. & Nagai, K. Crystal structure of human U1 snRNP, a small nuclear ribonucleoprotein particle, reveals the mechanism of 5′ splice site recognition. eLife 4, e04986 (2015).
Article PubMed Central Google Scholar
Neubauer, G. et al. Identification of the proteins of the yeast U1 small nuclear ribonucleoprotein complex by mass spectrometry. Proc. Natl Acad. Sci. USA 94, 385–390 (1997).
Article ADS PubMed CAS PubMed Central Google Scholar
Li, X. et al. CryoEM structure of Saccharomyces cerevisiae U1 snRNP offers insight into alternative splicing. Nat. Commun. 8, 1035 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Puig, O., Bragado-Nilsson, E., Koski, T. & Séraphin, B. The U1 snRNP-associated factor Luc7p affects 5′ splice site selection in yeast and human. Nucleic Acids Res. 35, 5874–5885 (2007).
Article PubMed PubMed Central CAS Google Scholar
Agarwal, R., Schwer, B. & Shuman, S. Structure-function analysis and genetic interactions of the Luc7 subunit of the Saccharomyces cerevisiae U1 snRNP. RNA 22, 1302–1310 (2016).
Article PubMed PubMed Central CAS Google Scholar
Schwer, B. & Shuman, S. Structure–function analysis of the Yhc1 subunit of yeast U1 snRNP and genetic interactions of Yhc1 with Mud2, Nam8, Mud1, Tgs1, U1 snRNA, SmD3 and Prp28. Nucleic Acids Res. 42, 4697–4711 (2014).
Article PubMed PubMed Central CAS Google Scholar
Chen, J. Y. et al. Specific alterations of U1-C protein or U1 small nuclear RNA can eliminate the requirement of Prp28p, an essential DEAD box splicing factor. Mol. Cell 7, 227–232 (2001).
Article PubMed CAS Google Scholar
Puig, O., Gottschalk, A., Fabrizio, P. & Séraphin, B. Interaction of the U1 snRNP with nonconserved intronic sequences affects 5′ splice site selection. Genes Dev. 13, 569–580 (1999).
Article PubMed PubMed Central CAS Google Scholar
Förch, P., Puig, O., Martínez, C., Séraphin, B. & Valcárcel, J. The splicing regulator TIA-1 interacts with U1-C to promote U1 snRNP recruitment to 5′ splice sites. EMBO J. 21, 6882–6892 (2002).
Article PubMed PubMed Central Google Scholar
Spingola, M. & Ares, M. Jr. A yeast intronic splicing enhancer and Nam8p are required for Mer1p-activated splicing. Mol. Cell 6, 329–338 (2000).
Article PubMed CAS Google Scholar
Agafonov, D. E. et al. ATPγS stalls splicing after B complex formation but prior to spliceosome activation. RNA 22, 1329–1337 (2016).
Article PubMed PubMed Central CAS Google Scholar
Fica, S. M. & Nagai, K. Cryo-electron microscopy snapshots of the spliceosome: structural insights into a dynamic ribonucleoprotein machine. Nat. Struct. Mol. Biol. 24, 791–799 (2017).
Article PubMed CAS PubMed Central Google Scholar
Abovich, N. & Rosbash, M. Cross-intron bridging interactions in the yeast commitment complex are conserved in mammals. Cell 89, 403–412 (1997).
Article PubMed CAS Google Scholar
Tang, Q. et al. SF3B1/Hsh155 HEAT motif mutations affect interaction with the spliceosomal ATPase Prp5, resulting in altered branch site selectivity in pre-mRNA splicing. Genes Dev. 30, 2710–2723 (2016).
Article PubMed PubMed Central CAS Google Scholar
Staley, J. P. & Guthrie, C. An RNA switch at the 5′ splice site requires ATP and the DEAD box protein Prp28p. Mol. Cell 3, 55–64 (1999).
Article PubMed CAS Google Scholar
Agafonov, D. E. et al. Molecular architecture of the human U4/U6.U5 tri-snRNP. Science 351, 1416–1420 (2016).
Article ADS PubMed CAS Google Scholar
Nguyen, T. H. D. et al. Cryo-EM structure of the yeast U4/U6.U5 tri-snRNP at 3.7 Å resolution. Nature 530, 298–302 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Huang, Y. H., Chung, C. S., Kao, D. I., Kao, T. C. & Cheng, S. C. Sad1 counteracts Brr2-mediated dissociation of U4/U6.U5 in tri-snRNP homeostasis. Mol. Cell. Biol. 34, 210–220 (2014).
Article PubMed PubMed Central CAS Google Scholar
Boesler, C. et al. A spliceosome intermediate with loosely associated tri-snRNP accumulates in the absence of Prp28 ATPase activity. Nat. Commun. 7, 11997 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Wan, R. et al. The 3.8 Å structure of the U4/U6.U5 tri-snRNP: Insights into spliceosome assembly and catalysis. Science 351, 466–475 (2016).
Article ADS PubMed CAS Google Scholar
Ismaïli, N., Sha, M., Gustafson, E. H. & Konarska, M. M. The 100-kda U5 snRNP protein (hPrp28p) contacts the 5′ splice site through its ATPase site. RNA 7, 182–193 (2001).
Article PubMed PubMed Central Google Scholar
Nguyen, T. H. D. et al. The architecture of the spliceosomal U4/U6.U5 tri-snRNP. Nature 523, 47–52 (2015).
Article ADS PubMed PubMed Central CAS Google Scholar
Umen, J. G. & Guthrie, C. A novel role for a U5 snRNP protein in 3′ splice site selection. Genes Dev. 9, 855–868 (1995).
Article PubMed CAS Google Scholar
Galej, W. P. et al. Cryo-EM structure of the spliceosome immediately after branching. Nature 537, 197–201 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Wang, H., Ach, R. A. & Curry, B. Direct and sensitive miRNA profiling from low-input total RNA. RNA 13, 151–159 (2007).
Article PubMed PubMed Central CAS Google Scholar
Mastronarde, D. N. Automated electron microscope tomography using robust prediction of specimen movements. J. Struct. Biol. 152, 36–51 (2005).
Article PubMed Google Scholar
Zheng, S. Q. et al. MotionCor2: anisotropic correction of beam-induced motion for improved cryo-electron microscopy. Nat. Methods 14, 331–332 (2017).
Article PubMed PubMed Central CAS Google Scholar
Zhang, K. Gctf: Real-time CTF determination and correction. J. Struct. Biol. 193, 1–12 (2016).
Article ADS PubMed PubMed Central CAS Google Scholar
Scheres, S. H. W. & Chen, S. Prevention of overfitting in cryo-EM structure determination. Nat. Methods 9, 853–854 (2012).
Article PubMed PubMed Central CAS Google Scholar
Kimanius, D., Forsberg, B. O., Scheres, S. H. & Lindahl, E. Accelerated cryo-EM structure determination with parallelisation using GPUs in RELION-2. eLife 5, e18722 (2016).
Article PubMed PubMed Central CAS Google Scholar
Scheres, S. H. RELION: implementation of a Bayesian approach to cryo-EM structure determination. J. Struct. Biol. 180, 519–530 (2012).
Article PubMed PubMed Central CAS Google Scholar
Punjani, A., Rubinstein, J. L., Fleet, D. J. & Brubaker, M. A. cryoSPARC: algorithms for rapid unsupervised cryo-EM structure determination. Nat. Methods 14, 290–296 (2017).
Article PubMed CAS Google Scholar
Pettersen, E. F. et al. UCSF Chimera—a visualization system for exploratory research and analysis. J. Comput. Chem. 25, 1605–1612 (2004).
Article PubMed CAS Google Scholar
Kucukelbir, A., Sigworth, F. J. & Tagare, H. D. Quantifying the local resolution of cryo-EM density maps. Nat. Methods 11, 63–65 (2014).
Article PubMed CAS Google Scholar
Emsley, P. & Cowtan, K. Coot: model-building tools for molecular graphics. Acta Crystallogr. D 60, 2126–2132 (2004).
Article PubMed CAS Google Scholar
Adams, P. D. et al. PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr. D 66, 213–221 (2010).
Article PubMed CAS PubMed Central Google Scholar
Webb, B. & Sali, A. Comparative protein structure modeling using MODELLER. Curr. Protoc. Bioinformatics 5.6.1–5.6.32 (2014).
Yang, J. et al. The I-TASSER Suite: protein structure and function prediction. Nat. Methods 12, 7–8 (2015).
Article PubMed PubMed Central CAS Google Scholar
Kretzner, L., Krol, A. & Rosbash, M. Saccharomyces cerevisiae U1 small nuclear RNA secondary structure contains both universal and yeast-specific domains. Proc. Natl Acad. Sci. USA 87, 851–855 (1990).
Article ADS PubMed CAS PubMed Central Google Scholar
Jacewicz, A., Schwer, B., Smith, P. & Shuman, S. Crystal structure, mutational analysis and RNA-dependent ATPase activity of the yeast DEAD-box pre-mRNA splicing factor Prp28. Nucleic Acids Res. 42, 12885–12898 (2014).
Article PubMed PubMed Central CAS Google Scholar
Hadjivassiliou, H., Rosenberg, O. S. & Guthrie, C. The crystal structure of S. cerevisiae Sad1, a catalytically inactive deubiquitinase that is broadly required for pre-mRNA splicing. RNA 20, 656–669 (2014).
Article PubMed PubMed Central CAS Google Scholar
Absmeier, E., Santos, K. F. & Wahl, M. C. Functions and regulation of the Brr2 RNA helicase during splicing. Cell Cycle 15, 3362–3377 (2016).
Article PubMed PubMed Central CAS Google Scholar
Ester, C. & Uetz, P. The FF domains of yeast U1 snRNP protein Prp40 mediate interactions with Luc7 and Snu71. BMC Biochem. 9, 29 (2008).
Article PubMed PubMed Central CAS Google Scholar
Kimura, E. et al. Serine-arginine-rich nuclear protein Luc7l regulates myogenesis in mice. Gene 341, 41–47 (2004).
Article PubMed CAS Google Scholar
Yan, C., Wan, R., Bai, R., Huang, G. & Shi, Y. Structure of a yeast activated spliceosome at 3.5 Å resolution. Science 353, 904–911 (2016).
Article ADS PubMed CAS Google Scholar
Wilkinson, M. E. et al. Postcatalytic spliceosome structure reveals mechanism of 3′-splice site selection. Science 358, 1283–1288 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Robert, X. & Gouet, P. Deciphering key features in protein structures with the new ENDscript server. Nucleic Acids Res. 42, W320–W324 (2014).
Article PubMed PubMed Central CAS Google Scholar
Sievers, F. et al. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol. Syst. Biol. 7, 539 (2011).
Article PubMed PubMed Central Google Scholar
Buchan, D. W., Minneci, F., Nugent, T. C., Bryson, K. & Jones, D. T. Scalable web services for the PSIPRED Protein Analysis Workbench. Nucleic Acids Res. 41, W349–W357 (2013).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank C. Savva, S. Chen, G. Cannone, G. McMullan, J. Grimmett and T. Darling for maintaining electron microscopy and computing facilities; the mass spectrometry facility for protein identification; A. Murzin for discussions; E. Hesketh and R. Thompson for assistance with cryo-EM data collection of dataset three and S.-C. Cheng, A. Newman, L. Strittmatter, M. E. Wilkinson for critical reading of the manuscript. We thank J. Löwe, V. Ramakrishnan, D. Barford and R. Henderson for their continuing support. The project was supported by the Medical Research Council (MC_U105184330) and European Research Council Advanced Grant (693087-SPLICE3D). C.P. was supported by an EMBO Long-Term Fellowship (984-2015).

Author information

Clemens Plaschka
Present address: Research Institute of Molecular Pathology (IMP), Vienna BioCenter (VBC), Vienna, Austria
These authors contributed equally: Clemens Plaschka, Pei-Chun Lin

Authors and Affiliations

MRC Laboratory of Molecular Biology, Cambridge, UK
Clemens Plaschka, Pei-Chun Lin, Clément Charenton & Kiyoshi Nagai

Authors

Clemens Plaschka
View author publications
You can also search for this author in PubMed Google Scholar
Pei-Chun Lin
View author publications
You can also search for this author in PubMed Google Scholar
Clément Charenton
View author publications
You can also search for this author in PubMed Google Scholar
Kiyoshi Nagai
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

C.P. and P.-C.L. established complex preparation, performed cryo-EM data analysis, model building, and refinement. C.C. assisted with cryo-EM analysis. C.P., P.-C.L. and K.N. analysed the structure and wrote the manuscript. C.P. and P.-C.L. designed the project and K.N. supervised the project.

Corresponding authors

Correspondence to Clemens Plaschka, Pei-Chun Lin or Kiyoshi Nagai.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data figures and tables

Extended Data Fig. 1 Biochemical characterization and cryo-EM of the prespliceosome A-complex.

a, Mutation of the UBC4 pre-mRNA branch point sequence (UACUAAC to UACAAAC, in which A is the branch-point adenosine and A is the mutated nucleotide) stalls splicing before the first step, as described⁹. Splicing reactions were carried out for 30 min at 23 °C in yeast extract using wild-type (lane one) or mutant (U > A, lane two) pre-mRNA. This experiment was performed three times. The asterisk indicates a degradation product. For gel source data see Supplementary Fig. 1a. b, Protein analysis of purified A-complex (SDS–PAGE stained with Coomassie blue). The U2-associated Prp5 protein is sub-stoichiometric and not observed in the A-complex structure. The purification and analysis of protein compositions were performed at least five times with similar results. For gel source data see Supplementary Fig. 1b. c, Cryo-EM micrograph of the A-complex. Scale bar, 100 nm. d, 2D class averages of the A-complex were determined in RELION 2.1^39,40, and reveal a bipartite architecture, comprising the U1 snRNP and the U2 snRNP 3′ and 5′ regions, respectively. e, Composite cryo-EM density of the A-complex shown in two orthogonal views (compare to Fig. 1). The respective densities used for modelling the U1 snRNP (A2, grey), the U2 3′ region (A1, cyan), and the U2 5′ region (A3, green) are coloured and superimposed on a transparent outline of the full A3 map (Methods). The overall resolution of each map as well as the percentage from the cleaned dataset of 153,556 particles are shown in parentheses. Non-modelled regions are indicated and putatively assigned. f. Composite cryo-EM density with the final A-complex model superimposed in a cartoon representation. The path of 40 nucleotides of the disordered UBC4 pre-mRNA intron are indicated. A-complex components are coloured as in Fig. 1. Views as in e.

Extended Data Fig. 2 Cryo-EM image classification and refinement.

a, Image processing workflow for analysis of the A-complex cryo-EM dataset (see ‘Image processing’ in Methods). To visualize differences between the reconstructions, the U1 snRNP (grey), U2 3′ (cyan) and U2 5′ regions (green) are coloured. For each round of three-dimensional classification, the percentage of the data and the type of soft-edged mask are indicated. The type of mask and overall resolution are indicated for each 3D refinement (blue box). b, Orientation distribution plots for all particles that contribute to the respective A1, A2, and A3 cryo-EM reconstructions. c, Gold-standard Fourier shell correlation (FSC = 0.143) of the respective A1, A2 and A3 cryo-EM reconstructions. d, Two views of the composite A-complex cryo-EM density (maps A1, A2 and A3) coloured by local resolution as determined by ResMap⁴³. e, As panel d, but for a central slice through the composite A-complex cryo-EM map.

Extended Data Fig. 3 Details of the U1 snRNP.

a, U1 snRNP structure with subunits coloured as in Fig. 1, except for Nam8 (orange), Snu56 (light blue), Snu71 (blue), Luc7 (dark purple), Mud1 (red) and the U1 snRNA (various). The pre-mRNA nucleotides are labelled relative to the first nucleotide (+1) of the intron. The Nam8 RRM1 and RRM2 domains are flexible and project downstream of the 5′SS. The protein attributed to Luc7 in the free U1 snRNP structure¹² was re-assigned to Snu71. C-term, C terminus; N-term, N terminus; SL, stem loop. In the structure we do not observe any evidence that the C-terminal tails of SmB, SmD1, and SmD3 interact with the 5′SS, consistent with their absence in the human 5′SS–minimal U1 snRNP crystal structure¹⁰. b, Representative regions of the sharpened U1 snRNP density determined at 4 Å resolution (map A2) are superimposed on the refined coordinate model. The density reveals side-chain details, and here segments from the Prp42 N terminus (TPR repeat 1), the Sm ring subunit SmB, and the Snu56 α-helical domain are shown. c, The A2 cryo-EM density is shown superimposed on the coordinate models of a selection of U1 snRNP proteins: Luc7, Snu71, Yhc1 and Prp39. In the structure most of Snu71 is disordered, except for a small N-terminal domain (residues 2–43) that binds between the Prp42 N terminus and the Snu56 KH-like fold, consistent with protein crosslinking¹². Functional regions and disordered domains are indicated. d, The U1 snRNA–pre-mRNA 5′ splice site (U1–5′SS) model is superimposed on its cryo-EM density (map A2). A secondary structure diagram of the U1–5′SS interaction is shown underneath the model. The register of the U1–5′SS is shifted by one nucleotide with respect to U1C (Yhc1) compared to the minimal human 5′SS–U1 snRNP crystal structure, owing to an additional nucleotide in the yeast U1 snRNA¹⁰ (U11). Lines indicate Watson–Crick base pairs and dots indicate pseudouridine (ψ)-containing base pairs. e, The Prp39–Prp42 heterodimer is coloured to indicate each of their respective TPR repeats. f, Cryo-EM density of U1 snRNA from maps A2 (dark grey) and A3 (light grey) without (top) and with the superimposed coordinate model of yeast U1 snRNA (bottom). The model is labelled and coloured according to functional regions of U1 snRNA (5′ end, pink; H helix, cyan; SL1, dark blue; SL2-1, green; SL3-1, light blue; SL2-2 and SL3-2 to -7, grey; 3′end and Sm site, yellow). g, Secondary-structure diagram of U1 snRNA. Bold letters indicate residues included in the model, lines indicate Watson–Crick base pairs, and dots G–U wobble and pseudouridine-containing base pairs. Compare to e. The conserved U1 snRNA ‘core’ is outlined with a grey box. The region of the putative phosphate backbone model of part of the U1 SL3-7 region is indicated with a grey box.

Extended Data Fig. 4 Comparisons of yeast and human U1 snRNPs and implications for alternative splicing.

a, Formation of the U1–5′SS helix induces stable binding of Luc7. In the absence of a pre-mRNA 5′SS in the free U1 snRNP density (left, EMD-8622), Luc7 and the U1 5′ end are disordered. Upon 5′SS recognition at the U1 5′ end (centre, map A2), Luc7 becomes ordered and stabilizes the U1–5′SS interaction, suggesting a mechanism for the selection of weak 5′SS sequences. The free U1 snRNP and the 5′SS-bound (map A2) cryo-EM densities are superimposed on the right. Although the long α-helical density next to Luc7 cannot be assigned with confidence, protein–protein crosslinking data¹² and protein secondary structure prediction are consistent with the presence of either Prp40 or Snu71. On the basis of additional biochemical data on the interaction between the α-helical Prp40 FF1 domain and Luc7 ZnF2⁵², we would speculate that the Prp40 FF1 domain is the most likely candidate for this density. b, Comparison of the yeast U1 snRNP ‘core’ with the human U1 snRNP crystal structure (PDB ID 3CW1). Protein and RNA (top) and RNA only (bottom) are shown side by side (left and centre) and superimposed by a global alignment in PyMOL (right). Coloured as in Extended Data Fig. 3a. c, The yeast U1 snRNP model suggests regulatory mechanisms for human alternative splicing factors. The human homologues of the peripheral yeast U1 proteins may function through stabilization of the U1–5′SS interaction (region 1), of the U1–U2 3′ region interface (region 2), or the U1–U2 5′ interface (region 3). The yeast U1 snRNP ‘core’ is shown superimposed on a surface representation of the U1 snRNP model (top), compared with the similarly coloured human U1 snRNP (below). Interaction sites with the U2 snRNP are labelled (top). d, The location of yeast U1 snRNP components with homology to human splicing factors are indicated in the U1 snRNP structure. The Prp39–Prp42 heterodimer (human PRPF39 homodimer), Nam8¹⁸ (human TIA-1 and TIA-R), Luc7⁵³ (human LUC7L1–3), and the Yhc1 C terminus (human U1C) have clear counterparts in the human system. The yeast-specific U1 snRNA insertions may be replaced in the human system by alternative splicing factors that modulate interactions with the U2 5′ region. e, Model of the yeast E complex on the basis of the U1 snRNP structure and biochemical data²². Luc7, Snu71 and Prp40 form a heterotrimer in vitro⁵², and their interacting regions may be located near unassigned density (compare to Extended Data Fig. 1e) at the tip of an unassigned 40-residue α-helix next to Luc7 ZnF2. This helix is likely to belong to the U1 subunit Snu71 or Prp40, consistent with protein crosslinking¹² and protein secondary-structure prediction. Prp40 could then bind the yeast branch point-binding protein (BBP, human SF1), which in turn interacts with Mud2 (human U2AF65) to tether the pre-mRNA branch-point sequence in the E complex²².

Extended Data Fig. 5 Conformational flexibility of the U2 snRNP.

a, Two defined positions of the U1 snRNP-U2 3′ region could be identified relative to the U2 5′ region. A-complex models were fitted into class two and four from round two of the 3D image classification (compare Extended Data Fig. 2a). The classes are aligned via their U2 5′ region, illustrating their relative flexibility. b, Cartoon schematic of observed positions of the U2 3′ region relative to the U2 5′ region in the A-complex (left), B-complex⁸ (centre), and activated B-complex (B^act) (right, modelled from previously published work⁵⁴). Although in the B-complex the U2 3′ region is free, in the A- and B^act-complexes the position of the U2 3′ region is influenced by interactions with Prp39 as well as Syf1 and Clf1, respectively. c, The U2 snRNP subunit Lea1 (human U2A′) aids to position the U2 snRNP 3′ domain in different spliceosome states. In our A-complex structure, the Prp39 TPR repeat T1 contacts the helical C terminus of Lea1. In the yeast C-complex structure, the non-modelled density for the Syf1 N terminus binds a neighbouring but non-overlapping surface of Lea1 (PDB ID 5LJ5). In the C*/P-complex⁵⁵ (PDB ID 6EXN), the Syf1 N terminus binds yet another Lea1 surface and the U2 3′ domain is repositioned relative to its C-complex location. Together, this suggests that the Lea1 provides multiple interfaces that can be used to position the U2 3′ domain in different spliceosomal complexes. d, Fit of the U2 3′ region coordinate model to the A1 cryo-EM density. The dashed black lineseparates the U2 3′ domain (Sm ring, Msl1 and Lea1 subunits and U2 snRNA, left) and the SF3a subcomplex (Prp9, Prp11 and Prp21, right). Two orthogonal views are shown (Supplementary Video 2). e, Fit of the U2 5′ region coordinate model to the A3 cryo-EM density. A density consistent with the U2 snRNA stem IIa/b and the branch helix is observed. Two density thresholds are shown side by side (left, 0.0163; right, 0.0121), and orthogonal views are shown underneath (Supplementary Video 2).

Extended Data Fig. 6 Luc7 and Nam8 sequence alignments.

a, The Luc7 (human LUC7-like) amino-acid sequence alignment comparing S. cerevisiae, Kluyveromyces lactis, Schizosaccharomyces pombe, Danio rerio, Xenopus tropicalis, Mus musculus, Bos taurus and Homo sapiens was generated with Clustal Omega and visualized with ESPript 3^56,57. For the human sequence, LUC7L1 was used. Secondary structure elements are indicated above the sequence and derive from the A-complex structure (purple) or PSIPRED⁵⁸ secondary structure prediction (grey). Modelled regions (dashed line) and the Zn-coordinating residues of ZnF1 and ZnF2 (asterisks) are indicated. Invariant or conserved residues are highlighted with a red box or red letter font, respectively. b. As in panel a but for Nam8 (human TIA-1) comparing S. cerevisiae, K. lactis, S. pombe, Drosophila melanogaster, D. rerio, X. tropicalis, M. musculus, B. taurus, and H. sapiens amino acid sequences.

Extended Data Fig. 7 Details of the pre-B-complex model.

a, Multiple views of the pre-B-complex model, generated by combining functional and structural data from yeast and human systems^8,25. The mobility of the U1 snRNP relative to the U2 snRNP in the A-complex (this study) as well as of the U2 snRNP relative to tri-snRNP in the B-complex structure⁸ are indicated (left). The pre-B model contained only minor clashes, and a clash between the highly flexible Prp28 C-terminal RecA-2 lobe (from the human tri-snRNP²⁵) and the highly flexible U6 snRNA 5′ stem loop (from the yeast B-complex⁸) may be resolved by small movements of either domain. b, Structural comparisons of the yeast pre-B model (from this study) and the yeast B-complex structure (PDB ID 5NRL⁸) suggest the existence of a molecular checkpoint to couple 5′SS transfer to U1 snRNP release and formation of the activation-competent B-complex. In the pre-B model (left) Sad1 tethers Brr2 through its interaction with the conserved Brr2 PWI domain⁵¹, and the U1 snRNP and its U1–5′SS helix are positioned near the U6 ACAGAGA region and the helicase Prp28. Subsequent to Prp28-mediated 5′SS transfer, Brr2 is repositioned onto its U4 snRNA substrate, guided by the B-complex-specific proteins (right). In this conformation the Brr2 helicase and its associated factors would clash with the U1 snRNP, consistent with U1 snRNP destabilization and release yeast and human B-complexes^7,8. Brr2 is now ready to initiate spliceosome activation and formation of the active site in the B^act-complex. Regions that are changed between pre-B- and B-complex models (black outline) and the clash between the Brr2-containing ‘helicase’ domain and the U1 snRNP in B-complex (red X) are indicated. The lower right panel would conform to the alternative ‘U1-B-complex’ model.

Extended Data Fig. 8 Model for early splicing events.

a, Cartoon schematic of proposed early splicing events, detailing (i) assembly of the pre-B-complex spliceosome from the A-complex and the U4/U6.U5 tri-snRNP and (ii) the subsequent conversion to the pre-catalytic B-complex spliceosome. In the pre-B model the mobile U1 snRNP is next to Prp28, which is bound at the Prp8^N domain. To initiate 5′SS transfer, Prp28 could clamp the pre-mRNA at, or next to, the U1–5′SS helix to destabilize it and to hand over the 5′SS to the U6 ACAGAGA region of tri-snRNP, consistent with protein–RNA crosslinks³⁰. Transfer of the 5′SS may induce the binding of the B-complex proteins to replace Prp28 at the Prp8^N domain and induce the large movement of Brr2 to its B-complex location on U4 snRNA. The U1 snRNP, now loosely tethered to U2, may dissociate from the B-complex owing to the steric clash with the Brr2-containing ‘helicase’ domain⁸ (Extended Data Fig. 7b). Consistent with this, the human pre-B-complex converts to a B-complex-like state in the presence of a 5′SS oligonucleotide, which coincides with U1 snRNP release²⁸. This model can explain how Brr2 is kept inactive to prevent premature U4/U6 duplex unwinding²⁶. The model thereby implies the existence of a molecular checkpoint, coupling 5′SS transfer from U1 to U6 snRNA with Brr2 helicase repositioning and U1 snRNP release to generate the activation-competent B-complex spliceosome. b, Cartoon schematic of an alternative model for spliceosome assembly and 5′SS transfer that relies only on the yeast A-complex (from this work), tri-snRNP^26,29 and B-complex structures⁸. In this model the tri-snRNP that associates with the A-complex already contains the Brr2 helicase bound to the U4 snRNA substrate and the yeast B-complex proteins at the Prp8 N-terminal domain. The tri-snRNP then binds the A-complex (transition I, ‘Assembly’), requiring a substantial readjustment to avoid a steric clash of the Brr2-containing ‘helicase’ domain and the U1 snRNP (‘U1-B-complex’). The Prp28 helicase is then recruited to the U1 snRNP directly as the Prp28-binding site on the Prp8 N-terminal domain in human tri-snRNP is occupied by B-complex proteins²⁵. Prp28 then disrupts the U1–5′SS helix, leading to 5′SS transfer (transition II, ‘Transfer’). Similar to the ‘pre-B-complex’ assembly model in a, the U1 snRNP, now freed from the 5′SS, may then be released owing to a steric clash with the Brr2-containing ‘helicase’ domain. This model does not require Sad1. Compare to a.

Extended Data Fig. 9 Data collection, refinement statistics and validation.

a, Cryo-EM data collection and refinement statistics of the A-complex structure. Maps A1 and A3 were used to position the U2 snRNP 3′ and 5′ regions, respectively. b. FSC between the A2 cryo-EM density and the refined A-complex U1 snRNP coordinate model.

Extended Data Table 1 Summary of the components modelled into the A-complex cryo-EM densities

Full size table

Supplementary information

Supplementary Figure 1

Uncropped gels shown in Extended Data Figure 1a (a) and 1b (b).

Reporting Summary

Supplementary Data

PyMol session of the A complex structure (PDB 6G90). Subunits are coloured as in Figure 1.

Video 1: 360° rotation of the A complex coordinate model (PDB 6G90).

Coloured as in Figure 1.

Video 2: 360° rotation of the composite A complex cryo-EM density.

A1, grey, EMD-4363; A2, cyan, EMD-4364; A3, green, EMD-4365. Coloured as in Extended Data Figure 1e.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Plaschka, C., Lin, PC., Charenton, C. et al. Prespliceosome structure provides insights into spliceosome assembly and regulation. Nature 559, 419–422 (2018). https://doi.org/10.1038/s41586-018-0323-8

Download citation

Received: 11 January 2018
Accepted: 16 May 2018
Published: 11 July 2018
Issue Date: 19 July 2018
DOI: https://doi.org/10.1038/s41586-018-0323-8
Springer Nature Limited

This article is cited by

Structural insights into the cross-exon to cross-intron spliceosome switch
- Zhenwei Zhang
- Vinay Kumar
- Reinhard Lührmann
Nature (2024)
Structural basis of human U5 snRNP late biogenesis and recycling
- Daria Riabov Bassat
- Supapat Visanpattanasin
- Clemens Plaschka
Nature Structural & Molecular Biology (2024)
Intramolecular autoinhibition regulates the selectivity of PRPF40A tandem WW domains for proline-rich motifs
- Santiago Martínez-Lumbreras
- Lena K. Träger
- Michael Sattler
Nature Communications (2024)
Variation of C-terminal domain governs RNA polymerase II genomic locations and alternative splicing in eukaryotic transcription
- Qian Zhang
- Wantae Kim
- Y. Jessie Zhang
Nature Communications (2024)
Co-transcriptional gene regulation in eukaryotes and prokaryotes
- Morgan Shine
- Jackson Gordon
- Karla M. Neugebauer
Nature Reviews Molecular Cell Biology (2024)

Prespliceosome structure provides insights into spliceosome assembly and regulation

Abstract

Similar content being viewed by others

Main

Methods

Prespliceosome preparation and purification

Electron microscopy

Image processing

Structural modelling

Reporting summary

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Extended data figures and tables

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation