Main

Termination is the last stage of gene transcription, during which RNA polymerases stop RNA extension, release RNA transcript and dissociate from the DNA template. In eukaryotes, Pol II uses distinct pathways to terminate transcription of protein-coding and non-coding genes6,7,8,9,10. The pathway for transcription termination of mRNAs is conserved in yeast, mammals and plants, requiring prior cleavage at the polyadenylation site (PAS) of the pre-mRNAs by the endonuclease subunit of the cleavage and polyadenylation complex11,12,13 to release mRNA and subsequent cleavage of the Pol II-associated RNA by a 5′−3′ exoribonuclease1,2,3,4,5 (Rat1 in yeast, XRN2 in humans and XRN3 in plants). The exoribonuclease is also involved in terminating transcription of premature mRNA at promoter-proximal PASs or decapped transcripts near transcription start sites14,15,16,17.

Two prevailing models were proposed for the exoribonuclease-mediated termination of mRNA transcription by Pol II: the allosteric model and the torpedo model7,8,9. In the allosteric model, an exchange of elongation factors and termination factors causes conformational change of Pol II, leading to the dissociation of RNA and DNA from Pol II18,19,20,21. In the torpedo model, the exoribonuclease degrades the RNA in a 5′-to-3′ direction, catches up with the elongating Pol II, releases the RNA and dislodges Pol II from DNA1,2,3,4. Recently, it has become apparent that both mechanisms play an important role in the termination of mRNA transcription by exoribonucleases. This insight has led to the development of unified models, such as the unified allosteric–torpedo model and the sitting duck torpedo model22,23,24. Owing to the lack of structural information on Pol II transcription termination, how the 5′−3′ exoribonuclease Rat1 interacts with Pol II and the elongation factors to terminate mRNA transcription remains elusive.

Here we determined two cryogenic electron microscopy (cryo-EM) structures of S.cerevisiae Pol II pre-termination transcription complexes (Pol II Rat1–PTTC1 and Pol II Rat1–PTTC2) bound to the exoribonuclease Rat1–Rai1 heterodimer and the elongation factor Spt5 at 2.7 and 2.8 Å (Extended Data Table 1), respectively. Our structures provide insights for understanding the exoribonuclease-mediated termination of mRNA transcription by Pol II.

The overall structure of Pol II Rat1–PTTC1

To understand the structural basis for Rat1-mediated Pol II termination, we reconstituted a pre-termination complex comprising S. cerevisiae Pol II, the termination exoribonuclease—Rat1–Rai1 heterodimer, the elongation factors Spt4 and Spt5, and a nucleic acid scaffold that contains a 23-nucleotide (nt) RNA harbouring phosphorus-sulfur-bond modifications at the 5′ termini of the nucleotides (Fig. 1a,b, Methods and Extended Data Fig. 1). The cryo-EM structure of Pol II Rat1–PTTC1 was determined through a single-particle cryo-EM method at a nominal resolution of 2.7 Å (Extended Data Fig. 2).

Fig. 1: The cryo-EM structure of Pol II Rat1–PTTC1.
figure 1

a, Sequences of the nucleic acid scaffold used in Pol II Rat1–PTTC1. Phosphorus-sulfur-bond modification of the RNA nucleotides is shown in green, and the mono-phosphorylation modification of the RNA 5′ terminus is shown in yellow. Nucleotides in grey are disordered in the structure. NT, non-template DNA; T, template DNA. b, Domain architecture of Rat1, Rai1 and Spt5. CR, conserved region; CTR, C-terminal region. The colour code presented here is used throughout all figures. All modelled regions are shown with colour. Rat1, magenta; Rai1, light blue; Spt5, dark red. c, The front and side views of the cryo-EM map (map 1a) of Pol II Rat1–PTTC1. Rpb1, green; Rpb2, cyan; Rpb4, yellow; Rpb7, orange; Pol II other subunits, light grey. d, The front and side views of the Pol II Rat1–PTTC1 structure. e, The local refined map (map 1b) and model of Rat1, Rai1, the Pol II subunits Rpb4 and Rpb7, and structure elements around the RNA exit channel of Pol II.

The cryo-EM map (map 1a) calculated from Pol II Rat1–PTTC1 shows unambiguous signals near the RNA exit channel of Pol II that could be assigned to the Rat1–Rai1 heterodimer and the KOW5 domain of Spt5 (Fig. 1c,d and Extended Data Fig. 2). Local refinement focusing around the Rat1–Rai1 heterodimer resulted in a local map with a much-improved resolution of the Rat1–Rai1 heterodimer and the Pol II stalk domain, and continuous RNA signals from the Rat1 catalytic site to the RNA exit channel of Pol II (map 1b at 3.5 Å of Pol II Rat1–PTTC1; Fig. 1e and Extended Data Fig. 2e). We also obtained a cryo-EM map (map 1c) showing clear signals for all domains of Spt4 and Spt5 but not the Rat1–Rai1 heterodimer (Extended Data Fig. 2a,j), suggesting that Spt4 and Spt5 preclude interaction of the Rat1–Rai1 heterodimer with Pol II.

In the structure of Pol II Rat1–PTTC1, the Rat1–Rail heterodimer docks at the Pol II stalk domain and shields the RNA exit channel of Pol II; Rat1 and Pol II interact with the 5′ terminus and 3′ terminus of the nascent RNA, respectively; and only the KOW5 domain of Spt5 is associated with Pol II (Fig. 1c,d).

Rat1 displaces Spt5 to bind Pol II

Rai1 is a fungus-specific protein that stimulates Rat1 activity25. Rai1 is located at the opposite face of the active site of Rat1 and interacts with Rat1 through a 1,550 Å2 otherwise exposed surface. Two concave surfaces of Rai1 embrace the carboxy-terminal structure elements of Rat1 (Extended Data Fig. 4). The smaller concave surface of Rai1 mainly interacts with three loops of Rat1 (residues 217–221, 313–318 and 853–865, respectively). This interaction has been observed in the crystal structure of the Schizosaccharomyces pombe Rat1–Rai1 complex26 (Extended Data Fig. 4a–c). The larger concave surface of Rai1 interacts with another loop and a helix at the C terminus of Rat1 (residues 913–925; Extended Data Fig. 4a,d). This interaction has not been observed in the crystal structure of the S. pombe Rat1–Rai1 complex26. In summary, our structure shows that Rai1 forms a tight heterodimer with Rat1 through interaction with its C-terminal region.

Our structure of Pol II Rat1–PTTC1 reveals the interactions among the termination factor Rat1–Rai1 heterodimer, the elongation factor Spt5, and Pol II (Fig. 2). Both the Rat1–Rai1 heterodimer and the Spt5 KOW5 domain localize around the RNA exit channel of Pol II (Fig. 2a). The Rat1–Rai1 heterodimer binds on top of the RNA exit channel by making interactions with the stalk, clamp, dock and wall domains of Pol II (Fig. 2a,b). The Spt5 KOW5 domain fills in a cleft between the dock and wall domains of Pol II (Fig. 2a). The termination and elongation factors of Pol II together shield the RNA exit channel and direct the route of nascent RNA after it exits from Pol II (Fig. 2a,b).

Fig. 2: Protein interactions in Pol II Rat1–PTTC1.
figure 2

a, The RNA exit channel of Pol II. The dashed line represents the location of the Rat1–Rai1 complex. dsDNA, double-stranded DNA. b, The Rat1–Rai1 complex anchors on Pol II and covers the RNA exit channel of Pol II. c, Overview of the interface of Pol II and Rat1 in three view orientations. d, A detailed view of the interactions between Pol II and Rat1.

Rat1 of the Rat1–Rai1 heterodimer is mainly responsible for interactions with Pol II. Five arms of Rat1 make extensive contacts with the Rpb7 subunit of Pol II stalk, the dock and clamp domains of Rpb1, and the wall domain of Rpb2 (Fig. 2c,d and Extended Data Fig. 5), among which arm 1 and arm 2 of Rat1 form the largest interface with Pol II. Arm 1 and arm 2 anchor on the surface of the Rbp7 subunit of Pol II stalk. The interface residues probably include 15 residues of Rat1 (F180, T184, P186, P836, N837, S839, T841 and L842 in arm 1; Q27, V29, D30, V32, L34, P35 and D37 in arm 2) and 16 residues of Rpb7 (C94, S95, Q96, F121, A123, S125 and N126 interact with arm 1; K107, F109, T111, H113, I151, S152, S156, H158 and I160 interact with arm 2). The short helix of arm 3 contacts the clamp domain of Rpb1, and the interface residues probably include two residues of Rat1 (E276 and F279) and three residues of Rpb1 (I50, R57 and Q68). Arm 4 of Rat1 probably contacts residue Y931 of the Rpb2 wall domain through residue K402. The helix of arm 5 contacts the dock domain of Rbp1 through four residues of Rat1 (T873, Y875, D878 and Y882) and four residues of Rpb1 (R407, I413, R420 and D423). Sequence alignment shows that about half of the residues on arms 1 and 2 of Rat1 are conserved in the functionally equivalent proteins in other eukaryotes (XRN3 in Arabidopsis thaliana and XRN2 in humans; Extended Data Fig. 6), consistent with the finding that arms 1 and 2 are the major dock site for the Rat1–Rai1 complex. The truncation of arms 1 and 2 impaired the interaction between Pol II and the Rat1–Rai1 complex (Extended Data Fig. 8c,d), a finding supporting our structure. In summary, our results show that the Rat1–Rai1 complex interacts extensively with Pol II, and the Pol II stalk domain is the main anchor site.

Our structure shows that Rat1 makes tight interactions with Pol II, raising the possibility that Rat1 can interact with Pol II independently of RNA. To test the hypothesis, we measured interactions between the recombinant S.cerevisiae Rat1–Rai1 complex and the S.cerevisiae Pol II core enzyme without DNA and RNA. The results of the size-exclusion chromatography show that the Rat1–Rai1 complex and Pol II were eluted in the same peak, and the result of the native polyacrylamide gel electrophoresis analysis indicates that the Rat1–Rai1 complex migrates with Pol II (Extended Data Fig. 8a–c). The results support the hypothesis that Pol II can recruit the Rat1–Rai1 complex without prior interaction with RNA.

Structure superimposition between our structure of Pol II Rat1–PTTC1 and the Pol II transcription elongation complex (TEC) shows that Rat1 occupies the surface patches used for interaction with Spt6 and the KOW1–4 domains of Spt5 (Extended Data Fig. 7), suggesting that the interaction of Pol II termination factors is incompatible with the interaction of these Pol II elongation factors, consistent with the previous chromatin immunoprecipitation results showing that Spt6 dissociates after Pol II passes the PASs18,27,28. The cryo-EM map of Spt4-Spt5-bound Pol II TEC (map 1c) shows unambiguous signals for Spt4 and Spt5 but not the Rat1–Rai1 complex (Extended Data Fig. 2j), indicating that Spt4 and Spt5 have to be displaced (or at least partially displaced) to allow interactions of the Rat1–Rai1 heterodimer with Pol II, explaining the reduced elongation rate of Pol II at the termination zone24. Our structure of Pol II Rat1–PTTC1 shows that the Spt5 KOW5 domain still attaches on the pre-termination complex (Fig. 1c,d), consistent with the retained chromatin immunoprecipitation signals of Spt5 after Pol II passes PASs18,27,29, and supported by our pulldown result (Extended Data Fig. 8e, lane 8). Notably, the presence of Spt4 and Spt5 do not affect the amount of the Rat1–Rai1 complex bound to Pol II (Extended Data Fig. 8e, lanes 5 versus 8), which could be explained by the much stronger binding of the Rat1–Rai1 complex with Pol II than the Spt4-Spt5 complex(∆KOW5-C-terminal region; ∆755–1063) (lanes 5 versus 7), an Spt4-Spt5 derivative containing the NGN and KOW1–4 domains of Spt5 that shows little interaction with Pol II under the same experimental conditions (lane 7). In summary, our results show that the Rat1–Rai1 complex competes with the elongation factors for binding to Pol II.

Rat1 guides RNA into its active site

The interactions among Pol II, Rat1 and Spt5 KOW5 connect the RNA exit channel of Pol II and the RNA entry channel of Rat1 (Fig. 3). In the semi-closed continuous channel, the nascent RNA exits from the RNA exit channel of Pol II and enters the active site of Rat1 (Fig. 3a), where three ribonucleotides of the RNA 5′ terminus are stacked between two aromatic residues of Rat1 (W732 and H59; Fig. 3b). A similar substrate interaction has also been observed in the crystal structure of a binary complex of Xrn1 and single-stranded DNA30 (Fig. 3b). The 5′-phosphate group is stabilized by K109, Q113, R116 and R117 (Extended Data Fig. 9a). A triphosphate group would introduce severe steric clash in the active site, explaining the preference of RNA substrate with the mono-phosphate group at its 5′ terminus31 (Extended Data Fig. 9c). The phosphate backbone between the first and the second ribonucleotide is exposed to the catalytic loop of Rat1 (Fig. 3b and Extended Data Fig. 9a). The polar residues (residue R63 of the Pol II Rbp1 subunit, residues R879, R884, K886, H887 and R1124 of the Pol II Rpb2 subunit, and residues R8, R12, R344, R402, R891 and R892 of Rat1) along the channel probably define a charge-complementary path directing the nascent RNA from the Pol II exit channel to the Rat1 active site by making direct interactions with its phosphate backbones (Fig. 3c,d and Extended Data Fig. 9b).

Fig. 3: The RNA channel in Pol II Rat1–PTTC1.
figure 3

a, The cryo-EM map (map 1a) of the nucleic acid scaffold in the structure of Pol II Rat1–PTTC1. ssRNA, single-stranded RNA. b, Left: the Rat1 active site. RNA is stacked by W732 and H59 in Rat1. The cryo-EM density (map 1b) of RNA is coloured grey. Right: the Xrn1 active site. c, A detailed view of the interactions between the RNA and the polar residues in the RNA channel. d, The electrostatic potential surface of the RNA channel. e, Comparison of the DNA–RNA hybrid in the Pol II Rat1–PTTC1 structure (red and blue) with the post-translocated hybrid in Pol II TEC (Protein Data Bank (PDB): 5XON (ref. 32); grey). f, Conformational comparison between Pol II Rat1–PTTC1 and Pol II TEC in front and side views. g, The electrostatic potential surface (left) and a structural comparison (right) of the RNA exit channel between Pol II Rat1–PTTC1 and Pol II TEC.

Structural superimposition between our Pol II Rat1–PTTC1 structure and the previously reported Pol II TEC suggests similar conformations of Pol II, DNA and RNA (Fig. 3e–g). The DNA–RNA hybrid in the structure of Pol II Rat1–PTTC1 adopts a post-translocation state (Fig. 3e); the clamp, jaw-lobe, protrusion and shelf modules around the main cleft in the structure of Pol II Rat1–PTTC1 adopt a closed conformation as observed in Pol II TEC32 (Fig. 3f). The interactions of the Rat1–Rai1 complex with Pol II do not induce a noticeable conformational change of the clamp, wall and dock domains around the RNA exit channel of Pol II (Fig. 3g). The structural comparison indicates that the exchange of termination and elongation factor does not induce a conformational change of Pol II, arguing against the allosteric model of Rat1-mediated transcription termination.

Detailed analysis suggests that the RNA in the semi-closed channel of Pol II Rat1–PTTC1 adopts a relaxed conformation, in which the binding of ribonucleotides at the 5′ terminus of the nascent RNA in the Rat1 active site does not affect the binding of ribonucleotides at the 3′ terminus of the nascent RNA to the active site of Pol II, where the DNA–RNA hybrid adopts the post-translocated state (Fig. 3e). We predict that further cleavage of the nascent RNA in the 5′-to-3′ direction by Rat1 would pull ribonucleotides towards its active site, stretch the RNA in the semi-closed channel and apply mechanical force to the pre-termination complex of Pol II.

Rat1 rotates to Pol II during RNA cleaving

To investigate how Pol II pre-termination complex accommodates nascent RNA further shortened by Rat1, we reconstituted Pol II Rat1–PTTC2 with a nascent RNA of 20 nt in length, 3 nt shorter than that in Pol II Rat1–PTTC1 (Fig. 4a, Extended Data Fig. 1c–d and Extended Data Fig. 3). The cryo-EM map (map 2a) shows clear signals for Rat1, Rai1 and Spt5 KOW5 that surround the RNA exit channel of Pol II (Fig. 4b). To improve the resolution of Rat1–Rai1 and the nascent RNA, we obtained a local map focusing around the Rat1–Rai1 heterodimer at 3.4 Å (map 2b; Extended Data Fig. 3c). The overall and local cryo-EM maps (maps 2a and 2b) show clear signal for 10-nt RNA in the active-site cleft at the post-translocated state, a 7-nt RNA in the semi-closed extended RNA channel, and 3-nt RNA in the active site of Rat1 (Fig. 4c and Extended Data Fig. 3f). Structure superimposition between Pol II Rat1–PTTC2 and Pol II Rat1–PTTC1 shows that the Rat1–Rai1 complex together with Pol II stalk as a structure module rotates about 4° along an axis perpendicular to the plane of the transcription bubble (Fig. 4d). The rotation retains the interactions of arms 1 and 2 with Pol II stalk (Fig. 4e), but slides on the surface where arms 3, 4 and 5 make contact, consistent with the notion that the stalk is the primary dock point for Rat1 (Fig. 4f).

Fig. 4: Rat1 rotates towards the RNA exit channel of Pol II as it shortens the nascent RNA.
figure 4

a, Sequences of the nucleic acid scaffold used in Pol II Rat1–PTTC2. Phosphorus-sulfur-bond modification of the RNA nucleotides is shown in green, and the mono-phosphorylation modification of the RNA 5′ terminus is shown in yellow. Nucleotides in grey are disordered in the structure. b, The cryo-EM map (map 2a) of Pol II Rat1–PTTC2. c, The cryo-EM map (map 2a) of the nucleic acid scaffold in the structure of Pol II Rat1–PTTC2. d, Rat1–Rai1–stalk in Pol II Rat1–PTTC2 rotates about 4° towards the RNA exit channel compared with that in Pol II Rat1–PTTC1. e, Superimposition of the two Pol II Rat1–PTTC structures suggests that the interaction of arms 1 and 2 with the stalk is retained during rotation. f, Arms 3, 4 and 5 slightly slide on the Pol II surface following the rotation of the Rat1–Rai1–stalk unit. g, Structure superimposition of the Rat1 active site of the two structures. h, Structure superimposition of the Pol II active site of the two structures. i, Structure superimposition of the semi-closed RNA channel of the two structures.

The rotation of the Rat1–Rai1–stalk module is attributed to the shortened RNA length. A detailed comparison shows that, in both structures, the three ribonucleotides at the 5′ terminus of nascent RNA are secured in the active site of Rat1 and make similar interactions with Rat1 (Fig. 4g) and that the ten nucleotides at the 3′ terminus of the nascent RNA are secured in the active site of Pol II adopting the post-translocated state (Fig. 4h). Therefore, to retain the above interactions with the shortened RNA, the Rat1–Rai1–stalk module rotates towards the RNA exit channel, which reduces the distance between the active site of Rat1 and the RNA exit channel of Pol II from 54 Å in Pol II Rat1–PTTC1 to 51 Å in Pol II Rat1–PTTC2 to accommodate the shortened linker region of the nascent RNA (Fig. 4i).

The structure of Pol II Rat1–PTTC2 shows that the linker RNA in the semi-closed channel is in a stretched state, and the Rat1–Rai1 complex is tightly engaged with Pol II (Fig. 4i). We predict that further cleavage and subsequent translocation of the nascent RNA in the 5′-to-3′ direction by Rat1 would interrupt interactions between ribonucleotides in the RNA exit channel and apply mechanical force to the DNA–RNA hybrid, leading to disruption of the DNA–RNA hybrid and RNA release.

Discussion

Our work supports the unified sitting duck torpedo model of the Rat1-mediated transcription termination22,23. We show that the Rat1–Rai1 complex is incompatible with the elongation factors for Pol II binding, including Spt5 and Spt6, and therefore, the recruitment of Rat1–Rai1 would cause dissociation of the elongation factors, explaining the reduced elongation rate of Pol II at the termination zone in the sitting duck torpedo model of Pol II termination7,8,9,24,33,34. However, the exchange of termination/elongation factor does not cause a conformational change of Pol II, arguing against the allosteric mechanism of Rat1-mediated transcription termination.

Furthermore, our structures show that Rat1 interacts extensively with Pol II by anchoring on the stalk domain and nascent RNA by trapping three nucleotides of the RNA 5′ terminus in its active site. The structures further show and that Rat1–Rai1–Rbp4-Rpb7 rotates as a structure unit towards the RNA exit channel as it shortens the length of nascent RNA. We predict that further cleavage of RNA in the 5′-to-3′ direction would generate mechanical force to pull the RNA out from the Pol II active centre and eventually release it from Pol II, highlighting the essential role of the Pol II interaction and exonuclease activity of the Rat1 torpedo in the sitting duck torpedo model1,2.

Our work suggests that the Rat1-mediated transcription termination pathway probably involves three stages—the recruitment stage, the engagement stage and the termination stage. In the recruitment stage, the Rat1–Rai1 complex is first recruited to the Pol II transcription complex by Rtt103 recognizing the Pol II Ser2P C-terminal domain after Pol II passes through the PAS1,12. In the engagement stage, the Rat1–Rai1 complex engages with Pol II along with dissociation of Pol II elongation factors, and the 5′ terminus of the cleaved nascent RNA is trapped into the active site of Rat11,2,24,33,34. Once the Rat1–Rai1 complex is engaged with Pol II, a race occurs between Rat1, which shortens the 5′ terminus of RNA, and Pol II, which extends the 3′ terminus of RNA1,2,4,7,8. In the termination stage, the cleavage and translocation of Rat1 pull RNA towards itself and release the RNA from Pol II. Our structures of Pol II Rat1–PTTC1 and Rat1–PTTC2 probably represent two intermediate states at the termination stage of Pol II transcription termination before RNA release (Fig. 5). We propose that any factors impeding Pol II elongation would increase the probability of RNA release, consistent with previous in vivo and in vitro evidence showing that the RNA release is facilitated by the physical obstacle of a nucleosome on the path35, the RNA sequence-induced Pol II pausing or backtracking36,37 and NTP misincorporation38.

Fig. 5: The proposed model of Rat1–Rai1-mediated Pol II termination.
figure 5

The Rat1–Rai1 complex engages with Pol II along with the dissociation of Pol II elongation factors and traps the 5′ terminus of the nascent RNA in its active site of Rat1 (Pol II Rat1–PTTC1). Subsequently, Rat1 cleaves the nascent RNA in a 5′-to-3′ direction and pulls the RNA towards its active centre. The cleavage stretches the RNA in the semi-closed RNA channel and causes rotation of Rat1 towards Pol II (Pol II Rat1–PTTC2). Rat1 continues cleaving the RNA in the 5′-to-3′ direction, pulls the RNA out from the Pol II active centre and digests the released RNA. EC, elongation complex.

By comparing Rat1-dependent termination of Pol II mRNA transcription in eukaryotes, Rho-dependent termination of bacterial transcription39,40, FttA-dependent termination of archaeal transcription41 and integrator-initiated termination of premature mRNA42,43, we show that all of the RNA-associating termination motor proteins mentioned above anchor near the RNA exit channel, make physical interactions with RNA polymerase and apply mechanical force on the DNA–RNA hybrid to release RNA. The similarity suggests a general principle of transcription termination by multiple-subunit RNA polymerases. Despite the similarity, the termination pathways differ in the working mode of the motor proteins, the requirement of elongation factors and the fate of released RNA.

Methods

Cloning and protein expression

DNA fragments encoding full-length S.cerevisiae Rat1, Rai1, Spt4, Spt5, Rpb4 and Rpb7 were amplified from S.cerevisiae genomic DNA using Phanta Max Super-fidelity DNA Polymerase (Vazyme Biotech). Rat1 with Rai1, Spt4 with Spt5, and Rpb4 with Rpb7 were cloned into the pETDuet1 vector, respectively, by a homologous recombination method (NovoRec Plus One Step PCR Cloning Kit; Novoprotein). The pETDuet1-Rat1(∆1/2; ∆23–43/823–843)-Rai1 and pETDuet1-Spt4-Spt5(∆KOW5-CTR; ∆755–1063) were constructed by site-directed mutagenesis (Transgene Biotech). The expression of the Rat1-Rai1 complex, Rat1(∆1/2; ∆23–43/823–843)-Rai1 complex, Spt4-Spt5 complex, Spt4-Spt5(∆KOW5-CTR; ∆755–1063) complex, and Rpb4-Rpb7 complex was induced by 0.4 mM isopropyl β-d-1-thiogalactopyanoside at 18 °C for 16 h in BL21 (DE3) codon plus cells. The cells were collected by centrifugation (3 min, 16 °C, 8,000 r.p.m.) and stored at −80 °C until protein purification.

A modified S.cerevisiae BJ2168 strain for expressing endogenous Pol II with an affinity tag at the C terminus of Rpb2 was generated as described previously44. Briefly, a DNA fragment encoding the last 15 residues of Rpb2 followed by an affinity tag (TEV-6×His-3×Flag) was inserted into the pESC-his vector through homologous recombination. The target DNA fragment amplified from the above vector was transformed into S.cerevisiae BJ2168. Positive colonies were screened by SD-URA medium, and protein expression was detected by western blot. The yeast cells were cultured in YPDA medium, collected at an optical density of 18 at 600 nm by centrifugation (3 min, 16 °C, 8,000 r.p.m.), and stored at −80 °C.

Protein purification

The cell pellets of wild-type and derivatives of the Rat1-Rai1 complex, Rpb4-Rpb7 and Spt4-Spt5 were suspended in lysis buffer (50 mM Tris-HCl pH 7.7, 0.2 M NaCl, 5% (v/v) glycerol, 5 mM β-mercaptoethanol), and lysed using an Avestin EmulsiFlex-C3 cell disrupter. The lysate was centrifuged at 16,000g for 50 min at 4 °C, and the supernatant was loaded onto a nickel nitrilotriacetic acid (Ni-NTA) affinity column (Smart-Lifesciences). The column was washed with lysis buffer supplemented with 20 mM imidazole and eluted with lysis buffer supplemented with 300 mM imidazole. For the Rat1–Rai1 complex, the protein sample was further purified by a Heparin column (HiTrap Heparin HP 5-ml column, Cytiva) using a salt gradient of buffer A (20 mM Tris-HCl pH 7.7, 0.1 M NaCl, 5% (v/v) glycerol, 1 mM dithiothreitol (DTT)) and buffer B (20 mM Tris-HCl pH 7.7, 1 M NaCl, 5% (v/v) glycerol, 1 mM DTT). The eluted fractions containing the target protein were diluted with buffer A, loaded onto a Q HP column (HiPrep Q HP 16/10, Cytiva), and eluted with a salt gradient of buffer A and buffer B. The eluted fractions containing target protein were collected and loaded onto a Superdex 200 10/300 GL column (Cytiva) in S200 buffer C (20 mM Tris-HCl pH 7.7, 0.1 M NaCl, 5% (v/v) glycerol, 1 mM DTT). Fractions containing the Rat1–Rai1 complex were collected, concentrated to 10 mg ml−1 and stored at −80 °C. For the Spt4-Spt5 and Rpb4-Rpb7 complexes, after Ni-NTA purification, the protein sample was purified by a Q HP column (as above) with a salt gradient of buffer A and buffer B. The fractions containing Spt4-Spt5 and Rpb4-Rpb7 were collected and loaded onto a Superdex 200 10/300 GL column (Cytiva) in S200 buffer C, and target proteins were concentrated at 3 mg ml−1 and stored at −80 °C.

For Pol II purification, cell pellets were suspended using lysis buffer (100 mM Tris-HCl pH 7.5, 0.25 M ammonium sulfate, 5% (v/v) glycerol, 1 mM EDTA, 2 mM DTT, 0.01 mM ZnCl2, 0.1 mM phenylmethylsulfonyl fluoride and protease inhibitor cocktail (Biomake)) and lysed with an EmulsiFlex-C3 cell disrupter (Avestin). The lysate was centrifuged at 16,000g for 60 min at 4 °C; then the supernatant was precipitated with ammonium sulfate (30 g per 100 ml) and stirred at 4 °C overnight. The pellet was collected and dissolved with TEZ buffer (50 mM Tris-HCl pH 7.5, 50 mM ammonium sulfate, 5% (v/v) glycerol, 2 mM DTT). Flag resin (Genscript) was subsequently added to the solution, followed by stirring at 4 °C for 3 h. Flag resin with target protein was collected, washed with lysis buffer and then eluted with 500 μg ml−1 Flag peptide (Genscript). The eluted protein was loaded onto a Mono Q column (Mono Q 10/100 GL, Cytiva) followed by a salt gradient of Mono Q buffer A (50 mM Tris-HCl pH 7.5, 50 mM NaCl, 5% glycerol, 2 mM DTT) and Mono Q buffer B (50 mM Tris pH 7.5, 1 M NaCl, 5% (v/v) glycerol, 2 mM DTT). Fractions containing target protein were collected and loaded onto a Superose 6 10/300 GL column (Cytiva) in S6 buffer C (20 mM Tris-HCl pH 7.7, 0.1 M NaCl, 5% (v/v) glycerol, 1 mM DTT). Fractions containing Pol II were collected, concentrated to 6 mg ml−1 and stored at −80 °C.

Gel-filtration assay

Complexes (500 μl) containing 1 μM Pol II and 4 μM Rat1-Rai1 were incubated for 30 min at room temperature in complex buffer B (20 mM HEPES pH 7.4, 0.1 M NaCl, 1 mM DTT). The sample was applied to a Superose 6 Increase 10/300 GL column (Cytiva) equilibrated with complex buffer B. Peak fractions were analysed by SDS–PAGE followed by Coomassie staining.

Gel-shift assay

Complexes (10 μl) contained 0.2 μM Pol II with a gradient (0.2–3.2 μM) of the wild-type Rat1–Rai1 complex or the Rat1(∆1/2; ∆23–43/823–843)–Rai1 derivative was incubated for 60 min at room temperate in complex buffer A (20 mM Tris-HCl pH 7.7, 0.1 M KCl, 5% (v/v) glycerol, 3 mM MgCl2, 1 mM DTT). The complexes were separated by 5% Tris-glycine gel in the Tris-glycine buffer (Sangon Biotech), followed by Coomassie staining.

Pol II release assay

The sequences of the nucleic acid scaffold used in the Pol II release assay are: 5′-CATGCGTACACGCGTTTGTACAGAAAAAAAAGAAAAATCCTGGTCGTTTCCTATA-3′ (template strand), 5′-TATAGGAAACGACCAGGATTTTTCTTTTTTTTCTGTACAAACGCGTGTACGCATGTAAC-biotin-3′ (non-template strand), 5′-p-rUrUrUrUrUrGrArCrCrArGrGrArUrU-3′ (RNA; RNA was mono-phosphorylated at the 5′ terminus). Template-strand DNA and RNA were mixed in a 1:1.2 molar ratio and annealed by 95 °C, 5 min followed by 2 °C-step cooling to 25 °C in annealing buffer (5 mM Tris-HCl pH 8.0, 200 mM NaCl). All following steps were carried out at ambient temperature unless otherwise indicated. The reactions were carried out in transcription buffer (TB; 20 mM Tris-HCl pH 8.0, 100 mM NaCl, 2.5% glycerol, 5 mM MgCl2, 1 mM DTT). Pol II (50 nM) was mixed with template DNA–RNA scaffold in a 1:2 molar ratio and incubated for 20 min with gentle shaking. Next, a fourfold molar excess of non-template DNA was added and incubated for 20 min with gentle shaking to form ECs. Streptavidin beads (Smart-Lifesciences) pre-washed three times in TB with 100 μg ml−1 BSA were added into the reaction system and incubated for 30 min with gentle shaking. Then, the EC-bound beads were sequentially washed with TB + 100 μg ml−1 BSA, TB + 0.1% Triton X-100, TB + 0.5 M NaCl and TB to remove the unbound Pol II, DNA and RNA. The EC-bound beads were then resuspended in TB. A subset of NTP (100 μM UTP and 100 μM CTP) or a combination of the Rat1–Rai1 complex (1,200 nM) and a subset of NTP (100 μM UTP and 100 μM CTP) was subsequently incubated with the EC-bound beads for 1 h with gentle shaking at 30 °C. The ‘total’ (bead-bound and released Pol II) and ‘supernatant’ (released Pol II) fractions were analysed by 8% SDS–PAGE. Protein gel was transferred to PVDF membranes, and immunoblotted for Pol II with the anti-Flag antibody (YEASEN Biotechnology).

RNA cleavage assay

The sequence of the RNA used in the cleavage assay is: 5′-p-rArCrArGrArUrCrGrU*rG*rUrCrCrArUrCrGrArGrArGrGrU-3′ (RNA was mono-phosphorylated at the 5′ terminus; the asterisks represent phosphorus–sulfur-bond modifications to prevent Rat1 cleavage). The reaction mixture (20 μl) contained 1 μM RNA and 2 μM Rat1-Rai1 were incubated for 1 h at 30 °C in reaction buffer (20 mM Tris-HCl pH 8.0, 0.1 M KCl, 5% (v/v) glycerol, 3 mM MgCl2, 1 mM DTT). The reaction was stopped by stopping buffer (1× TBE, 8 M urea) and boiling for 5 min at 95 °C. The sample was analysed with 20% urea gel in the TBE buffer (90 mM Tris-borate, pH 8.0, and 2 mM EDTA), and stained by SYBR Gold Nucleic Acid Gel Stain (Invitrogen).

Pulldown assay

The reaction mixtures (20 μl) containing Flag-tagged Pol II (0.8 μM) and 1.2 μM DNA–RNA scaffold (the same scaffold used in the Pol II Rat1–PTTC1 cryo-EM study) in binding buffer (20 mM Hepes pH 7.5, 100 mM NaCl, 5% glycerol, 2 mM DTT, 50 μg ml−1 BSA) were incubated for 30 min. Subsequently, the Spt4-Spt5 complex (2.4 μM) or the Spt4-Spt5(∆KOW5-CTR; ∆755–1063; 2.4 μM) complex were added and incubated for 30 min. The mixtures were then incubated with the Rat1–Rai1 complex (2.4 μM) for 30 min. All steps were carried out at ambient temperature unless otherwise indicated. Anti-Flag affinity resin (Genscript) was added to reaction mixtures and incubated at 30 °C for 1 h with shaking. After incubation, the anti-Flag affinity resin was washed four times with the wash buffer (20 mM Hepes pH 7.5, 100 mM NaCl, 0.05% Triton X-100, 2 mM DTT), and eluted with 500 μg ml−1 Flag peptide. The eluted fractions were mixed with SDS loading buffer and subjected to electrophoresis at 150 V for 70 min on a 4–20% SDS–PAGE gel. Gels were analysed by Coomassie blue staining.

Cryo-EM sample preparation and image processing

The sequences of the nucleic acid scaffold used in Pol II Rat1–PTTC1 are: 5′-CACTCTACCGATAAGCAGACATACCTCTCGATCCTGTGCTAGACACGG-3′ (template strand), 5′-CCGTGTCTAGCACAGGTAAATGGTTTGTGTCTGCTTATCGGTAGAGTG-3′ (non-template strand), 5′-p-rA*rC*rA*rG*rArUrCrGrUrGrUrCrCrArUrCrGrArGrArGrGrU-3′ (RNA; RNA was mono-phosphorylated at the 5′ terminus; the asterisks represent phosphorus–sulfur-bond modifications to prevent Rat1 cleavage).

The sequences of the nucleic acid scaffold used in Pol II Rat1–PTTC2 are: 5′-CACTCTACCGATAAGCAGAATTACCTCTCGATCCTGTGCTAGACACGC-3′ (template strand), 5′-GCGTGTCTAGCACAGGATCGAGAGGTAATTCTGCTTATCGGTAGAGTG-3′ (non-template strand), 5′-p-rA*rC*rA*rG*rArUrGrUrCrCrUrCrGrArGrArGrGrUrA-3′ (RNA; RNA was mono-phosphorylated at the 5′ terminus; the asterisks represent phosphorus–sulfur-bond modifications to prevent Rat1 cleavage).

Unless otherwise indicated, all of the following steps were carried out in EM buffer (20 mM Hepes pH 7.4, 100 mM KCl, 3 mM MgCl2, 2 mM DTT). Pol II and the nucleic acid scaffold (Pol II TEC) were assembled at a molar ratio of 1:1.25, incubated at 4 °C for 1 h, and then purified by a Superose 6 10/300 GL column (Cytiva) equilibrated in EM buffer. Fractions containing Pol II TEC were collected, and then Rpb4-Rpb7 (a molar ratio of 5:1 with Pol II) was added and incubated at 4 °C for 30 min, Rat1–Rai1 (a molar ratio of 3:1 with Pol II) and Spt4-Spt5 (a molar ratio of 3:1 with Pol II) were added and incubated at 4 °C for 1 h. The complex was again purified by a Superose 6 10/300 GL column (Cytiva). Fractions containing the target proteins were collected, concentrated to 0.8 mg ml−1, and then crosslinked with 0.15% glutaraldehyde on ice for 10 min. Subsequently, the crosslinking reactions were quenched with 100 mM Tris pH 8.0 and dialysed overnight in EM buffer. The sample was concentrated at 1.5 mg ml−1.

The freshly purified sample was mixed with 0.05% Tween20 (final concentration) before grid preparation. A 3-μl sample was applied to glow-discharged UltraAuFoil R1.2/1.3 300 mesh holey Au grids (Quantifoil Micro Tools), and the grid was plunge-frozen in liquid ethane using a Vitrobot Mark IV (FEI) with 100% chamber humidity at 22 °C.

Cryo-EM imaging was carried out on an FEI Titan Krios equipped with a Gatan K3 Summit direct electron detector at the National Center for Protein Sciences Shanghai. Data were collected at a nominal magnification of 64,000 (1.1 Å per pixel) with a dose rate of 22.5 electrons per pixel per second on the sample.

For Pol II Rat1–PTTC1, 7,327 images were recorded using EPU software in counting mode (2.67 exposures in 40 subframes resulting in a total dose of 50 electrons per square ångström, defocus range of −1.0 to −1.8 μm). Frames of individual videos were aligned using MotionCor2 (ref. 45), and contrast transfer function estimations were carried out using CTFFIND46. About 1,000 particles were manually picked and subjected to two-dimensional (2D) classification in RELION 3.0 (ref. 47). The resulting distinct 2D classes served as templates for autopick. Protein aggregates and ice spots were removed, and the resulting particles were subjected to 2D classification in RELION 3.0 by specifying 50 classes. Poorly populated types were removed. We used a 60-Å low-pass-filtered map from our test data as the starting reference model for 3D classification (n = 6). Classes were combined, and 416,223 single particles were used for constructing the final cryo-EM map for Pol II Rat1–PTTC1. The steps of 3D auto-refinement, contrast transfer function refinement, Bayesian polishing and post-processing were finished in RELION 3.1. To improve the Rat1–Rai1 map signal, polished particles of Pol II Rat1–PTTC1 were imported to cryoSPARC v3.3.2 (ref. 48). Particles were then subjected to a non-uniform refinement, followed by particle subtraction using a mask that excludes Rat1-Rai1, Rp4-Rpb7 and structure elements around the Pol II RNA exit channel, and finally refined by local refinement. Gold-standard Fourier shell correlation indicated a mean map resolution of 2.7 Å for the Pol II Rat1–PTTC1 overall map (map 1a) and a resolution of 3.5 Å for the Rat1-Rai1-Rpb4-Rpb7 local refinement map (map 1b).

For Pol II Rat1–PTTC2, 5,313 images were recorded using EPU software in counting mode (2.67 exposures in 40 subframes resulting in a total dose of 50 electrons per square ångström, defocus range of −1.0 to −2.0 μm). Frames of individual videos were aligned using MotionCor2 (ref. 45). The following processes were primarily carried out in cryoSPARC v4.2.1 (ref. 48) unless specified otherwise. The Patch CTF module estimated each summed image’s contrast transfer function. A total of 3,506,025 particles were picked from motion-corrected images using Template Picker (template from Pol II Rat1–PTTC1) and extracted with a box size of 100 pixels using particle extraction. The resulting particles were first sorted by 2D classification (n = 50) followed by ab initio reconstruction (n = 4). Particles from the good class were then classified by heterogeneous refinement (n = 4). Classes containing the Rat1-Rai1 or Spt4-Spt5 complex density were re-extracted using a box size of 400 pixels, followed by non-uniform refinement. For the Pol II Rat1–PTTC2 class, particles were further processed using RELION 3.1 Bayesian polishing. Polished particles were refined using non-uniform refinement, resulting in an overall map (map 2a) with a nominal resolution of 2.8 Å. The Rat1-Rai1-Rpb4-Rpb7 map signal was improved by particle subtraction (using Pol II core as a mask) and local refinement, resulting in a local map (map 2b) with a nominal resolution of 3.4 Å.

Model building and refinement

The cryo-EM maps were first fitted with the Pol II EC structure (PDB: 5VVS (ref. 49)), Spt5 structure (PDB: 7NKX (ref. 50)) and an AlphaFold2-predicted model51 of Rat1-Rai1 using USCF Chimera52, followed by manual adjustment in Coot53 and real-space refinement in Phenix54.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.