Main

Despite decades of study, the transition from translation initiation to elongation in eukaryotes remains poorly understood. During translation initiation, the 40S ribosome subunit in complex with initiation factors is recruited to a 7-methylguanosine (m7G)-capped mRNA and scans the 5′ untranslated region until the correct AUG start codon is recognized (termed ‘scanning’), at which point a post-scanning 48S pre-initiation complex (PIC) is formed1,2. The subsequent joining of the 60S subunit to this 48S PIC is catalysed by the universally conserved factor eIF5B, independently of its GTPase activity6,7,8,9. The conformation of yeast eIF5B when bound to GTP is remodelled after association with a ribosome, and this conformational change enables eIF5B to contact directly both Met-tRNAMeti and the GTPase activation centre on the 60S subunit10,11,12. These contacts are probably essential for triggering the hydrolysis of GTP and the subsequent release of eIF5B–GDP, which allows the ribosomal A site to accept the first elongator aminoacyl-tRNA (aa-tRNA) and enter the elongation phase12. Bypassing the hydrolysis of GTP by eIF5B with a mutation that weakens the ribosome-binding affinity of the factor increases the frequency of the first AUG codon in an mRNA being bypassed (‘leaky scanning’) in vivo8—which implies that the timing of the dissociation of eIF5B from the 80S complex and the subsequent entry into elongation is critical to start-site selection. The steps that govern this transition—and their timescales—have not been elucidated in eukaryotes because the complex, multi-step nature of the transition challenges traditional biochemical approaches.

Here we overcome these limitations by using single-molecule fluorescence microscopy to observe translation initiation directly, and thereby elucidate the dynamics and pathways that are involved in the assembly of the 80S complex and subsequent acceptance of the first elongator aa-tRNA. We first reconstituted an active yeast translation system from purified ribosomes and core eukaryotic initiation and elongation factors (Extended Data Fig. 1) that is suitable for single-molecule analyses13. We then applied this system to study the assembly of the 80S complex in real time using a zero-mode waveguide (ZMW)-based instrumentation platform14, which is capable of multi-colour single-molecule fluorescence imaging at high concentrations of fluorescently labelled ligands.

Given that we observed efficient and stable formation of the 48S PIC on an uncapped, unstructured model mRNA that has been well-characterized previously9 (Fig. 1a, Extended Data Fig. 2a), we formed and immobilized single 48S PICs (containing the cyanine 3-labelled 40S subunit, Cy3–40S) within ZMWs and delivered Cy5–60S along with the other required factors (Extended Data Fig. 3a). We observed efficient joining of the 60S subunit on single 48S PICs—around 90% (n = 267 out of 300) of the 48S PICs bound to Cy5–60S to form the 80S complex (Extended Data Fig. 3b, c). This required eIF5B; in the absence of eIF5B, no 80S formation was observed in the 15-min time window that was used for imaging (n = 500); of note, yeast that lack eIF5B are able to survive (albeit with a severe growth defect8) despite the critical role of this factor, which may be because the presence of polyamines can lead to functional subunit joining in the absence of eIF5B (Extended Data Fig. 3d, e). The association rate of 60S subunits was comparable to that from previous bulk measurements9 (Extended Data Fig. 3c). When we labelled the first elongator Phe-tRNAPhe with Cy3.5 and included this in the delivery mix as a ternary complex with eukaryotic elongation factor 1A (eEF1A) and GTP (Cy3.5–Phe-TC) (Fig. 1b), around 80% (n = 164 out of 200) of the 60S joining events (signalled by single-molecule fluorescence resonance energy transfer, smFRET) were followed by a subsequent Cy3.5–Phe-TC arrival event (Fig. 1c). This association of aa-tRNA with the ribosomal A site was codon- and tRNA-specific and led to further elongation (Extended Data Fig. 4). These results demonstrate that real-time 80S assembly occurs in the correct reading frame and that our single-molecule system for translation initiation is fully active.

Fig. 1: Real-time observation of eukaryotic translation initiation and the transition to elongation.
figure 1

a, mRNA constructs used in single-molecule assays (Extended Data Fig. 5d). All of the mRNAs contain a UUC Phe codon after the AUG start codon and are biotinylated at their 3′-ends. p, phosphate group; nt, nucleotide. b, Experimental setup for single-molecule assays. 48S PICs containing Cy3–40S, Met-tRNAMeti and the 3′-end-biotinylated mRNA of interest were immobilized in ZMWs in the presence of the required initiation factors (eIFs). Experiments were started by illuminating ZMWs with a green laser and delivering Cy5–60S, Cy3.5–Phe-TC and initiation factors. c, Example experimental trace (bottom) and schematic (top) of the molecular events along the reaction coordinate. IC, initiation complex; EC, elongation complex. d, The dwell times between the 60S joining and the arrival of Phe-TC to the A site were measured and fitted to single-exponential distributions to estimate the average time, Δt, of the transition from initiation to elongation. For bars from bottom to top: n = 118, n = 130, n = 130, n = 121, n = 189, n = 159, n = 164 and n = 136 molecules. Error bars show 95% confidence intervals.

Source data.

To determine an initial rate for the transition from initiation to elongation, we measured the dwell times between 60S joining and the arrival of Cy3.5–Phe-TC, and determined the average time of this transition (Δt) (Fig. 1c). The Δt value was 30.2 ± 1.0 s (n = 147) (± 95% confidence interval) when assayed under the same conditions as were used in previous bulk kinetic analyses of yeast translation initiation9 (26 °C and buffer containing 3 mM free Mg2+). This is substantially longer than that in bacterial translation, for which the Δt value was reported to be approximately 1 s at 20 °C3,4,5. For direct comparisons with the bacterial rates, we also determined the Δt value at 20 °C (92.2 ± 2.5 s) (Fig. 1d). The Δt value remained similar after addition of 150 nM eIF3 and 200 nM eEF3 (and 1 mM ATP:Mg2+) or addition of 500 nM hypusine-modified eIF5A15—indicating that the transition from initiation to elongation is independent of these factors (Extended Data Fig. 5a). Changing the concentration of Cy3.5–Phe-TC did not notably alter the Δt value (Extended Data Fig. 5b). Thus, the rate of the transition is limited by a step that occurs before binding of the first aa-tRNA to the A site.

We next questioned whether the slow transition was a result of the m7G-cap-independence of initiation on the model mRNA (that is, the lack of the m7G cap on the model mRNA and eIF4F proteins in the reaction). We formed 48S PICs on RPL41A or RPL30 mRNAs that were completely capped with m7G (cap-RPL41A or cap-RPL30, respectively) (Fig. 1a, Methods) in the presence of eIF4F, eIF4B and eIF3 proteins and performed the same experiments as above. The 48S PICs readily formed in a cap-dependent manner (Extended Data Fig. 2b), and cryo-electron microscopy (cryo-EM) analyses of the complexes showed that they were in the post-scanning, mRNA-channel-‘closed’ state16 (Extended Data Fig. 2c–e). Thus, the 48S PIC that was central to our single-molecule assays is an authentic and on-pathway complex. However, the Δt values at 20 °C were 81.4 ± 2.5 s and 65.7 ± 3.4 s for cap-RPL41A and cap-RPL30, respectively—similar to the Δt value that was measured on the model mRNA (Fig. 1d). Increasing the Cy3.5–Phe-TC concentration did not markedly change the Δt value (Extended Data Fig. 5c), which further demonstrates that the rate-limiting step occurs before aa-tRNA binding to the A site. Our findings suggest that the slow transition to elongation may be a general feature of yeast translation.

We next tested whether the sequence context near the start codon affected entry into the elongation phase of translation. When the optimal yeast Kozak sequence17 was inserted upstream of the AUG codon in the model mRNA (designated as model mRNA-Kozak), the Δt value was reduced by around threefold to 34.1 ± 1.3 s at 20 °C (Fig. 1a, d). This effect was largely abolished by an A to U substitution at the −3 position (Extended Data Fig. 5d), consistent with the critical role of that residue within the Kozak sequence18. We have thus uncovered a role of the Kozak sequence in the control of this transition. However, even the fastest rate we observed at 20 °C was still more than 30-fold slower than that observed in E. coli3,4,5. To understand the energetics of the rate-limiting step in the eukaryotic transition, we increased the reaction temperature to 30 °C (which is within the optimal temperature range for growth of S. cerevisiae19), and found that Δt depended on temperature across all assayed mRNAs (Fig. 1d, Extended Data Fig. 5d). At 30 °C, the Δt value decreased by approximately 4–5-fold compared to the value at 20 °C for the model, cap-RPL41A and cap-RPL30 mRNAs, and by around 2.5-fold for the model mRNA-Kozak.

The temperature dependence of the Δt value suggested that slow conformational rearrangements and/or factor dissociation that are predominantly driven by changes in enthalpy limited the transition. We hypothesized that dissociation of eIF5B was the rate-limiting step, given that its binding site on the ribosome overlaps with that of the elongator aa-tRNAs12. Previous kinetic analyses of translation in rabbit reticulocyte lysate suggest that GTP hydrolysis by an initiation factor limits the rate at which the first peptide bond is formed after the joining of ribosomal subunits20. This GTP hydrolysis could plausibly be that by eIF5B. If eIF5B indeed gates the transition to elongation, we would expect its lifetime on the 80S complex to be similar to the Δt values that we measured above.

To test this hypothesis, we fluorescently labelled eIF5B (Extended Data Fig. 6a) and tracked the protein throughout the reaction pathways using a similar experimental scheme as above. A Cy5.5 dye was attached to N-terminally truncated eIF5B (remaining residues 396–1002) through a ybbR tag fused to the N terminus. Note that this truncated eIF5B is the standard construct in the eukaryotic translation field6,9,21,22; it was used in all of our measurements above that involved eIF5B (Extended Data Fig. 6b) and neither the tag nor the dye affected the function of eIF5B (Extended Data Fig. 6c, d). After immobilizing pre-formed Cy3–48S PICs in ZMWs, we co-delivered Cy5.5–eIF5B, Cy5–60S and Cy3.5–Phe-TC to the immobilized complexes in the presence of other required factors. With the model mRNA and direct illumination of all fluorophores, we observed transient sampling events of Cy5.5–eIF5B to the 48S PIC before 60S joining (mean lifetime = 0.59 ± 0.06 s, n = 903 events; Extended Data Fig. 7a), which is consistent with the dynamic interaction of eIF5B with the 48S PIC that has been suggested by previous measurements with ensemble fluorescence approaches9. After the joining of Cy5–60S to form the 80S complex (as shown by the smFRET signal), the average lifetime of Cy5.5–eIF5B on the ribosome was prolonged and nearly equal to Δt (Fig. 2a–c, Extended Data Fig. 7a). The Δt value measured here matched that measured with unlabelled eIF5B, which confirms that labelling did not affect eIF5B function (Extended Data Fig. 6e). Furthermore, dissociation of Cy5.5–eIF5B always preceded the binding of Cy3.5–Phe-TC to the A site of the 80S ribosome, which occurred very rapidly (within 1 s) after eIF5B departure (Fig. 2a, d). Similar results were observed for all mRNAs at both 20 °C and 30 °C (Fig. 2c). Notably, the Δt value was not reduced by using full-length eIF5B, which indicates that the lack of the non-conserved N-terminal region8,10 of the protein did not interfere with the transition rate (Extended Data Fig. 6b, f, g). Thus, the presence of eIF5B on the de novo-assembled 80S ribosome limits the rate of the transition from initiation to elongation.

Fig. 2: eIF5B gates the transition between initiation and elongation.
figure 2

a, b, Sample trace (a) and schematic (b) showing the correlation of Cy5.5–eIF5B occupancy on the ribosomal complex with Cy5–60S joining and the arrival of Cy3.5–Phe-TC. c, For all assayed mRNAs, the Δt value, average eIF5B lifetime on 80S and average tRNA arrival time after eIF5B departure were estimated by fitting the dwell times to single-exponential distributions (Extended Data Fig. 9g). From bottom to top for each group of three bars: n = 141, n = 164, n = 133, n = 131 and n = 134 molecules. Error bars show 95% confidence intervals. d, Contour plots of the departure of Cy5.5–eIF5B (bottom; normalized Cy5.5 fluorescence intensity (norm. Cy5.5 int.) changing from 1 to 0) and the arrival of Cy3.5–Phe-TC (top; normalized Cy3.5 intensity (norm. Cy3.5 int.) changing from 0 to 1) in experiments performed with model mRNA in the presence of GTP. Plots were generated by superimposing all the analysed fluorescence traces such that the time point of Cy3.5–Phe-TC arrival was set at 0. Contours are plotted from tan (lowest population) to red (highest population); n = 134. e, The same types of average dwell times as in c were determined for experiments performed with the model mRNA in the presence of 3 mM free Mg2+ at 20 °C. Experiments were performed either in the presence of GTP with unlabelled wild-type (WT) eIF5B (n = 164; data taken from Fig. 1d), Cy5.5-labelled wild-type eIF5B (n = 134; data taken from c), unlabelled eIF5B(H505Y) (n = 119) or Cy5.5-labelled eIF5B(H505Y) (n = 80); or separately in the presence of GDPNP with unlabelled eIF5B(H505Y) (n = 80) (Extended Data Fig. 9h).

Source data.

We hypothesize that the rate of GTP hydrolysis by eIF5B defines its residence time on the 80S complex. The Δt value that was measured here on the model mRNA was similar to the mean time of GTP hydrolysis by eIF5B during initiation on the same type of model mRNA (approximately 30.2 s versus 28 s at 26 °C)21. Notably, inclusion of a non-hydrolysable GTP analogue (GDPNP) trapped eIF5B on the 80S complex (average Cy5.5–eIF5B lifetime on the 80S was around 850 s and was limited by photobleaching and imaging time; n = 105), and prevented the association of aa-tRNA with the ribosomal A site (Extended Data Fig. 7b, c). To further investigate this finding, we used two previously reported mutant versions of eIF5B that exhibit disrupted GTPase activity8. In the first of these mutants (eIF5B(T439A)), tRNA association with the A site was inhibited (only 17 out of 131 assembled 80S ribosomes underwent A-site tRNA association) and the Δt value increased to around 370 s at 20 °C (Extended Data Fig. 8). By contrast, in the second mutant (eIF5B(H505Y)), which also has weakened affinity for the ribosome, the Δt value decreased by about 15-fold (n = 119) compared to wild-type eIF5B (Fig. 2e, Extended Data Fig. 8). The H505Y mutation instead reduced the occupancy time of eIF5B on the ribosome compared to wild type independent of GTP hydrolysis (Fig. 2e). Thus, eIF5B departure is indeed controlled by GTP hydrolysis.

The marked dependence of the eIF5B departure rate from the 80S complex on temperature and on the concentration of free Mg2+ (Extended Data Fig. 9) suggests that the GTP hydrolysis activity of eIF5B is sensitive to the conformation of the ribosome. We therefore applied cryo-EM, guided by our single-molecule kinetics, to analyse the conformations of on-pathway ribosomal complexes that were assembled in the presence of wild-type eIF5B and GTP. We added eIF5B–GTP and 60S subunits to assembled 48S PICs (on the model mRNA) (Fig. 1a) and allowed the reaction to proceed at about 20 °C for 45 s (from mixing to grid plunge-freezing) (Methods), which corresponds to a pre-steady-state time point according to our single-molecule measurements (Extended Data Fig. 10a, Methods). Two distinct classes of 80S complexes were resolved: (1) the eIF5B-bound 80S initiation complex; and (2) the 80S elongation complex (Extended Data Fig. 10c, d). The two states were populated in agreement with predictions from our single-molecule kinetics (Extended Data Fig. 10a, b). Similar to that of the reported 80S complex bound to eIF5B–GDPCP12, the eIF5B-bound 80S initiation complex (class 1) adopts a semi-rotated inter-subunit conformation, with the Met-tRNAMeti in the 80S peptidyl/initiation configuration. Moreover, eIF1A density was absent in this complex. Its dissociation therefore occurs before the departure of eIF5B and is not the rate-limiting step9. After dissociation of eIF5B, the 80S initiation complex matured to the 80S elongation complex (class 2) in a non-rotated-state, with Met-tRNAMeti shifted to the 80S peptidyl/peptidyl configuration—poising the ribosome for rapid association of tRNA with the A site. Together, we propose that the irreversible hydrolysis of GTP by eIF5B and its subsequent dissociation serves as a final kinetic quality-control checkpoint of the assembly of the 80S complex before entry into the energy-expensive elongation phase.

In summary, we have demonstrated that in S. cerevisiae (and probably in other eukaryotic systems20), eIF5B gates the transition from translation initiation to elongation (Fig. 3). The shortest transition time that we measured in m7G-cap-dependent translation was approximately 8 s (in the presence of 2 mM free Mg2+ at 30 °C) (Extended Data Fig. 9b)—a timescale that is relevant to the estimated rates of translation initiation in yeast in vivo (which vary from several seconds to minutes at 30 °C)23,24. The transition to elongation is probably gated by the hydrolysis of GTP by eIF5B, as shown by the correspondence between the rate of GTP hydrolysis21 and the occupancy time of eIF5B on the 80S complex, and by our observation that the 80S complex bound to eIF5B (probably with GTP) is in a similar conformation to that observed for non-hydrolysable GTP12. We speculate that the long lifetime of eIF5B-bound 80S complexes could partially account for the enrichment of ribosome-protected mRNA fragments at the start codon that has been observed in ribosome profiling experiments in eukaryotic systems25,26,27. The relatively slow transition between translation initiation and elongation in the eukaryotic system compared with the bacterial counterpart provides a mechanism to control the density of ribosomes in the early part of the mRNA open reading frame23,28 to prevent ribosome collisions29,30, and a final kinetic checkpoint for accurate start-codon selection. Consistent with the latter proposal, disruption of GTPase activity results in a higher propensity for leaky scanning in vivo8, and the residence time of eIF5B on the 80S is strongly dependent on the sequence context that surrounds the start codon, with non-optimal Kozak sequences leading to longer residence times. Subtle conformational signalling—perhaps through the initiator tRNA—may modulate the final conformation of the 80S–eIF5B complex for rapid hydrolysis of GTP and subsequent formation of an elongation-competent 80S complex. Finally, we expect that there are many kinetic checkpoints in the early stages of eukaryotic translation. The application of similar single-molecule approaches will unveil the complicated dynamics that underlie mRNA selection, scanning and start-codon selection and the mechanisms by which they are regulated1,2.

Fig. 3: Model of the late events of eukaryotic translation initiation and the transition to elongation.
figure 3

eIF5B catalyses the joining of the 60S subunit to the 48S PIC to form the 80S initiation complex (80S IC). Dissociation of eIF5B from the 80S initiation complex requires the hydrolysis of GTP, and this plausibly leads to a change in the conformation of eIF5B that lowers its affinity for the 80S. Thus, the dissociation of eIF5B from the 80S initiation complex gates the transition to elongation, which is marked by the binding of an elongator aa-tRNA to the 80S elongation complex (80S EC). The effects of free Mg2+ concentration, temperature and the sequence context that surrounds the start codon on the rate of the transition indicate that conformational rearrangements may have key roles in governing the rate of eIF5B dissociation, probably by controlling its GTPase activity.

Methods

No statistical methods were used to predetermine sample size. The experiments were not randomized. The investigators were not blinded to allocation during experiments and outcome assessment.

The strategy for labelling with a ybbR tag

The ybbR-tag labelling strategy has previously been established31. In brief, 4′-phosphopantetheinyl transferase (SFP synthase) specifically transfers a CoA conjugate to a serine residue in the ybbR tag (an 11-amino-acid peptide sequence), resulting in the covalent linking of the CoA conjugate to the ybbR tag. The SFP synthase that was used in this study was purified as previously described31, and the Cy3-like dye analogue CoA 547 was purchased from New England BioLabs (S9349). Following the published protocol, we synthesized CoA-Cy5.5 dye by reacting CoA trilithium salt (Sigma-Aldrich, C3019-25MG; 0.52 mg, 0.66 μmol, 0.75 equivalent (equiv.)) with sulfo-cyanine5.5 maleimide (Lumiprobe, 17380; 1 mg, 0.88 μmol, 1 equiv.) at room temperature for 2 h in a 1-ml volume of 100 mM potassium phosphate pH 7.05 and 10% DMSO. This was followed by quenching the excess amount of maleimide with 8 mM DTT at room temperature for 10 min, which enables the product to be used without the need of further purification.

Unlabelled and labelled tRNAs and yeast ribosomal subunits

Native yeast Met-tRNAMeti was purchased from tRNA Probes, LLC (MI-60) and E. coli Phe-tRNAPhe and Lys-tRNALys labelled with Cy3.5 or Cy5 were previously characterized32. Unlabelled yeast ribosomal subunits were purified by sucrose density-gradient centrifugation as previously described13. Cy3–40S was prepared by the ybbR-tag labelling method as described in the next two sections below.

Genetically tagging the 40S subunit at uS19 with a ybbR tag

The yeast uS19 (RPS15) strain JD1346 (his3Δ1; leu2Δ0; ura3Δ0; YOL040c::KanMX4; pFL38(Pgal::RPS15; URA3)) and plasmid pJD1055 (pRPS15-HIS3) were provided by J. D. Dinman33. A ybbR-tag sequence (GATTCTCTTGAATTTATTGCTAGTAAGCTTGCG)31 was inserted at the N-terminal end of the uS19 sequence in the plasmid pJD1055, resulting in the plasmid pJFW1. This was introduced into yeast JD1346 using a standard lithium acetate yeast-transformation protocol34 followed by a 5′-FOA plasmid shuffle35, to obtain the yeast strain yJFW11 in which all the 40S ribosomal subunits were tagged. For ybbR-uS19-40S purification, the yeast strain yJFW11 was streaked onto a selection plate containing synthetic complete–His (SC-His) medium. A single colony was used to inoculate 5 ml of fresh SC-His liquid medium, and the culture was grown at 30 °C overnight with shaking at 200 r.p.m. The next morning, the stationary-phase yeast culture was used to inoculate a 1.8-l culture of YPAD medium with a 1:4,000 dilution for large-scale growth until an optical density at 600 nm (OD600 nm) of around 0.6 was reached, before collecting cells by fast centrifugation at 4 °C and 7,808g in a Fibrelite F9-6x1000 LEX Fixed Angle Rotor (Thermo Fisher Scientific) for 12 min. The cell pellet was flash-frozen in liquid nitrogen before storage at −80 °C for further ribosome purification.

Labelling of ybbR-uS19-40S

For labelling of the 40S subunit through the ybbR tag, a 300-μl labelling reaction was prepared by mixing 1 μM 40S, 2 μM SFP synthase and 10 μM CoA-547 dye in a buffer containing 50 mM HEPES-KOH pH 7.5, 1 mM DTT and 10 mM MgCl2. The reaction was incubated at 30 °C for 30 min. To purify the labelled 40S from free dye, 150 μl of the reaction mixture was loaded on top of a 750-μl sucrose cushion (30 mM HEPES-KOH pH 7.5, 2 mM DTT, 100 mM KOAc, 5 mM Mg(OAc)2 and 0.5 M sucrose) for ultracentrifugation at 351,955g at 4 °C for 90 min in a TLA100.2 rotor. The pellet containing labelled 40S was washed once with and then resuspended in the ribosome storage buffer (50 mM HEPES-KOH pH 7.5, 2 mM DTT, 100 mM KOAc, 3 mM Mg(OAc)2, 0.25 M sucrose) and stored at −80 °C after flash-freezing with liquid nitrogen.

Labelling of 60S

The Cy5-labelled SNAP-tagged 60S was prepared as previously described and characterized32. The Cy5.5-labelled ybbR-tagged 60S was prepared in the same way as described above for the Cy3–40S. For yeast genetics, the pRS315 plasmid used for the SNAP-tagged 60S yeast strain was mutated to carry the native uL18 (RPL5) gene sequence with a ybbR-tag sequence fused in frame at the C terminus before the stop codon. The new plasmid was transformed into JWY3733 strain (MATα; ura3-52; trp1Δ101; leu2Δ1; ade1; his3Δ200; rpl1::TRP1+ pRS316-RPL5) and plated onto SC-Leu plates, followed by screening for the loss of the pRS316-RPL5 on SC-Leu plates containing 5′-FOA. The resulting strain (MATα; ura3-52; trp1Δ101; leu2Δ1; ade1; his3Δ200; rpl1::TRP1+ pRS315-RPL5-ybbR) expresses uL18-ybbR as the sole source of the protein. The ybbR ribosome labelling efficiencies were generally around 75% based on absorbance measurements.

Purification of protein factors

Published strains, constructs and protocols were used for the purification of yeast eIF2, eIF3, eIF4E, eIF5, eIF5A, eIF5B (residues 396–1002), eEF1A, eEF1Bα, eEF2 and eEF313,15,32,36.

Purification of eIF1

The SUI1 gene was cloned into a pET28 vector such that the overexpressed protein had the following sequence: MGSSHHHHHHMENLYFQG–native eIF1, wherein the underlined sequence is a tobacco etch virus (TEV) protease cleavage site that cleaves between the Q and the G and this cleavage site is followed by the native eIF1 sequence. The final purified protein sequence is therefore the native eIF1 sequence, preceded by a Gly residue.The plasmid was transformed into Rosetta2(DE3) E. coli cells. Protein overexpression was performed at an OD600 nm of around 0.6 by induction with 1 mM IPTG at 30 °C for 4 h. Cells were pelleted and stored at −80 °C. The pellet was rapidly thawed and suspended in 50 mM potassium phosphate (pH 7.5), 500 mM KCl, 10 mM imidazole, 5 mM β-mercaptoethanol (βME), 10% (v/v) glycerol and 1 mM phenylmethylsulfonyl fluoride (PMSF). The cells were lysed by sonication and the lysate was clarified by centrifugation at 41,656g for 30 min at 4 °C in a F21-8x50y rotor (Thermo Fisher Scientific). The clarified lysate was loaded onto a 2-ml Ni-NTA column and washed with 50 ml wash buffer (50 mM potassium phosphate (pH 7.5), 300 mM KCl, 20 mM imidazole, 5 mM βME, 10% (v/v) glycerol). Histidine (His)-tagged protein was eluted with wash buffer containing 250 mM imidazole. The eluted protein was diluted five times in volume by addition of 20 mM HEPES-KOH pH 7.5, 10% (v/v) glycerol and 2 mM DTT, and loaded to a 5-ml HiTrap Heparin HP column for further purification as previously described13. The eluate was treated with TEV protease overnight on ice to cleave off the 6×His tag and eIF1 was recovered by flowing through a Ni-column to remove the free tag and TEV protease. The final eIF1 protein was purified on a HiLoad 26/600 Superdex 200-pg gel filtration column in storage buffer (20 mM HEPES-KOH pH 7.5, 100 mM KOAc, 10% (v/v) glycerol, 2 mM DTT), flash-frozen with liquid nitrogen and stored at −80 °C.

eIF1A

The TIF11 gene was cloned into a pET28 vector such that the overexpressed protein had the following sequence: MGSSHHHHHH MENLYFQG–native eIF1A, and the final purified protein sequence is the native eIF1A sequence preceded by a Gly residue. The protein overexpression and purification were performed in exactly the same way as for eIF1.

eIF4A

The TIF1 gene was cloned into a pET28 vector such that the overexpressed protein had the following sequence: MGSSHHHHHH MENLYFQG–native eIF4A, and the final purified protein sequence is the native eIF4A sequence preceded by a Gly residue. The protein overexpression and Ni-NTA purification were performed in the same way as for eIF1. After Ni-NTA purification, the protein was treated with TEV protease overnight to cleave off the 6×His tag, while dialysing at 4 °C against a buffer containing 50 mM potassium phosphate (pH 7.5), 300 mM NaCl and 5 mM βME. The eIF4A protein was recovered by flowing through a Ni-column to remove the cleaved 6×His tag and TEV protease. The final protein was purified on a HiLoad 26/600 Superdex 200-pg gel filtration column in storage buffer (20 mM HEPES-KOH pH 7.5, 100 mM KOAc, 2 mM DTT), flash-frozen with liquid nitrogen and stored at −80 °C.

eIF4B

The TIF3 gene was cloned into a pET22b vector such that the overexpressed protein had the following sequence: native eIF4B–ENLYFQGAAAELHHHHHH, and the final purified protein sequence is the native eIF4B sequence followed by ENLYFQ. The protein overexpression and purification were performed in exactly the same way as for eIF4A.

eIF4G

The TIF4631 gene was cloned into the multiple-cloning site 1 of a pETDuet-1 vector such that the overexpressed protein had the following sequence: native eIF4G-ENLYFQGHHHHHH, and the final purified protein sequence is the native eIF4G sequence followed by ENLYFQ. On the same plasmid, the native eIF4E gene (CDC33) was cloned into the multiple-cloning site 2, which has been proven to facilitate the overexpression of the full-length eIF4G in E. coli37. The protein overexpression and purification were performed in the same way as for eIF4A, except that 500 mM KCl was included in all the buffers used for the Ni-column purification. The final protein contained both eIF4G and wild-type eIF4E and the full-length eIF4G migrates slower than expected on SDS–PAGE for its molecular weight, which has previously been shown37,38,39. The full-length nature of eIF4G was further verified by Edman degradation and mass spectrometry.

eIF5B variants

When applicable, a ybbR-tag sequence was introduced into the pMGAHisTev5B plasmid13 after the TEV cleavage site, with or without a T439A or H505Y mutation, by site-directed mutagenesis. The protein was overexpressed and purified following the previously described protocol13. For full-length eIF5B, we started with the plasmid pC100540, deleted the Flag-tag sequence, inserted in a 8×His tag and a TEV cleavage site before the native eIF5B sequence, and cloned into the multiple-cloning site 1 of the pETDuet-1 vector. Proteins were overexpressed in Rosetta2(DE3) cells as described for eIF1 and the purification was performed as previously described13. To label the ybbR-tagged proteins with Cy5.5, 10 μM protein was incubated with 2 μM SFP synthase and 15 μM CoA-Cy5.5 dye in a buffer containing 50 mM HEPES-KOH pH 7.5, 1 mM DTT and 10 mM MgCl2 at room temperature for 30 min. The reaction mixture was loaded onto a Ni-NTA column equilibrated with the same buffer to remove His-tagged SFP synthase, and the flow-through was passed through a 10-DG desalting column to remove free dye from the protein. The final protein was purified on a Superdex 75 10/300 GL column in storage buffer (20 mM HEPES-KOH pH 7.5, 100 mM KOAc, 10% glycerol, 2 mM DTT) and stored at −80 °C. The ribosome-dependent GTPase activity was determined as previously described using the malachite green assay32.

mRNAs

All the mRNAs were prepared by T7 RNA polymerase run-off transcription. The RNA transcript sequences are as follows (with the start and stop codons underlined): model mRNA: GG(UC)8UAUGUUCAAAUAA(UC)16; model mRNA-Kozak: GG(UC)8UAAAAAAAUGUUCAAAUAA(UC)16; model mRNA-Kozak_-3U: GG(UC)8UAAAUAAAUGUUCAAAUAA(UC)16; RPL41A: GGAGACCACAUCGAUUCAAUCGAAAUGUUCGCCAAGUGGAGAAAGAAGAGAACUAGAAGACUUAAGAGAAAGAGACGGAAGGUGAGAGCCAGAUCCAAAUAA; RPL30: GGAAACAGACCGGAGUGUUUAAGAACCUACAGCUUAUUCAAUUAAUCAAUAUACGCAGAGAUGUUCCCAGUUAAAUCCCAAGAAUCUAUCAACCAAAAGUUGGCUUUGGUUAUCAAGUCUGGUAAGUACACCUUAGGUUACAAGUCCACUGUCAAGUCUUUGAGACAAGGUAAGUCUAAGUUGAUCAUCAUUGCCGCUAACACUCCAGUUUUGAGAAAGUCCGAAUUGGAAUAUUACGCUAUGUUGUCCAAGACUAAGGUCUACUACUUCCAAGGUGGUAACAACGAAUUGGGUACUGCUGUCGGUAAGUUAUUCAGAGUCGGUGUUGUCUCUAUUUUGGAAGCUGGUGACUCUGAUAUCUUGACCACCUUGGCUUAAAUAAGGUAAGUUCAAACGAUUU.

For the model mRNA, model mRNA-Kozak and model mRNA-Kozak_-3U, 3′-end biotinylation or Cy5-labelling was performed using periodate chemistry followed by hydrazide reaction as previously described41. The RNA was purified from the free hydrazide on a Superdex 75 10/300 GL column in storage buffer (20 mM HEPES-KOH pH 7.5 and 100 mM KOAc) and stored at −80 °C. The hydrazide analogues used were EZ-Link Biotin-LC-Hydrazide (Thermo Fisher Scientific, 21340) and cyanine5 hydrazide (Lumiprobe, 13070).

To obtain 3′-end-biotinylated or Cy3-labelled, fully m7G-capped RPL41A mRNA, the RNA transcript was: (1) ligated to a DNA oligonucleotide (5′pd(CT)12-biotin-3′ or 5′pd(CT)6-Cy3-d(CT)6-3′) (TriLink Biotechnologies) and gel purified; (2) m7G-capped using the Vaccinia capping system (New England BioLabs, M2080S); and (3) treated with RNA 5′polyphosphatase (Epicentre, RP8092H) and terminator 5′-phosphate-dependent exonuclease (Epicentre, TER51020) to remove uncapped RNA, followed by phenol and chloroform extraction and ethanol precipitation.

The 3′-end-biotinylated, fully m7G-capped RPL30 mRNA was prepared by: (1) 3′-end biotinylation using the periodate chemistry and hydrazide reaction41; and (2) m7G-capping and uncapped-RNA depleting as described above for RPL41A mRNA.

Native gel-shift analyses

The formation of pre-initiation and initiation complexes was monitored by 4% native PAGE analyses using protocols as previously described13. Samples were prepared in 1× Recon buffer (30 mM HEPES-KOH pH 7.5, 100 mM KOAc, 3 mM Mg(OAc)2). A ternary complex was prepared by incubating 3.9 μM eIF2 with 1 mM GTP:Mg2+ at 30 °C for 10 min, followed by addition of 2.7 μM Met-tRNAMeti and incubation at 30 °C for another 5 min.

For reactions with the model mRNA (Extended Data Fig. 2a), the 48S PIC was formed by mixing a final concentration of 1 μM eIF1, 1 μM eIF1A, threefold-diluted ternary complex (final 1.3 μM eIF2 and 0.9 μM Met-tRNAMeti), 50 nM Cy3–40S and 2 μM model mRNA–Cy5, and incubating at 30 °C for 15 min; the 80S PIC was formed by addition of a final concentration of 1 μM eIF5, 1 μM eIF5B and 200 nM 60S to the 48S PIC and further incubation at 30 °C for 5 min. Typically, 10 μl of each sample was mixed with 2 μl of native gel dye (50% sucrose, 0.02% bromophenol blue in 1× Recon buffer) and loaded into the 4% native PAGE gel for electrophoresis13; this was followed by scanning the gel for the Cy3 and Cy5 fluorescence with a Typhoon scanner.

For reactions with the cap-RPL41A mRNA (Extended Data Fig. 2b), the samples were prepared similarly to above, except 1 mM ATP:Mg2+ was supplemented into the reaction buffer, the cap-RPL41A–Cy3 mRNA was at 100 nM, 40S was unlabelled and, when applicable, 1 μM eIF4A, 1 μM eIF4B, 400 nM eIF4G, 200 nM eIF4E and 200 nM eIF3 were added.

Single-molecule inter-subunit FRET experiments using a total internal reflection fluorescence microscope

The ribosomal inter-subunit FRET signal was characterized on a previously described42,43 home-built, prism-based total internal reflection fluorescence microscope. The 80S complexes were assembled using Cy3–40S and Cy5–60S on the model mRNA–biotin in the same way as for the native gel shift assays. The 80S complexes were immobilized on a neutravidin-derivatized quartz slide according to reported protocols32,43. Unbound components were washed from the slide with 1× Recon buffer supplemented with 1 mM GTP:Mg2+ and an oxygen-scavenging system (containing 2.5 mM PCA (protocatechuic acid), 2.5 mM TSY and 2× PCD (protocatechuate-3,4-dioxygenase), purchased from Pacific Bioscience). Cy3 fluorophores were excited with a diode-pumped solid-state 532-nm laser at 1 kW cm−2. Fluorescence intensity trajectories were recorded at 10 frames per second with a total of 600 frames collected per movie by using the MetaMorph imaging software (Molecular Devices). Analysis of fluorescence intensity trajectories was performed in MATLAB using custom scripts42. Baseline correction of raw fluorescence traces and Cy3 signal bleed-through into the Cy5 channel were corrected on a trace-by-trace basis. The FRET states were assigned with a hidden Markov model-based approach as previously described and visually corrected42.

Real-time single-molecule experiments on the ZMW instrument

All the experiments were performed in the 1× Recon buffer supplemented with 1 mM GTP:Mg2+ (and 1 mM ATP:Mg2+ for m7G-capped mRNAs) unless otherwise specified. A ZMW-based PacBio RSII instrumentation14 was used for all the real-time measurements at 20 °C, 26 °C or 30 °C. For experiments performed with the model mRNA, model mRNA-Kozak and model mRNA-Kozak_-3U, in general, immediately before each experiment, the 48S PIC was formed as described for the native gel shift assays using final concentrations of 80 nM Cy3–40S, 500 nM biotinylated mRNA, 1 μM eIF1, 1 μM eIF1A, 300 nM eIF5, 1.3 μM eIF2 and 0.9 μM Met-tRNAMeti. In parallel, a ZMW chip was treated with 0.2% (w/v) Tween20 at room temperature for 10 min followed by treatment with neutravidin at room temperature for 5 min, which bound to the biotin-PEG (polyethylene glycol) on the chip. After immobilizing the 48S PIC via the biotin on the mRNA onto the neutravidin-coated surface, the chip was washed twice using an imaging buffer containing the oxygen-scavenging system and 62.5 μg ml−1 casein to remove unbound complexes, and loaded onto the RSII instrument in the presence of 20 μl of the imaging buffer. At the same time, a delivery mixture containing 200 nM Cy5-60S, 1 μM eIF5, eIF5B (2 μM unlabelled or 40 nM Cy5.5-labelled), Cy3.5–Phe-TC (pre-formed by incubating 8 μM eEF1A, 10 μM eEF1Bα with 1 mM GTP:Mg2+ at room temperature for 5 min, followed by the addition of 0.8 μM Cy3.5–Phe-tRNAPhe and another incubation at room temperature for 5 min; diluted eightfold in the delivery mixture for final 50 nM ternary complex or fourfold for final 100 nM ternary complex), oxygen-scavenging system and 62.5 μg ml−1 casein was prepared and loaded onto the instrument. For each experiment, the instrument delivered 20 μl of the delivery mixture to the chip at the beginning of the experiment, and recorded a 15-min movie at 10 frames per second, with 0.32 μW μm−2 of 532-nm laser illumination (for unlabelled eIF5B experiments), or 0.6 μW μm−2 of 532-nm laser and 0.1 μW μm−2 of 642-nm laser dual illumination (for Cy5.5-labelled eIF5B experiments). For the elongation assay with tRNA–tRNA smFRET, both the imaging buffer and the delivery mixture were supplemented with 1 mM ATP, and the delivery mixture contained 115 nM Cy5.5–60S, 1 μM eIF5, 2 μM eIF5A, 2 μM eIF5B, 1 μM eEF2, 400 nM eEF3, 100 nM Cy3.5–Phe-TC and 100 nM Cy5–Lys-TC (pre-formed separately by incubating 8 μM eEF1A, 10 μM eEF1Bα with 1 mM GTP:Mg2+ at room temperature for 5 min, followed by the addition of 0.8 μM Cy3.5–Phe-tRNAPhe or Cy5–Lys-tRNALys and another incubation at room temperature for 5 min), oxygen-scavenging system and 62.5 μg ml−1 casein. The imaging was performed at 20 °C with 0.32 μW μm−2 of 532-nm laser illumination.

Experiments with the cap-RPL41A and cap-RPL30 mRNAs were performed in a similar manner, except that 1 μM eIF4A, 1 μM eIF4B, 0.67 μM eIF4E, 300 nM eIF4G and 200 nM eIF3 were also included in the second incubation step for the 48S PIC formation, and 300 nM eIF3 was also added to the delivery mixture. When titrating free Mg2+ concentrations, the corresponding amount of Mg(OAc)2 was used in both the imaging buffer and the delivery mixture. When additional protein factors were added to each experiment, they were added to the delivery mixture in the denoted amounts. For experiments performed with GDPNP, the 48S PIC was formed in the presence of GTP as described above. After immobilization, the imaging buffer used contained 0.2 mM GTP; no GTP was added in the delivery mixture and 6 mM GDPNP:Mg2+ was added instead.

The resulting experimental movies were analysed using custom MATLAB scripts, as previously described14. In brief, fluorescence traces from the ZMWs were filtered on the basis of the presence of the fluorophores of interests at different time points (the signal from immobilized 48S PICs was expected to be present at the beginning of the movie, whereas signals from fluorophores attached to the 60S, eIF5B and tRNA were expected not to be). The different arrival and departure events were manually assigned based on the appearance and disappearance of the corresponding fluorescent signals.

Cryo-EM analysis of the 48S PIC formed with cap-RPL30 mRNA

The 48S PIC sample for the cryo-EM analysis was prepared in the same way as that for the real-time single-molecule experiments in the presence of ATP, GTP and 3 mM free Mg2+. After the incubation at 30 °C, 30 μl of the sample was diluted with 100 μl 1× Recon buffer and then concentrated to 30 μl using a Vivaspin 500 concentrator (100, 000 MWCO PES, Sartorius, VS0141) by centrifugation at 6,973g for 5 min at 4 °C. This step mimics the washing step in our single-molecule experiments for removing unbound components. A 3 μl sample was applied to each 25-s glow-discharged (in a PELCO easiGlow glow discharger, Ted Pella, Inc.) 200-mesh Quantifoil R2/1 gold grid (Electron Microscopy Sciences, Q250AR1) at 21 °C and 95% humidity. The grid was blotted for 2 s before plunging into liquid ethane using a Leica EM GP (Leica Microsystems). Automated data acquisition was performed using SerialEM44 on a Tecnai F20 microscope operated at 200 kV at a dose rate of 10 e per pixel per s using a defocus range of −2.0 to −4.0 μm. Eighty movie-mode images with 10-s exposure time and 50 frames for each were recorded on a Gatan K2 Summit direct electron detector camera in a counting mode at a nominal magnification of 14,500 (yielding a pixel size of 2.53 Å on the specimen level). The drift correction was performed using MotionCor245 and the contrast transfer function (CTF) estimation was performed using CTFFIND-4.146. The images with bad CTF fitting were excluded and 78 images were selected for further data processing. The initial particle-picking was performed using EMAN247 and the saved box files were imported into RELION-3.0-beta-248 for the first 2D classification to generate some good 2D class averages as a new template for automatic particle-picking in RELION. The 3D initial model was built using RELION. After the first six-class 3D classification, 11,478 particles within one class from a total of 17,654 good particles were selected for the 3D auto-refine to obtain a map at 9.0-Å resolution (not shown). Then a soft mask around the Met-tRNAMeti region was created to perform the particle subtraction followed by the focused classification (without image alignment) into two classes: with Met-tRNAMeti and without Met-tRNAMeti. Finally, 4,930 particles were selected for the final 3D auto-refine to reconstruct the map at 9.9-Å resolution. The scanning-competent open (EMD 3049) or scanning-incompetent closed (EMD 3048) 48S PIC structures were low-pass-filtered to the same resolution (9.9 Å) and scaled to the same pixel size using the command relion_image_handler in RELION for visual comparisons in Chimera (UCSF) (see Extended Data Fig. 2c–e).

Cryo-EM analysis of the on-pathway 80S complexes during initiation and the transition to elongation

To probe the conformational states of the 80S complexes along the native initiation and transition to elongation pathways, we decided to analyse the sample corresponding to the reaction time point 45 s (under pre-steady-state conditions), when the 80S initiation complex should account for around 64.5% (and the 80S elongation complex account for around 35.5%; see Supplementary Note) of the total 80S complexes assembled on the model mRNA at 20 °C and 3 mM Mg2+ in the presence of wild-type eIF5B and GTP. Also, to ease the post-acquisition particle-classification procedure, we decided to use excess 48S PICs to 60S, such that free 60S particles would be minimized. To this end, we assembled 48S PICs in the same way as in the single-molecule experiments with the model mRNA at 3 mM free Mg2+, and the only difference was that the concentration of the unlabelled 40S was 300 nM (assuming 100% complex assembly, the 48S PIC concentration should be 300 nM). In parallel, an initiation mixture containing 150 nM unlabelled 60S, 1 μM eIF5 and 2 μM eIF5B was prepared. After mixing equal volumes of the 48S PIC with the initiation mixture, the final concentrations were 150 nM 48S PIC, 75 nM 60S and 1 μM eIF5B. At 20 °C under these conditions, we would expect the reaction kinetics to be similar to that of the single-molecule experiments described above.

While the 48S PIC and initiation mixtures were kept on ice, the 200-mesh R2/1 Quantifoil grids were glow-discharged for 25 s. When the Leica EM GP system was ready for grid plunging, 5 μl of the 48S PIC mixture was mixed with 5 μl of the initiation mixture, and incubated at about 20 °C for 30 s before approximately 3 μl of the sample was applied to each grid. The grid was immediately blotted for 2 s using a filter paper before being plunged into liquid ethane with 95% chamber humidity at about 21 °C (resulting in a reaction time from mixing to grid-freezing at 45 s). Automated data acquisition was done using SerialEM44 on a Tecnai F20 microscope operated at 200 kV at a dose rate of 10 e per pixel per s using a defocus range of −0.7 to −2.5 μm. A total of 226 movie-mode images with 10-s exposure time and 50 frames for each were recorded on a Gatan K2 Summit direct electron detector camera in a counting mode at a nominal magnification of 14,500 (yielding a pixel size of 2.53 Å on the specimen level). The drift correction was performed using MotionCor245 and the CTF estimation was done using CTFFIND-4.146. The images with bad CTF fitting, too much drift or that were over-focused were excluded, and eventually 134 images were selected for further 2D and 3D classifications. The initial particle-picking was performed using EMAN247 and the saved box files were imported into RELION-3.0-beta-248 for the first 2D classification, to generate some good 2D class averages as a new template for better 80S particle-picking in RELION. The 3D initial model of the 80S complex was built using RELION. The following 2D and 3D classifications were used to eliminate junk or bad particles, or even free 40S or 60S particles. In total, 25,144 80S particles were selected for the 3D auto-refine to obtain the map at 7.8-Å resolution (not shown). The eIF5B Protein Data Bank (PDB) model was selected from the PDB file 4V8Z12 and fitted into the 7.8-Å 80S map. A zone with radius 6 around the fitted eIF5B PDB model was generated using Chimera. Then a soft mask around the eIF5B density was made using mask creation in RELION. The particle subtraction was done with this soft mask to perform the focused 3D classification (without image alignment) into two classes of 80S: 17,602 particles with eIF5B bound (class 1, 80S initiation complex; Extended Data Fig. 10c) and 7,542 without eIF5B (class 2, 80S elongation complex; Extended Data Fig. 10d). Finally 3D auto-refine reconstructed the 80S initiation complex map at 8.4-Å resolution and the 80S elongation complex map at 10.1-Å resolution.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this paper.