Main

Secretory proteins are proposed to target to the endoplasmic reticulum (ER) membrane either co- or post-translationally for subsequent translocation10,11,12. Mechanistic models of ER targeting and the role of the SRP derive primarily from cell-free systems using model proteins10,13, raising the question of how these pathways function in the cell. To investigate membrane targeting in vivo, we fractionated soluble and membrane-attached ribosomes from yeast cells, and then used ribosome profiling (termed Ribo-seq)9 to compare the ribosome-protected mRNA footprints from polysomes obtained from both fractions (Extended Data Fig. 1a). We derived a cotranslational membrane enrichment score for each coding sequence (Methods, Extended Data Fig. 1b and Supplementary Table 1). Transcripts encoding cytosolic or nuclear (cytonuclear) proteins were preferentially translated on cytosolic ribosomes and not enriched on membrane polysomes (Fig. 1a). Tail-anchored proteins, whose single or transmembrane domain (TMD) at the carboxyl terminus is only revealed posttranslationally14, were also translated on cytosolic ribosomes. By contrast, many nuclear-encoded mitochondrial protein transcripts were enriched in the membrane-bound ribosome fraction, as expected15. Transcripts encoding ER-destined secretory proteins were highly enriched on membrane-bound ribosomes. Proteins containing a signal sequence (SS) or TMD had comparable cotranslational membrane enrichment, conflicting with the idea that the targeting signal itself distinguishes which proteins are targeted co- or post-translationally to the ER11,12 (Fig. 1a).

Figure 1: Cotranslational membrane enrichment.
figure 1

a, Distributions of the open reading frame (ORF) enrichment of ribosome-protected reads in the membrane fraction compared to the soluble fraction. ORFs were alternatively classified by expected SRP dependence11. Values are the mean from two biological replicates. **P ≤ 0.01, Wilcoxon rank-sum test. b, Ribosome-protected reads at each codon of an example transmembrane protein OLE1. Membrane topology is indicated above, with the first TMD in lavender. c, Metagene analysis of soluble fraction polysome-protected reads from transcripts that were at least twofold membrane enriched. ORFs were aligned at the targeting signal and scaled. d, Cotranslational membrane targeting is in competition with elongation. e, Elongation inhibitors provide additional time for polysomes exposing a targeting signal to localize to the membrane. f, Membrane enrichment was limited by the length of the reading frame remaining after the encoding of targeting signals. The vertical dashed line indicates 50 codons.

PowerPoint slide

Ribosome profiling provides a snapshot of the abundance of ribosomes at each codon of each mRNA9, revealing the dynamics of translation on soluble versus membrane-bound ribosomes. For cytonuclear proteins, soluble ribosome-protected reads were distributed across the entire reading frame, consistent with complete translation in the cytosol (Extended Data Fig. 1c). For secretory proteins, both soluble and membrane-bound polysomes produced protected reads. Cytosolic translation represented only a small fraction of any given secretory transcript, and most of the secretory mRNA pool was membrane anchored. In the classical understanding of cotranslational targeting, secretory protein RNCs bind to the membrane only after exposing a targeting signal4. Thus, there should be fewer RNCs found on the membrane translating the portion of transcripts not yet targeted, that is, at codon positions upstream of the first SS or TMD. However, the membrane-bound ribosome-protected reads were evenly distributed across the entire transcript (Fig. 1b and Extended Data Fig. 1c, d). This suggests that once targeted, secretory mRNAs remain associated to the ER and their translation initiates at the membrane. This is consistent with the observed proximity of secretory RNCs to the translocon before synthesis of the targeting signal16. The small fraction of secretory mRNA in cytoplasmic pre-targeted RNCs probably represents the pioneer round of targeting.

The positioning of soluble ribosomes along mRNA provides insight into how secretory transcripts are targeted to the membrane. The highest read density for these messages mapped 5′ of the region encoding the first SS or TMD; read density declined after the first targeting signal was exposed by the ribosome, as expected from cotranslational signal-dependent targeting of soluble RNCs to the membrane (Fig. 1b, c and Extended Data Fig. 1d). Surprisingly, the loss of reads after signal emergence was gradual, resulting in many RNCs that remained soluble for hundreds of residues after SS or TMD exposure. This result was inconsistent with the elongation attenuation activity proposed for the SRP17,18 and suggests that elongation continues on cytosolic RNCs upon exposure of a targeting signal (see Supplementary Discussion).

The idea that there is a kinetic competition between continuing elongation in the cytosol and RNC targeting to the membrane makes two testable predictions (Fig. 1d). First, pharmacological inhibition of elongation with cycloheximide (CHX) should decouple these processes, enhancing targeting of translocation-competent RNCs and promoting their depletion from the soluble fraction (Fig. 1e). Cells were subjected to a brief, two-minute CHX incubation before Ribo-seq analysis of soluble and membrane-bound polysomes. Importantly, such brief incubation did not perturb non-secretory polysomes (Extended Data Fig. 1c). By contrast, CHX treatment markedly reduced the soluble secretory reads, but only after cytosolic RNCs exposed the first SS or TMD, that is, 40 codons after its synthesis (Fig. 1b, c and Extended Data Fig. 1d).

The kinetic competition between targeting and elongation predicts that cotranslational membrane attachment is influenced by translation termination. In the absence of an elongation arrest, the probability of RNCs reaching the membrane cotranslationally will decrease as the first SS or TMD is found closer to the C terminus (Fig. 1f and Extended Data Fig. 1e). Indeed, we observed a decline in the maximum membrane enrichment of secretory RNCs when the first targeting signal is near the C terminus. Thus, secretory proteins with a late targeting signal, SS or TMD, must be targeted to the ER posttranslationally (Supplementary Discussion). Overall, our data suggest that cotranslational targeting to the ER in yeast is accomplished via a pioneer round of translation on soluble ribosomes that establishes a pool of ER-residing mRNA that initiate translation at the membrane (Extended Data Fig. 1f).

We next determined which RNCs are substrates of the SRP in vivo. Immunoprecipitation of Srp72p from total soluble RNCs was followed by ribosome profiling of both SRP-associated polysomes and monosomes (Fig. 2a and Extended Data Fig. 2a). Few transcripts encoding cytonuclear or mitochondrial proteins were enriched on SRP, confirming its specificity towards ER-destined transcripts. Notably, the SRP bound to all secretory RNCs that were cotranslationally targeted to the membrane, including SRP-dependent and SRP-independent proteins (Fig. 2b, c).

Figure 2: Cotranslational enrichment of SRP.
figure 2

a, Srp72p-TAP was immunoprecipitated from the total soluble fraction. SRP-bound monosomes and polysomes were separated by sucrose gradient ultracentrifugation. b, Distributions of the ORF enrichment of ribosome-protected reads from SRP-bound soluble polysomes over the total soluble polysomes. ORFs were alternatively classified by expected SRP dependence11. Values are the mean from two biological replicates. TA, tail-anchored. *P ≤ 0.05, Wilcoxon rank-sum test. c, Cotranslational membrane-fraction enrichment compared to SRP enrichment. d, Metagene analysis of soluble SRP-bound polysome-protected reads from transcripts that are at least twofold SRP-enriched. ORFs were aligned at the targeting signal and scaled.

PowerPoint slide

The number of ribosome-protected reads from soluble, SRP-bound transcripts diminished after ribosome exposure of the first SS or TMD, as expected from its targeting function (Fig. 2d). The loss was gradual and many SRP–RNCs remained soluble well after the targeting signal became fully exposed to the cytosol. This supports the notion that elongation proceeds on cytosolic ribosomes even after SRP binds, in contrast with the expected SRP-induced elongation arrest. Indeed, blocking elongation with CHX for 2 min before lysis caused a marked depletion in SRP-bound reads, but only for RNCs exposing their first targeting signal (Fig. 2d). In principle, the delayed targeting of soluble RNCs to the membrane after SS/TMD emergence could reflect a delay in SRP binding rather than a lack of elongation arrest. Comparing the SRP and membrane enrichment to transcripts indicated that this is not the case. RNCs encoding late targeting signals, that is, near the C terminus, still bound SRP but did not target to the ER membrane (Supplementary Discussion, Fig. 2c and Extended Data Fig. 2b–d). Addition of CHX allowed these late-signal RNCs to enrich at the membrane, indicating the SRP–RNC complexes are competent for ER-targeting. We conclude that the SRP binds the nascent chain quickly, and continued elongation causes termination of late signals before targeting.

Although elongation arrest is not a general consequence of SRP binding in vivo, recent work showed that a rare-codon-directed slowdown of elongation facilitates SRP binding19. An intrinsic, non-SRP-dependent elongation slowdown should increase ribosome-protected reads at the same codon in both soluble SRP-bound and membrane-bound polysomes. Indeed, several transcripts presented such local increases in ribosome-protected reads at sites corresponding to exposure of a targeting signal on the ribosome (Extended Data Fig. 3a–c). Distinct elongation attenuation mechanisms observed at these sites included clusters of rare codons19 and stalling polypeptide elements, such as stretches of positively charged amino acids, or proline motifs, positioned within the exit tunnel20,21. While most secretory transcripts were not significantly enriched in these attenuator elements compared to the proteome (Extended Data Fig. 3d, e), the few non-secretory proteins that cotranslationally bound to the SRP were enriched in elongation attenuation elements positioned at sites that exposed a near-cognate hydrophobic sequence for SRP binding (Extended Data Fig. 3d, f). We speculate that the presence of such elements enhances SRP recognition of the near-cognate hydrophobic tracts in these non-secretory proteins.

To understand the basis for the specificity of the SRP in vivo, we next determined the initial point of SRP recruitment to ribosomes translating secretory proteins. Because polysomes require only a single SRP-bound ribosome to co-purify with Srp72p, additional strategies were necessary to identify mRNA footprints that originated from a single SRP-bound ribosome. We developed a protocol using in vivo monosomes to identify the initial SRP binding event on RNCs (Fig. 2a). At any given time, a fraction of transcripts contains only a single actively translating ribosome (Extended Data Fig. 4a). Total soluble monosomes yield a similar distribution of protected reads compared to polysomes (Extended Data Fig. 4b–e and Supplementary Discussion). We separated soluble SRP-bound monosomes from SRP-bound polysomes and subjected both fractions to Ribo-seq analysis (Extended Data Fig. 5a, b). Of note, the monosomes were necessarily bound to the SRP during the purification, and thus should reveal which codons are responsible for the initial SRP recruitment step.

The canonical model that the SRP recognizes the nascent chain after the targeting signal exits the ribosome4 (Fig. 3a) makes several predictions. First, there should be few monosome-protected reads relative to polysomes before the first SS/TMD emerging from the ribosome tunnel; second, ribosome footprints should increase beginning approximately 40 codons after the first codon in the targeting signal, and third, monosome reads should decrease after full exposure of the SS/TMD, as SRP–RNCs are delivered to the membrane. Indeed, these patterns were observed in a subset of secretory transcripts with significantly more hydrophobic signals (Fig. 3b, Extended Data Figs 2e, f and 5c). SRP recruitment to these RNCs only occurred when the translated signals were fully exposed, and not while still in the exit tunnel22,23 (Extended Data Fig. 5d and Supplementary Discussion).

Figure 3: Distinct mechanisms of SRP recruitment.
figure 3

a, Recruitment of SRP to RNCs is expected to increase ribosome-protected reads from SRP-bound monosomes when an SS or TMD is exposed to the cytosol (orange). b, Distributions of SRP-bound ribosome reads on representative transcripts from CHX-treated cultures. Selected transcripts are RCR2 and VBA4. c, Most secretory proteins demonstrated SRP enrichment before signal exposure. d, Metagene plot of the median value of enrichment of SRP-bound monosomes compared to polysomes. Included transcripts encode TMDs at least 40 codons from the start codon. Shaded areas represent enrichment before the TMD is encoded (cyan), while the TMD is in the ribosome exit tunnel (lavender), and after the TMD is exposed (orange). e, Two mechanisms for SRP to select secretory mRNA. f, PMP1 and PMP2 were the only tail-anchored proteins that enriched SRP. g, The GFP ORF was fused to the indicated 3′ UTRs and expressed in vivo. Srp72p-TAP was immunoprecipitated from the total soluble fraction and RNAs were subject to quantitative PCR (qPCR). n = 3 biological replicates; **P ≤ 0.01, Welch’s t-test. h, Puromycin treatment of lysate from yeast expressing GFP with the PMP1 3′ UTR was followed by SRP immunoprecipitation and qPCR. n = 3 biological replicates; *P ≤ 0.05, Welch’s t-test.

PowerPoint slide

Notably, most secretory transcripts did not conform to the predictions of the model (Fig. 3b, c and Extended Data Fig. 5e). Instead, ribosome footprints from most SRP-bound monosomes were abundant well before translation of the first targeting signal. For instance, the RNCs of DAP2 were enriched on SRP from the start codon. For membrane proteins, SRP enrichment could be observed up to hundreds of codons before translation of the first TMD (Fig. 3d and Extended Data Fig. 5e). Thus, the exquisite selectivity of SRP towards secretory transcripts occurs via RNCs that have not yet translated any SS or TMD. Of note, the SRP-bound monosome reads did diminish upon full signal exposure by the ribosome. Thus, SRP is pre-recruited to secretory RNCs before the synthesis of an SS or TMD, but only after the emergence and recognition of the targeting signal can it promote membrane targeting, presumably owing to a conformational change in the SRP–RNC complex22,24. Our findings show that the SRP stably and preferentially binds ribosomes translating secretory mRNAs in a manner independent from the sequence of the exposed nascent chain (Fig. 3e). Models in which SRP scans all ribosomes with high affinity and rapid kinetics25 do not explain our findings, as discussed in the Supplementary Discussion.

To begin to understand the determinants that confer specific recruitment of SRP without an exposed SS or TMD, we examined the most extreme cases of nascent-chain-independent SRP recruitment. PMP1 and PMP2 encode two abundant, small membrane proteins of 40 and 43 amino acids, respectively. Even though the entire proteins are smaller than the length of the ribosomal tunnel, PMP1 and PMP2 RNCs bind to the SRP throughout translation (Fig. 3f and Extended Data Fig. 6a, b).

We considered whether non-coding mRNA determinants could confer nascent-chain-independent SRP recruitment. PMP1 and PMP2 contain long 3′ untranslated regions (UTRs) implicated in membrane attachment26. We thus tested the effect of fusing the 3′ UTR of either PMP1 or PMP2 to the mRNA of cytosolic green fluorescent protein (GFP) lacking any targeting signal (Fig. 3g). The 3′ UTR of cytosolic protein TUB2 served as a control. Notably, the 3′ UTRs of either PMP1 or PMP2 conferred cotranslational SRP binding to the GFP transcripts, as well as membrane localization, whereas the 3′ UTR of TUB2 did not (Fig. 3g, Extended Data Fig. 6c). For all constructs, GFP protein was diffuse and cytosolic, indicating that the 3′ UTR alone is insufficient to promote substantial translocation of GFP into the ER (Extended Data Fig. 6d). Notably, the 3′ UTR of endogenous PMP1 is functionally important in vivo. Thus, replacing the 3′ UTR of the PMP1 gene with the 3′ UTR of TUB2 resulted in a growth defect more severe than complete deletion of the entire PMP1 gene (Extended Data Fig. 6e). Perhaps mislocalization of the TMD in the absence of the 3′ UTR is more toxic than the loss of gene function.

Two non-exclusive models can account for SRP recruitment by the PMP1 and PMP2 3′ UTRs. First, SRP binds to the mRNA, either directly or through other RNA-binding proteins. Alternatively, ribosomes translating PMP1 or PMP2 recruit SRP in a 3′ UTR-mediated manner (Extended Data Fig. 6f). To distinguish between these possibilities, a puromycin incubation was used to disrupt elongating27 ribosomes before fractionation and SRP immunoprecipitation. This treatment caused a significant reduction in the GFP-PMP1 mRNA that copurified with SRP (Fig. 3h). Thus, translating ribosomes promote SRP recruitment to the GFP–PMP1 transcript. Of note, puromycin also disrupted the SRP interaction with the SEC61 mRNA control. We thus next examined the general role of translation in SRP recruitment.

We assessed the global ribosome dependency of SRP binding to secretory transcripts using either puromycin or CHX incubations to disrupt or stabilize elongating ribosomes, respectively. Srp72p-bound transcripts isolated from the soluble fraction were examined by RNA-seq (Fig. 4a). SRP association with all secretory mRNAs was sensitive to puromycin. Transcripts that only recruit SRP through a canonical nascent chain interaction were more dependent on elongating ribosomes. The reduced puromycin sensitivity observed for pre-enriched transcripts may arise from the inability of puromycin to disrupt initiating ribosomes27, which appear able to recruit SRP (Extended Data Fig. 6g).

Figure 4: Translation and the role of SRP.
figure 4

a, Distributions of RNA-seq SRP enrichment scores from secretory protein transcripts (SS, TMD, SS-TMD or tail-anchored), with or without puromycin treatment. Included ORFs have at least twofold SRP enrichment without puromycin. b, Transcripts are retained on the membrane through binding of the RNC to the translocon. It is also possible that mRNA binding proteins at the ER bind transcripts. c, The prt1-1 allele prevents initiation at non-permissive temperatures. Translational run-off removes all ribosomes from transcripts. d, Distributions of RNA-seq membrane-enrichment scores of secretory protein transcripts (n = 584). e, After mRNA export, a pioneer round of targeting directs secretory transcripts to the ER membrane. SRP is specifically pre-recruited to transcripts that will present a functional targeting signal. After emergence of an SS or TMD, SRP directs RNCs to the ER membrane. Once at the ER membrane, transcripts are retained over several rounds of translation.

PowerPoint slide

We next examined whether the membrane association of secretory transcripts similarly depends on continuing translation. In principle, ER-localized proteins could recruit secretory transcripts to the membrane in the absence of translation26,28 (Fig. 4b). Membrane and soluble mRNAs were fractionated in the presence and absence of puromycin treatment and subjected to RNA sequencing (RNA-seq) analysis (Extended Data Fig. 7). Disruption of translating ribosomes reduced membrane enrichment for all secretory protein transcripts, including those of PMP1 and PMP2. This result was confirmed using the GFP-PMP1 reporter (Extended Data Fig. 6h).

The translation-dependence of membrane association for secretory transcripts was further examined using a temperature-sensitive allele of eIF3 subunit PRT1, prt1-1. Shifting cells to the non-permissive temperature precludes mRNA binding to the 40S subunit29, allowing elongating ribosomes to run off (Fig. 4c). After displacing ribosomes from the mRNA, soluble and membrane fractions were analysed by RNA-seq. Notably, the only mRNAs that remained in the membrane fraction corresponded to mitochondrially encoded proteins. The membrane enrichment of all ER-destined secretory transcripts was abolished in the absence of translation (Fig. 4d). Thus, translation is required for the observed association of mRNAs with membranes.

Our findings define the principles of cotranslational membrane targeting and the role of SRP in this process and provide a solution to the paradox of how SRP achieves exquisite specificity in vivo despite its low abundance, its substrate binding promiscuity, and despite the competition from abundant cytosolic chaperones30 that could potentially bind SSs or TMDs. For most mRNAs, SRP does not need to scan translating ribosomes rapidly for binding of targeting sequences while ignoring near-cognate cytosolic hydrophobic sequences6. Instead, several mechanisms bias towards the correct SRP–RNC interactions (Fig. 4e). For most secretory mRNAs, SRP binds before targeting signals are synthesized, in a pioneer round of cytoplasmic translation. Pre-recruited SRP is thus poised to recognize the SS or TMD after emergence of a targeting signal from the ribosome22 and facilitate membrane attachment. For a smaller fraction of clients with more hydrophobic-targeting signals, SRP recruitment is initiated by binding RNCs that fully expose SS or TMD in the nascent chain. We do not observe an SRP-induced elongation arrest, but some mRNAs have intrinsic elements attenuating elongation upon signal exposure. Since membrane targeting is in kinetic competition with continued elongation, posttranslational targeting dominates for proteins with a late targeting signal. Once at the membrane, secretory mRNAs remain bound through subsequent rounds of initiation and translocon engagement. These hydrophobic proteins will no longer compete with soluble proteins for cytosolic quality control components. Conversely, transcripts not captured in this first round of selection become more likely to encounter cytosolic chaperones. One important and surprising conclusion is that cotranslational events governing nascent polypeptide fate are not only guided by the nascent chain itself, but also rely on additional aspects of translation, such as mRNA itself and cellular organization. These findings illustrate the multi-layered nature of protein biogenesis fidelity.

Methods

Data reporting

No statistical methods were used to predetermine sample size. The experiments were not randomized, and the investigators were not blinded to allocation during experiments and outcome assessment.

Yeast strains

Ribosome profiling (Ribo-seq) and qPCR assays were performed using BY4741 Srp72-TAP::His3MX, obtained from OpenBioystems31. BY4741 SEC61-GFP::His3MX was obtained from Invitrogen32. Strain CY2522, containing the prt1-1 temperature-sensitive allele33, was provided by E. Craig. BY4741 PMP1Δ::kanMX4 was obtained from the yeast deletion library34.

Ribosome profiling

For each biological replicate, six 500-ml cultures of YPD were grown with shaking at 30 °C to OD600 nm = 0.8–1.0, and collected one at a time by filtering through a 0.22-μm membrane. Cells were scraped off the filter in a single motion using a metal scoopula and then immersed in liquid nitrogen. When indicated, CHX treatments were performed by adding the drug to the culture to 100 μg ml−1 immediately before filtering. Filtration completed in approximately 2 min, and cells were scraped and immersed in liquid nitrogen within 3 s.

Lysis buffer comprised 50 mM potassium MOPS, pH 7.2, 275 mM potassium glutamate, 5 mM magnesium acetate, 1 mM DTT, 100 μg ml−1 CHX, and 20 U ml−1 Superase•In (Ambion). Two 3-ml aliquots were supplemented with Complete Protease Inhibitor Cocktail, EDTA-free (Roche) and frozen dropwise in liquid nitrogen. One 3-ml aliquot of frozen lysis buffer was combined with cells from 1.5 l of culture in a 50 ml ball mill chamber chilled in liquid nitrogen (Retsch). Cells were pulverized for 1 min at 20 Hz in a MM-301 mixer mill. Pulverized cells from 3 l of culture were combined and thawed in a room temperature water bath. Lysates were immediately centrifuged in a Type 70.1 Ti rotor (Beckman) for 10 min at 12,000 r.p.m. The following were then added to the supernatant: Triton X-100 to 0.01%, heparin sulfate to 0.2 mg ml−1, and PMSF to 1 mM. Heparin was added as an RNase inhibitor only after fractionation as it may dislodge ribosomes from the membrane35. A low concentration of Triton X-100 reduces bead clumping during immunoprecipitation, and prevents aggregation upon elution. A portion of the supernatant was retained as the total soluble fraction. Three millilitres of lysis buffer supplemented with Triton X-100 to 1% and heparin sulfate to 0.2 mg ml−1 were added to the pellets. Pellets were resuspended using a glass dounce homogenizer fit to the internal diameter of the tube. Membrane extracts were centrifuged as before, and the detergent-extracted supernatant is recovered as the total membrane-bound fraction.

One millilitre of streptavidin-conjugated magnetic beads (Pierce) was saturated with biotinylated total rabbit IgG (Calbiochem). Beads were incubated with the total soluble fraction for 1 h at 4 °C. Beads were then washed three times with 1 ml of wash buffer, which consisted of lysis buffer supplemented with 0.2 mg ml−1 heparin and 0.01% Triton X-100. Beads were then incubated with 100 μl wash buffer, 2 μl Superase•In and 3 μl AcTEV protease (Invitrogen) for 1 h at room temperature. The first eluate was retained on ice, the digest was repeated, and the eluates combined, yielding the total SRP-bound fraction. The eluate was immediately quantified using A260 nm, and the total soluble and membrane-bound fractions were diluted in wash buffer to equivalent concentrations in 200 μl. Samples were layered on a 7–47% sucrose gradient prepared in wash buffer omitting RNase inhibitors. Gradients were centrifuged in a SW 41 Ti rotor (Beckman) for 2.5 h at 39,000 r.p.m., and fractionated using a UA-6 detector and Foxy Jr. fraction collector (ISCO). Five 1-ml fractions containing the polysomes were combined, as were two 1-ml fractions containing the monosomes. Samples were diluted to 6 ml in wash buffer without RNase inhibitors and centrifuged for 12 h at 50,000 r.p.m. in a Type 70.1 Ti rotor.

Ribosome pellets were resuspended in 250 μl of cutting buffer of 20 mM Tris, pH 7.5, 140 mM potassium chloride, 1.5 mM magnesium chloride, and 0.01% Triton X-100. Concentrations were typically 10–100 ng μl−1 as measured by A260 nm. Samples with greater concentration were diluted to 100 ng μl−1 in 250 μl. Fifty units of RNase I (Ambion) were added to each sample, and digests proceeded for 1 h at room temperature. Digests were stopped with the addition of 2 μl Superase•In, transferred to a MLA-130 centrifuge tube (Beckman) and underlaid with 750 μl of 35% sucrose in cutting buffer. Ribosomes were pelleted by centrifugation for 4.5 h at 70,000 r.p.m. Total RNA was extracted from the pellets using a miRNeasy kit (Qiagen). Libraries were prepared as previously described36, quantified by qPCR (Kapa Biosciences), and sequenced using a HiSeq 2500 (Illumina).

RNA-seq

To test disruption of elongating ribosomes, yeast from one 500-ml culture of BY4741 Srp72-TAP::His3MX were lysed in ribosome profiling lysis buffer, which includes CHX. Yeast from a second culture were lysed in buffer prepared without CHX that was supplemented with 0.5 mM puromycin, 1 mM ATP, 1 mM GTP, 10 mM creatine phosphate and 40 μg ml−1 creatine kinase. Both lysates were incubated at room temperature for 10 min after thawing.

To test the effect of initiation in vivo, two 500-ml cultures of CY2522 (prt1-1) and two 500-ml cultures of W303 were grown with shaking at 25 °C to OD600 nm = 0.6. One culture of each strain was shifted to 37 °C, and the four cultures were grown one additional hour followed by harvesting by fast filtration.

Samples of the total soluble, total membrane, and total SRP-bound fractions were prepared as described for ribosome profiling. RNA was extracted using the hot SDS-phenol-chloroform method, and mRNA was purified using oligo-dT beads according to the manufacturer’s instructions (NEB). Eluted mRNA was then fragmented under alkaline conditions9, and fragments of 35–50 nucleotides were purified by PAGE. Libraries were constructed as for Ribo-seq and sequenced using a HiSeq 4000 (Illumina).

Data processing for enrichment scores

Adaptor sequences were trimmed from sequencing reads using Cutadapt37. Two rounds of alignment are performed using Bowtie, and Tophat38,39. First, sequences are aligned against a library comprising mature ribosomal rRNA with Bowtie. Unaligned reads are retained and then aligned against S288C Release 64-1-1 (http://www.yeastgenome.org) using Tophat. Any read with more than one match was removed, and reads were assigned to ORFs and counted using the GenomicAlignments package in Bioconductor40,41. Dubious ORFs were omitted.

Identification of targeting signals and classification of ORFs

Different SS and TMD prediction programs vary in their output, and so we included only consistently predicted targeting signals in our analysis. Protein sequences were given to SignalP V3 (ref. 42), Phobius43, Philius44, and TMHMM45. The following scheme was used for classification.

Mitochondrial

All mitochondrially encoded proteins, and nuclear-encoded proteins that localized to this organelle in at least one of two fluorescence-based screens32,46.

SSs

SSs from non-mitochondrial proteins predicted by SignalP, Phobius and Philius that did not contain any TMDs predicted by Phobius or Philius. The first residue of the hydrophobic domain, as predicted by Phobius, was designated as the first signal residue. Predictions by TMHMM within the first 50 residues were ignored. SSs were defined as ‘looped’ if they enriched on Ssh1p at least 90 codons after the first SS codon16. Every GPI-anchored protein (as previously annotated11) satisfies conditions for an SS, and was included. Some GPI-anchored proteins have predicted TMDs near the stop codon; these do not expose cotranslationally.

TMDs

The first TMDs of non-mitochondrial proteins were predicted by TMHMM, Phobius, and Philius within five amino acids for each pair of predictors (that is, Philius versus Phobius, Philius versus TMHMM, Phobius versus TMHMM). The first signal location was designated as the average of the three predictions, rounded down. If the TMD was within the first 50 codons, predictions by SignalP are ignored. Phobius and Philius did not predict a cleavable SS.

Tail-anchored

First TMDs that begin within 50 amino acids of the stop codon were designated tail-anchored.

Signal sequences with transmembrane domains (SS-TMD)

SSs were predicted as above with at least one TMD predicted by both Phobius and Philius.

Cytonuclear

ORFs that had no predicted SS or TMD by SignalP, TMHMM, Phobius, or Philius, and which did not appear in the mitochondria. Since only fluorescence localization data were used to designate mitochondrial proteins, this set includes some true mitochondrial proteins.

Exceptions

All remaining sequences had an SS or TMD predicted by SignalP, Phobius, Philius, or TMHMM that was not predicted by the other programs. Because of the ambiguity in type and location of the targeting signal, these proteins are excluded from our analysis. Other exceptions included proteins with predicted TMDs from position 50 or later, as well as a SignalP SS prediction but no Phobius or Philius SS. We considered this ambiguity in the prediction of an SS, and excluded these ORFs.

Enrichment analysis

Count-based sequencing assays report changes in a transcript’s abundance as changes in its proportion of the total sequencing reads. Thus, when a subset of transcripts is enriched in a sample, the proportion of reads from all other transcripts decreases by a corresponding amount. Here we assume that enrichment on the membrane or SRP represents active selection of certain transcripts, and the depletion of all others is passive. In other words, we assume that few transcripts will be specifically prevented from appearing in the membrane or SRP. The distributions of the ‘Cytonuclear’ sets in Figs 1a and 2b, which skew towards enrichment, are consistent with this assumption. Thus, we expect the distribution of non-enriched transcripts to be similar to their overall expression or translation. This makes direct comparison of different enrichment scores (that is, SRP enrichment vs membrane enrichment, under various drug treatments etc.) calculated from proportional abundance15,16,47 unintuitive, because a component of enrichment appears as depletion of non-enriched mRNA.

In current approaches for differential gene expression, most genes are assumed to have unaltered abundance, and library sizes are normalized by a robust estimator such as median ratio method48 and trimmed mean of M-values (TMM)49. However, we expected changes for up to a third of ORFs. We used the TMM method of DESeq to derive library scale factors using reference ORFs, selected as those designated as ‘cytonuclear’. We applied these scale factors to the counts for every ORF, and calculated enrichment as the ratio of scaled reads between sets. Biological replicates were scaled separately and then averaged. The robust nature of scale estimation allowed for extreme cases of the reference set to have high enrichment scores. Scores were only reported for ORFs that have at least 100 total reads between replicates in each of the compared samples9,47.

Figure 2c included proteins designated as SS, TMD and SS-TMD that have at least 100 reads in all four data sets, with the following exceptions: YEL050C, YLR077W, YML061C, YOL053W and YPL132W, which were all observed on the surface on the mitochondria15 and had at least fourfold membrane enrichment here. Membrane enrichment after CHX treatment was determined using total ribosome-protected reads from the soluble and membrane fractions.

Mapping of ribosome-protected reads to codons

Reads were mapped to codons using an alternate method. After filtering rRNA, reads were aligned to a Bowtie library comprising coding sequences, plus the stop codon and 21-nucleotides flanking upstream and downstream. Using combined data from the SRP-pulldown and membrane polysome replicates, ORFs for which at least 20% of reads could map to a second ORF were removed, leaving a working set of 5,441 genes. Footprints of 26–35 nucleotides were processed separately for each length. The nucleotide that mapped to the centre of each read (rounded down) was given a value of 1, and reads were summed at each nucleotide position. A metagene analysis was performed50 and for each footprint length, an integer offset was determined so that the characteristically large peak at the start codon was maximized at the second nucleotide position (that is, aTg). Then, reads of all lengths were offset and combined. Nucleotide reads were summed for each codon.

Read distributions

For each sample, the total reads from elongating ribosomes were determined by adding counts from all codons excluding the first two and last two sense codons. Reads from biological replicates were summed to increase overall read depth, but owing to the high overall reproducibility, all of our conclusions can be demonstrated by treating replicates separately. The reads at each codon position are then divided by this total and multiplied by one million to yield reads per million (RPM). Values at each codon are smoothed using an 11 residue rolling average. The positions of TMDs in topology diagrams were taken from the TMHMM prediction. We caution that predictors may differ in the number and position of subsequence TMDs. Positions of SSs are the H-region predicted by Phobius. The point at which an SS or TMD begins to emerge is considered 40 codons after the first encoded residue of the signal.

For metagene plots, reads at each codon are smoothed using a 5 residue rolling average. ORFs are then aligned as indicated, and the median and interquartile ranges are calculated at each position. For each ORF in Figs 1c and 2d, reads at each codon position are divided by the mean reads per codon within the range +20 to +40 after first signal codon. Included ORFs have at least 20 reads within this window in each data set shown. The first 30 codons of each ORF are excluded to avoid the universally observed low-density region near the start codon.

Identification of SRP recruitment to the nascent chain

Increases in ribosome-protected reads in the soluble SRP-bound polysome data set were observed for a subset of ORFs at codon positions coincident with the exposure of targeting signals. We developed a clustering scheme that sorted ORFs by the shape of the distribution of ribosome-protected reads specific to SRP-bound polysomes. The test set comprised 568 SS, TMD or SS-TMD proteins with at an average of at least 3 reads per codon in both soluble SRP-bound polysome and membrane-bound polysome sets. For each ORF, we first smooth read counts from each data set with a leading 15-residue moving average window. We corrected for local features intrinsic to the sequence (that is, appearing in all fractions) by dividing the smoothed SRP-bound polysome reads by the smoothed membrane-bound polysome reads at each codon; positions with fewer than 3 reads in either smoothed set were omitted. Peak codon positions were identified as the maximum value within 30–180 residues after the first predicted targeting signal codon. Each ORF was scaled by dividing the value at each codon position by the mean value over the range from 50 codons before to 200 codons after the peak. Codon positions outside this range are discarded. Scaled values are then used to generate an empirical cumulative distribution function (ECDF). The ECDF is sampled from 0.0 to 2.0 in 0.1 steps. The samplings from the ECDFs were used for agglomerative hierarchical clustering using a Euclidean distance function and Ward’s minimum variance method51. The first split in the population distinguished ORFs having strong peaks from ORFs with weak or no peaks.

To analyse the distances between the first signal codon and the peak, peaks must be unambiguously assigned to the first targeting signal; otherwise the peak may be due to a later TMD. The distance to the peak was compared to the distance between the first TMD and the second (as determined by TMHMM) or the distance between the SS and the first TMD (as determined by Phobius). Peaks were considered unambiguous only if no more than 8 residues of the next TMD were translated. This value is the length of the shortest functional signal sequence in our set, controlling for any affect that an additional signal within the exit tunnel may have on SRP.

An alternative approach was used to determine codons with significant recruitment of SRP in mitochondrial transcripts and transcripts lacking an ER-targeting signal. For every ORF, a count matrix was built with codon position as rows, and replicates of SRP-bound polysome ribosome-protected reads, total soluble polysome reads and total membrane polysome reads as columns. These matrices were individually input to DESeq2 (ref. 48), and a linear model was fit using the presence or absence of SRP co-immunoprecipitation as the coefficient; in this application, codons were treated as ‘genes’. Reads at each codon were used for fitting local dispersion trends. Genes with at least one codon that had at least threefold enrichment with P < 0.001 were selected. We note that all six SRP subunits had local enrichment towards the C terminus; since this may be cotranslational particle assembly, we omitted them from further analysis. Thirteen other genes were identified, and binding sites were assigned to the first significantly enriched codon preceding the position with maximum enrichment.

Quantification of early SRP enrichment

Secretory protein ORFs which were used in the analysis of SRP recruitment to the nascent chain (see previous section) were also tested for SRP pre-enrichment by comparing ribosome-protected reads from SRP-bound polysomes and monosomes from a CHX-treated culture. The RPM values from codon 10 to the position of the first SS or TMD, plus 40, were added, and the monosome sum was divided by the polysome sum. The first 9 sense codons were omitted to avoid artefacts near the start of transcripts. Ratios of greater than 1 were designated pre-recruited.

The enrichment in Fig. 3d was determined by first smoothing SRP-bound monosome and polysome reads (in RPM) using an 11 codon window, and then dividing the monosome values by the polysome values.

GFP reporter constructs

Sequences of the yeast TUB2 5′ UTR (300 nucleotides preceding the start codon), the TUB2 3′ UTR (300 nucleotides following the stop codon), the PMP1 3′ UTR (600 nucleotides), and the PMP2 3′ UTR (500 nucleotides) were PCR amplified from BY4741 genomic DNA with flanking overlaps to the M13 (−20) forward or M13 reverse sequences, and to the beginning or end of the sfGFP ORF sequence52. The sequence of sfGFP was amplified from a pET33b-derived expression vector provided by W. Clemons. Plasmids were assembled in a single reaction using Gibson assembly53 using the M13 (−20) forward and M13 reverse sequences to amplify pRS315. Plasmids were transformed into BY4741 Srp72-TAP::HIS3MX.

qPCR

For each biological replicate, 500 ml of synthetic complete media lacking leucine were inoculated with an overnight culture to OD600 nm of 0.05. Cultures were grown at 30 °C to OD600 nm 0.8–1.0 and then collected by fast filtration. Cells were lysed and fractionated, and Srp72p was immunoprecipitated as described for RNA-seq. Purified RNA were subjected to TURBO DNase digestion (Ambion). Concentrations were determined using A260 nm, and 100 ng of RNA was used to synthesize cDNA using iScript (Bio-Rad). qPCR was performed on a CFX-96 thermocycler using iTAQ Universal SYBR Green Supermix (Bio-Rad). ACT1 was used as a reference for each fraction from the same culture, and enrichments were determined as fold difference of mRNA in the membrane or SRP-bound fractions over the soluble fraction. Enrichment scores from three technical replicates of the qPCR step were averaged, and biological replicates are shown.

Statistical hypothesis testing

All analysis was performed using the R programming language (https://www.r-project.org). Statistical significance in comparing distributions of SRP or membrane enrichment scores from ribosome profiling (Figs 1a and 2b), as well as in comparing hydrophobicity scores (Extended Data Fig. 2e), codon usage, or residue abundance (Extended Data Fig. 3d, e) was determined using two-sided Wilcoxon rank-sum tests. This test assumes independence of observations and does not require a normal distribution. Enrichment distributions are multimodal and so mean and variance estimates are not provided. Significance of SRP or membrane enrichment from qPCR (Fig. 3g, h and Extended Data Fig. 6c, h) was determined using a two-sided Welch’s t-test on log-transformed enrichment values. This test assumes normal distributions but allows unequal variance. In all tests, the null hypothesis is the distributions of tested populations are equal.

Code availability

Scripts for data processing and analysis in R are available upon request.