Introduction

Expression of proteins for study by structural biology is widely achieved using Escherichia coli due to several advantages relative to other expression hosts. Factors include high expression yields, rapid doubling time, ease in handling and manipulating strains, and the ability to use growth media of low complexity. This last advantage is particularly important for proteins targeted for study by NMR methods, since it facilitates incorporation of stable isotopic labels such as [15N], [13C] and [2H] into the expressed proteins. Many proteins produced for structural and mechanistic studies require post-translational modifications to achieve biologically relevant states, which is particularly true for mammalian proteins. Unfortunately, many putative drug discovery targets are difficult to express in functional or soluble forms in E. coli, which lacks the cellular machinery to affect requisite folding and post-translational modifications. Therefore, proteins that are not expressable in E. coli or require post-translational modifications, such as phosphorylation or glycosylation, are typically expressed in eukaryotic host systems such as insect cells (e.g. Sf9, Sf21, T. ni and Hi5), mammalian cells (e.g. CHO, HEK, COS) or yeast (e.g. Pichia, S. cerevisiae). Significant advances in protein expression for pharmaceutical research have been made in recent years (Assenberg et al. 2013).

Among the eukaryotic expression systems, BV-infected insect cells is perhaps the most straightforward, least labor-intensive and most commonly employed system. In BV-infected insect cells, heterologous over-expression of proteins is achieved by replacing the naturally highly expressed polyhedrin gene with the gene of interest. This mode of protein expression imparts additional flexibility to expressing isotopically-labeled proteins in insect cell by enabling cell growth and infection in unlabeled optimal growth media prior to transferring infected cells to a type-specifically labeled medium (Kelly et al. 2007). Overall, the relative ease of use has rendered the baculovirus expression system a preferred method for protein expression for X-ray crystallography. The use of the baculovirus expression system for the expression of isotopically enriched proteins for NMR applications has proved more challenging due to the considerable complexity of the required growth media in insect cell cultures (Weisset al. 1981; Maiorella et al. 1988; Schlaeger 1996; Gossert and Jahnke 2012). The development of serum-free growth media enabled the incorporation of common amino acid substitutes (such as selenomethionine, or Se-Met) or isotopically enriched amino acids for naturally occurring amino acids. Initial uses of baculovirus expression for incorporating isotopically labeled amino-acids were carried out by DeLange et al. who incorporated [2H]-labeled tyrosine into rhodopsin (DeLange et al. 1998), and Creemer et al. who reported the incorporation of [15N]-labeled lysine into rhodopsin (Creemers et al. 1999). The use of BV-infected insect cell cultures for the production of Se-Met labeled protein for crystallographic phasing was first reported by (Fremont et al. 1998; Bellizzi et al. 1999).

The baculovirus expression system is particularly well suited for type-specific labeling proteins with one or more of its ten essential amino acids (i.e. Arg, His, Ile, Leu, Lys, Met, Phe, Thr, Trp, and Val). Selective isotope labeling of these residue types can be achieved without the risk of isotope scrambling due to the inability of eukaryotic cells to produce these amino acids. [15N]-label dilution and scrambling occurs only for amino acids that are related biosynthetically to the citric cycle (i.e. Ala, Asn, Asp, Cys, Gln, Glu, Gly, Pro, and Ser). [13C]-dilution originates from unlabeled carbohydrates to a varying degree (Strauss et al. 2005). In addition to amino acids, insect cell growth media contain multiple carbon sources, including lipids, carbohydrates and yeastolate. Yeastolate, a nutritional supplement generally comprised of a mixture of peptides, amino acids, carbohydrates, and vitamins, is included in standard insect cell media at levels up to 6 g per liter, and therefore constitutes a major source of dilution of isotope labels. Due to the restricted metabolism in insect cells, for [15N]-labeling, only amino acids and yeastolate need to be considered as sources of isotopic dilution; for [13C]-labeling, carbohydrate may be an additional source for the dilution of [13C]-enrichment.

Two approaches have been used for preparing growth media for BV-infected protein expression in insect cell culture: preparing media from scratch (Bruggert et al. 2003), and augmenting a commercial medium that has been depleted in a targeted set of amino acids with isotopically enriched amino acid(s) (Creemers et al. 1999; Strauss et al. 2003; Walton et al. 2006; Gossert et al. 2011). The former approach enables high levels of uniform isotope enrichment, but suffers the drawback of being expensive if uniform enrichment is the desired outcome. In addition, preparing media from scratch may adversely impact reproducibility of protein expression (Gossert and Jahnke 2012). In contrast, augmenting a commercially available medium that has been depleted of targeted amino acids affords good reproducibility and high target protein expression yields; hence, it appears most suited for the preparation of type-specifically labeled protein samples. Most recently, Gossert and coworkers (Sitarska et al. 2015) developed protocols for uniform stable isotope labeling of proteins using [15N]- and [13C,15N]-labeled algal extracts supplemented with the addition of [15N]-ammonium salt as a nitrogen source and [13C]-glucose as a carbon source. This protein expression method represents the most cost effective approach to uniform labeling yet published.

In this study we refine the method of producing type-specifically labeled protein samples utilizing modified versions of the insect cell growth media ESF 921 (Expression Systems, LLC) in which the media were depleted in targeted amino acid types. Similar to Gossert et al. (2011), we focused on improving the growth medium by altering the yeastolate content. We first quantify the level of residual unlabeled amino acids in various media and their consumption during cell growth and protein expression. We then explore the use of yeastolate that had been dialyzed to remove ingredients having molecular weights less than 1 kDa. Our rationale was to eliminate many low molecular weight compounds in yeast extracts that may constitute unlabeled sources for amino acid synthesis during protein biosynthesis. We then show that the use of dialyzed yeastolate combined with a ten-fold dilution delivered enhanced isotope labeling efficiency as well as at least maintaining similar protein expression yields relative to expression in rich, unlabeled medium.

Results and discussion

In recognition that yeastolate may represent a major source of unlabeled biomaterial limiting the percentage of incorporation of [13C]- and [15N]- isotopes in baculovirus-expressed proteins, we pursued the strategy of using modified yeast extracts that were dialyzed against a filter with a cutoff pore size of 1 kDa. The rationale was to remove amino acids and low molecular weight peptidic fragments from the yeastolate which could potentially be hydrolyzed and converted to amino acids by the insect cells. This step should remove a source of unlabeled amino acids during the biosynthesis of a heterologously expressed protein. Our chosen protein expression protocol resembles the method by Cristiani, Pharmacia (Cristiani 2003). Sf9 or Hi5 cells were grown in commercial ESF 921 growth medium. Once the cell density reached the targeted level, cells were infected with baculovirus. Following an infection period of 16 h the cells were pelleted, washed and transferred to the amino acid-depleted medium. After a 4 h “starvation” period, labeled amino acid stocks were added (see Experimental section for further details). This sequence of events differs from the protocol of Gossert et al. who initiated viral induction after the transfer of cells to the labeled growth medium. In a more recent paper, Gossert et al. (2011) cited improved infection yields in the rich medium by employing the protocol of inducing viral infection 16 h prior to the transfer of cells to the labeled growth medium.

Quantitation of amino acid levels in yeastolate by analysis of growth media

To quantify the residual unlabeled (natural abundance [13C]) amino acid content in growth media, HC-HSQC spectra were recorded on four media formulations which were derived from the following stock solutions: (a) ESF 921; the “rich” undepleted commercial growth medium containing standard level of 8 g/l (100%) yeastolate, (b) ESF 921-Δ11, 10%D, ESF 921 medium depleted in 11 amino acids (i.e. Arg, His, Ile, Lys, Leu, Met, Phe, Thr, Tyr, Trp, Val) and containing 10% (0.8 g/l) dialized yeastolate, (c) dialized yeastolate stock, (d) undialized yeastolate stock. From these stock solutions, the following four media test solutions were examined by NMR:

  1. 1.

    ESF 921; the “rich” undepleted ESF 921 medium containining 100% undialyzed yeastolate,

  2. 2.

    ESF 921-Δ11 10%D stock (containing 0.8 g/l dialized yeastolate),

  3. 3.

    ESF 921-Δ11 10%D stock + 100% undialized yeastolate stock (containing 0.8 g/l dialized yeastolate + 8 g/l undialized yeastolate),

  4. 4.

    ESF 921-Δ11 10%D stock + 90% dialized yeastolate stock (containing 8 g/l yeastolate).

Using the procedure described in the Experimental Details section employing the extraction of cross-peak volumes in HC-HSQC spectra, the percentages of amino acids in the various media formulations were calculated relative to the amino acids content in ESF 921 and listed in columns 3–5 of Table 1. Figure 1 qualitatively illustrates that the undialyzed yeastolate in the ESF 921 growth medium contains a considerable amount of residual amino acids. Figure 1a depicts the super-position of the expanded region of the HC-HSQC spectrum of the rich medium indicated by the blue contours and the ESF 921-Δ11 medium indiated by the red contours illustrating the disappearance of the methyl peaks of the depleted amino acids. Figure 1c shows how cross-peaks were assigned by spiking ESF 921-Δ11 media samples with individual amino acids. With the exception of methionine, all tested amino acids are present in sizeable amounts in ESF 921. This helps to explain why initial attempts to produce labeled proteins in baculovirus expression system yielded relatively poor incorporations of isotope labels (Creemers et al. 1999; DeLange et al. 1998; Strauss et al. 2003). Column 3 in Table 1 shows that the approach by Gossert et al. of diluting the content undialized yeastolate ten-fold does indeed significantly reduce the percentage of amino acids originating from the yeastolate. In each case, the level of residual amino acids is reduced close to 10-fold, indicating the yeastolate is the primary source of unlabeled amino acids in the media. Our original approach of using undiluted, dialyzed yeastolate (ESF 921-Δ11) also led to a significant reduction in all tested amino acids in the 11-amino acid type depleted medium (Table 1, column 4). On average, the use of 100% of dialyzed yeastolate led to a 10−20% higher amino acid content than the medium which contained 10% undialyzed yeastolate (compare columns 3 vs. 4). This finding is in qualitative agreement with our observation of moderately lower 13C-enrichment levels than those reported by Gossert et al. (see below) in the growth media containing undiluted, dialyzed yeastolate. The Δ11 growth medium containing 10% dialized yeastolate, Table 1, Column 5, exhibited at least another four-fold reduction in background amino acids, suggesting that an expression medium containing 10% dialized yeastolate should produce the highest levels of isotope enrichments.

Table 1 Quantitation of unlabeled amino acids in ESF 921-Δ11 with various formulations of dialyzed or undialyzed yeastolate
Fig. 1
figure 1

Expanded region of HC-HSQC spectra in ESF 921. a blue contours: ESF 921, red contours: ESF 921-Δ11 containing 10% dialyzed yeastolate, b blue contours: ESF 921, red contours: ESF 921-Δ11 containing 10% dialyzed yeastolate + 100% undialyzed yeastolate, c Same as B + green contours: spectrum of ESF 921-Δ11 + 0.25 gr L-threonine

Quantitation of isotopic label incorporation

To overcome the reported dilution in isotope labeling efficiency in proteins by the required presence of yeastolate in insect cell cultures, we tested the replacement of yeastolate with dialyzed yeastolate. As shown above, undialyzed yeastolate contains a significant level of amino acids as illustrated in Fig. 1b that can be incorporated into an expressed protein during bio-synthesis. An initial customized medium from Expression Systems, LLC was depleted in seven essential amino acids (i.e. Ile, Leu, Val, Met, Thr, Phe, Tyr) as well as yeastolate (ESF 921-Δ7). Subsequent formulations of the growth medium included the depletion of all ten essential amino acids plus tyrosine (i.e. Arg, His, Ile, Lys, Leu, Met, Phe, Thr, Tyr, Trp, Val) as well as yeastolate (ESF 921-Δ11). Prior to use, ESF 921-Δ7 and ESF 921-Δ11 were replenished with the unlabeled missing amino acids plus dialyzed yeastolate except for those amino acids which were to be added in labeled form. Initial tests proved encouraging. [13C]-incorporation efficiency in ubiquitin type-specifically labeled in threonine and in valine was compared by performing protein expressions with ESF 921-Δ7 supplemented with either dialyzed or undialyzed yeastolate. As expected the [13C]-incorporation efficiency was significantly improved when dialyzed yeastolate was used. For example, in threonine labeled ubiquitin expressed while supplying the medium with 0.2 gram per liter of labeled amino acid, the [13C]-incorpation was 62.4% (Table 2A) when using undialyzed yeastolate, whereas the [13C]-incorporation level jumped to 89.6% when using dialyzed yeastolate. Having determined the residual levels of unlabeled amino acid of the depleted residue types in ESF 921-Δ11 media (Table 1), we also calculated the maximum achievable [13C]-incorporation levels when a set amount of labeled amino acid was added to the depleted medium. For example, the addition of 0.2 grams per liter of labeled threonine is expected to produce a maximum [13C]-level of 62.5% in the medium containing undialyzed yeastolate (Table 2 row 2, column 5). This closely matches the achieved incorporation of 62.4% (column 4). In expression runs in which ubiquitin was type-specifically labeled with valine, the observed [13C]-incorporation levels were 53.3 and 70.5% in media supplied with undialyzed yeastolate and 0.2 and 0.4 grams of labeled valine per liter of media respectively. The estimated upper limits of [13C]-incorporation based on the residual levels of unlabeled valine in the medium were 50.7 and 67.3%. These results suggest that measuring the residual level of unlabeled amino acids, including random coil peptides containing amino acids, provides a good measure of the maximum level of stable isotope enrichment which may be achievable.

Table 2 13C and 15N incorporation efficiency

Additional expressions of single amino acid type-specific labeled ubiquitin using the ESF 921-Δ7 or ESF 921-Δ11 were performed with dialyzed yeastolate at the concentration present in the commercial ESF 921 growth medium to assess [13C]-incorporation levels (Table 2B). The replacement of undialyzed yeastolate with dialyzed yeastolate significantly improved the level of [13C]-enrichment in baculovirus expressed type-specific labeled ubiquitin samples. For most expression runs, the achieved [13C]-incorporation levels were within two percentage points of the calculated upper limits of [13C]-incorporation levels. In expression runs of isoleucine labeled ubiquitin the difference between observed and expected 13C-incorporation levels was about 5%. In the valine expression runs, a large gap of ~10% between expected and observed [13C]-incorporation levels in the expression run in which valine was supplied at 0.2 g per liter. Conversely, in the expression run where labeled valine was supplied at 0.4 g per liter there again was a close match between expected and observed [13C]-incorporation level (92.65% observed with 94.9% expected). The highest level of [13C]-incorporation level was achieved for tyrosine; the [13C]-incorporation level in tyrosine labeled ubiquitin was 92% and 94% in expressions where the growth medium supplied with 0.2 and 0.4 grams per liter of labeled tyrosine respectively. Interestingly, [13C]-labeled tyrosine was already present in ESF 921-Δ11 upon addition of cell-paste while the other 13C-labeled amino acid types were added following a four hour “starvation period” (see Materials and methods section for details).

While our work was still ongoing, a paper appeared in print by Gossert et al. (2011) describing a new protocol for enhancing the isotope labeling efficiency. Previously, Gossert et al. showed that a complete removal of yeastolate produced a high isotope incorporation efficiency in excess of 90% at the expense of significant loss in protein expression yields (Strauss et al. 2003). Using a new protocol whereby only 10% of yeast extract was added to the growth medium, an almost complete restoration in protein expression yield resulted compared to expression in a rich medium.

These results by Jahnke and co-workers inspired us to test whether it might be possible to combine the two approaches whereby the amino acid depleted medium was supplemented with 10% of dialyzed yeastolate. Analysis of ESF 921-Δ11 supplemented with a ten-fold reduced amount of dialyzed yeastolate relative to the standard addition (ESF 921-Δ11 10%D) produced on average close to four-fold further reduction in the content of residual “unlabeled”-amino acid compared to using dialyzed yeastolate at 100% of the concentration of the commercial ESF 921 growth medium. Expression of [13C,15N]-leucine labeled ubiquitin showed a significant increase of [13C]-content from 80% to over 90% using 0.2 g of labeled leucine in ESF 921-Δ11 10%D (Table 2B, rows 11 & 13).

This [13C]-labeling efficiency in [13C,15N]-LEU-ubiquitin is comparable to the results of Gossert who expressed [13C,15N]-LEU-Abl kinase in a growth medium supplied with 0.25 g of labeled leucine and 10% undialyzed yeastolate (Gossert et al. 2011). This raises the question as to possible causes limiting labeling efficiencies of proteins expressed using Δ11 media supplied with ten-fold reduced amounts of dialyzed yeastolate.

One factor reducing isotope enrichment level is the incomplete enrichment levels in commercially available [13C,15N]-amino acids which typically have enrichment levels of about ~99%, giving rise to a ~1% reduction in labeling efficiency. A second factor is the presence of unlabeled amino acids in the cell paste that is transferred to the labeled medium. This should give rise to a further reduction in labeling efficiency of an additional 2–5%. A third potential source of reduction in incorporation levels of isotope labels is the expression of target protein between the time of viral infection of insect cell and the addition of labeled amino acid(s) to the 11-medium. In the presented setup this time lag is 16 h for tyrosine and 20 h for other amino acid types. Only low levels of expression of target proteins is expected during the first 16–20 h post baculovirus-infection since the baculovirus polyhedrin promoter is activated only in the late stages of the viral life cycle (Passarelli and Guarino 2007), (Jarvis 2009).

Furthermore, a significant drop in protein expression levels was reported when viral incubation period was reduced to less than 16 h (Meola et al. 2014). Possibly, a moderate reduction of the baculovirus incubation period may produce an incremental increase in protein labeling efficiency. Inspection of Table 2B shows the highest [13C]-incorporation yield was observed in tyrosine. Using 0.4 g of [13C]-tyrosine produced a [13C]-incorporation yield of 96.8%. Assuming that the 0.4 g of tyrosine, which is the prescribed amount in the rich medium, produced close to 100% [13C]-labeling efficiency suggests that 1–2% of unlabeled tyrosine may be due to protein expression during the viral incubation period. The lower than 96% [13C]-incorpoation yield in [15N,13C]-leucine labeled ubiquitin which was expressed in the medium containing 10% dialyzed yeastolate may be due to residual expression of ubiquitin during the 4 h “starvation” period prior to the addition of fresh medium containing [13C]-labeled leucine. Another potentially significant source of unlabeled amino acids may be cell paste which are transferred from the rich medium and which may comprise of 2–5% of the labeled media volume.

Depletion of amino acids in the growth medium during baculovirus protein expression

To reduce cost of expression with labeled amino acids, we assessed the level of amino acid consumption during two expression runs. We quantified the depletion of amino acids by comparing their levels in aliquots of the growth media taken at the beginning and end of fermentation runs. Levels of amino acids were determined both for labeled amino acids and for unlabeled amino acids containing a methyl group by integration of cross peak volumes in the HC-HSQC spectra. Very little consumption of amino acids was observed when adding the amino acids at levels corresponding to the amounts in rich media. Table 3A lists the amount of each amino acid remaining at the end of the expression of [13C,15N]-Thr, [15N]-Val-CD4 as a percentage of its level at the beginning of the run. In every case, over 85% of the initial amino acid remained (Table 3A). Given the large amounts of amino acids remaining, we lowered the levels of amino acids down to the 0.12–0.20 g per liter range.

Table 3 Consumption of amino acids during growth and protein expression

Table 3B lists the percentage of amino acids remaining after the expression of FAK1. Again, a large percentage of the individual amino acids remained at the end of the expression, demonstrating the expression is not limited by the depicted levels of amino acids being added to the medium. Not surprisingly, the depletion of an amino acid during an expression run is correlated with the concentration of the amino acid in the medium. For example, a reduction in the concentration of isoleucine (Ile) from 0.80 to 0.12 g per liter reduced the residual concentration at the end of the expression from 96.5 to 75.6%—see rows 2 and 3 in Table 3A, B respectively. The most pronounced depletion was observed in arginine (Arg), whose concentration declined almost two-fold during the protein expression when its concentration was reduced from 0.80 grams (rich medium) to 0.12 grams. Even so, greater than 50% of the original Arg in the sample remained at the end of the expression when only 0.12 g were added to the media at the start of expression. Interestingly, reducing the concentration of valine (Val) from 0.6 to 0.2 g per liter produced no noticeable reduction in Val content during a fermentation run.

Inspection of HC-HSQC spectra of growth media also revealed that apart from the amino acids, the concentrations of most components changed very little during the expression runs with a few notable exceptions (highlighted in Figure S1). Figure S1a shows a cross peak which indicates the reduction in concentration of one component by almost half. The [1H] and [13C] chemical shift of this cross peak were subsequently assigned to phosphocholine (PhCh). Figure S2B points to a second component that changes considerably during protein expression. A five-fold increase in intensity was seen for the alanine (Ala) methyl peak due to active synthesis of Ala during the expression run. This observation closely resembles the result by Sitarska et al. (2015) who reported a four-fold increase in Ala concentration during an expression run.

To make reasonable estimates in the accuracy of amino acid depletion Table 3 only depicts depletion of either [13C]-labeled amino acids or methyl group containing amino acids. These results suggest that prior to performing protein expression runs using alternate labeling schemes, it may be prudent to run a trial expression to estimate amino acid depletion and assess minimal concentrations of nutrients in the growth medium prior to setting up a large-scale fermentation run.

Protein expression yields

In spite of the fact that insect cells do not grow well in the ESF 921-Δ11 medium, which was replentished with dialyzed yeastolate and amino acids, we observed good protein expression yields. Expression of labeled KATII, FAK1 and CD4 all produced yields which exceeded what was observed in the expression runs of unlabeled protein samples. However, all expressions of these proteins in unlabeled form were done at larger scale (i.e. 12–40 L), using wave bag fermentation. These differences in growth and expression conditions hampers accurate and meaningful comparions in expression yields between labeled and unlabeled protein sample for these proteins.

For ubiquitin, however, expression expression yields of labeled and unlabeled were both based on shake flask expressions of identical volumes. We observed expression of type-specifically labeled ubiquitin was moderately higher than unlabeled protein.

In summary, it appears that the presented protocol for expressing type-specifically labeled protein appears qualitatively comparable to expression of unlabeled protein in the rich medium (ESF 921).

Application of baculovirus expression system to probe the interaction of an engineered adnectin-based inhibitor to  CD4

In order to probe the site of interaction of the extra-cellular domains of CD4 with an engineered adnectin (Lipovsek 2010), the extracellular domain of CD4 was expressed in insect cell cultures with pairs of amino acid types isotopically enriched such that amino acid type 1 was [13C,15N]-labeled and type 2 was [15N]-labeled. Residue type selection was guided by choosing sequential 13C(i-1),15N(i)-residue pairs which occur at a low abundance in CD4 and for which at least one of the sequentially [13C] and [15N]-labeled amino acid residue pairs are expected to reside near the putative binding site of the adnectin. In order to highlight HN-resonance pairs in 15N-labeled amino acid residues which are preceded by a 13C-labeled residue, CO-edited HN-TROSY spectra were recorded on a Bruker Avance III 700 MHz spectrometer equipped with a TCI-cryogenic probe. CO-edited HN-TROSY spectra were recorded on type-specifically [13C,15N]-Thr, [15N]-Val labeled CD4 in the absence and presence of unlabeled adnectin. At room temperature only poor-sensitivity HN-TROSY spectra were obtained. Raising the sample temperature to 40 °C produced high quality spectra (Fig. 2), which depicts the expanded spectral region of the superposition of CO-edited HN-TROSY spectra in samples of [13C,15N]-Thr, [15N]-Val-CD4 in the absence (blue contours) and presence (red contours) of unlabeled adnectin which was present at a 1.2-fold molar excess. The HN-resonances of all three expected Thr-Val-pairs were observed with one exhibiting a large shift perturbation highlighting the spatial proximity of respective Val residue to the adnectin binding interface. The quality of the depicted spectra is commensurate with the reported high efficiency of [13C]- and [15N]-labeling.

Fig. 2
figure 2

Superposition of CO-edited HN-TROSY spectra in [13C,15N]-Thr, [15N]-Val-CD4 in blue contours: apo form & red contours: in complex with unlabeled adnectin

Conclusions

We have presented an improved protocol for producing type-specifically labeled protein samples in insect cell cultures. Enhancements in isotope labeling efficiency have been demonstrated by replacing yeast extracts in the growth medium by a ten-fold reduced amount of dialyzed yeast extract. In addition, we have shown that the ten-fold reduction in yeastolate reported by Gossert et al. in growth media still produces high level protein expression when dialyzed yeastolate is used. Hence, the use of ten-fold reduced amounts of dialyzed yeastolate offers the potential for enhanced levels of levels of incorporation in type-specifically labeled proteins.

Materials and methods

Protein expression vector generation

Sf9 cells (Invitrogen, Carlsbad, CA) were used to generate recombinant baculovirus and for virus amplification, according to the manufacturer’s instructions. Sf9 cells (Expression Systems LLC, Davis, CA) for protein expression and production were routinely maintained in ESF 921 medium (Expression Systems, LLC: ESF 921-96-001) at 27 °C in shake flasks. Further details a provided in the supplemental materials section.

Preparation of type-selectively labeled growth media

Preparation of amino acid depleted stocks are described in detail in the supplemental materials section.

Protein expression

Sf9 or Hi5 cells were grown to the targeted density of 2 × 106 cells/ml in the rich medium. Cell growth was monitored and analyzed under a microscope, and as well by Vi-CELL XR Cell Viability Analyzer (Beckman Coulter, Indianapolis IN). After reaching the targeted density, the cells were inoculated with a high-titer secondary-passage baculovirus. After 16-hour post infection, cells were transferred and spun down in sterile centrifugation bottles (Corning, Tewksbury MA) by mild centrifugation. The cell pellets were washed, followed by a resuspension into media stock 2. Following a 4 h “starvation”-period, the labeled amino acids of concentrated stock solutions were added into the cell culture at the targeted concentration. Hence, labeled amino acids, with the exception of labeled tyrosine were added to the growth media 20 h after viral infection. The fermentation process continues for another 40–48 h followed by harvesting cells by centrifugation. The cell paste was stored at −80 °C for protein purification. For the production of CD4, the conditioned medium from cell culture was processed immediately following harvesting.

Protein purification

Labeled Ubiquitin, FAK1 and KATII samples expressed in Sf9 or Hi5 cells as cleavable N-terminal Hexa-His tagged constructs. CD4 was expressed in Sf9 cells containing a non-cleavable C-terminal Hexa-His tag. The adnectin, ATI-1637, was expressed in E. coli with a non-cleavable C-terminal Hexa-His tag. Further details on the respective purification protocols are provided in the supplemental materials section.

NMR

Analysis of residual amino acid contents in the Δ11 amino acid growth medium

To determine the residual level of depleted unlabeled amino acids in ESF 921-11, test samples were prepared with ESF 921-11 supplemented with various amounts of undialyzed and/or dialyzed yeastolate. Table S3 in the supplemental Material section depicts the compositions of the four media test samples.

HC-HSQC spectra in the growth media test samples were recorded on an AvanceIII HD 600 MHz spectrometer which was equipped with 5 mm TCI cryo probe. Spectra processed with cosine2 apodization in both spectral dimensions, twofold zero-filling in both dimension, followed by baseline correction along F2 using the automatic polynomial baseline correction tool in topspin. Table 1 depicts the residual percentages of unlabeled (natural abundance) of the depleted eleven amino acids were generated as follows: HC-HSQC spectra were uploaded into nmrView version 9.0.-b114 (OneMoon Scientific, Inc.). Cross-peaks were automatically picked using elliptic integration footprints followed by manual editing. Cross-peak integrals were exported into MS-Excel tables for further analysis of amino acid contents in growth media. The translation of cross-peak integrals of hc-hsqc spectra of growth media test samples (depicted in Table S3) into residual amino acid contents is depicted in Table S4.

NMR data on [13C,15N]-Thr,[15N]-val-CD4 n type-specifically labeled CD4

Sample preparation

Doubly type-specific [13C,15N]-Thr and [15N]-Val CD4 and unlabeled adnectin were exchanged into a buffer 25 mM HEPES, 300 mM NaCl, pH 7.0, 90% H2O + 10% D2O, NMR sample 1:0.35 mM apo [13C,15N]-Thr and [15N]-Val CD4 in a 4 mm Shigemi mico sample tube, Sample 2: 0.5 mM [13C,15N]-Thr and [15N]-Val CD4 + 0.6 mM (unlabeled) adnectin.

NMR data acquisition

All spectra were acquired on a Bruker AvanceIII 700 MHz spectrometer operating at 700.13 MHz equipped with a TCI 5 mm triple resonance cryo probe. Two-dimensional HNCO-TROSY (M. Salzman et al. 1998) were acquired, 15N-evolution in the indirect detection domain and proton detection at 40 °C were acquired on an Bruker Avance III 700 MHz spectrometer employing the following NMR data acquisition parameters: number of t1-increments = 112, number of t2-data points = 1024, ns = 352, relaxation delay = 1.6 s, sw3 = 15 ppm, sw1 = 35 ppm, total acquisiton time = 19 h, processed in topspin 3.2 with cosine square apodization in both time domains with two-fold zero filling. Spectra are displayed with geometric spacing of contour-levels with a coefficient of 1.3.

Mass spectrometric analysis of labeled protein samples

Protein digestion

Each sample was diluted to ~1 mg/mL with 100 mM NH4HCO3 (pH 7.5). 25 μg of sample was diluted :1 (v/v) with 0.1% RapiGest (Waters, Milford MA), and heated at 80 °C for 15 min. Then each sample was digested with trypsin (Promega, Madison WI) at a ratio of protein: trypsin of 50:1. The digestion was kept at 37 °C overnight, and inactivated by 6 μL 5% trifluoroacidic acid. Each digested solution was centrifuged at 14,000 RPM for 5 min and supernatant was analyzed by LC/MS.

LC/MS instrumentation and data analysis

Digested samples were separated using an ultra high pressure LC (ACQUITY UPLC, Waters) equipped with BEH C18 column, 2.1 × 150 mm, 130 Å with 1.7 µm particle. The mobile phase A was 0.1% formic acid in water, and mobile phase B was 0.1% formic acid in acetonitrile. The injection volume for each digested sample was 1 µL. Tryptic peptides were separated in a gradient of 1–40% B in 23 min at 45 °C column temperature.

Peptides were detected using SYNAPT G2 (Waters) with positive ion electrospray. Mass acquisition range was 50–2000 m/z, resolution was 20,000. Peptide identification was performed with accurate mass at both precursor ion level and fragment ion level. The relative labeling efficiency for each labeling site was calculated from the ratio of the peak intensity of labeled peptide containing the labeling site divided by the sum intensities of the labeled and unlabeled peptides.