Introduction

The COVID-19 pandemic and the subsequent global social and economic disruption have generated great interest in improved laboratory testing for respiratory viruses. Widespread testing for viral infection is a cornerstone of the response to COVID-19, with nearly 2.76 billion Reverse Transcriptase-Polymerase Chain Reaction (RT-PCR) tests for the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) carried out worldwide from 1st of January, 2020 to 1st of November, 20211. A rapid, sensitive, and accurate test for viral infection is critical in reducing widespread transmission of the virus especially where asymptomatic infections with SARS-CoV-2 frequently occur2.

Nucleic Acid Test assays are ubiquitous in clinical virology. Their utility for clinical decision making is limited, however, by the inability to distinguish replication-competent (i.e. live, transmissible) virus from residual viral nucleic acid3. In a significant number of cases, SARS-CoV-2 infected patients continue to test positive for viral RNA for days to months after the known window of infection has passed. This significantly impairs decision making around transmission risk and release from quarantine4.

Rapid antigen tests have been developed for SARS-CoV-2 and other respiratory viruses, where the antigen target is a viral protein. While providing potential value as a screening tool, reports of the sensitivity and specificity of such tests have been mixed5,6 and are most likely manufacturer dependent7. Additionally, over time these tests may face lot-to-lot variation, false negatives due to patient-specific antibodies, and other issues already described for antibody-based tests8.

Mass spectrometry (MS) has been used as a complementary tool to PCR and other assays in virology research9, but so far has not been routinely used in high-throughput clinical testing for viral infection. Liquid chromatography-tandem MS (LC–MS/MS) has become widely used in clinical chemistry for the analysis of small molecules and proteins in a range of biological specimens10. Matrix-Assisted-Laser-Desorption-Ionisation coupled to Time-of-Flight (MALDI-TOF) mass spectrometry is used extensively in clinical microbiology to rapidly, accurately, and inexpensively identify the causative agents of bacterial infections11. The widespread adoption of MS and the availability of generic sample preparation methods for detection of proteins marked both MALDI-TOF–MS and LC–MS/MS as possible solutions to diagnostic problems in the COVID-19 pandemic. This included early predicted disruptions in supply-chains for RT-PCR reagents and diagnostic problems around transmission.

Direct detection of respiratory viruses by MS in clinical samples had rarely been reported prior to the emergence of SARS-CoV-212, with one report from 2015 describing a 100-fold analytical sensitivity deficit13. However, since March 2020 several groups have demonstrated the capability of MS to detect SARS-CoV-2 in small cohorts of clinical specimens including nasopharyngeal swabs14,15,16 and gargle solution17 using nanoflow LC, high resolution MS and shotgun proteomics techniques. Studies analysing larger cohorts of SARS-CoV-2 positive and negative nasopharyngeal swabs used validated analytical techniques more common in current routine clinical LC–MS/MS analysis18 and provided strong evidence for the extent of correlation between viral RNA and protein concentration in nasopharyngeal swab samples after performing immunocapture of the target protein19 or peptide20,21 before LC–MS/MS analysis. Later, a consortium of academic laboratories and instrument vendors demonstrated a process to standardise analysis of SARS-CoV-2 proteins across different LC–MS/MS instruments and laboratories but were limited in detecting viral proteins in samples with an RT-PCR cycle threshold (Ct) of greater than 2022. MALDI-TOF MS was applied to analysis of clinical samples, targeting either protein23 or amplified nucleic acids24 with significant discriminating power. However, it has been suggested that direct detection of viral proteins in clinical samples remains beyond the sensitivity of current MALDI-TOF–MS techniques25.

As SARS-CoV-2 is predicted to become an endemic virus26, there is value in incorporating its detection into a panel of other circulating respiratory viruses. In response, we developed a rapid multiplexed LC–MS/MS test to assess the sensitivity and selectivity of instruments that are currently available in clinical laboratories. Furthermore, in creation of this method, we established a fast bottom-up proteomics workflow for discovery of target virus peptides using ‘serum-free’ culture conditions, providing high coverage of viral proteins without the need for protein or peptide fractionation techniques used when standard culture conditions are employed. Finally, in applying the targeted method to the analysis of 30 SARS-CoV-2 RT-PCR positive nasopharyngeal swab extracts, we provide new evidence for the ORF9B protein as a target for detection of SARS-CoV-2 infection in clinical samples.

Results

Serum-free viral culture reduces protein background and improves virus detection

Concentrated virus samples (~ 106 TCID50/mL) were generated using cell culture. The SARS-CoV-2 was grown in Vero E6 cells cultured in standard conditions with 2% fetal bovine serum (FBS). High pH fractionation of culture supernatant digests gave adequate coverage of the SARS-CoV-2 proteome in the high bovine serum background from untargeted analysis. There were 44 peptides identified from 7 SARS-CoV-2 proteins (Table 1), representing 0.5% of the 15,813 peptide spectral matches recorded. By contrast, 84.7% of these spectral matches were made to bovine proteins.

Table 1 Performance comparison of peptide discovery methods in various cell culture conditions.

A baseline for unfractionated analysis of culture supernatant digests was set by a single 2-h analysis on the TripleTOF 6600, resulting in 28 SARS-CoV-2 peptides identified at > 95% confidence. The bovine protein background was then reduced by culturing the virus in Vero E6 cells under serum-free conditions. This greatly improved virus detection from a single run on a TripleTOF 6600, generating higher sequence coverage than the fractionated 2% FBS sample (Table 1), comparable to other published studies27,28,29 using fractionation techniques on viral cultures (Supplementary Table S2). To demonstrate the impact of serum-free conditions on relative virus concentration in culture supernatants, the summed peak area from the best four fragment chromatograms from each SARS-CoV-2 peptide detected during SWATH analysis was expressed as a percentage of the area under the TOF–MS total ion chromatogram (TIC). On average, virus peptides generated 26 times higher signal intensity from when considered as a proportion of the total precursor intensity from each run (Fig. 1). An overlay of the TOF–MS TIC shows the difference in total protein loaded from each sample and the plot of percentage peak areas demonstrates that the total peak area from SARS-CoV-2 peptides is an order of magnitude higher in the serum-free culture compared to the 2% FBS virus culture (Fig. 1). Sequence coverage and peptide numbers from other respiratory virus cultures grown under serum free conditions and assayed by either the nanoflow LC–MS or the alternate microflow LC–MS method (as noted), are presented in Table 2.

Figure 1
figure 1

(Left) Box-and-whisker plots of % peak area (TIC-normalised total SWATH fragment peak area) for all detected SARS-CoV-2 peptides in culture digests, showing improved detectability of virus proteins despite overall lower sample loading. (Middle) Overlay of total ion chromatograms (TIC) for SARS-CoV-2 culture digests under standard culturing conditions (2% FBS) and serum-free conditions. Overall higher intensity of signal for FBS culture represents higher total protein loading on the LC–MS. (Right) Box-and-whisker comparison of normalised peak areas for all targeted peptides in spiked samples. The area of each peak in the MRM chromatogram was normalised to the total area under the chromatogram within the detection window.

Table 2 Sequence coverage for all respiratory viruses from analysis of serum-free cultures. Viruses 229E, OC43, RSV-A, RSV-B and SARS-CoV-2 were analysed with a short microflow method, whereas Influenza A and B were analysed with nanoflow LC, resulting in inferior sequence coverage for the former group.

Transport medium protein source generates differing sensitivity and background interferences

Spiking viral cultures into negative swab extracts stored in VTM containing either FBS or bovine gelatin allowed an additional round of culling based on the presence of interfering signals in MRM traces, which were more prevalent in FBS VTM extracts than in gelatin VTM. For the same relative concentration (v/v) of virus spiked into each matrix type, gelatin VTM extracts gave a mean relative response of target peak area to total background area more than 2 times higher than FBS VTM extracts (Fig. 1). Interestingly, one peptide (SNLKPFER from the Spike protein) produced significant overlapping peaks for all MRMs in RT-PCR negative swabs from gelatin VTM but not in FBS VTM swabs, despite being confidently identified (> 99% confidence) in SARS-CoV-2 cultures. As such, this peptide was removed from further analyses. Finally, identical spiked and cultured samples were compared using the original and rapid sample preparation methods and a final peptide list was determined from peptides efficiently liberated during rapid digestion. The two most sensitive and selective MRMs for each peptide were included in the final method (Fig. 2 and Supplementary Table 4).

Figure 2
figure 2

(Left) Heat map of signal-to-noise ratio for combined peak areas of virus spikes (1:9 v/v) versus peak area average of blank samples. (Right) Overlay of MRM chromatograms from human background proteins (upper panel) and virus cultures (lower panel) showing efficient use of MS detection time and minimal concurrency through careful peptide selection and maximising chromatographic separation.

Targeted MRM method is specific for each virus

To assess combined method selectivity, we spiked cultures of all eight virus targets into negative nasopharyngeal swabs in all possible paired combinations. Our final targeted MRM method could detect six of the eight target viruses with 100% specificity at a 1:9 (v/v) spiking concentration, with no cross-reactivity of viruses or background proteins (Fig. 2). Peptides for H1N1 and H3N2 that were detected in undiluted culture with below average peak areas, potentially indicating comparatively low concentration, fell below the limit of detection at this spiked concentration. Sensitivity of high-flow chromatography, was assessed by scaling flow rate, gradient program and run time on the same column. Similar sensitivity was achieved with all methods; therefore, the flow-split technique was selected as it offered a crucial increase in sample throughput (Supplementary Fig. 1).

The optimised LC–MS/MS method required a similarly rapid sample preparation procedure to ensure a high throughput total sample analysis. A short acetone protein precipitation was incorporated to remove small molecule interferences (particularly phenol red), disrupt virions, and pre-concentrate proteins. Consistent with other published methods, reduction and alkylation steps were omitted as we had deliberately avoided the use of cysteine-containing peptides. Finally, SDC and n-propanol denaturants were replaced with a commercial rapid digest buffer to allow the direct injection of the digest on to the trap column without the acid precipitation step required for SDC. Digestion time was reduced from overnight to 1 h for the first set of clinical samples (n = 20) and then increased to 2 h for the second clinical sample set (n = 40). The total workflow from viral culture to targeted MRM method is summarised in Fig. 3.

Figure 3
figure 3

Graphical summary of workflow from viral cultures to rapid targeted MRM method. Use of serum-free culture avoids the need for extensive sample fractionation and use of both data-dependent and data-independent-acquisition methods during discovery phase streamlines peptide selection process. Created with BioRender.com.

Sensitivity of multiplexed virus detection in clinical specimens was consistent with other published methods targeting only SARS-CoV-2

Prior to development of the multi-virus targeted method, an initial cohort of ten clinical specimens RT-PCR positive for SARS-CoV-2 were assayed by a method including only SARS-CoV-2 peptide targets. Cycle thresholds (Ct) values ranged between 17 and 37. Out of ten RT-PCR positive infections, LC–MS/MS detected viral proteins in two clinical specimens that had Ct values of 17 and 19. Virus peptides were not detected in specimens with Ct values greater than 19 (Supplementary Table 5).

Following addition of MRMs for the virus panel to the targeted method, a second cohort of four RSV-A positive and two OC43 positive clinical specimens were subjected to a protein precipitation step and the rapid digestion protocol before analysis. Three out of four RSV-A infections were detected by LC–MS/MS with Ct values of 15, 17, and 22. One clinical specimen with a Ct of 24 was not detectable. The OC43 samples were not detectable by LC–MS/MS, consistent with their relatively high Ct values of 29 and 30 (Supplementary Table 5).

The final cohort of SARS-CoV-2 positive (n = 30) and negative (n = 10) samples was tested with the multi-virus LC–MS method, but with an increased digestion time. After an initial analysis showed detectable signals for the ORF9B peptide LVDPQIQLAVTR in 24 of the 30 RT-PCR positives, a confirmatory analysis including an additional MRM for that peptide as well as transitions for two other ORF9B peptides (VYPIILR and LGSPLSLNMAR) was performed. Considering the two most discriminating MRMs for each of 6 SARS-CoV-2 peptides and using an ion ratio tolerance of 30% and a retention time (RT) tolerance of ± 1% from the measured internal standard RT, 23 of 30 (77%) positive samples had detectable peaks for at least two peptides. There were no false positives from the 10 RT-PCR negative samples when the same criteria were applied. Reducing the threshold to 1 detected peptide increased the positive rate to 28/30 (93%) but increased the false positive rate to 20%. The best discriminating peptides were GFYAEGSR from the nucleoprotein and LVDPQIQLAVTR from the ORF9B protein (Fig. 4).

Figure 4
figure 4

Targeted method performance for six peptides targeting 3 proteins from SARS-CoV-2. (Top panel) Spread of Ct values for 3 SARS-CoV-2 genes vs # peptides detected in clinical sample cohort 3. (Lower panel) From left; (A) Violin plot comparing software generated signal-to-noise ratio between positive (left) and negative (right) clinical specimens. (B) Correlation between reported N-gene Ct and peak area of the most intense MRM transition. Grey dots are peak areas detected in RT-PCR negative samples, assigned an arbitrary Ct of 25 for ease of reading. (C) linear regression of best two MRMs for each peptide for all RT-PCR positive samples. (D, top panel) linear regression and correlation for raw peak area and analyte/IS peak area ratio for peptide N.AYNTQAEFGR, which has been reported in other published targeted methods. (D, lower three panels) linear regression and correlation of other SARS-CoV-2 peptide peak areas to N.AYNTQAEFGR.

Targeted method imprecision and robustness was briefly tested by analysis of 5 separate extractions of the same pooled sample, as well as replicate injections (n = 6) of a pooled extract. Method linearity was assessed across a small dynamic range by dilution of a pool of samples that showed high responses to SARS-CoV-2 proteins. Serial dilutions were made 1:1 with a pool of negative samples to mimic the log2 nature of Ct values. Unweighted linear least-squares regression showed the method was linear across the range surveyed (25), with correlation coefficients ranging from 0.9669 to 0.9985 for IS-normalised peak areas vs relative concentration and from 0.97356 to 0.99992 for raw peak areas vs relative concentration (Supplementary table 6). Imprecision for peak areas, IS peak areas and IS area ratios ranged from 4.4% to 26.2% for SARS-CoV-2 peptides and 3.1% to 11.4% for background peptide peak areas. Repeatability ranged from 2.3% to 33.5% for peak areas, IS peak areas and IS area ratios of SARS-CoV-2 peptides and 1.6% to 12.1% for background peptide peak areas (Fig. 5 and Supplementary Table 7).

Figure 5
figure 5

Targeted method imprecision (repeated preparations of pooled sample, n = 5) and repeatability (repeated injections of pooled sample extract, n = 6). Results are combined for 6 SARS-Cov-2 peptides using CVs for peak area, IS peak area and peak area ratio, and for 6 background peptides for peak area only.

Discussion

We utilised viral cultures to develop a targeted LC–MS/MS method as these contain acceptably high concentrations of viral proteins for testing of method parameters and require viral proteins to be liberated from virions for successful analysis. While it is possible to develop such method using recombinant viral proteins30, this approach omits the critical virion-extraction requirement for clinical specimens. Serum-free culture conditions removed the need for extensive sample fractionation and multiple LC–MS runs, yielding peptide identifications comparable to published studies where SDS-PAGE fractionation and 5 to 20 LC–MS runs were used in the analysis of FBS-supplemented viral cultures27,29,31. One notable omission in our protein list was the ORF 1ab polyprotein, detected with high sequence coverage in studies where infected host cells were harvested and digested29,31. In the search for clinical targets, however, viral proteins found in culture supernatants are more likely to be representative of those that can be easily sampled in the extracellular matrix of an infected patient, being either from secreted virions or viral proteins rather than from liberated endothelial cells. Further work is required to determine the relative contribution to the much-improved detection of virus proteins of; 1) smaller dynamic range of protein concentration being more compatible with LC–MS analysis, and 2) increased virus replication and secretion of virions occurring under “serum-free” conditions. However, these data suggest that serum-free cell culture conditions should be utilised where possible to rapidly acquire an in-depth snapshot of the viral proteome relevant to an infection.

Using viral culture supernatants to supply proteins for method development also revealed that the ORF9B protein held potential as a target in clinical samples, showing both high sequence coverage in discovery experiments and good sensitivity and specificity in targeted analysis of spiked nasopharyngeal swab extracts. ORF9B has so far been excluded from published targeted LC–MS methods, which have focused almost exclusively on the nucleoprotein, based on data suggesting that the latter exists in far greater abundance in the assessable virus proteome31. Although our initial analysis of targeted samples showed low method sensitivity overall, detecting SARS-CoV-2 proteins in only 2 of 10 RT-PCR positive samples, we noted that the ORF9B protein, via peptide LVDPQIQLAVTR, gave good responses in both positive detections and was thus retained for later analysis of a larger cohort of samples with lower RT-PCR Ct values. Confident detection of at least 1 peptide from ORF9B in approximately 80% of the cohort of low Ct SARS-CoV-2 RT-PCR positive samples is in agreement with reports of anti-ORF9B antibodies in serological analysis32,33 and suggests an under-explored role of this protein in SARS-CoV-2 infection.

Where previous works analysing clinical samples focused only on detection of SARS-CoV-2, the multiplexing capacity of MRM analysis was effectively demonstrated by our targeted method. For confident detection of the complete virus panel, a minimum of 16 proteotypic peptides (two peptides per virus, adapted from the accepted ‘two peptides per protein’ guideline for clinical protein MS34) were included, as well as transitions for isotope-labelled internal standard peptides and background proteins (Supplementary Table 1). Using short desalting and analytical columns ensured method washing and re-equilibration times were minimised. The relatively high flow rate applied to the core–shell analytical column ensured target peptides were optimally resolved as narrow peaks spread across the elution time. The final method required less than 1.5 min of MS acquisition time with minimal MRM concurrency and a total run time of ~ 3.25 min (Fig. 2). Combined with the rapid sample preparation procedure and simple data processing (compared to traditional proteomics database searching35), this method allows for the full analysis of a 96-well plate of samples in approximately 8 h. Linearity, imprecision, and repeatability under these conditions appear acceptable based on performance of the higher performing SARS-CoV-2 peptides and the background peptides, however further work is required to assess performance of the whole virus panel.

Monitoring peptides from human proteins allowed for assessment of differences in sampling of the clinical specimens. Peptides from human background proteins were selected from those known to be expressed abundantly in the nasopharyngeal space36,37 after analysis of a pool of SARS-CoV-2 negative nasopharyngeal swabs (n = 20). Inappropriate swab sampling has been confirmed to cause false negative RT-PCR results38. The addition of multiple background proteins to our targeted multi-virus detection method provided protection against falsely low results caused by sub-optimal sampling without the need for a secondary technique or extra reagents, an advantage that warrants further investigation.

The SARS-CoV-2 has been successfully cultured from clinical specimens with Ct values as high as 3339, indicating viral proteins may be present in samples with Ct values beyond those which we were able to designate as infected. However, the probability of successful culture is also less than 100% even with low Ct values40, suggesting a basis for the relatively poor correlations between Ct and viral protein concentration in the analysis reported here. Our detection limit corresponding with a Ct value of approximately 20 is consistent with other groups who assayed clinical samples using similar equipment and techniques and without virus protein enrichment18,22 when considered in light of the inter-assay variation amongst RT-PCR tests41. While we have not yet assessed the sensitivity of our method in terms of absolute protein concentration, we believe it would be comparable to other methods where similar samples, extraction techniques and instrumentation were used. In the absence of more sensitive and selective LC–MS instrumentation, immunocapture of viral proteins19 or peptides21 prior to LC–MS analysis has been shown to improve detection limits. However, while this may be possible for a single virus or small number of viruses, it may be prohibitively expensive for a virus panel applied to routine diagnostics. Strategies for depletion of background proteins may allow for higher volume of patient sample, and thus viral proteins, to be injected into the LC–MS. However, as a first step, nasopharyngeal swabs intended for MS analysis should not be extracted into VTM or other, similar protein rich solutions that limit sample loading and increase chromatographic interferences from isobaric species.

Conclusion

We have presented an updated method for MS-based multi-virus testing proposed by others13. The sample throughput of our method improves on traditional proteomics approaches and compares well to the turnaround time of RT-PCR, while being accessible to many routine clinical LC–MS laboratories without the need for investment in new equipment. While the selectivity of our method is high, it is limited by the current sensitivity as a tool to be applied in the case of an inconclusive RT-PCR result for respiratory virus infection. However, our multiplexed test for the presence of eight respiratory viruses could potentially accommodate additional viruses and be quickly altered to track relevant viral mutations, indicating that LC–MS/MS may have future utility as a tool for differential diagnostics in suspected viral infection. As such, considering the importance of rapid and accurate diagnosis of viral infection to the current and future pandemics, efforts to incorporate the use of LC–MS/MS in clinical virology should continue.

Materials and methods

Clinical samples

Respiratory tract samples from nasopharyngeal swabs were collected in viral transport medium (VTM). All samples were initially stored at 4˚C, and those identified as positive for SARS-CoV-2 by RT-PCR were stored at -80 °C within 48 h of collection. Samples negative for SARS-CoV-2 were stored at room temperature in sample cohorts 1 and 2, and at -80 °C within 48 h of collection for cohort 3. Samples positive to other respiratory viruses were stored at 4 °C. This study was approved by the South Eastern Sydney Local Health District HREC (reference number 2021/ETH00108) and the NSW Health Pathology Research Governance Office (2021/STE01365). Experiments were conducted in accordance with the National Statement on Ethical Conduct in Human Research (2007) [https://www.nhmrc.gov.au/about-us/publications/national-statement-ethical-conduct-human-research-2007-updated-2018#toc__725 ]. Due to the use of deidentified specimens collected as part of standard diagnostic protocols, informed consent from all patients was waived.

Samples were tested for presence of SARS-CoV-2 RNA using standard commercial assays, predominantly with the Seegene Allplex platform for SARS-CoV-242 and other respiratory viruses43. Any samples positive for SARS-CoV-2 RNA were further tested using the Roche Cobas 6800 platform (Roche USA) to confirm the presence of SARS-CoV-2 RNA and infer presence of SARS-CoV-2 virus.

Viral cell culture

For viral culture, appropriate cell lines (Supplementary Table 1) were seeded in tissue culture plates 24 h prior. At the time of infection, tissue culture medium was removed, sub-confluent cellular monolayers were washed with PBS, and then inoculated with viral isolate. Plates were incubated for one hour before the inoculum was removed and replaced with viral culture media. Viral cultures were incubated at 37 °C (5% CO2) for 3–5 days and checked daily for cytopathic effect. Culture supernatants were collected, centrifuged at 2000rcf for 10 min to remove cellular debris, and stored at -80 °C as single-use aliquots.

Protein extraction from viral cell culture

Virus-containing cell culture supernatant (150 µL) was mixed with methanol, chloroform, and water. All solvents were ice-cold at the time of addition. Centrifugation separated the phases, and the upper phase was removed and discarded. Methanol was added and the solution was vortexed for 30 s then centrifuged. The protein pellet was washed with methanol and allowed to air-dry before storage at −80 °C.

Clinical specimen sample preparation

Proteins from respiratory tract samples (100 µL) were precipitated by mixing with sodium deoxycholate (SDC) and acetone44. After centrifugation, the pellet was washed with acetone, re-centrifuged and the protein pellet was air-dried.

Protein preparation: standard conditions

Protein pellets were reconstituted in SDC (1% w/w), n-propanol (5% v/v), and triethylammonium bicarbonate (100 mM) and heated at 95 °C for 15 min at 600 rpm. Cysteine residues were reduced with dithiothreitol (DTT) and alkylated with iodoacetamide by incubating in the dark. The alkylation was quenched with DTT and the sample digested with trypsin (5 µg) overnight at 37 °C. Addition of formic acid (1:50 v/v) precipitated the SDC, which was removed by centrifugation. The supernatant was desalted using either Oasis MCX or HLB cartridges (Waters, Milford MA) according to the manufacturer’s protocol. Samples were dried and re-suspended in water with 3% acetonitrile and 0.1% formic acid (100 µL) for LC–MS/MS analysis.

Protein preparation: rapid protocol

Protein pellets were reconstituted in Rapid Digest Buffer (100 µL, Sigma-Aldrich, St. Louis, MO) and sonicated for 5 min before the addition of SOLu-trypsin (5 µg, Sigma-Aldrich). Digestion was performed for 1 h (cohorts 1 and 2) or 2 h (cohort 3) at 60 °C, quenched with formic acid and 80 µL transferred to a 96-well plate for LC–MS/MS.

High pH peptide fractionation

Peptides (approximately 15 µg) were fractionated using an Agilent 1290 UHPLC system (Agilent Technologies, Santa Clara CA) with an in-house packed capillary column. LC mobile phases were: (A) ammonium formate (10 mM, pH 7.9) and (B) water (10% v/v) and acetonitrile (90% v/v). Fractions were monitored by total UV absorbance.

Nanoflow high performance liquid chromatography

Fractionated and unfractionated peptides were separated on self-packed fused-silica columns with incorporated emitter tips. Mobile phases were water (A) and acetonitrile (B) both with 0.1% formic acid. For Orbitrap analysis of fractionated peptide mixtures, a Thermo Scientific Ultimate 3000 LC (Thermo Fisher Scientific, Waltham MA) system operating in direct-injection mode was used at a flow rate of 400 nL/min with a 60 min gradient. For experiments using the TripleTOF 6600 (SCIEX, Framingham MA), the LC system was an Eksigent NanoLC 425 operating in direct-injection mode at a flow rate of 500 nL/min over a 75 min gradient.

High-resolution mass spectrometry

Fractionated samples were analysed using a Q-Exactive Plus quadrupole-orbitrap mass spectrometer (Thermo Fisher Scientific) equipped with a nanoflow ion source operating in positive electrospray mode. The Orbitrap was operated in data-dependent acquisition mode. Unfractionated samples were analysed on a TripleTOF 6600 Quadrupole-Time-of-Flight (QTOF) mass spectrometer equipped with a NanoSpray III source (SCIEX). Untargeted analyses were performed in Information Dependent Acquisition (IDA) mode. A Data Independent Acquisition method (Sequential Windowed Acquisition of All Theoretical fragments, SWATH) for the TripleTOF 6600 was created using PeakView 2.2 and the SWATH Variable Window Calculator v1.0 (SCIEX) as described45.

Discovery data processing

Orbitrap data were analysed using Proteome Discoverer v2.4 (Thermo) and Mascot v2.7 (Matrix Science, London) and TripleTOF IDA data was analysed using ProteinPilot software (v5.0.3, SCIEX, https://sciex.com/products/software/proteinpilot-software). Results were filtered to a global false discovery rate (FDR) of 1%. To facilitate cross-platform comparisons, PRIDE data sets were downloaded as .mgf (where available) or as .raw, processed with Proteome Discoverer v2.4 and exported as .mgf, then searched with ProteinPilot with settings appropriate for instruments used in each study.

Skyline (20.2.0.343, University of Washington) was used to extract chromatographic profiles and peak areas for virus peptides from SWATH raw data files for peptide selection.

Peptide selection

Peptide sequences identified as unique in ‘no-species’ searches against the full Uniprot database with respiratory virus sequences appended were selected from DDA results as potential targets after confirmatory BLASTP searches. The filtered peptide list was assessed for peak shape and raw signal in SWATH data. Four MRM transitions per peptide were selected based on intensity and Q3 m/z being greater than Q1 m/z. Initial collision energy (CE) was calculated from precursor m/z and the rolling CE equations in Analyst TF 1.8 then optimised per MRM by repeated injections of cultures with CE set to 25%, 50%, 100%, 125% and 150% of the initial value. Peak areas at each CE were plotted in SCIEX OS Analytics (v1.7, SCIEX) and the optimum CE extrapolated from CE values generating the three highest responses.

Targeted LC–MS/MS

Targeted multiple reaction monitoring (MRM) data acquisition was performed on a SCIEX QTRAP 6500+ coupled to an ExionLC AD UHPLC (SCIEX). The QTRAP 6500+ was operated in MRM mode with positive electrospray ionisation. The advanced scheduled MRM function of Analyst 1.7 (SCIEX) was used to schedule 62 MRMs across the acquisition time of 0.6 to 1.8 min after sample injection. The total sample run-time was completed in 3 min and 16 s.

Targeted data processing

MRM data was processed using SCIEX OS 1.7 (SCIEX) with the MQ4 algorithm. Peak integration parameters were set to be appropriate for the peak shape and background noise found in the chromatogram from each MRM transition. Additional processing was performed using Microsoft Excel and GraphPad Prism v9.