Nanopore sequencing provides snapshots of the genetic variation within salmonid alphavirus-3 (SAV3) during an ongoing infection in Atlantic salmon (Salmo salar) and brown trout (Salmo trutta)

Roh, HyeongJin; Skaftnesmo, Kai Ove; Kannimuthu, Dhamotharan; Madhun, Abdullah; Patel, Sonal; Kvamme, Bjørn Olav; Morton, H. Craig; Grove, Søren

doi:10.1186/s13567-024-01349-z

Nanopore sequencing provides snapshots of the genetic variation within salmonid alphavirus-3 (SAV3) during an ongoing infection in Atlantic salmon (Salmo salar) and brown trout (Salmo trutta)

Research article
Open access
Published: 03 September 2024

Volume 55, article number 106, (2024)
Cite this article

Download PDF

You have full access to this open access article

Veterinary Research Aims and scope Submit manuscript

Nanopore sequencing provides snapshots of the genetic variation within salmonid alphavirus-3 (SAV3) during an ongoing infection in Atlantic salmon (Salmo salar) and brown trout (Salmo trutta)

Download PDF

HyeongJin Roh ORCID: orcid.org/0000-0002-1825-2375¹,
Kai Ove Skaftnesmo¹,
Dhamotharan Kannimuthu¹,
Abdullah Madhun¹,
Sonal Patel^1,2,
Bjørn Olav Kvamme¹,
H. Craig Morton¹ &
…
Søren Grove¹

103 Accesses
Explore all metrics

Abstract

Frequent RNA virus mutations raise concerns about evolving virulent variants. The purpose of this study was to investigate genetic variation in salmonid alphavirus-3 (SAV3) over the course of an experimental infection in Atlantic salmon and brown trout. Atlantic salmon and brown trout parr were infected using a cohabitation challenge, and heart samples were collected for analysis of the SAV3 genome at 2-, 4- and 8-weeks post-challenge. PCR was used to amplify eight overlapping amplicons covering 98.8% of the SAV3 genome. The amplicons were subsequently sequenced using the Nanopore platform. Nanopore sequencing identified a multitude of single nucleotide variants (SNVs) and deletions. The variation was widespread across the SAV3 genome in samples from both species. Mostly, specific SNVs were observed in single fish at some sampling time points, but two relatively frequent (i.e., major) SNVs were observed in two out of four fish within the same experimental group. Two other, less frequent (i.e., minor) SNVs only showed an increase in frequency in brown trout. Nanopore reads were de novo clustered using a 99% sequence identity threshold. For each amplicon, a number of variant clusters were observed that were defined by relatively large deletions. Nonmetric multidimensional scaling analysis integrating the cluster data for eight amplicons indicated that late in infection, SAV3 genomes isolated from brown trout had greater variation than those from Atlantic salmon. The sequencing methods and bioinformatics pipeline presented in this study provide an approach to investigate the composition of genetic diversity during viral infections.

Nanopore sequencing for rapid diagnostics of salmonid RNA viruses

Article Open access 05 November 2018

Ultra-deep sequencing of VHSV isolates contributes to understanding the role of viral quasispecies

Article Open access 08 January 2016

High-Resolution Screening of Viral Communities and Identification of New Pathogens in Fish Using Next-Generation Sequencing

Introduction

The emergence of new viral strains with increased virulence is of great concern to the aquaculture sector. Salmonid alphavirus (SAV) is the causative agent of pancreas disease (PD) in Atlantic salmon (Salmo salar) and of sleeping disease (SD) in rainbow trout (Oncorhynchus mykiss). SAV is an enveloped, spherical, single-stranded positive-sense RNA virus with a diameter of ~70 nm belonging to the Togaviridae family. The SAV genome is approximately 12 kb long and comprises two open reading frames (ORF1 and ORF2) that both encode polyproteins [1]. ORF1 encodes four nonstructural proteins (nsP1, nsP2, nsP3, and nsP4) that are required for RNA synthesis [2]. Like for other alphaviruses, SAV ORF2 likely encodes six structural proteins, i.e., C, E2, E3, 6 k, E1 and TF, where C is the capsid protein and E1, E2 and E3 are constituents of the heterotrimeric spike proteins in the envelope [3, 4]. 6 k is an ion channel protein [5], whereas the TransFrame (TF) protein, known from several alphaviruses, is produced by a ribosomal –1 frameshift in 6 k. The TF protein has the same N-terminus as 6 k but a unique C-terminus, which may be relevant to virion stability, antigenicity, fusion, and tropism [4, 6].

Since SAV was first identified in 1995, at least six subtypes have been described based on nucleotide sequence analysis of nsP3 and E2 [7, 8]. More recently, the existence of a seventh genotype has been proposed based on an SAV isolate from Ballan wrasse (Labrus bergylta) [3]. The SAV subtypes show differences in geographical distribution, host range, and clinical manifestations [1, 9, 10]. SAV1 (salmon pancreas disease virus; SPDV) and SAV2 (sleeping disease virus; SDV) were characterized as two separate subtypes from approximately 1999–2000 [11, 12]. The SAV3 subtype (Norwegian salmonid alphavirus; NSAV) was first characterized by Hodneland et al. [13]. Over the whole genome, the subtypes have been shown to share ~86–96% genetic identity [3, 13].

Gallagher et al. [8] reported SAV sequencing data suggesting that individual farmed fish may become coinfected with different SAV subtypes. Infection of a host with two or more viral subtypes may be a basis for viral genetic changes via recombination. Similarly, a single SAV subtype transmitted from one host species or region to another may undergo genetic changes during adaptation [8, 14, 15]. RNA viruses generally have high mutation rates of between ~10^–6 and 10^–4 substitutions per nucleotide site per cell infection. A previous study estimated the SAV substitution rate to be approximately 1.70 (± 1.03) × 10^–4 nt substitution/site/year [16]. A more recent study of the genome-wide substitution rate for SAV3 estimated 7.351 × 10^–5 substitutions per site per year, with a 95% highest posterior density range of 5.33 × 10^–5–9.994 × 10^–5 [17]. In addition, there is evidence that SAV can frequently undergo mutations and deletions even within a single host [8, 18]. Petterson et al. [18] reported that many genome deletions are generated during natural SAV infection, and subsequent verification of frequent deletion mutations was achieved using nanopore sequencing methods [17]. The low fidelity of the RNA-dependent RNA polymerase (RdRp) and the high incidence of recombination via template switching during replication both contribute to this high mutation rate [19,20,21]. The copy choice model is a widely accepted mechanistic model for viral recombination and is particularly relevant for single-stranded positive-sense RNA viruses such as SAV [22, 23]. In an infected cell, erroneous replication may produce considerable variation in the virus genome sequence and thus in the expressed viral proteins. In addition to this type of variation, selective pressure may also lead to “intracellular adaptations” that improve viral fitness in a particular host cell environment, including adaptations to codon and codon pair usage, improved suppression of the IFNα/β response and more [24]. Viral particles exiting infected cells may differ in the amino acid (aa) sequence of their capsid and spike proteins, leading to possible changes in their receptor binding affinities and specificities and hence potentially to changes in cell, tissue and host tropism. Virus particles with altered protein sequences may also be less prone to recognition by specific antibodies. With such variation and the inferred potential differences in viral function, fitness and adaptability, the viral consensus sequence may be insufficient to characterize a virus. Instead, the variation can be better understood as a mutant spectrum or quasispecies, which may provide a better definition of wild-type virus [25].

Long-read deep sequencing technologies, such as single-molecule real-time sequencing by Pacific Biosciences and Oxford Nanopore, have significantly contributed to the understanding and profiling of genetic variations in pathogens [26,27,28,29]. In particular, Oxford Nanopore long-read sequencing technology has proven useful for identifying new SAV genotypes and for profiling SAV mutation sites [3, 8, 30]. Until recently, a prevailing issue with long-read sequencing platforms has been the inherent low base-calling accuracy [31], which may lead to the misidentification of mutations in individual nanopore reads. Several methods have been proposed to complement and overcome this limitation. Gallagher et al. [32] demonstrated that sequencing errors generated from the Oxford nanopore platform can be minimized by achieving a sufficient sequencing depth. They found that a sequencing depth of more than 50 × was sufficient to accurately sequence the SAV genome. Aligning long reads to a consensus sequence is a standard pipeline for identifying single nucleotide polymorphisms (SNPs) and structural variants. However, the relatively high error rate in individual reads can pose a challenge in distinguishing rare minor variants from within the cloud of nonvariant reads. As an alternative, unique molecular identifiers (UMIs) have been utilized to address sequencing errors, but other technical challenges, such as accurate titration of input templates and sequencing depth, remain a challenge [33, 34]. In the most recent advancements, due to improvements in the chemistry of sequencing library preparation kits, the structural and functional properties of nanopores, and recent changes in base-calling algorithms, the accuracy of each raw read can now be over 99.9% (> Q30) with the duplex basecalling algorithm [35]. By excluding reads found in low numbers, likely representing random sequencing errors, the sequencing fidelity of reads included in the analysis can be increased.

With such high accuracy of single reads, sequence diversity can be profiled by de novo clustering using high thresholds of sequence identity, a technique that is widely applied in microbiome studies from PCR amplicons. In such studies, sequence reads from PCR amplicons (e.g., from the 16S or 18S rRNA gene) can be clustered and classified as operational taxonomic units (OTUs) based on sequence identity [36, 37]. Alongside the advantage of amplicon clustering, the high accuracy of single long reads enables the relatively precise profiling of minor variants within a sample. In other words, it allows for both the identification of genetic variation within a sample and de novo assembly of multiple complete genomes for viral variants, strains, and/or quasispecies within a sample. In this study, nanopore sequence reads were clustered based on sharing at least 99% sequence identity. The cluster containing the largest number of reads was designated the “major cluster”, while clusters with fewer sequence reads were defined as “minor clusters”. The consensus that can be generated from each cluster may provide an overview of the most frequent variants present in the analysed samples. In this study, we aimed to 1) develop an SAV3 variant identification method within a sample using high-accuracy nanopore reads; 2) identify major and minor SAV3 variants that arise during an active infection; and 3) explore potential genetic variations that occur when SAV3 infects either Atlantic salmon or brown trout.

Materials and methods

Fish and viral challenge

Atlantic salmon and brown trout were reared at the Institute of Marine Research (IMR), Research Station in Matre (Masfjorden, Norway). Prior to viral challenge, the fish were transported to IMRs fish disease laboratories in Bergen (Norway). The salmon and trout were acclimated in 400 L tanks supplied with freshwater at a flow rate of approximately 400 L h⁻¹. Commercial feed was provided twice daily, and the water temperature was maintained at 10–12 °C. The photoperiod was maintained at 12 h light and 12 h dark during both the acclimation and experiment. Viral challenge was performed as a cohabitation challenge. In brief, naïve salmon shedder fish were injected intramuscularly with a 2 × 50 µL of 1 × 10⁴ TCID₅₀ mL⁻¹ SAV3 inoculum [38]. The virus was propagated in CHH-1 cells, and passage 3 of the virus was used in this trial. The shedder fish were marked by the adipose fin clipping method for selective sampling of cohabitant fish during the subsequent sampling period. Then, 30 salmon shedders and 70 naïve salmon or trout were transferred to 250 L experimental tanks where they remained for the duration of the cohabitation challenge experiment. At 2, 4, and 8 weeks after cohabitation started, sixteen cohabitation fish of each species were euthanized using an overdose of Benzocaine (160 mg L⁻¹; Apotekproduksjon AS, Norway). Sampling was performed at 2-, 4-, and 8-weeks post-challenge (wpc), producing six experimental groups consisting of specific combinations of sampling time points and fish species (2wpc_Salmon, 4wpc_Salmon, 8wpc_Salmon, 2wpc_Trout, 4wpc_Trout, and 8wpc_Trout). Hearts were dissected from all the fish, transferred to RNALater (Ambion, TX, USA) and stored at −80 °C until further analysis. All experiments involving live animals were approved by the Norwegian Food Safety Authority (FOTS approval number 11260).

RNA extraction and quantitative PCR (qPCR)

Total RNA was extracted from the heart following the standard protocol of the Promega ReliaPrep^™ simply RNA HT 384 kit (Promega, WI, USA) on a Biomek 4000 Laboratory Automated Workstation (Beckman Coulter, CA, USA). The total RNA concentration was quantified using a NanoDrop^™1000 spectrophotometer (Thermo Scientific, MA, USA), and the RNA samples were diluted to 100 ng µL⁻¹ using a Biomek 4000 Laboratory Automated Workstation (Beckman Coulter, CA, USA). Quantitative RT-PCR was conducted using the AgPath-ID One Step RT-PCR kit (ThermoFisher, MA, USA) according to the manufacturer’s instructions with primers targeting the SAV3 nsP1 gene (F: 5′-CCGGCCCTGAACCAGTT-3′; R: 5′-GTAGCCAAGTGGGAGAAAGCT-3′ and probe: 6FAM-TCGAAGTGGTGGCCAG-MGBNFQ)[39]. Briefly, 200 ng of total RNA was added to a reaction mixture containing 400 nM forward and reverse primers and 160 nM probe in a total volume of 10 µL on a 384-well plate [39]. The qPCR protocol included reverse transcription (1 cycle: 45 °C/10 min), predenaturation (1 cycle: 95 °C/10 min), 40 cycles of amplification (95 °C/15 s and 60 °C/45 s) and fluorescence detection using a QuantStudio 5 real-time PCR system (Applied Biosystems, MA, USA).

Nanopore sequencing library preparation

Only heart samples with Ct values below 35 were included for analysis via nanopore sequencing. A total of 22 heart samples from salmon and trout at 2, 4, and 8 wpc were included in this experiment. Each experimental group (i.e., fish species at a specific sampling time point) included 3–4 samples, given the maximum of 24 barcodes available in the nanopore sequencing library used in this study (Additional file 1). From each sample, 1 µg of total RNA was added to a total of 10 µL of cDNA reaction mix containing 10X SuperScript reverse transcriptase, 5X VILO reaction and random hexamers (SuperScript VILO cDNA synthesis kit (Invitrogen, MA, USA)). The cDNA mixture was then sequentially incubated at the following conditions: 25 °C for 10 min, 42 °C for 60 min, 50 °C for 30 min, and 85 °C for 5 min. For each sample, eight sets of PCR primers were used to produce eight amplicons (amplicon1—amplicon8; amp1—amp8) that covered most of the SAV genome (Figure 1A; Additional file 2). Briefly, the PCR mixture was prepared using the following components: 2 µL of 5X Q5 reaction buffer, 0.2 µL of 10 mM dNTPs, 0.1 µL of Q5 hot-start DNA polymerase (20 units mL⁻¹), primers (forward and reverse; 5 µM), 1 µL of cDNA (synthesized from 100 ng of total RNA), and DNase-free water up to 10 µL. The PCR conditions were as follows: 1 cycle of denaturation (98 °C for 30 s), 35 cycles of amplification (98 °C for 10 s, 62 °C for 30 s, and 72 °C for 3 min), and 1 cycle of post-extension (72 °C for 8 min). Amplicons were cleaned using AMPure XP beads according to the manufacturer’s guidelines (Beckman Coulter, CA, USA). Blunt end repair and DNA ligation were carried out using the NEBNext End Repair Module and NEBNext Ligation Sequencing Kit (NEBNext, MA, USA). A Native Barcoding Kit 24 (Q20 + and duplex enabled, Oxford Nanopore, UK) was used to obtain a unique barcode for all eight amplicons from each sample. All the barcoded samples were then pooled together and sequenced using a MinION flow cell (R10.4, Oxford Nanopore, UK).

Bioinformatics

Basecalling

Basecalling was performed using the GPU-enabled guppy6.06 basecaller with the super accuracy configuration dna_r10.4_e8.1_sup.cfg. Since the accuracy of the raw reads is important for downstream variant calling analyses, we further implemented the newer duplex basecalling capability introduced by the Oxford Nanopore Company (Oxford Nanopore, UK). Duplex tools were used to identify duplex pairs. The guppy duplex basecalling command was then executed with the super accuracy configuration (dna_r10.4_e8.1_sup.cfg), and the duplex pair information identified in the prior step was used as input. The flags “–barcode_kits “SQK-NBD112-24”–trim_barcodes –trim_adapters –trim_strategy dna –require_barcodes_both_ends” were included in this command to ensure proper demultiplexing and trimming of adapter sequences.

Single nucleotide variant (SNV) identification

To identify single nucleotide variants (SNVs) (Table 1) occurring in salmon samples at 4 and 8 wpc and all trout samples, a consensus genome was constructed from the reads from the salmon samples at 2 wpc. Briefly, the sequence reads from the 2wpc_Salmon experimental group were mapped onto the published SAV3 genome (SAV3-2-MR/10 isolate; GenBank accession: KC122926), after which Tablet (ver. 1.21.02.08) [40] was used to generate the “2wpc consensus genome”. All variant analyses were conducted using the 2wpc consensus genome. The FastQ files for each sample, identified by the barcodes, were mapped onto the 2wpc consensus genome using Bowtie2 with the “very sensitive option” [41]. The SAM file was converted to a sorted BAM file using samtools, and the variant calling file (vcf) was produced using BCFtools call with the command “-m” or “-mv" [42, 43]. The terminology related to the analysis of SNVs conducted in this study is defined in Table 1. Excluding primer binding site sequences, SNVs were identified using the variant calling command with the “-mv” option. Any of the three possible nucleotides that differed from the nucleotide in the reference genome at a polymorphic site were defined as “SNV alleles” (Table 1). SNV-alleles with an SNV allele frequency ranging from 5–60% were considered minor SNV-alleles while SNV-alleles with an SNV-allele_freq above 60% were considered major (Table 1). For each sampling time point and fish species (i.e., experimental group), the number of major SNV-alleles was counted (Figure 2).

Table 1 Definition of terminologies for the analysis of single nucleotide variants (SNVs) and sequence read clustering investigated in this study

Full size table

Identification of major and minor SAV3 cluster(s) in each amplicon

For each sample, all the sequence reads in the FastQ files were mapped onto each of the eight individual amplicons using Bowtie2 with the same options as described in the subsection “Single nucleotide variant (SNV) identification”. The reads from amplicon (amp) 7 and amp8 were pooled together for clustering because the amplicons overlapped somewhat (Figure 1). Antisense reads in the sets were transformed to complementary sense reads using FASTX-Toolkit [44, 45]. The reads from each amplicon were de novo clustered (i.e., amp1-cluster to amp8-cluster) using qiime2 and a 99% sequence identity threshold [46]. In detail, the sample information and FastQ files were processed (“tools” option with the flags “– type SampleData[SequencesWithQuality]” and “–input-format SingleEndFastqManifestPhred33V2”) to.qza file using qiime2. Then, the individual sequences and table files were extracted with the flag “vsearch dereplicate-sequences”, and finally, de novo clustering was carried out through “vsearch cluster-features-de-novo”, with the flag “–p-perc-identity 0.99″. Only reads not shorter than 90% of the amplicon length were included in the clustering, and only clusters that contained at least 0.5% of all reads for the given amplicon were used for further analysis. For each amplicon, the clusters passing the above criteria were then aligned, and phylogenetic trees were produced using the maximum likelihood phylogenetic method with 1000 bootstrap replicates in MEGA11 [47, 48].

Visualization of the location of selected deletions and SNVs in the SAV3 spike protein

The amino acid sequences for E1, E2, and E3 from the 2wpc_consensus genome were used. The SAV3 spike protein structure was modelled using homology modelling in SWISS-MODEL in automated mode [49]. The 3D structure of the SAV3 spike protein model was visualized using PyMOL software [50, 51]. The predicted 3D structure was used to visualize the location of the deletions observed in those of the minor clusters that contained at least 10% of the reads (i.e., a proportion > 10%). Additionally, the sites with nonsynonymous minor or major SNVs are also shown in the 3D structure.

Statistical analysis

Duncan’s HSD one-way ANOVA was used for the statistical analysis of Ct values and relative cluster size data. Welch’s two-sample t test was used for the SNV_freq and SNV-allele_freq analyses. The threshold of the p value was set to less than 0.05. All the statistical analyses were carried out using the “haven” library in R [52]. The statistical significance of the frequency of major SNV-alleles compared to the amino acid composition of the SAV3 2wpc_Salmon consensus genome was confirmed using chi-square testing in R.

Results

Viral load

The viral load in the samples included in the sequencing was assessed using qPCR. For Atlantic salmon, the mean Ct values were 28.9 ± 6.3, 22.6 ± 3.9, and 26.8 ± 0.4 at 2, 4, and 8 wpc, respectively. For trout, the parallel Ct values were 25.9 ± 4.0, 21.9 ± 0.8, and 33.4 ± 1.0, respectively. Significant differences in viral load measured by the Ct values between species were observed at 8 wpc (Additional file 3).

Nanopore sequencing

More than five million raw nanopore reads were contained in the Fast5 file obtained from the sequencing experiment using a single R10.4 nanopore flow cell. The Fast5 file was converted to nucleotide sequences using guppy 6.06 with the super accuracy base-calling algorithm, resulting in 5,278,494 reads with a median Phred quality score of 16.412 (equivalent to ~97.72% estimated accuracy). Using the duplex basecalling algorithm, we obtained 166740 reads that passed the more rigorous filtering implemented in this method, corresponding to less than 3.2% of the total reads. However, the median Phred quality score was much greater at 24.109, equivalent to ~99.61% estimated accuracy (mean Phred quality score ± standard deviation = 25.116 ± 7.392). Among them, 97,761 reads could be properly identified by the barcode. This study exclusively employed high-quality sequence reads that were accurately identified by barcodes after duplex basecalling. On average, ~50% of the high-quality sequence reads (45,318 out of 97,791 reads) were successfully mapped onto the reference genome (Additional file 1). Upon examination of unmapped sequences, sequences harboring high similarity to SAV were identified but were characterized by the presence of sequence transpositions, inversions, large insertions, or deletions. Whether these unmapped sequences were PCR artefacts or originated from viral variation was not examined in this study.

Major and minor mutation changes in SAV

Among the 22 samples, a total of 16 major SNV-alleles were identified in this study, and some of the major SNV-alleles were present in multiple samples (Figures 2, 3). Most of these major SNV-alleles appeared to be randomly distributed across the sampling time points and between fish species. However, two major, nonsynonymous SNV-alleles were identified in two out of four fish (50%) in the same experimental group. These mutations, which are located in nsP2 (SNV-nsP2₃₄₁₄-T/C) and E2 (SNV-E2₁₁₈₇-T/C), resulted in changes from tyrosine to histidine and valine to alanine, respectively (Figure 3). We also noted that while arginine constituted only 6.3% (248/3906) of the amino acids in the 2wpc_Salmon consensus genome, 18.8% (3/16) of the major SNVs occurred in codons for arginine (Table 2). Arginine codons, therefore, were the site of major SNVs three times more frequently than would be expected based on their relative frequency in the genome (P = 0.0431). The remaining 19 amino acids did not harbor major SNVs at a frequency that was significantly higher or lower than their frequency within the 2wpc_Salmon consensus genome (Table 2). We also identified 7 minor SNV-alleles distributed in both nonstructural and structural genes (Figure 4, Additional file 4). Most of the minor SNV-alleles resulted in nonsynonymous mutations. The trout group tended to show more frequent changes than did the salmon group, especially in the E2 gene. In the trout experimental groups, the two minor SNV-alleles, SNV-E2₄₁₂ and SNV-E2₄₃₂, increased in SNV_freq during the experiment. There was a distinctly greater proportion of SNV-E2₄₁₂-T/C. For SNV-E2₄₃₂, two specific variants, both of which produce a glutamic acid (E) to aspartic acid (D) change (SNV-E2₄₃₂-G/T and SNV-E2₄₃₂-G/C), had a distinct, though not significant, increase in proportion (Additional file 4).

Table 2 The occurrence of major SNV-alleles in codons for amino acids

Full size table

Amplicon clusters and phylogenetic analysis

Through de novo clustering, we identified 9,613 clusters comprising both mapped and unmapped sequences (Additional file 5). Among them, only 7 clusters in amp1, 3 in amp2, 3 in amp3, 8 in amp4, 2 in amp5, 4 in amp6, and 9 in amp7&8 met the thresholds defined for this study (Figures 5, 6 and 7; Additional file 6). For each amplicon, there was a single major cluster that contained the majority (>45%) of reads, along with one or more minor cluster(s), each with a relatively small number of reads. As the clustering analysis applied a 99% identity threshold, larger deletions (> ~20 bp) influenced the resulting clusters much more than did shorter deletions and SNVs. The proportion of reads in each cluster varied across genome location, sampling time point, and host species. The 4wpc_Trout and 8wpc_Trout experimental groups had a significantly greater proportion of reads in some minor clusters than did the other experimental groups (Figures 5, 6 and 7; Additional file 6). This was most prominent for Amp7&8_cluster2 and Amp7&8_cluster3 for 8wpc_Trout (Figure 7). Most of the minor clusters predominantly exhibited frameshift deletions; however, each cluster was composed of sequences with 99% identity, resulting in the practical coexistence of both in-frame and frameshift deletion reads. In addition, in some raw clusters that did not pass the threshold, sequence inversion, transposition, insertion, and deletion were observed (Additional file 5).

Nonmetric multidimensional scaling (NMDS) analysis of variation between experimental groups

NMDS analysis was used to analyse the variation (dissimilarity) between the experimental groups. In the NMDS analysis, 36 dimensions (i.e., the number of clusters) were condensed into two dimensions where the distance between experimental groups (and specimens) in an NMDS plot indicates the degree of similarity. At two weeks post-challenge, the experimental groups partially overlapped, and each showed relatively little variation between specimens (Figure 8A). At four weeks post-challenge, the experimental groups no longer overlapped but still showed relatively little variation between specimens (Figure 8B). At eight weeks post-challenge, the experimental groups were again partially overlapping but showed a distinct difference in variation between specimens (Figure 8C).

Visualization of selected mutations in the spike protein

A homology model of the SAV spike protein was constructed using SWISS-MODEL, and the model was subsequently used to visualize the location of selected mutations (Figure 9). Amp6_cluster2, Amp78_cluster2, and amp78_cluster3 exceeded a mean proportion of reads of 10% in at least one experimental group, showing statistically significant differences. The consensus sequences from both clusters are frameshift deletions located at the apical region of the spike protein. However, in reality, reads containing both in-frame and frameshift deletions coexist (Figures 8B–D). The major nonsynonymous SNVs identified in the SAV spike protein are highlighted in green and yellow in Figure 9E and Additional file 7. The QMEANDisCo global score, ranging from 0 to 1, expresses the quality of a predicted model [53]. Higher QMEANDisCo scores indicate better quality and accuracy in the predicted protein structure. While the acceptable range for the QMEANDisCo global score may vary depending on the types of predicted proteins, a score above 0.50 generally implies that the predicted model is likely acceptable based on the established threshold [54]. The predicted SAV spike protein model based on the 2wpc_consensus sequence had a QMEANDisCo global score of 0.60 ± 0.05, which is comparable to that of other models of alphavirus spike proteins deposited (e.g., Q5WQY5; Chikungunya virus- 0.65 ± 0.05 QMEANDisCo global score). The deletions (Amp6_cluster2, Amp78_cluster2, Amp78_cluster2) and nonsynonymous mutations did not affect the QMEANDisCo global score, as they showed the same values.

Discussion

In the present study, we used the Nanopore long-read sequencing platform to sequence the salmonid alphavirus-3 (SAV3) genome from tissue samples collected from Atlantic salmon and brown trout at various time points during a virus challenge experiment. The primary source of SAV3 infection in cohabitants was the shedder fish. SAV3 sequences from the 2wpc_Salmon experimental group were analysed and used as a reference genome for the remaining experimental time points. The cohabitation challenge applied in this study has both advantages and disadvantages as a method for investigating SAV3 variants. The advantage of the cohabitation model is that it accurately replicates the actual route of waterborne SAV3 infection. However, cohabitation challenges also have potential limitations regarding two parameters: the actual dose of SAV3 to which cohabitant fish are exposed and the exact timing of their initial infection. These potential limitations should be noted when considering the population diversity of sequences within quasispecies at different time points post-infection.

Among the major nonsynonymous SNV-alleles, only two (SNV-nsP2₁₆₇₂-T/C and SNV-E2₁₁₈₇-T/C) were found in more than one fish. Among them, the SNV-E2₁₁₈₇-T/C, located within the spike protein, represented a nonsynonymous mutation that converts valine to alanine. This valine-to-alanine substitution may significantly influence viral fitness, leading to notable phenotypic changes. Interestingly, Tsetsarkin et al. [55] investigated the impact of an alanine-to-valine mutation at position 226 in the E1 fusion protein of Chikungunya virus (CHIKV). Compared with yellow fever mosquitos (Ae. aegypti), CHIKV with an alanine at this position (E1-226A) showed relatively rapid infection and an increased ability to infect Asian tiger mosquitos (Ae. albopictus). Conversely, CHIKV with valine at this position (E1-226 V) was significantly better at infecting yellow fever mosquitos. This study highlights how a single substitution can significantly alter the phenotypic characteristics of alphaviruses. Among several minor SNV-alleles identified between the experimental groups, only SNV-E2₄₁₂-T/C was consistently and significantly more abundant in the trout experimental group and exhibited a distinct increase over time. At another site, two minor SNV-alleles (SNV-E2₄₃₂-G/C and SNV-E2₄₃₂-G/T) that both led to an E (glutamic acid) to D (aspartic acid) aa change also increased in SNV-allele_freq over time in the trout experimental group, but this increase was not statistically significant. In general, SNVs could alter viral tropism towards different hosts. The E2 protein is one of the three glycoproteins that makes up the SAV spike protein and is one of the structural proteins where most immunogenic epitopes are located [56, 57]. Karlsen et al. [58] observed the influence of a mutation at position E2₂₀₆, from proline (E2_206p) to serine (E2_206s), which is located in the receptor binding site. The authors found that viral growth and replication differed significantly between these mutants. The E2_206s mutant also reverted to E2_206p when the virus was inoculated into a cell line (BF2), indicating that SAV3 may adapt to its host and environment. In the present study, the minor SNVs (E2₄₁₂ and E2₄₃₂) identified in the E2 gene are located in the middle of the spike protein rather than in the receptor binding site. Hence, the effect of these nonsynonymous mutations is likely less pronounced/direct than that of the variant observed in the study by Karlsen et al. [58]. On the other hand, most deletion mutations identified from minor clusters in the spike protein (Amp6_cluster2, Amp7&8_cluster2, and Amp7&8_cluster3) are located in a region that faces outwards from the viral membrane. Deletions in these regions could influence cellular tropism. In addition, introduction of minor SNV-nsP2₄₈₆ may lead to the introduction of premature stop codons (TAG and TAA). Given that nonstructural proteins such as nsP2 regulate viral RNA synthesis, premature stop codons will result in a defective viral polyprotein unable to perform its role in viruses.

In the cluster analysis, the reads in each identified cluster had at least 99% sequence identity. Given that the genetic identity among SAV subtypes ranges from ~86–96% [3], we used the threshold of 99% sequence identity in the cluster analyses to allow the study of intrasubtype variation. If, in contrast, a threshold lower than ~96% sequence identity had been used, the cluster analysis would not have been able to differentiate between SAV subtypes. Since the amplicons (and hence the reads) had an average length of approximately 2000 bp, the clusters, on average, differed from each other in at least 20 nucleotides. Using these threshold conditions inadvertently led to all the identified clusters being predominantly defined by larger deletions. When the reads in each identified cluster were “merged” into a defining consensus sequence, these deletions mostly led to a shift in the reading frame. This would suggest that these deletion-defined clusters should be considered nonproductive dead ends. It should be noted, however, that among the reads in these clusters, there were sequences with in-frame deletions that, in principle, could retain (some) functionality. Similarly, Gallagher et al. [17] identified many deletion mutations based on nanopore sequencing, and ~34% of deletions did not disrupt the protein-coding frame (in-frame mutation), which leaves open the possibility that not all observed deletions result in defective viral particles. In addition, the sizes of the complete SAV genomes varied slightly (SAV1 (AJ316244.1; 11,919 bp), SAV2 (AJ316246.1; 11,900 bp), SAV3 (KC122926.1; 11,887 bp), SAV4 (MH708651.1; 11,762 bp), SAV5 (MH708650.1; 11,804 bp), and SAV6 (MH238448.1; 11,726 bp)). This difference may ultimately stem from the frequent occurrence of deletion mutations in SAV. Overall, the cluster analysis of each of the 8 amplicons revealed little directional development (i.e., adaptation) at different sampling time points or between fish species. The only exception was for amplicons 1 and 7/8, where the frequency of some minor clusters increased for brown trout at 8 wpc.

NMDS analysis integrating the cluster data over all eight amplicons indicated that late in infection, SAV3 genomes from brown trout had higher levels of variation than did SAV3 genomes from salmon. At the first sampling time point (2wpc), little difference was observed in the NMDS plot. By 4 wpc, the experimental groups had similar levels of variation but were still separated in the NMDS plot. In contrast, the groups overlapped at 8 wpc, but the brown trout experimental group showed distinctly more variation. Considering the distinct kinetics observed between salmon and trout at 8 wpc, the susceptibility of brown trout to SAV3 may be lower than that of other trout species. The observed higher variation in brown trout could be interpreted as the SAV3 exploring the virus fitness landscape in a host to which it is not well adapted.

In conclusion, this study provides insight into the genetic variation in SAV3 in infected fish, revealing mostly random variation with no development in SNV_freq during the experiment. Nevertheless, a few specific variants, such as SNV-E2₄₁₂ and SNV-E2₄₃₂, increased in frequency with time, potentially showing viral adaptation to trout. We believe that this approach and bioinformatics pipeline will be useful for studies of viral variation and evolution.

Data Availability

The datasets used in this study are available from the corresponding author upon reasonable request.

References

Deperasińska I, Schulz P, Siwicki AK (2018) Salmonid alphavirus (SAV). J Vet Res 62:1
Article PubMed PubMed Central Google Scholar
Pietilä MK, Hellström K, Ahola T (2017) Alphavirus polymerase and RNA replication. Virus Res 234:44–57
Article PubMed Google Scholar
Tighe AJ, Gallagher MD, Carlsson J, Matejusova I, Swords F, Macqueen DJ, Ruane NM (2020) Nanopore whole genome sequencing and partitioned phylogenetic analysis supports a new salmonid alphavirus genotype (SAV7). Dis Aquat Organ 142:203–211
Article PubMed Google Scholar
Firth AE, Chung BY, Fleeton MN, Atkins JF (2008) Discovery of frameshifting in Alphavirus 6K resolves a 20-year enigma. Virol J 5:108
Article PubMed PubMed Central Google Scholar
Melton JV, Ewart GD, Weir RC, Board PG, Lee E, Gage PW (2002) Alphavirus 6K proteins form ion channels. J Biol Chem 277:46923–46931
Article CAS PubMed Google Scholar
Ramsey J, Mukhopadhyay S (2017) Disentangling the frames, the state of research on the alphavirus 6K and TF proteins. Viruses 9:228
Article PubMed PubMed Central Google Scholar
Nelson R, McLoughlin M, Rowley H, Platten M, McCormick J (1995) Isolation of a toga-like virus from farmed Atlantic salmon Salmo salar with pancreas disease. Dis Aquat Organ 22:25–32
Article Google Scholar
Gallagher MD, Matejusova I, Ruane NM, Macqueen DJ (2020) Genome-wide target enriched viral sequencing reveals extensive ‘hidden’ salmonid alphavirus diversity in farmed and wild fish populations. Aquac 522:735117
Article CAS Google Scholar
Herath TK, Thompson KD (2022) Salmonid alphavirus and pancreas disease Aquac Pathophysiol. Elsevier, Amsterdam
Google Scholar
Herath TK, Ashby AJ, Jayasuriya NS, Bron JE, Taylor JF, Adams A, Richards RH, Weidmann M, Ferguson HW, Taggart JB (2017) Impact of Salmonid alphavirus infection in diploid and triploid Atlantic salmon (Salmo salar L.) fry. PLoS One 12:e0179192
Article PubMed PubMed Central Google Scholar
Weston JH, Welsh MD, McLoughlin MF, Todd D (1999) Salmon pancreas disease virus, an alphavirus infecting farmed Atlantic salmon, Salmo salar L. Virol 256:188–195
Article CAS Google Scholar
Villoing S, Béarzotti M, Chilmonczyk S, Castric J, Brémont M (2000) Rainbow trout sleeping disease virus is an atypical alphavirus. J Virol 74:173–183
Article CAS PubMed PubMed Central Google Scholar
Hodneland K, Bratland A, Christie K, Endresen C, Nylund A (2005) New subtype of salmonid alphavirus (SAV), Togaviridae, from Atlantic salmon Salmo salar and rainbow trout Oncorhynchus mykiss in Norway. Dis Aquat Organ 66:113–120
Article CAS PubMed Google Scholar
Bruno D, Noguera P, Black J, Murray W, Macqueen D, Matejusova I (2014) Identification of a wild reservoir of salmonid alphavirus in common dab Limanda limanda, with emphasis on virus culture and sequencing. Aquac Environ Interact 5:89–98
Article Google Scholar
Macqueen DJ, Eve O, Gundappa MK, Daniels RR, Gallagher MD, Alexandersen S, Karlsen M (2021) Genomic epidemiology of salmonid alphavirus in Norwegian aquaculture reveals recent Subtype-2 transmission dynamics and novel Subtype-3 lineages. Viruses 13:2549
Article CAS PubMed PubMed Central Google Scholar
Karlsen M, Hodneland K, Endresen C, Nylund A (2006) Genetic stability within the Norwegian subtype of salmonid alphavirus (family Togaviridae). Arch Virol 151:861–874
Article CAS PubMed Google Scholar
Gallagher MD, Karlsen M, Petterson E, Haugland Ø, Matejusova I, Macqueen DJ (2020) Genome sequencing of SAV3 reveals repeated seeding events of viral strains in Norwegian aquaculture. Front Microbiol 11:524801
Article Google Scholar
Petterson E, Stormoen M, Evensen Ø, Mikalsen AB, Haugland Ø (2013) Natural infection of Atlantic salmon (Salmo salar L.) with salmonid alphavirus 3 generates numerous viral deletion mutants. J Gen Virol 94:1945–1954
Article CAS PubMed Google Scholar
Patterson EI, Khanipov K, Swetnam DM, Walsdorf S, Kautz TF, Thangamani S, Fofanov Y, Forrester NL (2020) Measuring alphavirus fidelity using non-infectious virus particles. Viruses 12:546
Article CAS PubMed PubMed Central Google Scholar
Peck KM, Lauring AS (2018) Complexities of viral mutation rates. J Virol 92:e01031-17
Article CAS PubMed PubMed Central Google Scholar
Stapleford KA, Rozen-Gagnon K, Das PK, Saul S, Poirier EZ, Blanc H, Vidalain P-O, Merits A, Vignuzzi M (2015) Viral polymerase-helicase complexes regulate replication fidelity to overcome intracellular nucleotide depletion. J Virol 89:11233–11244
Article CAS PubMed PubMed Central Google Scholar
Poirier EZ, Mounce BC, Rozen-Gagnon K, Hooikaas PJ, Stapleford KA, Moratorio G, Vignuzzi M (2016) Low-fidelity polymerases of alphaviruses recombine at higher rates to overproduce defective interfering particles. J Virol 90:2446–2454
Article CAS PubMed Central Google Scholar
Simon-Loriere E, Holmes EC (2011) Why do RNA viruses recombine? Nat Rev Microbiol 9:617–626
Article CAS PubMed PubMed Central Google Scholar
Sumpter R Jr, Wang C, Foy E, Loo Y-M, Gale M Jr (2004) Viral evolution and interferon resistance of hepatitis C virus RNA replication in a cell culture model. J Virol 78:11591–11604
Article CAS PubMed PubMed Central Google Scholar
Domingo E, Sheldon J, Perales C (2012) Viral quasispecies evolution. Microbiol Mol Biol Rev 76:159–216
Article CAS PubMed PubMed Central Google Scholar
Rhoads A, Au KF (2015) PacBio sequencing and its applications. Genom Proteom Bioinform 13:278–289
Article Google Scholar
Takeda H, Yamashita T, Ueda Y, Sekine A (2019) Exploring the hepatitis C virus genome using single molecule real-time sequencing. World J Gastroenterol 25:4661
Article CAS PubMed PubMed Central Google Scholar
Freed NE, Vlková M, Faisal MB, Silander OK (2020) Rapid and inexpensive whole-genome sequencing of SARS-CoV-2 using 1200 bp tiled amplicons and Oxford Nanopore rapid barcoding. Biol Methods Protoc 5:bpaa014
Boldogkői Z, Moldován N, Balázs Z, Snyder M, Tombácz D (2019) Long-read sequencing–a powerful tool in viral transcriptome research. Trends Microbiol 27:578–592
Article PubMed Google Scholar
Karst SM, Ziels RM, Kirkegaard RH, Sørensen EA, McDonald D, Zhu Q, Knight R, Albertsen M (2021) High-accuracy long-read amplicon sequences using unique molecular identifiers with Nanopore or PacBio sequencing. Nat Methods 18:165–169
Article CAS PubMed Google Scholar
Rang FJ, Kloosterman WP, de Ridder J (2018) From squiggle to basepair: computational approaches for improving nanopore sequencing read accuracy. Genome Biol 19:90
Article PubMed PubMed Central Google Scholar
Gallagher MD, Matejusova I, Nguyen L, Ruane NM, Falk K, Macqueen DJ (2018) Nanopore sequencing for rapid diagnostics of salmonid RNA viruses. Sci Rep 8:16307
Article PubMed PubMed Central Google Scholar
Boone M, De Koker A, Callewaert N (2018) Capturing the ‘ome’: the expanding molecular toolbox for RNA and DNA library construction. Nucleic Acids Res 46:2701–2721
Article CAS PubMed PubMed Central Google Scholar
Brodin J, Hedskog C, Heddini A, Benard E, Neher RA, Mild M, Albert J (2015) Challenges with using primer IDs to improve accuracy of next generation sequencing. PLoS One 10:e0119123
Article PubMed PubMed Central Google Scholar
Sanderson ND, Kapel N, Rodger G, Webster H, Lipworth S, Street TL, Peto T, Crook D, Stoesser N (2023) Comparison of R9. 4.1/Kit10 and R10/Kit12 Oxford Nanopore flowcells and chemistries in bacterial genome reconstruction. Microb Genom 9:000910
Blaxter M, Mann J, Chapman T, Thomas F, Whitton C, Floyd R, Abebe E (2005) Defining operational taxonomic units using DNA barcode data. Philos Trans R Soc Lond B Biol Sci 360:1935–1943
Article CAS PubMed PubMed Central Google Scholar
Nguyen N-P, Warnow T, Pop M, White B (2016) A perspective on 16S rRNA operational taxonomic unit clustering using sequence similarity. NPJ Biofilms Microbi 2:1–8
Article CAS Google Scholar
Xu C, Guo T-C, Mutoloki S, Haugland Ø, Evensen Ø (2012) Gene expression studies of host response to Salmonid alphavirus subtype 3 experimental infections in Atlantic salmon. Vet Res 43:1–10
Article Google Scholar
Hodneland K, Endresen C (2006) Sensitive and specific detection of Salmonid alphavirus using real-time PCR (TaqMan®). J Viro Methods 131:184–192
Article CAS Google Scholar
Milne I, Stephen G, Bayer M, Cock PJ, Pritchard L, Cardle L, Shaw PD, Marshall D (2013) Using Tablet for visual exploration of second-generation sequencing data. Brief Bioinform 14:193–202
Article CAS PubMed Google Scholar
Langmead B, Salzberg SL (2012) Fast gapped-read alignment with Bowtie 2. Nat Methods 9:357–359
Article CAS PubMed PubMed Central Google Scholar
Danecek P, Schiffels S, Durbin R, Multiallelic calling model in bcftools (-m) (2014), June
Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, Handsaker RE, Lunter G, Marth GT, Sherry ST (2011) The variant call format and VCFtools. Bioinformatics 27:2156–2158
Article CAS PubMed PubMed Central Google Scholar
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, Subgroup GPDP (2009) The sequence alignment/map format and SAMtools. Bioinformatics 25:2078–2079
Article PubMed PubMed Central Google Scholar
Gordon A, Hannon GJ (2010) Fastx-toolkit. FASTQ/A short-reads preprocessing tools (unpublished). http://hannonlab.cshl.edu/fastx_toolkit/. Accessed May 2022
Bolyen E, Rideout JR, Dillon MR, Bokulich NA, Abnet CC, Al-Ghalith GA, Alexander H, Alm EJ, Arumugam M, Asnicar F (2019) Reproducible, interactive, scalable and extensible microbiome data science using QIIME 2. Nat Biotechnol 37:852–857
Article CAS PubMed PubMed Central Google Scholar
Tamura K, Stecher G, Kumar S (2021) MEGA11: molecular evolutionary genetics analysis version 11. Mol Biol Evol 38:3022–3027
Article CAS PubMed PubMed Central Google Scholar
Sievers F, Wilm A, Dineen D, Gibson TJ, Karplus K, Li W, Lopez R, McWilliam H, Remmert M, Söding J (2011) Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol Syst Biol 7:539
Article PubMed PubMed Central Google Scholar
Waterhouse A, Bertoni M, Bienert S, Studer G, Tauriello G, Gumienny R, Heer FT, de Beer TAP, Rempfer C, Bordoli L (2018) SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res 46:W296–W303
Article CAS PubMed PubMed Central Google Scholar
Bramucci E, Paiardini A, Bossa F, Pascarella S (2012) PyMod: sequence similarity searches, multiple sequence-structure alignments, and homology modeling within PyMOL. BMC Bioinformat 13:1–6
Article Google Scholar
Yuan S, Chan HS, Hu Z (2017) Using PyMOL as a platform for computational drug design. Wiley Interdisciplinary Rev Comput Mol Sci 7:e1298
Article Google Scholar
Wickham H, Miller E, haven: Import and Export'SPSS','Stata'and'SAS'Files. R package version 1.1. 2, 2018 (2017)
Studer G, Rempfer C, Waterhouse AM, Gumienny R, Haas J, Schwede T (2020) QMEANDisCo—distance constraints applied on model quality estimation. Bioinformatics 36:1765–1771
Article CAS PubMed Google Scholar
Gupta K (2023) In silico structural and functional characterization of hypothetical proteins from Monkeypox virus. J Genet Eng Biotechnol 21:46
Article PubMed PubMed Central Google Scholar
Tsetsarkin KA, Vanlandingham DL, McGee CE, Higgs S (2007) A single mutation in chikungunya virus affects vector specificity and epidemic potential. PLoS Pathog 3:e201
Article PubMed PubMed Central Google Scholar
Hunt AR, Frederickson S, Maruyama T, Roehrig JT, Blair CD (2010) The first human epitope map of the alphaviral E1 and E2 proteins reveals a new E2 epitope with significant virus neutralizing activity. PLoS Negl Trop Dis 4:e739
Article PubMed PubMed Central Google Scholar
Hikke MC, Braaen S, Villoing S, Hodneland K, Geertsema C, Verhagen L, Frost P, Vlak JM, Rimstad E, Pijlman GP (2014) Salmonid alphavirus glycoprotein E2 requires low temperature and E1 for virion formation and induction of protective immunity. Vaccine 32:6206–6212
Article CAS PubMed Google Scholar
Karlsen M, Andersen L, Blindheim SH, Rimstad E, Nylund A (2015) A naturally occurring substitution in the E2 protein of Salmonid alphavirus subtype 3 changes viral fitness. Virus Res 196:79–86
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The authors would like to acknowledge the invaluable contributions from the technicians at the IMR and personnel at the fish disease laboratory.

Funding

Open access funding provided by Institute Of Marine Research. This study was funded by the Institute of Marine Research (Bergen, Norway) in the context of the Disease Transmission Project (15821) and the VIRAQ Project (15533).

Author information

Authors and Affiliations

Institute of Marine Research, Nordnes, PO Box 1870, 5817, Bergen, Norway
HyeongJin Roh, Kai Ove Skaftnesmo, Dhamotharan Kannimuthu, Abdullah Madhun, Sonal Patel, Bjørn Olav Kvamme, H. Craig Morton & Søren Grove
Norwegian Veterinary Institute, Bergen, Norway
Sonal Patel

Authors

HyeongJin Roh
View author publications
You can also search for this author in PubMed Google Scholar
Kai Ove Skaftnesmo
View author publications
You can also search for this author in PubMed Google Scholar
Dhamotharan Kannimuthu
View author publications
You can also search for this author in PubMed Google Scholar
Abdullah Madhun
View author publications
You can also search for this author in PubMed Google Scholar
Sonal Patel
View author publications
You can also search for this author in PubMed Google Scholar
Bjørn Olav Kvamme
View author publications
You can also search for this author in PubMed Google Scholar
H. Craig Morton
View author publications
You can also search for this author in PubMed Google Scholar
Søren Grove
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conception and design of the study: HR, DK, HCM, and SG; acquisition and analysis of data: HR, KOS, AM, SP, and BOK; interpretation of data: HR, KOS, DK, HCM, and SG; drafting of manuscript: HR and SG; and revision of the manuscript: HR, KOS, DK, AM, SP, BOK, HCM, and SG. All authors read and approved the final manuscript.

Corresponding author

Correspondence to HyeongJin Roh.

Ethics declarations

Competing interests

The authors declare that they have no competing interests.

Additional information

Handling editor: Stéphane Biacchesi.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1. Basic characteristics of the reads and data used for sequencing and read mapping of the reference genome.

Additional file 2. Primers used in this study.

Additional file 3.The Ct values (mean ± SD) determined by RT-qPCR targeting the SAV3 nsP1 gene in the samples sequenced in this study.

Additional file 4. The frequency of minor SNVs

in the experimental groups. A total of 7 SNVs were identified as minor, as they had an SNV_freq between 5 and 60% in at least one experimental group. For each minor SNV, the table shows the frequency observed in the experimental groups and the results of Welch’s t test comparison of frequencies in the experimental groups and the 2wpc_Salmon consensus genome.

Additional file 5. All the raw de novo clusters identified in this study.

Additional file 6. Consensus nucleotide sequences of clusters that passed the threshold in this study.

Additional file 7. Visualization of the locations of selected deletions and SNVs in the SAV3 spike protein.

A 3D structural model of the SAV3 spike protein consisting of the E1, E2 and E3 subunits was constructed via homology modelling and visualized in videos. (A) Space-filling model of the SAV3 spike protein, shown as a 12-meric protein including four E1 subunits (white), four E2 subunits (orange), and four E3 subunits (gray). (B, C and D) The deletions identified in Amp6_cluster2, Amp7&8_cluster2, and Amp7&8_cluster3, respectively, are highlighted in blue. (E) Nonsynonymous major SNVs (SNV-E2₁₁₈₇ and SNV-E1₁₃₂₁) are highlighted in green and purple, and two minor SNVs (SNV-E2₄₁₂ and SNV-E2₄₃₂) are shown in cyan and yellow.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Roh, H., Skaftnesmo, K.O., Kannimuthu, D. et al. Nanopore sequencing provides snapshots of the genetic variation within salmonid alphavirus-3 (SAV3) during an ongoing infection in Atlantic salmon (Salmo salar) and brown trout (Salmo trutta). Vet Res 55, 106 (2024). https://doi.org/10.1186/s13567-024-01349-z

Download citation

Received: 19 February 2024
Accepted: 24 June 2024
Published: 03 September 2024
DOI: https://doi.org/10.1186/s13567-024-01349-z

Nanopore sequencing provides snapshots of the genetic variation within salmonid alphavirus-3 (SAV3) during an ongoing infection in Atlantic salmon (Salmo salar) and brown trout (Salmo trutta)

Abstract

Similar content being viewed by others

Introduction

Materials and methods

Fish and viral challenge

RNA extraction and quantitative PCR (qPCR)

Nanopore sequencing library preparation

Bioinformatics

Basecalling

Single nucleotide variant (SNV) identification

Identification of major and minor SAV3 cluster(s) in each amplicon

Visualization of the location of selected deletions and SNVs in the SAV3 spike protein

Statistical analysis

Results

Viral load

Nanopore sequencing

Major and minor mutation changes in SAV

Amplicon clusters and phylogenetic analysis

Nonmetric multidimensional scaling (NMDS) analysis of variation between experimental groups

Visualization of selected mutations in the spike protein

Discussion

Data Availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's Note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation