Vitis vinifera L. (grapevine) is a significant crop in the South African agricultural sector, with the country being in the top ten for both wine and table grape production [1]. Vegetative propagation and global trade of cultivars has led to grapevine being disproportionately affected by more viruses than any other crop [2]. The exponential increase in the use of metaviromics has accelerated the discovery of grapevine pathogens, with the number of currently known viruses approaching 90 [3]. South Africa has a long history of viticulture and, consequently, a diverse set of viruses circulating within the industry.

Members of the genus Vitivirus (family Betaflexiviridae) typically have genomes of approximately 7,500 nt with five open reading frames coding for a replication-associated protein (RAP), a 22-kDa protein associated with vector transmission, a movement protein (MP), a coat protein (CP), and a nucleic acid binding protein (NABP) that functions as a suppressor of RNA silencing [4]. Grapevine vitiviruses are common components of viral populations in South African vineyards [5], with grapevine virus A (GVA), grapevine virus B, and grapevine virus E (GVE) having been reported previously [6,7,8]. In general, single infections with grapevine vitiviruses are associated with very mild symptoms [9], but synergistic coinfections with other viruses result in several severe and economically important disease phenotypes [10]. GVA and GVB are part of a complex of viruses associated with rugose wood [11] and in South Africa are associated with Shiraz decline and corky bark disease, respectively [7, 12]. In recent years, the application of metaviromics approaches has led to the discovery of a number of new grapevine vitiviruses, including grapevine virus F [11], grapevine virus G (GVG) [13], grapevine virus H [14], and grapevine virus L [15].

In December 2019, a total of 229 samples were collected from the Vitis cultivar collection at the Nietvoorbij Campus of the Agricultural Research Council (ARC), as described previously [16] (GPS coordinates: -33.912053, 18.862291). Total RNA was isolated according to White et al. [17], and RNAtag-seq libraries were prepared as described [18] and sequenced using an Illumina HiSeq 2500 instrument (Illumina, San Diego, CA, USA; ARC - Biotechnology Platform, Onderstepoort, Pretoria, South Africa) as paired ends (2 × 125 nt) using TruSeq V4 chemistry (Illumina, San Diego, CA, USA).

Datasets were demultiplexed using Je software [19]. Trimming was performed with CLC Genomics Workbench 9 (QIAGEN Bioinformatics, Aarhus, Denmark), using the following parameters: minimum read length, 20 nt; quality limit, 0.05; adapter trimming with Illumina universal (5’-AGATCGGAAGAG-3’) and RNAtag-seq (5’-TACACGACGCTCTTCCGATCTNNNNNNNNT-3’) adapters. Assembly of trimmed reads into contigs was done using SPAdes 3.14.0 [20] with the meta option. The cultivars and datasets associated with this study are listed in Supplementary Table S1. These datasets are associated with NCBI BioProject PRJNA763365. Contigs of putative viral origin were identified using BLASTn [21] and the viral fraction of the NCBI Refseq database. Plant virus contigs were then submitted to the browser-based version of BLASTx, using the NCBI nr database, for further confirmation of their identity.

A novel vitivirus, provisionally named "grapevine virus N" (GVN), was associated with the V. vinifera cultivars Azal tinto, Bourboulenc, Cinsault noir, Crouchen blanc, and Palamino, while the provisionally named "grapevine virus O" (GVO) was associated with Roter Zierfandler and Royal Molenberg. The genome organization of GVN and GVO and associated amino acid sequences of each gene was determined using ORF Finder [22], showing the typical arrangement of members of the genus Vitivirus [11]. The average amino acid sequence identity (AAI) between each gene product and that of the closest relative was determined using the AAI calculator from the enveomics collection [23] and is shown in Table 1. The amino acid sequences of ORF1 (RAP) of both GVN and GVO were aligned with cognate NCBI RefSeq sequences of extant vitiviruses, using BioEdit 7.2.5 [24]. A best-fit maximum-likelihood phylogeny, based on the Le Gascuel model + G + I + F, was generated using MEGA X [25] and is shown in Figure 1.

Table 1 Data on grapevine virus N (GVN) and grapevine virus O (GVO) genomes and deduced genome products
Fig. 1
figure 1

Maximum-likelihood phylogeny based on the amino acid sequences derived from ORF 1 (replicase-associated proteins; RAP) of grapevine virus N and grapevine virus O from this study (indicated by solid circles) and references derived from other extant vitiviruses. The cognate RAP sequence from potato virus T was used as an outgroup. The phylogeny represents the tree with the highest log likelihood and was generated in MEGA X using the Le Gascuel model with frequencies, invariant sites and gamma distribution (n = 5). Bootstrapping was applied (1000 replicates), and the percentage of trees in which the associated taxa clustered together is shown next to the branches. Bootstrap percentages lower than 50 are not shown.

Portions of the GVN and GVO CP genes from each sample were amplified using RT-PCR. Primers with the following sequences were used to set up two-step RT-PCR reactions: GVN-CP-F (5’-TCGCTGAGATAATAAGGAGGATTGAG-3); GVN-CP-R (5’-GACTTGAATCACACTGGCTTCAGA-3’); GVO-CP-F (5’-GGTGTGATAGAGGAT AACCACAGT-3’); and GVO-CP-R (5’-TACACTCTAAACGACCACAACAGT-3’). Two-step RT-PCR reactions were carried using Promega GoScriptTM Reverse Transcriptase and GoTaq® Taq Polymerase (Promega, Madison, WI, USA) according to the manufacturer's instructions. Amplicons of the expected size were visualized on an agarose gel (572 and 789 bp for GVO and GVN, respectively), and their identities were confirmed using bidirectional Sanger sequencing (Inqaba Biotechnical Industries, Pretoria, South Africa).

The 5’-terminal nucleotides of GVN and GVO were confirmed using the 5' RACE System for Rapid Amplification of cDNA Ends, version 2.0 (Invitrogen, Carlsbad, CA, USA) according to the manufacturer’s specifications. Total RNA from Palamino and Roter Zierflander was used as the input for the GVN and GVO 5’ RACE reactions, respectively. The following gene-specific primers were used: For GVN, GVN-GSP1 (5’- CACTATATCTTAACTCATCT-3’) and GVN-GSP2 (5’- CCTCTACAATATGACTAGATATGCT-3’). For GVO, GVO-GSP1 (5’- AGGGTCGTCTTATCTTCATC-3’) and GVO-GSP2 (5’-TCCCTATACCTTAGGTTATC CTTAGC-3’). 3’ RACE amplification was not performed, as all of the GVN and GVO contigs contained a poly-A tract between 48 and 52 nt in length (data not shown). Confirmation of the 5’ ends showed that the GVN and GVO genomes are 7,486 and 7,560 nt in length, respectively, excluding the poly-A tract.

Considering the RAP phylogeny, GVN and GVO are most closely related to GVG and GVE, respectively, which was also confirmed by the AAI values obtained for the various genes of these viruses. Phylogenetically, GVN and GVO both clustered within the recently described GVE superclade [26]. The AAI values indicate that GVN and GVO are novel members of the family Betaflexiviridae, for which the species demarcation limit is 80% AAI for either the RAP or CP gene [27]. However, some additional criteria may need to be implemented in order to determine whether GVN and GVO are representatives of two new species or distinct members of already established taxa. GVN and GVO showed limited incidence and genetic diversity within the virus population of the vineyard under study, which could suggest a recent introduction. However, more widespread surveys of South Africa’s viticultural regions are needed in order to get a complete view of their distribution nationwide. Finally, the possible role of GVO and GVN in grapevine disease requires further investigation, as both viruses were found in coinfections with other viruses, including grapevine leafroll-associated virus 3, grapevine leafroll-associated virus 2, and at least two additional grapevine vitiviruses (data not shown). Given the potential for grapevine vitiviruses to be associated with synergistic coinfections [9], GVN and GVO should be considered potentially harmful pathogens for South African viticulture.