The genus Vitivirus (family Betaflexiviridae; subfamily Trivirinae) consists of several ssRNA (+) viruses characterized from grapevine and other perennial plants. To date, 15 species have been assigned to the genus (https://talk.ictvonline.org/files/master-species-lists/m/msl/7992). Eleven vitiviruses were discovered in grapevine (Vitis spp.), the host that inspired the genus epithet. The grapevine-infecting vitiviruses may not elicit discernible foliar symptoms, but infection by some, such as grapevine virus A (GVA) and grapevine virus B (GVB), has been associated with detrimental modifications of the vine woody cylinder and Shiraz disease [1]. Mealybugs (Hemiptera: Pseudococcidae) and/or soft scale insects (Hemiptera: Coccidae) are vectors for GVA, GVB and grapevine virus E (GVE) [1] whereas aphid transmission has been reported for heracleum latent virus (HLV) and mint virus 2 (MV-2) [2, 3]. The genomes of vitiviruses are organized into five ORFs, except for those of arracacha virus V (AVV) and HLV, which lack an ORF2. The ORF1 encodes a large replication-associated protein (REP), ORF2 encodes a hypothetical protein of unknown function, ORF3 encodes a movement protein (MP), ORF4 encodes the coat protein (CP), and ORF5 encodes a nucleic-acid-binding protein (NABP) [4].

In May 2018, we explored the virome of a 32-year-old Blanc du Bois (Vitis spp.: ‘Florida D 6-148’ × ‘Cardinal’) vineyard located in Austin County, Texas. Leaf tissue samples (with intact petioles) were collected from 51 vines across the vineyard, flash frozen in liquid N2, and stored in -80 °C until processing. Total RNA was extracted from each of the 51 samples using a SpectrumTM Plant Total RNA Kit (Sigma-Aldrich) and equimolar amounts of RNA from each sample were pooled into a single RNA sample, which was subjected to high-throughput sequencing (HTS) on an Illumina NextSeq 500 platform [5]. The approximately 23.5 million raw HTS reads generated from the sample (TX-WAT) were bioinformatically processed as described earlier [6]. The analysis revealed contigs specific to five viruses and three viroids, among which only 10 contigs (ranging from 206 to 4,405 nt) showed similarities to vitivirus sequences, particularly the recently characterized grapevine virus H (GVH) [7] (query coverage, 100%; average identity, 68%). A quick screen of each of the 51 RNA samples by RT-PCR with two primer pairs designed from the HTS-derived sequences (data not shown) revealed a 11.8% (6/51) incidence of the vitivirus.

Independently, the near-complete genome of the virus was amplified from three positive samples as two pieces of overlapping DNA fragments using two sets of primer pairs designed from the HTS-derived sequences of TX-WAT. The primer pair GVM-264v (5’-CCAgTTAgCAgCgAAAggTAT) and GVM-4086c (5’-gTTgTACCTCCTTCggTTCTTC) amplified a 3,823-bp DNA fragment specific to the REP, and the primer pair GVM-3820v (5’-CCgACgAAgACCCATTTgTA) and GVM-7211c (5’-CTCTACCgTATTTCACCCAgTC) amplified a 3,392-bp DNA fragment that encompassed the C-terminal sequences of the REP and other genes. RT-PCR, cloning, and Sanger sequencing were done as described earlier [8]. Sequences of the 5’ and 3’ extremities of the virus were determined using a Clontech SMARTer RACE 5’/3’ kit (Takara-Bio). Pairwise comparisons of the fragment-specific sequences derived from the three samples showed > 99% sequence identity among them; hence, the genome sequence of the virus was assembled using BioEdit software [9] from fragments obtained from one of the isolates. The programs ORF Finder and SMART BLAST (https://www.ncbi.nlm.nih.gov/orffinder/) were used to predict and verify potential protein-encoding segments of the DNA sequence.

The complete genome of the vitivirus isolate TX-WAT, excluding the polyA tail, was determined to be 7,387 nt in length (MK492703), a typical size for a vitivirus genome [4]. The sequence obtained by Sanger sequencing shared 99.7% nt identity with the HTS-derived sequence (~ 88% genome coverage), thus validating the HTS results. The genome organization and lengths of the encoded genes (Fig. 1) are typical for members of the genus Vitivirus [1, 4]. The REP-encoding ORF1 is 5,151 nt long (1,716 amino acids [aa]; 194.5 kDa), and it shared the highest nt (65.8%) and aa (74.7%) sequence identity with the REP of GVH; it contains all of the typical conserved motifs for members of the family Betaflexiviridae. ORF2 is 492 nt long (163 aa; 18.1 kDa) and encodes a hypothetical protein of unknown function that shares 55% nt and 65.6% aa sequence identity with ORF2 of GVH. ORF3 encodes an 804-nt MP (267 aa; 29.7 kDa) that shares 70.2% nt and 69.7% aa sequence identity with the MP of GVH. The CP-encoding ORF4 is 594 nt long (197 aa; 21.5 kDa) and it shared greatest similarity (nt: 78.7%; aa: 88.8%) with the CP of GVH. ORF5 encodes a 336-nt NABP (111 aa; 13.0 kDa) that shares 71.9% nt and 70.5% aa sequence identity with the NABP of GVH. Phylogenetic analysis of the predicted REP and CP aa sequences supports the placement of TX-WAT within the genus Vitivirus in the same clade as GVH and other grape-infecting vitiviruses (Fig. 2).

Fig. 1
figure 1

Genome organization of grapevine virus M. The positions of the five open reading frames are shown as rectangular boxes. REP, replicase; ORF2, a hypothetical protein of unknown function; MP, movement protein; CP, coat protein; NABP, nucleic-acid-binding protein; A(n), polyA 3’ tail

Fig. 2
figure 2

Neighbor-joining phylogenetic trees (1,000 bootstrap replications) based on the amino acid sequences of the replicase (A) and coat protein (B) of members of the genus Vitivirus. The proteins of the newly characterized grapevine virus M (GVM) are indicated in bold font with a colored background

The taxonomic status of isolate TX-WAT was interrogated vis-à-vis the current ICTV species demarcation threshold for the family Betaflexiviridae [4]. While an analysis of the CP sequences of TX-WAT would suggest that it is a variant of GVH, the nt and aa sequence identity levels in its REP are well below the set threshold of < 72% nt or < 80% aa identity for new species [4]. A similar situation exists for the recently characterized GVL isolate KA (MH643739), which also had sequence identity values above (nt: 71.6%; aa: 77.8%) and below (nt: 64.2%; aa: 69%) the species demarcation threshold values when compared with GVE isolate TvAQ7 in the taxonomically relevant CP and REP, respectively. Thus, we posit that isolate TX-WAT represents a new species in the genus Vitivirus, and we propose the name “grapevine virus M” (GVM) for this new virus. Further evidence that GVM is distinct from GVH could be seen in the relatively low levels of nt and aa sequence similarity between their ORF2, MP and NABP and the shorter length (12 nt) of its RACE-verified 3’ UTR sequence.

In future work, we will explore the biological properties of GVM and attempt to develop robust diagnostic assays for its accurate discrimination from GVH and other grapevine-infecting vitiviruses in epidemiological studies.