Introduction

Crinivirus is a genus in the family Closteroviridae, which also contains the genera Closterovirus, Ampelovirus and Velarivirus. Members of the family Closteroviridae have positive-sense, single-stranded genomes encapsidated in long flexuous virions [14]. Criniviruses have bipartite genomes and are transmitted by whiteflies, whereas viruses belonging to the genera Closterovirus and Ampelovirus, which have monopartite genomes, are transmitted by aphids and mealybugs, respectively [5]. Unlike the other three genera in the family Closteroviridae, no known insect vector has been identified for viruses in the genus Velarivirus [6]. Tetterwort, otherwise known as Chelidonium majus, in the family Papaveraceae, is a valuable herbaceous plant used for the treatment of various diseases, including gastroenteritis, and as an anodyne. Tetterwort produces small, bright yellow flowers and is a common flowering plant cultivated in gardens in South Korea. Here, we present the complete genomic sequence of TVCV, identified by RNA sequencing, and its relationships to other Crinivirus members.

Materials and methods

Tetterwort plants showing typical symptoms of vein chlorosis and leaf distortion were collected in Yesan, South Korea. Total RNA was extracted using TRI Reagent (MRC, Cincinnati, OH, USA)n and the ribosomal RNA was removed using a Ribo-Zero rRNA Removal Kit (Plant Leaf) (Epicentre, Madison, WI, USA), according to the manufacturer’s instructions. The remaining RNA was then used for construction of an RNA sequencing library using a TruSeq RNA Sample Prep Kit (Illumina, San Diego, CA, USA). The prepared sample was sequenced on an Illumina HiSeq 2500 Sequencing System, generating approximately 40 gigabase pairs of DNA sequence. BLAST results indicated that sequences of the seven viral fragments produced after contig assembly were homologous to, but distinct from, those of the criniviruses lettuce chlorosis virus (LCV), cucurbit yellow stunting disorder virus (CYSDV), and cucurbit chlorotic yellows virus (CCYV). Four of the seven sequences were related to RNA1, and the other three were related to RNA2. Reverse transcription polymerase chain reaction (RT-PCR) was used to reamplify the complete sequences of those seven fragments from an RNA template isolated from tetterwort. The PCR amplicons were sequenced directly and shown to be consistent with the original sequences. This indicates that the tetterwort plant was probably infected with an uncharacterized crinivirus. To obtain the continuous sequence, specific primers originating from two adjacent fragments were designed to amplify the missing regions. All PCR procedures for joining the fragments were performed with Pfu-X DNA Polymerase (Solgent, Daejeon, South Korea). The 5′-terminal sequences of both RNAs were determined using a previously reported method [7]. The 3′-terminal regions of the bipartite genomes were generated with the addition of poly(A) tails to the purified RNA with a poly(A) polymerase tailing kit (Epicentre), which was then used as the template for RT with an oligo(dT)18 primer. The nucleotide sequences of RNA1 and RNA2 of TVCV were deposited in GenBank under accession numbers KR002686 and KR002687, respectively. Open reading frames (ORFs) were identified in RNA1 and RNA2 with ORF Finder (http:// http://www.ncbi.nlm.nih.gov/gorf/gorf.html), and transmembrane helices in the predicted proteins were identified with the TMHMM program (http://www.cbs.dtu.dk/services/TMHMM/) [8]. The deduced amino acid sequences of selected proteins were aligned using ClustalW [9] to construct phylogenetic trees by the neighbor-joining method using MEGA 6 (Fig. 2) [10].

Sequence properties

TVCV RNA1 is 8467 nucleotides (nt) long and contains four ORFs, designated (by analogy to other criniviruses) as ORF1a (nt 67–6045), ORF1b (nt 6044–7561), P6a (ORF2, nt 7577–7738) and P19 (ORF3, nt 7741–8232). ORF1a encodes a predicted polyprotein of 229 kDa, with papain-like protease (P-PRO), methyltransferase (MT), and helicase (HEL) domains (Fig. 1). In an alignment of the TVCV P-PRO sequence with those of other criniviruses, the presumed cleavage site of the putative protease domain is located between Gly476 and Val477, and the two catalytic residues identified in the protease domain are Cys408 and His457. Four transmembrane helices (amino acids 1240–1259, 1266–1283, 1287–1309 and 1353–1375) were predicted in the region between the MT and HEL motifs. As in most other criniviruses, ORF1a terminates with UUUGA and overlaps ORF1b, which putatively encodes an RNA-dependent RNA polymerase (RdRp) that is presumably expressed through a +1 ribosomal frameshift. P6a shows low similarity to its orthologue in bean yellow disorder virus (BnYDV) and includes a transmembrane helix. P19 encodes a putative 19-kDa protein with no significant sequence homology to any other viral protein available in the public databases. TVCV RNA2 (8113 nt) includes the hallmark gene array of the family Closteroviridae, encoding a 62.7-kDa heat shock protein 70 family homologue (Hsp70h, nt 1331–3004), a 22.7-kDa major coat protein (CP, nt 5055–5780) and a 54-kDa minor coat protein (CPm, nt 5780–7204) (Fig. 1). P6b (6.4 kDa), P60 (59.7 kDa), and P9b (9.7 kDa), encoded between the Hsp70h and CP ORFs, together with P27 (26.9 kDa), encoded downstream from CPm, are similar to the corresponding proteins in other criniviruses. No transmembrane helix was found in these four ORFs (Fig. 1). Two unique small ORFs, encoding two putative proteins of 9.5 kDa (P9a) and 5.4 kDa (P5), were also identified at the 5′-terminal region of RNA2. P9a is likely to be a component of the membrane, whereas P5 has a transmembrane domain between amino acids 2 and 24, but its function is uncharacterized. The 3′-untranslated region (UTR) of TVCV RNA2 is 213 nt long and shares 82.1 % identity with that of RNA1, which is 235 nt long.

Fig. 1
figure 1

Genomic structures of TVCV RNA1 and RNA2. Rectangles represent open reading frames encoding putative viral proteins. The ORFs encoding proteins P6a and P19 in RNA1 and P5, P6b, P9a/b, and P27 in RNA2 are named according to the predicted molecular mass of each protein

Pairwise comparisons of the amino acid sequences of the highly conserved RdRp and CP of TVCV and those of other criniviruses indicated that TVCV is most closely related to BnYDV, sharing the highest amino acid identities of 75.8 % and 62.4 %, respectively, with the BnYDV orthologues. Phylogenetic analysis of the RdRp, Hsp70h, CP, and CPm amino acid sequences of criniviruses placed TVCV in a cluster including BnYDV, CYSDV, LCV, and CCYV (Fig. 2). In conclusion, the differences and similarities in the genome organization (ORF numbers, sizes, and positions) of TVCV and other criniviruses available in the database, as described above, support the proposition that TVCV is a new member of the genus Crinivirus [5].

Fig. 2
figure 2

Phylogenetic relationships between TVCV and other members in the family Closteroviridae. Phylogenetic trees were constructed based on the RdRp (A), Hsp70h (B), CP (C), and CPm (D) deduced amino acid sequences using the neighbor-joining method. The GenBank accession number of each sequence is given on the tree. Bean yellow disorder virus, BnYDV; beet pseudo-yellows virus, BPYV; blackberry yellow vein-associated virus, BYVaV; cucurbit chlorotic yellows virus, CCYV; cucurbit yellow stunting disorder virus, CYSDV; diodia vein chlorosis virus, DVCV; little cherry virus, LChV1; lettuce chlorosis virus, LCV; lettuce infectious yellows virus, LIYV; strawberry pallidosis-associated virus, SpaV; sweet potato chlorotic stunt virus, SPCSV; tomato infectious chlorosis virus, TICV; tomato chlorosis virus, ToCV