Viruses classified within the sole genus Orthotospovirus of the family Tospoviridae cause significant economic losses to many crops worldwide [9]. Orthotospoviruses are characterized by quasi-spherical enveloped particles of 80-120 nm in diameter that have a tripartite antisense or ambisense RNA genome, named L, M, and S according to their size. The first eight nucleotides of the 5ʹ-termini and 3ʹ-termini are highly conserved among the three RNA segments and can form non-covalently closed pseudo-circular structures [2]. With the advent of next-generation sequencing (NGS) technologies for virus discovery, the number of sequenced virus genomes, including orthotospoviruses, is rapidly expanding [11]. At present, at least 29 distinct orthotospoviruses have been identified [9]. Calla lily chlorotic spot virus (CCSV) was first discovered on calla lily (Zantedeschia spp.) showing chlorotic spots in Taiwan area of China [4]. Later, it was found on spider lily (Hymenocallis litteralis) and tobacco (Nicotiana tabacum) in Yunnan province of China [5].

In May 2017, several celtuce plants (Lactuca sativa var. augustana) showing chlorotic spots on newly emerged leaves and vein banding on older leaves were collected in Changshang County of Zhejiang Province, China (Fig. 1). A small RNA (sRNA) library was constructed using symptomatic celtuce leaves and sequenced by Hangzhou Lianchuan Biological Technology Co., Ltd. (www.lc-bio.com). A total of 10,313,188 clear sRNA reads were obtained and de novo assembled as described earlier [10]. A total of twenty-six contigs with high similarities to the genome of orthotospoviruses, e.g. CCSV and tomato zonate spot virus (TZSV), were identified. No contig or deduced amino acid sequence that was homologous to genomic or protein sequences from other plant-infecting viruses was identified. Using SeqMan Pro 7.1.0 (Lasergene, GATC Biotech), the twenty-six contigs were further assembled into four fragments, which represented the near full length sequence of RNA L (8869 nt) and M (4732 nt), and two small fragments (1684 and 1047 nt) of RNA S, respectively. The total RNA was extracted from symptomatic celtuce leaves using the Plant RNA Purification Kit (Tiangen Biotech, Beijing, China) and reverse transcribed into cDNA using the primer Tosp-AUAP that contains the eight conserved nucleotides among all orthotospoviruses (Supplementary Table 1). The full genome of this celtuce-infecting orthotospovirus was then amplified as described in the Supplementary text using the primers listed in Supplementary Table 1. 5’/3’ RACE was used to complete the sequences of the genome segments. The amplified fragments were ligated into the pEASY-Blunt vector (Transgen, Beijing, China) and sequenced. At least 4 clones representing each fragment were sequenced to exclude possible artificial mutations that may have been introduced during PCR amplification.

Fig. 1
figure 1

The symptoms caused by CCSV-Cel on the top (A) or lower (B) leaves of celtuce

Table 1 Relative protein similarities for CCSV-Cel RdRp, NSm, G, NSs, and N when compared to other orthotospoviruses and other CCSV strains

The RNA S, M, and L of this celtuce-infecting othotospovirus are 3146, 4734, and 8934 nt, respectively (accession nos: MG252780-MG252782). The RNA S has 30.4-72.5% overall nucleotide sequence identities with other orthotospoviruses (Supplementary Table 3). The NSs and N proteins share 15.31-89.13% and 16.55-90.25% amino acid (aa) sequence identities with that of known orthotospoviruses, respectively (Table 1). Moreover, the N protein has 91.9-93.27% aa sequence identities with the partial N protein of four other CCSV isolates (Table 1). With reference to the N protein criterion from the International Committee on Taxonomy of Viruses (ICTV) [1, 6], the N protein of this celtuce-infecting orthotospovirus has ≥ 90% aa sequence identity with that of CCSV; therefore, it should be classed as a divergent strain of CCSV, which we have named CCSV-Cel [China: Zhejiang: 2017]. The CCSV-Cel RNA M and L segments share 43.4-80.8% and 29.84-82.9% nucleotide sequence identities with the respective segments from known orthotospoviruses, respectively (Supplementary Table 3). The deduced amino acid sequence of the NSm and G precursor proteins share 35.24-89.32% and 28.99-89.94% aa sequence identities with that of other orthotospoviruses (Table 1). The amino acid sequence of the CCSV-Cel RdRp shares the highest aa sequence identity with of the corresponding sequence from CCSV-LJ1 [China: Yunnan: 2014] (93.55%) and the lowest identity with the RdRp from bean necrotic mosaic virus (BeNMV) (41.39%; Table 1). Phylogenetic analyses were also performed with the N, NSs, NSm, G, and RdRp protein sequences of CCSV-Cel as well as other orthotospoviruses (Supplementary Table 2) using MEGA 6 software [8]. In the reconstructed Neighbor-Joining phylogenetic trees, CCSV-Cel is closely associated with CCSV isolates and was clustered with CCSV, TZSV, and tomato necrotic spot-associated virus (TNSaV) in a sub-branch (Supplementary Figure 1). These results further confirmed that CCSV-Cel is phylogenetically related to CCSV.

Orthotospoviruses are characterized by a wide host range. For instance, the tomato spotted wilt virus (TSWV) can infect more than 800 species [7]. To date, including celtuce more than 27 species of plants have been reported to be susceptible to CCSV [3,4,5, 12]. Considering the wide geographic distribution and high degree of genetic diversity of CCSV, special attention should be paid to the damage that it may cause.