Introduction

Potyvirus is the largest virus genus of the family Potyviridae, including nearly 200 definitive and tentative members [4]. Most potyviruses infect dicots and a few infect monocots. To date, only six grass-infecting potyviruses have been recognized: Maize dwarf mosaic virus (MDMV), Sorghum mosaic virus (SrMV), Sugarcane mosaic virus (SCMV), Johnsongrass mosaic virus (JGMV), Zea mosaic virus (ZeMV) and Cocksfoot streak virus (CSV), and the first five viruses forming the SCMV subgroup in the genus Potyvirus [1, 4, 16].

An isolate (isolate A) of a novel potyvirus from whitegrass (or flaccid pennisetum) (Pennisetum centrasiaticum) has been characterized and was tentatively designated as Pennisetum mosaic virus (PenMV) [7]. This virus not only infects whitegrass, but also infects maize (Zea mays) in the field and causes mosaic symptoms on the leaves of infected plants [10]. The variation in both nucleotide (nt) and deduced amino acid (aa) sequences of the capsid protein (CP) of some isolates from both whitegrass and maize has been investigated, and it was found that the extent of variation in maize isolates was slightly less than in those from whitegrass [9]. Although the available 3′ partial sequence data are helpful for the classification of PenMV, they are not sufficient to confirm its taxonomic status as a member of a distinct species in view of the fact that the potyviral CP cistron is not the best representative of the whole genome [3] and that there is the possibility of interspecific and intraspecific viral RNA recombination events, as reported recently [1, 6, 18]. The determination and analysis of the complete genomic RNA sequence of PenMV would be essential to discriminate it from other closely related potyviruses as well as to advance our understanding of its evolutionary history. In this paper, we report the complete nucleotide sequence of PenMV and compare the genomic and deduced amino acid sequences of this virus with those of other closely related potyviruses, and we also investigate the host range and major transmission vectors of this virus.

Materials and methods

Virus sources and maintenance

The virus isolates (B and C) were from the same batch of diseased whitegrass plants as the previously reported isolate (A) in Xinzhou city of Shanxi province in northern China [7] and maintained in sorghum (Sorghum bicolor) in an insect-proof greenhouse. Virus purification and mechanical and aphid transmission were performed as previously reported [7, 8].

RT-PCR amplification and molecular cloning

Virus purification was conducted according to methods described previously with some modifications, and viral RNA was extracted from the purified viral particles by the phenol method [7, 8, 12]. The primers for RT-PCR amplifications were designed based on conserved regions of the sequences previously determined for other related potyviruses or by previous sequencing results for this virus. The viral RNA extracted from purified virions was used as template, and oligo (dT) as the primer for reverse transcription. Reverse transcription and PCR were performed in a thermal cycler (Hybaid Sprint)programmed to give 1 cycle at 42°C (60 min); 1 cycle at 94°C (3 min), 5 cycles at 94°C (30 s), 45°C (30 s) and 72°C (3–4 min), 30 cycles at 94°C (30 s), 55°C (30 s) and 72°C (3–4 min); and a final cycle at 72°C (7 min) for amplifying most parts of the viral genome. The primers used for 3′RACE are 5′-GAT CAA CTT TTT TTT TTT TTT-3′ (reverse) and 5′-CCG AGC TCC ACT AGC AGA GAA CGT GTG TAG-3′ (forward), and those for 5′RACE are 5′-TGT GGT CTG CTA CTT TCA CCC A-3′ (reverse) and 5′-CAG GAT CCA AGC GGI GIG GGI GGG-3′ (forward) after the addition of a poly (C) tail to the 3′ terminus of the cDNA using terminal deoxyribonucleotidyl transferase (TdT) (Promega). The procedures for purification of PCR products, molecular cloning and transformation of plasmids to Escherichia coli DH5α cells were as previously reported [7, 8].

Phylogenetic analysis

The software DNAMAN (Lynnon BioSoft, Quebec, ver. 5.2.2) and ScanProsite (http://cn.expasy.org/tools/scanprosite) were used to analyze the nucleotide and amino acid sequences. The genomic sequence of PenMV was assembled and analyzed using the DNAMAN software. A BlastP analysis of the putative polyprotein deduced from the single large ORF was performed on GenBank sequences to obtain the closely related viral sequences. Comparisons were performed using the multiple sequence alignment and two sequence alignment programs (with a gap-opening penalty of 50 and a gap-extension penalty of three for nt comparisons and corresponding values of eight and two, respectively, for aa alignments) [3]. The former program was used to align all the sequences selected and to calculate the percentage identities between two sequences by using optimal full alignment. The previously reported potyviral genomic sequences (listed in the legend to Fig. 1) were used for the nucleotide and amino acid sequence comparisons.

Fig. 1
figure 1

Phylogenetic tree illustrating the relationships of PenMV with other potyviruses. This phylogram was generated by using the neighbour-joining method of DNAMAN ver. 5.2.2 from multiple alignment of genomic sequences. Horizontal distances are proportional to sequence distances (see scale bar), and vertical distances are arbitrary. The number at each branch indicates the percentage of 1,000 bootstraps, which support the grouping at that node. Bootstrap scores exceeding 90% are shown at major nodes. PVY was used as an outgroup in this phylogram. The acronyms present in the phylogram represent Cocksfoot streak virus (CSV, AF499738), Johnsongrass mosaic virus (JGMV, Z26920), Maize dwarf mosaic virus-BG (MDMV-BG, AJ001691), MDMV-SP (AM110758), Potato virus Y (PVY-N, D00441); Sugarcane mosaic virus-A (SCMV-A, AJ278405), SCMV-BJ (AY042184), SCMV-GD (AJ310105), SCMV-HN (AF494510), SCMV-HZ (AJ297628), SCMV-LP (AJ310102), SCMV-SD (AY149118), SCMV-SP (AM110759), SCMV-SX (AY569692), SCMV-XgS (AJ310103), SCMV-YH (AJ310104), Sorghum mosaic virus-H (SrMV-H) U57358, SrMV-XoS ( AJ310197) and SrMV-YH (AJ310198 )

Aphid transmission

Four aphid species Myzus persicae, Macrosiphum avenae, Rhopalosiphum padi and Schizaphis graminum (provided by Prof. Xifeng Wang of the Plant Protection Institute, Chinese Academy of Agricultural Sciences) were used for transmission tests. The aphids were raised on healthy cabbage (for M. persicae) or wheat plants (for the three other aphid species) free of viruses. The aphids were placed onto PenMV-infected whitegrass (and sorghum in another set of tests) leaf pieces for acquisition feeding of the virus isolate C for 1 h in a Petri dish after 2 h fasting, and then five aphids were moved onto each healthy two to three true-leaf seedling of sorghum (cv. Xin-Liang 52) (and maize cv. Ye-Dan 2 in another set of tests) overnight for inoculation feeding before being killed by insecticide. A total of 20 sorghum (or maize) plants were inoculated for each species of aphid. Mosaic symptoms on plants were observed 10 days post-inoculation.

Host range determination

Natural host range: The leaves of some graminaceous plants (maize, sorghum, whitegrass and Setaria viridis) showing mosaic symptoms in the suburban fields of Xinzhou city, Shanxi province, were collected and used as inoculum after being ground in a mortar with 0.05 m phosphate buffer (pH 7.0) to infect sorghum (cv. Xin-Liang 52) seedlings. These samples were also tested by both ELISA [5] and RT-PCR (using a pair of primers specific for the coat protein cistron of PenMV) to confirm the natural infection of field plants by PenMV [7, 10]. Antigen-coated plate ELISA was performed using polyclonal antibodies against the procaryote-expressed coat protein of PenMV (raised in our laboratory). A protein-A-alkaline phosphatase conjugate (Sigma) was used for binding to the antibodies, and p-nitrophenyl-phosphate (Sigma) was used as the substrate in the reaction. The absorbance was measured in a microplate reader (Bio-Rad) at 405 nm. The value for each sample is the average of two independent measurements. The reaction was considered a positive when the I/H value (the average absorbance of a sample minus that of the buffer control) was greater than or equal to three, and considered negative when I/H was less than three.

Experimental host range: Young leaves of a sorghum plant infected by PenMV-C (kept in an insect-proof greenhouse) showing typical mosaic symptom were used as inoculum after ground being ground in a mortar with phosphate buffer to infect the two true-leaf stage seedlings of 26 plant species/cultivars (23 plants each species/cultivar) belong to four families: Gramineae (or Poaceae) (15), Solanaceae (6), Leguminosae (3) and Chenopodiaceae (2) (see Table 4). Mosaic symptoms on plants were observed 10 days after inoculation, and all plants were tested by ELISA to confirm the infection by PenMV.

Assay of in vitro stability

The whitegrass leaves infected by PenMV-C showing mosaic symptom were ground in a mortar in 0.05 m phosphate buffer (pH7.0) and centrifuged briefly, and the supernatant was collected for the determination of the dilution end-point (DEP), thermal inactivation point (TIP) and longevity in vitro (LIV) of this virus isolate. Three replicates were performed for each treatment, and symptom development of inoculated plants was observed 10 days post-inoculation. The plant species used for inoculation after the below-mentioned treatments were maize (cv. Ye-Dan 2) and sorghum (cv. Xin-Liang 7). Five different temperatures in a water bath were used for the determination of DEP: 51, 52, 54, 56 and 58°C, and the crude virus sap samples in thin-walled Eppendorf tubes were kept separately at each of these temperatures for 10 min and then put into ice immediately before inoculating maize and sorghum. The three dilution gradients utilized for DEP determination were: 10−2, 10−3 and 10−4 -fold dilution in 0.05M phosphate buffer (pH 7.0). LIV was determined by inoculating plants with the sap after being kept at room temperature (20–25°C) for 0, 48, 60 and 72 h.

Results

Genome organization

The complete nucleotide sequences of the genome RNA of isolates B and C of PenMV were deposited into GenBank under the accession numbers NC_007147 and DQ977725, respectively. The genomic sequences of the two isolates are both composed of 9,611 nucleotides with 5′- and 3′-UTRs of 172 and 241 nt, respectively, excluding the 3′-terminal poly(A) tail. The base compositions of the genome of two isolates, C/B, are: adenine 35.1/34.8%, cystosine 18.8/18.5%, guanine 21.1/21.3%, and uracil 25.0/25.4%, which are similar to those of other potyviruses. There are three (four for isolate B) in-frame AUG triplets within the first 260 nt of the long single open reading frame (ORF), and the first triplet context CGCAAUGGC (TGTGAUGGC for isolate B) is the closest to the consensus sequence (AACAAUGGC) for translational initiation in plants [13]. Therefore, the first AUG triplet at nucleotide positions 173–175 is most likely to be the start codon of the PenMV polyprotein. The termination condon UGA is located at nt 9,368 to 9,370. Thus, this ORF consists of 9,198 nucleotides and has the capacity of encoding a polyprotein of 3,065 amino acids with a calculated molecular mass (M r) of 348,894 (349,101 for isolate B).

Nine putative protease cleavage sites were identified by comparing the putative coding region of PenMV with the consensus protease recognition motifs in other potyviruses (Table 1) [2]. The polyprotein is therefore highly likely to be processed into 10 smaller proteins by the three viral-encoded proteases P1, HC-Pro and NIa-Pro [2, 4, 15, 17]. This was supported by the presence of conserved motifs in this sequence that are required for the proteolytic processing of the potyviral polyprotein, such as the serine-type protease domain H166-X8-D175-X31-G207-D-S-G210 and the proteolytic domain F220IIRGR225 in P1, C592-X72-H665 in HC-Pro, and H2046-X34-D2081-X69-C2151 in NIa-Pro [2]. Among the nine putative cleavage sites, the HC-Pro/P3 junction (G706–G707) for this potyvirus is identical to those of other potyviruses sequenced to date, and the other cleavage sites were similar to those of MDMV, SCMV and SrMV, except for the junction between P3 and 6K1, which is E/H, a rare dipeptide recognition site for the potyviral protease NIa-Pro (Table 1) [2].

Table 1 Putative proteins obtained after cleavage of the polyprotein of PenMV

The motif M1873YG1875, which contained a tyrosine for NIa-VPg to be linked to the 5′-end of the viral RNA, was also found. Two putative phosphorylation sites T2883RAE2886 and T2930MMD2933 of protein kinase CK2, conforming to the motif T/S-X2-D/E [14], were found in the middle of the CP sequence, one of which is presumably involved in the cell to cell and long-distance movement of this virus as is the case with another potyvirus, Potato virus A [11].

Variation in isolates of PenMV

The genomic (9,611 nt) and polyprotein (3,065 aa) sequences of PenMV-C are most closely related to those of PenMV-B, with 88.7% nt and 96.8% aa identities, respectively. The 3′-UTR sequences of isolates B and C are 99.6 and 96.7% identical, respectively, compared with that of isolate A (AY172336) [7], and the CP amino acid sequences of isolates B and C are both 98.7% identical to that of isolate A. Thus, the three isolates undoubtedly belong to one virus species according to the current classification criteria for potyviruses [4].

Similarity of PenMV with other potyviruses

A BLAST search of GenBank (27 August 2007) using the polyprotein sequence (3,065 aa) of PenMV-C revealed that it was most closely related to SrMV, SCMV and MDMV in addition to PenMV-B. A phylogenetic tree was constructed from an alignment of the complete nucleotide sequences of monocot-infecting potyviral genomes with Potato virus Y (PVY) as an outgroup (Fig. 1), and this showed that PenMV clusters with SCMV, MDMV and SrMV. A similar phylogenetic tree was obtained from an alignment of the polyprotein aa sequences of the same potyviruses (data not shown). These results confirmed a relationship proposed previously on the basis of coat protein and 3′-UTR sequence data that PenMV is a member of the SCMV subgroup [7].

The pairwise sequence comparisons of the genomic nucleotide and polyprotein amino acid sequence identities showed that the PenMV was most closely related to SrMV-H (69.5% whole-genome and polyprotein ORF nt identity, and 77.5% CP and 72.1% polyprotein aa identity) (Table 2, Fig. 1). The current species demarcation criteria in relation to the genome sequence relatedness given by ICTV for the genus Potyvirus include CP aa identity of less than about 80%, complete genomic nt identity of less than 85%, and different polyprotein cleavage sites [4]. Another study showed that a value of 76% nt identity for either the complete ORF or the CP cistron is the most appropriate for separating species in the genus [3]. Therefore, whichever criteria we might employ, PenMV should be classified as a distinct potyvirus.

Table 2 Percentage identity in nucleotide (numbers before /) and the corresponding amino acid (numbers after /) sequences between PenMV isolate C and other monocot-infecting potyviruses.

Aphid transmission

Four to five out of 20 sorghum/maize seedlings developed mosaic symptom 10 days after transmission inoculation by each of the four aphid species M. avenae, M. persicae, R. padi and S. graminum, thus confirming the transmissibility of PenMV by these four aphid species.

Host range

Natural host range: All of the tested plants of maize, sorghum, whitegrass and S. viridis showing mosaic symptoms were confirmed to be infected by PenMV in the field by both ELISA and RT-PCR (Table 3). The band of predicted size of 0.9 kb of the amplified cDNA of the coat protein cistron of PenMV was visualized under UV light in agarose gel after electrophoresis (data not shown). Since only one plant of S. viridis was found to be infected in the field in this survey, the importance of this annual wild grass as a reservoir of the virus may be negligible. In contrast, most whitegrass plants were infected in the suburban fields of Xinzhou city; thus, this perennial wild grass could be the main source of primary infection of maize and sorghum by PenMV in that region.

Table 3 The assay for natural host range of PenMV

Experimental host range: The results of inoculation of the test plants by PenMV are listed in Table 4. These wild grasses infected by PenMV are potentially the natural hosts of the virus in the field and thus could be found to be the reservoirs of this virus in the future.

Table 4 The experimental host range of PenMV

Stability in vitro

The infectivity test of PenMV-C inoculum after various treatments showed that the virus isolate C had a thermal inactivation point of 53°C and a dilution end-point of 10−2, and its infectivity was retained in vitro for up to 48 h. The values of these in vitro properties of this virus are more or less similar to those previously reported for other members of the SCMV subgroup [16], indicating that the stability of this virus isolate is not very different from those of other closely related viruses.

Discussion

In the present study, we found that the homologous gene products have differing degrees of sequence identity and that PenMV is most closely related to MDMV, SrMV and SCMV, members of the SCMV subgroup of potyviruses. The cistron of the CI was remarkably conserved among the nt and aa sequences of PenMV and other members of the SCMV subgroup and was distinct from that of dicot-infecting potyviruses, while the NIb-coding region was the most conserved. Most parts along the genome of PenMV were most closely related to SrMV (such as P1, 6K1, CI, NIa-VPg, NIa-Pro and complete genomic sequences). It was noteworthy that some coding regions/UTRs (or proteins) of PenMV were most closely related to those of SCMV-XgS (such as NIb and 3′-UTR), and some parts were most closely related to the corresponding parts of MDMV-BG (such as P3, CP and 5′-UTR)(Table 2). Since the genomic sequences of PenMV are significantly different from those of all other known potyviruses, it should be assigned to the genus Potyvirus as a distinct species. In view of the similarities of the genomic and polyprotein sequences, the host range, and some other properties of PenMV to the SCMV subgroup, we believe that PenMV is a new member of this subgroup.

Since PenMV is now restricted to a small region (Shanxi province) in China, and the symptoms on its hosts are usually a relatively mild mosaic compared to that of SCMV [8, 10], its importance in agriculture has been relatively limited to date. With the change of cultural practices and continuing monoculture of a few cultivars of maize and sorghum in production in addition to the efficient transmission of the virus by four species of common aphids and the abundance of wild grasses serving as virus reservoirs, PenMV as an emerging virus has the potential to become a major pathogen in maize as SCMV [8, 10, 18] or, in the future, in sorghum.