Introduction

Watermelon mosaic virus (WMV) is a member of the genus Potyvirus and the family Potyviridae with approximately 750 nm long flexuous filamentous particles [20]. WMV appeared to share high sequence similarity with Soybean mosaic virus (SMV) [6]. The genome of WMV is a single, positive-sense RNA of around 10035-nt long. It contains a unique large open reading frame coding for a polyprotein of 3217 amino acids; it contains nine putative cleavage sites, yielding 10 putative functional proteins that have motifs conserved among homologous proteins of other potyviruses [6].

WMV causes economically important diseases on several horticultural crops, mostly cucurbits and some legumes, including pea [23] and orchids, vanilla [28], and Habenaria radiata [13]. It infects more than 170 plant species belonging to 27 families experimentally [26], including many weeds, that can host the virus between crops.

Sequence comparisons of coat protein (CP) genes can be used to identify and differentiate distinct potyviruses and their strains [24, 25]. Several studies have established phylogenetic relationships between strains of potyviruses based on CP gene sequences, e.g., potato virus Y [27], bean common mosaic virus [22], plum pox virus [4], papaya ring spot virus [29], lettuce mosaic virus [31], and zucchini yellow mosaic virus [7]. The sequences of other regions of the potyvirus genome, such as the 3′ nontranslated region (3′NTR), also can be used to discriminate strains within a virus species [11, 22].

Attempts have been made to analyze WMV variation at the nucleotide and amino acid levels. These studies have focused on analysis of the sequence of the P1, CP, and cylindrical inclusion (CI) genes. Moreno et al. [18] characterized 44 WMV isolates from Spain and demonstrated that variability existed in the CP and the cylindrical inclusion (CI) proteins.

Although WMV has long been known to occur in several regions of Iran [9, 16, 19, 21], a 218-nucleotide sequence of the CP N-terminal region of eight isolates from Iran has just recently been reported [8]. The goals of the present study were amplification of CP gene and assessment of the genetic variation among the CP sequences of the representative isolates collected from several regions of Iran. The availability of this geographically diverse range of WMV isolates and our observation that some of these isolates appeared to induce different symptoms in test plants prompted us to generate information on the molecular and biological variability of WMV.

Materials and methods

Collection and maintenance of virus isolates

In 2004, 178 and 127 samples in January, February, and April from Kerman and Hormozgan provinces, respectively, were monitored for virus identification. In 2005, 231, 136, and 116 samples during May to August from Yazd, West Azarbayjane, and Esfahan provinces, respectively, were conducted. Samples were collected from different cucurbit plants including cucumber, summer squash, winter squash, melon and weeds in the fields and greenhouses in major cucurbit growing regions of Iran. The cucurbit producing areas included were the southwest (West Azarbayjane province), center (provinces of Yazd and Esfahan), and southeast (provinces of Kerman and Hormozgan) (Table 1). The southwest areas of Iran are subject to severe winters, with average daily temperatures below freezing, and warm summers. The southeast of Iran has mild winters, with average January temperatures ranging from 7 to 18°C, summers are very humid and hot, with temperatures exceeding 40°C during July. The central part of Iran is marked by hot and dry summers and sporadic rainy winters, which is characterized by dry, warm winds in May to June. Cucurbit is grown as a summer crop in central and southwest of Iran; however, in the interior areas in the subtropical regions of the country in the southeast that is being frost-free in winter, cucumber is also grown as a winter crop.

Table 1 Origins of WMV isolates from Iran, used in this study

A total of 788 samples comprised of 757 cucurbit and 31 weed plants (Table 2) were collected from 64 locations. In each field or greenhouse, cucurbit and plants showing mosaic and other typical virus symptoms were collected. Each sample consisted of the two youngest fully developed symptomatic leaves from cucurbit vines and similar ones from the weed plants. WMV, Cucumber mosaic virus (CMV), Squash mosaic virus (SqMV), Papaya ring spot virus-type W (PRSV-W), and Zucchini yellow mosaic virus (ZYMV) infection in all collected plants was verified by the double antibody sandwich enzyme-linked immunosorbent assay (DAS-ELISA) [5] using the specific polyclonal antibodies (Bioreba, CH-4153 Reinach BL1, Switzerland). From WMV-positive sample set, 18 specimens were chosen according to hosts and agroecological considerations, and WMV isolates infecting these 18 specimens were biologically cloned by two local lesion passages in Chenopodium amaranticolor, followed by a single local lesion selection in C. quinoa. Such biologically cloned viruses are hereafter called isolates [18]. All WMV isolates for biological and molecular studies were maintained on Cucurbita pepo L.cv. Khoy in a temperature regulated insect-proof greenhouse. The isolates were named and numbered according to the abbreviated name of provinces and regions of collection (Table 1).

Table 2 Reaction of selected weed plant species to different viruses that infect Cucurbitaceaea by ELISA

Biological characterization of the WMV isolates

Eighteen isolates were characterized biologically by reaction to inoculation of a variety of range of test plants. These samples were ground in 1% (w/v) solution of K2HPO4 at pH 7.5 containing 0.01% Na2SO3, 2% polyvinylpyrrolidone (PVP), and 0.05% ethylenediaminetetraacetic acid (EDTA). The extracts were then mechanically inoculated onto the following 13 plant species: C. pepo L.cv. Khoy., Citrllus lantanus Thunb.cv. Crimson Sweet, Nicotiana clevelendii Gray., N. tabacum L.cv. Samsun.N, N. glutinosa L., N.debneyi L, N. bentamiana L, Datura metel L, Pisum sativum L, Phaseolus vulgaris L.cv. Red Kidney, C. quinoa Willd, C. amaranticolor Coste & Reyn and Cucumis melo L.cv. Asgrown. Five plants of each species or cultivar were mechanically inoculated. After inoculation, plants were kept in an insect-proof greenhouse at 15–25°C for symptom development. ELISA was used as well as symptomology to check for presence of the respective viruses in inoculated plants.

RNA isolation and RT-PCR

Total RNA from C. pepo L.cv. Khoy experimentally infected with each of the 18 WMV isolates was extracted using High Pure Viral Nucleic Acid Kit (Roche Biochemical, Germany) according to the manufacturer’s instructions. One microgram of total RNA (200 ng/μl) plus 2.5 μl of diethyl pyrocarbonate (DEPC)-treated water were incubated at 65°C for 8 min and chilled on ice for 3 min to denature the RNA. To the denatured RNA extract, 15 μl of RT reaction mixture containing 1× M-MLV RT buffer, 1.5 Mm each dNTP, 20 units of RNasin (Fermentas, Vilnius, Lithuania), 0.1 μg (0.1 μM) of antisense primer WMV-R, 5′-ATT CAC GTC CCT TGC AGT GTG-3′ designed from WMV-Fr sequence, accession number AY437609 corresponding to nts 9744-9724 at the C-terminal region of CP, and 200 units of M-MLV reverse transcriptase were added to provide a final volume of 25 μl. Samples were incubated at 42°C for 1 h followed by 95°C for 3 min to terminate the RT reaction. PCR was carried out using 2.5 μl aliquots of the cDNA preparation in 50 μl reaction mixture containing 5 μl of 10× PCR buffer, 1.5 mM MgCl2, 1 μl dNTP (10 mM each), 0.1 μM of antisense primer WMV-R, 170 ng (0.1 μM) of sense primer WMV-F, 5′-GAA TCA GTG TCT CTG CAA TCA GG-3′ designed from WMV-Fr sequence, accession number AF057533 corresponding to nts 8923-8945 at the N-terminal region of CP, Taq DNA polymerase (5 U/μl) (Fermentas) and deionized H2O. The single-step RT-PCR reaction was carried out in a Techne TC 312 thermal cycler (Techne, Cambridge, UK) using the following program: initial denaturing for 3 min at 94°C, followed by 30 cycles of denaturation for 60 s at 94°C, annealing for 1 min at 60°C, extension for 1 min at 72°C, and a final extension step at 72°C for 10 min. In all tests positive and negative controls were used. The negative control was including distilled water instead of DNA template. Ten microliters of each PCR product was analyzed by electrophoresis in a 1% agarose gel and visualized by ethidium bromide staining.

Cloning, sequencing, and phylogenetic analysis

The PCR products comprised of 804 bp of CP gene and 18 bp of the 3′ of the NIb gene were ligated into a pTZ57R/T vector using InsT/Aclone PCR Product Cloning Kit (Fermentas) according to the instruction of the manufacturer, and transformed into Escherichia coli strain DH5ά. Recombinant plasmids were extracted and nucleotide sequencing reactions were performed by Macrogen Company (South Korea) using dideoxy nucleotide chain termination method by universal primers for the plasmid.

The phylogenetic analysis of the 18 Iranian isolates were conducted by comparing the 778 bp of the CP gene and also the 200 nucleotides corresponding to 8922-9121 nts (accession number AY437609) coding for the N-terminal part of the CP gene with those of the other isolates of WMV obtained from GenBank (Tables 1, 3, and 4). The nucleotide and deduced amino acid sequences were compared with the counterpart sequences of WMV. Multiple sequence alignment was carried out using optimal alignment by DNAMAN software package (Lynnon, Biosoft, Quebec, Canada). Phylogenetic trees for grouping were constructed either by DNAMAN software package using similarity matrix and the neighbor-joining method and max parsimony (MEGA version 2.1) [15]. Tree branches were bootstrapped with 1000 replications. The validity of trees was evaluated by cophentic coefficient based on mantel test [17].

Table 3 GenBank accession number and origin of some of the previously reported WMV isolates used for phylogenetic comparison of a 804 nt long fragment of the genome including CP gene
Table 4 GenBank accession number and origin of some of previously reported WMVisolates/strains used for phylogenetic comparison of the 200 nucleotides long coding sequence of the N-terminal part of the CP gene

Results

WMV detection in the plant material

Of the 757 samples collected from cucumber, summer squash, winter squash and melon, tested by ELISA, 322, 190, 41, and 4 were infected with ZYMV, WMV, CMV, and SqMV, respectively. Double and triple infections involving different combinations of CMV, ZYMV, and WMV-2 were noted in 73 and 1 samples, respectively. WMV infection was detected only in one weed species (Colocynth; Citrullus colocynthis). Eighteen isolates of WMV were chosen as representative for further studies.

Biological characteristics of the WMV isolates

The isolates were compared with regard to symptoms produced on different test plants. All isolates infected N. bentamiana plants, systemically causing very mild mosaic. C. amaranticolor and C. quinoa reacted to all 18 isolates by production of local lesions in the inoculated leaves. Isolate KER.JI.1 caused a systemic mild mosaic on N. debneyii and vein banding and mosaic on D. metel (Fig. 1), whereas the remaining 17 isolates did not infect these two species. C. pepo and C. melo reacted differently to the isolates: ESF.ES.2, ESF.ES.3, ESF.ZA.2, ESF.ZA.3, and YAZ.MO.1 caused severe mosaic, leaf distortion, and vein clearing. Isolates ESF.ES.1, ESF.GA.1, ESF.GA.3, ESF.ZA.1, HOR.HA.1, KER.JI.1, KER.KE.1, URO.OS.1, and YAZ.SH.1 produced mild mosaic symptoms on both plants. The remaining isolates induced varying degrees of mosaic and vein banding (YAZ.MO.2), vein clearing (ESF.GA.2), severe mosaic (URO.NA.1), and vein banding (YAZ.TA.1) only on the inoculated leaves of C. pepo and C. melo. None of the isolates infected Citrllus lantanus, N. clevelendii, N. tabacum cv. Samsun.NN., N. glutinosa, Phaseolus vulgaris and Pisum sativum.

Fig. 1
figure 1

Mild mosaic in Nicotiana debneyii leaves caused by KER.JI.1 isolate (a). Vein banding and mosaic in Datura metel leaves due to infection with isolate KER.JI.1 (b)

Sequence data and identity matrix

The PCR-amplified fragment of approximately 822 bp was obtained by using primers WMV-F and WMV-R, when RNA isolated from 18 isolates was used as the template for the first-strand cDNA synthesis. The RT-PCR products were cloned and sequenced. Sequence information for these 18 WMV isolates has been submitted to NCBI-GenBank with the accession numbers from EU667627 to EU667644. The identity of WMV CP nucleotide sequences of the 18 Iranian isolates ranged from 92.7 to 99.9%. Sequence analysis demonstrated that 12 of the 18 isolates share 96.2–99.9% nucleotide homology with each other. The Iranian isolate KER.JI.1 in group II displayed the lowest (92.7%) and the highest (93.5%) nucleotide sequence identity with the isolates ESF.GA.1 and MAL99.2 in subgroup IA, respectively. The nucleotide sequence of HOR.HA.1 and YAZ.TA.1 isolates in subgroup IB showed the highest (94.9%) and the lowest (93.3%) nucleotide sequence identity with KER.JI.1, respectively. The two isolates, Habenaria from Japan and WMV-HLJ from China in subgroup IC, had the highest (95.5%) and the lowest (92.2%) nucleotide sequence identity with isolate KER.JI.1, respectively.

Identity levels in the amino acid of 14 Iranian isolates ranged between 97.4 and 100% (Table 5). The KER.JI.1 and YAZ.SH.1 isolates showed 91.6–93.1% and 92–93.4% identity at the amino acid sequence level with two other Iranian isolates YAS.TA.1 and YAZ.MO.I, respectively. The DAG box (Asp-Ala-Gly), which is believed to be crucial for potyvirus transmission by aphid, was found in the coat protein.

Table 5 Comparison of the percent nucleotide (top and right) and amino acid (bottom and left, in bold) identities among the 778 nt CP sequences of WMV isolated from Iran

Phylogenetic analysis

To depict the diversity of the sequences, phylogenetic tree was generated using the 18 Iranian WMV CP sequences with representative sequences present in GenBank. WMV CP sequences formed two different groups with a strong bootstrap value (98%) estimated from 1000 resamplings (Fig. 2). Seventeen of the Iranian isolates fell into group II, which can be divided into two subgroups A and B with a bootstrap value of 100%. The subgroup IB is a new subgroup and only four Iranian isolates from two provinces (Yazd and Hormozgan) were classified in this subgroup. The majority of the Iranian isolates fell into the large subgroups IA with bootstrap values of 74–93%. The Iranian isolates YAZ.SH.1, KER.KE.1, and ESF.GA.3 were placed in a distinct but smaller sub-group, 7 out of 13 isolates in subgroup IA were collected from different regions of Esfahan province formed a distinct small group according to the geographical region. The remaining three Iranian isolates, URO.NA.1, URO.OS.1, and ESF.GA.1, were placed in a distinct but larger group that included the majority of Spanish isolates.

Fig. 2
figure 2

Neighbor-joining tree illustrating the Phylogenetic relationships between the Iranian WMV isolates (Table 1) and other published WMV isolates (Table 3). The tree was drawn using the 778 bp CP sequences. Multiple sequence alignments of the nucleotide sequences were generated with DNAMAN program (Version 5.2.2; Lynnon Biosoft, Quebec, Canada). Numbers associated with nodes represent the percentage of 1000 bootstrap interactions supporting the nodes. The Iranian isolates are shown in bold. BZ is SMV and included as outgroup

Group II contained only the KER.JI.1 isolate that was distant from Iranian isolates of group II (92.7–94.9% nucleotide and 92–93.4% amino acid identities). Comparison 778 nt segment comprising the CP gene of the KER.JI.1 isolate with those of two isolates from Spain (BAD95.2 and SG99.1), one from USA (D13913), and one from Tonga (L22907) clustered together in group II showed 96.6, 96.6, 96.2, and 95.5% nucleotide and 94.9, 94.2, 95.99, and 94.5% amino acid sequence identities, respectively. Thirty-six isolates clustered in group I: the sequences of 11 isolates from Spain and the 17 isolates from Iran were 93.9–97.2% identical at the nucleotide and 97.1–98.2% at the amino acid identity levels.

Phylogenetic analysis based on 200 nucleotides coding for the N-terminal part of the CP with other isolates obtained from GenBank showed three distinct groups. Seventeen Iranian isolates clustered in group 1. There was only one Iranian isolate (KER.JI.1) within group 2 (Fig. 3).

Fig. 3
figure 3

Neighbor-joining tree illustrating the phylogenetic relationships between the Iranian WMV isolates (Table 1) and other published WMV isolates (Table 4). The tree was drawn using 200 nucleotides coding for the N-terminal part of the CP. Multiple sequence alignments of the nucleotide sequences were generated with DNAMAN program (Version 5.2.2; Lynnon Biosoft, Quebec, Canada). Numbers associated with nodes represent the percentage of 1000 bootstrap interactions supporting the nodes. The name of Iranian isolates is shown in bold letters. BZ is SMV and included as outgroup

Discussion

WMV is the most prevalent potyvirus infecting cucurbits in Iran [16, 21] and other countries [10, 14]. WMV has a relatively wide host range infecting plant in 23 families of dicotyledonous plants, and occurs naturally in cucurbit and legume crops [20]. In Iran, WMV has not been detected in non-cucurbit plants, except for a few weeds [16], and cucumber, watermelon, squash, and melon are by far the most important crop affected. Hence, the WMV population could be well represented by isolates recovered from cucumber, squash, and melon. Cucurbits in fields and greenhouses are grown in southwest, central, and southeast of Iran. In this work, care was also taken that most of the geographic areas and agroecological conditions where these cucurbits are grown be represented.

Phylogenetic trees were created using neighbor-joining (DNAMAN) and max parsimony methods for either 200 nt including NIb-CP gene fragment of WMV and 778 nucleotide long sequences of the coat protein gene. The cophentic coefficient of Fig. 2 which belong to group of isolates based on 778 bp was 0.97, which means the high validity of this dendenograms.

In agreement with the published data [18], existence of at least two groups of WMV isolates can be demonstrated in some districts of Iran. Group I included three subgroups A, B, and C. Isolates in subgroup IA were the most frequently detected isolates worldwide and included most of the Spanish melon isolates and Iranian cucurbit isolates. Most Iranian isolates of this subgroup were from melon (C. melo), cucumber (C. sativus), pumpkin (Cucurbita moschata), winter squash (C. maxima), and summer squash (C. pepo) (Table 1).

There were several distinct sub-clusters that could be related to geographical areas rather than to the host plants. For example, isolates collected from Esfahan (ESF.ZA.1, ESF.ZA.3, ESF.ES.3, ESF.ES.2, ESF.GA.2, ESF.ES.1, and ESF.ZA.2) clustered in subgroups IA, whereas isolates from Yazd province (YAZ.MO.1, YAZ.MO.2, and YAZ.TA.1) gathered in subgroups IB. Nonetheless, the subgroup IA was comprised of some sub-clusters, which did not strictly follow the geographical origin of the isolates. For instance, isolates ESF.GA.3 from Esfahan fell into a sub-cluster separate from the other isolates from this province. These data would imply movement of WMV isolates between or into provinces.

Subgroup IB also contains three out of four isolates collected from Yazd province. This subgroup, being reported for the first time herein, is comprised of only four Iranian isolates collected from summer squash, cucumber, and melon. According to the results of this study and those of Moreno et al. [18], the isolates of group I are the most widespread isolates of WMV.

Adams et al. [1] propose that 76–77% nt sequence identity at the CP is the optimal species demarcation criterion for potyviruses. Since the CP nt sequence identities between the WMV isolates were in the range of 92.7–99.5% (Table 5), all isolates studied here can confidently be assigned to one virus species, WMV.

Host range and symptoms have been used for the identification and differentiation of strains and pathotypes of viruses [26, 30]. Sap inoculation of 18 isolates of WMV onto a range of plants revealed differences in symptoms and host range. Some isolates showed similar biological properties, suggesting that they are closely related; others appeared to be exceptional strains, and these observations seemed to be consistent with their phylogenetic grouping. Isolates ESF.ZA3 and ESF.ES.2 were not only similar in symptoms and host range but also shared a CP amino acid sequence identity of 100%. YAZ.SH.1, KER.KE.1, and ESF.GA.3 isolates also caused mild symptoms on C. pepo and C. melo, and all are in the same sub-cluster of IIB. Based on reactions to the two test plants C. pepo and C. melo, the Iranian WMV isolates were clustered in three groups. The first group caused severe mosaic, leaf distortion, and vein clearing. The second group produced mild mosaic symptoms on both test plants and the third group depending on WMV isolates induced varying degrees of mosaic and vein banding and vein clearing. But the reaction of WMV isolated that was reported by Moreno et al. [18] on these two test plants was different. All mentioned isolates caused systemic mosaic and leaf lamina deformation in C. melo and systemic mosaic, leaf lamina deformation, and vein clearing in C. pepo. Therefore, the differences on symptoms may be related to cultivars of these two test plants.

Of particular interest is the case of KER.JI.1: although it was collected from the weed Colocynth, it had a varied host range and infected both N. debneyii and D. metel systemically. However, none of the other 17 Iranian isolates could infect these two test plants. In spite of different host range of this isolate, the CP amino acid sequence identity (>92%) of KER.JI.1 with other WMV isolates indicated that is an isolate of WMV, because the CP amino acid sequence identity among potyvirus isolates and between different potyviruses is >90% and <70%, respectively [2, 25]. Therefore, nucleotide sequence identity showed that KER.JI.1 is a unique isolate of WMV. We have been unable to ascertain the exact origin of this isolate but our results clearly show that it is more closely related to the Tonga, two Spanish, and American (US) isolates than to the other Iranian isolates of WMV (Fig. 2). The comparison made in the present work only reflects about one eleventh of the total genome of WMV; presumably, there are other differences in the remaining part of the genome of this isolate. Comparisons involving a full-length genome of this isolate are needed to clarify this. The other point is that no information about the reaction of other isolates placed them in group I is on N. debneyii and D. metel.

Phylogenetic analysis based on previous analysis of the 273 nt including NIb-CP gene fragment of WMV isolates [8] has shown three distinct molecular groups. In this article, phylogenetic analysis based on 200 nucleotides coding for the N-terminal part of the CP showed that all Iranian isolates clustered with previously reported WMV isolates in group I, whereas the KER.JI.1 isolate clustered in group 2 (Fig. 3). As mentioned by Desbiez et al. [8] also these three groups correlated with one motif at the N-terminal extremity of the CP. All Iranian isolates except KER.JI.1 and URO.OS.1 had a “KEA” motif at positions 3–5 in the CP. These two Iranian isolates (KER.JI.1 and URO.OS.1) had the “KET” motif instead of “KEA”.

The presented data provide useful information concerning WMV isolates found in Iran. These findings may be particularly relevant in light of recent reports of the presence of WMV in Spain. A more exhaustive analysis of pathogenicity determinants and vectors present in the same surveyed area should be carried out, considering that all of the Iranian isolates had the motif responsible for aphid transmission, which is similar to other aphid-transmissible potyviruses [3, 12].