Introduction

Cassava mosaic disease (CMD) is widespread in Africa and southern India, characterized by mosaic and deformation of the leaves. A begomovirus (family: Geminiviridae), was first shown to be associated with CMD in 1983 [1]. The genus begomovirus consists of large number of members; all transmitted by whitefly (Bemisia tabaci Genn.) to dicotyledonous hosts [2]. Based on their organization, they could be either mono- (containing only one DNA molecule called DNA-A) or bipartite (containing two DNA molecules: DNA-A and DNA-B). DNA-A codes for the coat protein (CP/AV1) and a protein of uncertain function (AV2) in the viral-sense strand and replication associated protein (Rep/AC1), transcriptional activator (TrAP/AC2), replication enhancer (Ren/AC3) and a possible silencing suppressor (AC4) in the complementary-sense strand. DNA-B codes for nuclear shuttle protein (NSP/BV1) and movement protein (MP/BC1) in the viral-sense and complementary-sense strands, respectively.

In India, two begomoviruses, Indian cassava mosaic virus (ICMV, [5]) and Sri Lankan cassava mosaic virus (SLCMV, [6]) are associated with CMD, according to surveys conducted in southern India, mainly in the states of Kerala and Tamil Nadu in the southwestern coast and in the south of the country. Of the two, SLCMV was found to be more prevalent in the cassava growing areas, while ICMV was found to be isolated in specific pockets [4]. Between DNA-A of ICMV and SLCMV, the most diverse region is the intergenic region (IR), (which does not contain any coding region, but contains the elements controlling replication and transcription) and the most conserved region is the CP. As such, SLCMV is believed to be a product of evolution, in which, from being a potential monopartite Begomovirus, it has acquired the DNA-B of ICMV [3]. Both ICMV and SLCMV are known to contain molecular variants [4].

Here, PCR-RFLP analysis of 38 CMD-infected cassava samples, collected over a period of 5 years (2002–2007) from cassava growing regions of southeastern, southwestern and southern India (Andhra Pradesh, Kerala and Tamil Nadu), was carried out, focusing on three different regions of the viral genome: IR, AC1 gene and AV1 gene, using three different enzymes. We conclude that (i) most samples indicated the presence of viruses which were recombination products of ICMV and SLCMV, (ii) IR region of ICMV and, AC1 and AV1 region of SLCMV is quite diverse and (iii) there exists a possible recombination hot-spot between the IR and AC1 region of the genome of ICMV/SLCMV.

Materials and methods

Source of virus-infected material

Field visits were made to cassava plantations and stem-cuttings were collected from apparently CMD-infected (depending on visual symptoms of leaf curling, crinkling and mosaic) cassava plants. These cuttings were maintained in soil inside an insect-proof greenhouse under a 16/8 h photoperiod, 30 ± 2 °C temperature and 70–80 % humidity.

Isolation of total nucleic acids, polymerase chain reaction and restriction digestion

Total nucleic acids were isolated from leaves sprouted from the cuttings and multiplex PCR was performed for differential detection of ICMV and SLCMV according to Patil et al. [4]. For PCR-amplification of IR, AC1, and AV1 regions (Fig. 1), primers were designed based on nucleotide sequence of ICMV and SLCMV DNA-A (Table 1). PCR amplifications were performed in a volume of 20 μl using Taq-DNA polymerase (New England Biolabs, USA) in a TC-512 thermocycler (Techne, UK) according to the manufacturer’s instruction. dNTPs were procured from Roche diagnostics (USA) and primers were commercially synthesized (Sigma-Aldrich, India). Thirty cycles of amplifications were performed with denaturation at 94 °C for 30 s, annealing at respective temperature for 30 s (Table 1), extension at 72 °C for 1 min; excluding initial denaturation for 5 min at 94 °C and final extension for 10 min at 72 °C. As positive control for ICMV and SLCMV, full length clones AY730035 and AJ579307 for ICMV and SLCMV, respectively were used. As a negative control, leaves from asymptomatic plant, maintained in insect-proof condition were used. Three four-base cutter restriction enzymes: HpaII (Fermentas, Lithuania), RsaI (NEB), and TaqI (Fermentas) were used for digesting the DNA completely according to instructions of the manufacturers.

Fig. 1
figure 1

PCR amplification to detect the begomoviruses from naturally infected cassava leaf samples. M size marker, 100 bp ladder, amplification using cloned DNA of ICMV and SLCMV are indicated as I and S respectively, (−), PCR using genomic DNA from asymptomatic plant

Table 1 ICMV-like (I) or SLCMV-like (S) PCR-RFLP patterns in the three regions of resident begomoviruses of mosaic-affected cassava leaves collected from three states of India

Electrophoresis

The PCR products for the detection of ICMV and SLCMV were resolved by electrophoresis according to [4]. For PCR-RFLP, a 7 % polyacrylamide gel in a vertical gel apparatus (Owl, USA), was used for electrophoresis for 5–6 h at 30 mÅ constant current. As size marker, ~100 ng of 50 bp DNA ladder (NEB) was used. After the electrophoresis was over, the gel was stained with ethidium bromide using standard procedure [7] and photographed in a UV–Vis gel documentation system (Quantity-One software, BioRad, USA).

Data-analysis

RFLP data were analysed using the PAST package in its excel-interface, scoring the bands “1” and “0” for the presence and absence, respectively, and similarity dendrograms were generated using the Jaccard’s similarity matrix in the same package in paired-group method. Visual modifications were done in the generated dendrograms in Microsoft Power-point for better clarity without altering the original results.

Results

Detection of ICMV versus SLCMV

Multiplex PCR for differential detection of ICMV from SLCMV was carried out to differentiate between ICMV and SLCMV, using the total DNA isolated from 38 different cassava samples as templates. All the 38 samples gave an amplicon of ~600 bp, representing the region flanked by 5′ end of AC1 and 5′ end of AV1, indicating the presence of only SLCMV in the samples, as reported earlier [4]. No sample showed a 900 bp amplicon, indicative of ICMV. This clearly shows that all the samples used for the analysis are infected by SLCMV alone (Fig. 1).

Detection of recombination between ICMV and SLCMV

The overall RFLP patterns (a representative shown in Fig. 2) for all the three PCR-amplified regions are summarized in Table 1 and Fig. 3. From this Table, it is clear that in case of samples #3, 5, 6, 9, 10, 11, 12, 15, 16, 30, 33, and 37 (12 out of 38), all the regions amplified gave an RFLP-pattern similar to that of SLCMV; but for the rest 26 samples, the three viral DNA regions showed patterns similar to both ICMV and SLCMV; 25 (#1, 2, 4, 7, 8, 13, 14, 17, 18, 19, 20, 21, 22, 24, 25, 26, 27, 28, 29, 31, 32, 34, 35, 36, and 38) contained IR region similar to ICMV, while the AC1 and AV1regions were similar to SLCMV. One sample (#23, shown in bold in Table 1) contained the IR and AV1 region resembling ICMV, while the AC1 region resembled SLCMV. The data indicate that the first 12 samples (#3, 5, 6, 9, 10, 11, 12, 15, 16, 30, 33, and 37) are SLCMV and the other samples are recombinants of ICMV and SLCMV. It is also clear in case of all the 26 recombinants that while two of the genomic regions analysed [AC1 and AV1; except in one case (sample no #23)] are similar to SLCMV, the IR is similar to ICMV. This indicates a recombination of the IR of ICMV in SLCMV background. The case of sample no #23 is a special one which might be a case of recombination of SLCMV AC1 in ICMV background. The fact that in none of the cases, AC1 resembling SLCMV was found in case of the recombinants, the AC1 gene of SLCMV might be preferred over the same of ICMV during evolution. Therefore, it is also evident that there is likely to be a recombination hotspot between the AC1 gene and the IR.

Fig. 2
figure 2

PCR-RFLP patterns of ICMV-type and SLCMV-type viral DNAs from naturally infected cassava samples (for AV1/RsaI). M size marker, 50 bp ladder, I cloned ICMV DNA control, S cloned SLCMV DNA control, U undigested PCR product. The sizes of the DNA markers are indicated in base pairs

Fig. 3
figure 3

Graphical representation of number of samples showing SLCMV-/ICMV-type PCR-RFLP patterns shown along the map of the viral genome. The numbers on the top of the bars indicate the number of samples showing a particular type of pattern. The ORFs and IR are shown as solid arrows; and the “ori” site is depicted as an open circle

Detection of micro-variants of ICMV and SLCMV

In detailed analysis of the dendrograms generated using the restriction pattern of all the three enzymes (HpaII, RsaI, and TaqI) for the IR, AV1, and AC1 separately (Fig. 4a–c), it was seen that the restriction pattern of IR from 26 samples were closer to the restriction pattern of IR of ICMV with more than 60 % similarity while the pattern in 12 samples was similar to the same of SLCMV. Out of the former 26 samples, 17 samples (#14, 4, 1, 8, 13, 17, 18, 21, 25, 26, 27, 28, 31, 32, 34, 35, and 36) showed 100 % similarity to ICMV. The rest of the samples shared different amounts of similarity to the known ICMV pattern, which ranged from 75 % (#23) to ~64 % (#7, 22, 2 and 20), revealing the presence of multiple micro-variants within the ICMV-like group. Similarly, among the SLCMV-like group consisted of 12 samples (#10, 3, 5, 9, 11, 12, 15, 16, 30, 33, 37, and 6), while 11 samples showed 100 % identity of patterns to SLCMV, one sample (#6) was a micro-variant of SLCMV, showing only ~77 % similarity to the same (Fig. 4).

Fig. 4
figure 4

Dendograms generated from the PCR-RFLP pattern. The known ICMV and SLCMV positive controls are marked in each, a IR, b AV1, and c AC1

Considering the restriction pattern of the AV1 amplicon, 37 samples showed at least 40 % similarity to that of SLCMV while only one sample (#23) showed ~70 % similarity to ICMV. Among the 37 samples showing similarity to SLCMV, none showed 100 % identity to SLCMV; however, five samples (#21, 4, 5, 8, 19; which are 100 % similar to each other) showed ~90 % similarity to SLCMV. Although multiple samples among the rest of them were identical to each other, as in case of the IR, micro-variants were found to be present among them sharing similarity ranging from a maximum of ~65 % to a minimum of ~41 %.

The restriction pattern of the AC1was similar to the pattern of IR of ICMV showing no significant similarity to any of the samples, while 28 samples (#3, 1, 7, 8, 9, 10, 11, 12, 13, 14, 15, 19, 20, 21, 22, 23, 24, 26, 27, 28, 29, 31, 32, 33, 34, 35, 36, and 37) showed 100 % similarity to that of SLCMV. As in case of IR and AV1, micro-variants, however were present in this case also. For example, while two variants (#2, 36) showed a maximum of ~90 % similarity to SLCMV, three samples (#4, 5, 6) showed only ~45 % similarity to SLCMV, the rest showing intermediate values of similarity.

Discussion

It was earlier reported, that although both ICMV and SLCMV were associated with CMD in India, SLCMV is more widespread than ICMV [4, 8]. SLCMV is speculated to have had a previous non-cassava host from which it might have jumped to cassava and “captured” ICMV DNA-B [3]. The speculation is based on the observation that SLCMV DNA-A, on its own, is able to produce symptoms in experimental plants such as Nicotiana benthamiana, indicating a potential monopartite nature. Considering these facts, and the observation here and elsewhere that SLCMV and SLCMV-like isolates predominate in southern India, SLCMV seems to be more successful as a pathogen as compared to ICMV. The reported properties of SLCMV, such as increased aggressiveness, capability of trans-replicating DNA components of other viruses and infecting new hosts with satellite molecules [3] might have important contribution in this regard.

Based on the discriminatory PCR standardized earlier [4] to distinguish between ICMV and SLCMV, it is clear that most of the samples contained SLCMV, not ICMV. However, the PCR-RFLP patterns indicated that IR region had more resemblance to ICMV in the same samples. PCR-RFLP analysis, which covers the region analyzed uniformly and reflects the inherent DNA sequence heterogeneity, has been used to determine micro-heterogeneity in natural populations of viral quasi-species earlier [9]. Hence, it can be concluded that these patterns indicate strongly that the viral DNAs are a product of recombination between ICMV and SLCMV.

Recombination is a major factor, along with mutation and re-assortment, in contributing to the broadening of host-spectrum of viruses [10, 11]. Within geminiviruses, examples of the above phenomenon include the recombinant cassava geminivirus EACMV-Uganada, [12], Maize streak virus strain MSV-A, a recombination product between MSV-B and MSV-G/F [13], Tomato yellow leaf curl virus isolates [14], and Cotton leaf curl Burewala virus [15]. The mechanistic process underlying rolling circle replication of geminiviral DNA is believed to be the prime determinant of recombination [16], which was also implicated in the observation of recombinant defective DNA molecules between ICMV and SLCMV in N. benthamiana [17]. Such processes may be responsible for the large number of recombinant geminiviruses in the recent past [1824].

The present report is the first detailed investigation on recombination among cassava geminiviruses in southern India, using a large sample size, and extends a similar studies conducted earlier [4, 8]. The samples used in this study were collected from State of Andhra Pradesh in the southeastern coastal area of India and represents new areas being brought under cassava cultivation, a region not surveyed extensively for CMD. The widespread occurrence of SLCMV and its recombinogenic tendency pose grave threats to the cassava crop in southern India, especially in the newly cultivated regions, which highlights the urgent need to take effective antiviral measures for this crop.