Introduction

Moringa oleifera is a tropical tree, commonly known as horseradish or drumstick tree, which is diploid with 28 chromosomes. The tree is indigenous to North-western India and is now distributed in Pakistan, the Philippines, Cambodia, Central, North and South America and the Caribbean Islands (Morton 1991). In India and other countries, immature pods, fresh leaves and flowers of this tree are used for culinary purposes (Oduro et al. 2008; D’souza and Kulkarni 1993), with medicinal uses of roots and seeds (Ruckmani et al. 1998; Lalas and Tsaknis 2002). The leaves and pods are known as potential sources of vitamins and minerals being used as raw, cooked and dried (Fuglie 2005). Anti-hypertensive, anti-pyretic and anti-microbial activities of the seeds of M. oleifera have been very well studied (The Wealth of India 1962; Oliveira et al. 1999; Lalas and Tsaknis 2002).

Molecular markers are very useful in taxonomic and genetic analysis. The most commonly used marker are restriction fragment length polymorphisms (Soller and Beckmann 1983), amplified fragment length polymorphisms (AFLPs) (Kim et al. 2002), random amplified polymorphic DNAs (RAPDs) (Jacobson and Hedren 2007), microsatellites or simple sequence repeats (SSRs) (Fang et al. 2012) and inter simple sequence repeats (ISSRs) (Zietkiewicz et al. 1994). These techniques have been extensively used for assessment of genetic diversity in a variety of annual and perennial plants like Zea mays (Garcia et al. 2004), finger millet (Panwar et al. 2010), artemisia (Badr et al. 2012). These markers have advantages and disadvantages, but produce reliable information on genetic diversity in genetically neutral regions (Karp 2002).

Cytochrome P450 genes are widely found in micro-organisms, animals and plants (Schalk et al. 1999) and play important roles in oxidative detoxification and the secondary metabolite production (Ohkawa et al. 1998). Cytochrome P450 gene-analogues have been successfully used as new genetic markers for genetic diversity studies in plants, which reflects both functional and genome-wide regions (Yamanaka et al. 2003b).

Previously, assessment of genetic diversity in M. oleifera has been done using AFLP (Muluvi et al. 1999; Muluvi et al. 2004) and RAPD (Mgendi et al. 2010) in accessions from Kenya and Tanzania, respectively; however, other methods have not been used on Indian Moringa oleifera cultivars. Quantitative variation in nutritionally important phytoconstituents has been found in leaves of M. oleifera cultivars (R. K. Saini et al., our unpublished data), indicating a considerable potential for improvement of this tree by breeding and selection for nutrient-rich food and wide range of adaptation to different climatic and soil conditions. In the present study, we therefore used RAPD, ISSR and cytochrome P450-based markers for detecting the genetic variability in eight cultivars of M. oleifera collected from various states of India.

Materials and methods

Plant material

Commercially grown cultivars of M. oleifera were collected from different states of India. Cultivars PKM-1 and PKM-2 were collected from Kerala Agriculture University, Thrissur (Kerala). Cultivar CO-1 was collected from Tamil Nadu Agriculture University, Coimbatore (Tamil Nadu). Cultivars GKVK-1 and Dhanraj were collected from University of Agriculture Sciences, Bangalore (Karnataka). Cultivars Bhagya and Amar-32 were collected from University of Horticultural Sciences, Bagalkot (Karnataka) and the Amar Seeds Pvt. Ltd, Pune (Maharashtra), respectively. Cultivar PAVM-1 was collected from a progressive farmer in Tamil Nadu (India). The collected seeds were grown in plastic pots (10 l size) containing soil, sand and vermicompost (product of composting using earthworms) (3:1:1 v/v) in a greenhouse at ambient temperature (26–28 °C) (Saini et al. 2012). The soil type was red silt loam with fine-silty characteristics, which is the best for Moringa cultivation. At 2 months interval after germination, 10 g N–P–K (15:15:15) were applied to each pot. Newly opened (unfolded) leaves from 1-year-old M. oleifera plants (cultivars) were used for DNA extraction.

DNA extraction, purification and quantification

All the reagents used for DNA extraction, purification and polymerase chain reaction (PCR) amplification were of molecular biology grade and purchased from Sigma-Aldrich (Bangalore, India). Oligos were synthesised in BioServe Biotechnologies Pvt. Ltd, Hyderabad (India).

Genomic DNA was extracted from leaves of the different M. oleifera cultivars by the CTAB method, according to Murray and Thompson (1980) with some modifications to eliminate phenolics. From each cultivar, 1 g of leaves were ground in liquid nitrogen to a fine powder in a pre-chilled mortar and transferred into a 30 ml centrifuge tube containing 10 ml of DNA extraction buffer containing 0.1 M Tris chloride (pH 8.0), 0.02 M EDTA (pH 8.0), 1.4 M NaCl, 2 % CTAB (w/v), 2 % polyvinylpyrrolidone (w/v) and 0.2 % β-mercaptoethanol (v/v) and incubated in a water bath at 65 °C for 60 min. The tubes were cooled at room temperature and an equal amount of chloroform: isoamyl alcohol (24:1) was added, mixed thoroughly by gentle inversion and finally centrifuged at 10,000g for 15 min at 20 °C. The upper aqueous layer was transferred to a fresh sterile centrifuge tube and 0.7 vol chilled isopropanol was added and incubated overnight at −20 °C. The tubes were centrifuged at 10,000g for 15 min at 4 °C and the pellet was washed with 70 % ethanol, air-dried and dissolved in 200 μl TE buffer and stored at 4 °C. For DNA purification, 5 μl RNase (10 mg/ml) was added to total isolated DNA (200 μl) and incubated at 37 °C for 60 min. An equal volume of phenol:chloroform:isoamyl alcohol (25:24:1) was added and mixed gently. The tubes were centrifuged at 10,000g for 5 min and the aqueous layer was transferred to fresh Eppendorf tube and 1/10 volume of sodium acetate (3 M, pH 5.2) and a double volume of chilled absolute ethanol was added. After 30 min, the mixture was centrifuged at 10,000g for 5 min and finally, the pellet was washed with 70 % ethanol, dried and dissolved in 200 μl TE buffer. For quantification of genomic DNA, the absorbance of the DNA samples was measured at 260 nm in a Nano Drop 1000 (Thermo Scientific). After quantification, the quality of the purified DNA was analysed in a 0.7 % (w/v) agarose gel.

RAPD, ISSR and Cytochrome P450-based marker analyses

A total 25 RAPD primers, 10 ISSR primers and 11 Cytochrome P450-based primers were used for the polymorphism survey. The RAPD assay was carried out in 0.2 ml PCR vials containing 1× reaction buffer, 200 μM dNTPs mix, 0.4 μM primer, 50 ng DNA template, 1 U Taq DNA polymerase and sterile distilled water to a final volume of 25 μl. The content was gently mixed by spinning for a few seconds. The PCR amplification was performed with a thermocycler (Eppendorf, Germany). The standardised amplification was performed at an initial denaturation at 94 °C for 4 min, followed by 40 cycles of denaturation at 94 °C for 30 s; primer annealing based on Tm for 1 min; primer extension at 72 °C for 2 min and final primer extension at 72 °C for 10 min. For ISSR analysis, ISSR primers were used and other conditions was the same as for RAPD. For Cytochrome P450-based markers, PCR amplification was performed using 0.4 μM forward primer and 0.4 μM reverse primer in a volume of 25 μl with same PCR conditions as that of RAPD. The annealing temperature of each primer was calculated online by OligoAnalyzer 3.1 (http://www.idtdna.com/analyzer/Applications/OligoAnalyzer) according to the sequence of the oligos.

Polymerase chain reaction amplified products were analysed by agarose gel electrophoresis using 1.5 % agarose in 1× TAE buffer. The amplicon sizes were measured with a 100–5,000 bp DNA ladder. Respective gels were stained with 10 ppm ethidium bromide followed by image capturing using a gel documentation system (Herolab, Germany). The procedure was repeated twice for each DNA set and reproducible primers were subjected for diversity analysis.

Data analysis

The various sizes of amplified products were scored for presence (1) or absence (0) in the eight cultivars to generate a binary matrix. Binary matrix data were analysed by the software NTSYS-pc, version 2.11w, to calculate the Jaccard’s similarity coefficient (using the simqual programme) and generate a dendrogram based on the unweighted pair-group method using the arithmetic average (UPGMA) method (Rohlf 2001). Genetic similarity between cultivars was calculated according to Jaccard’s similarity coefficient (Jaccard 1980). The computer programme WINBOOT was used to determine the robustness of the dendrogram, with 2,000 replications along with Jaccard’s coefficient (Yap and Nelson 1996).

Percent polymorphisms were calculated for each primer combination according to the formula: % Polymorphism = p/(m + p), where p is total number of polymorphic bands and m is the total number of monomorphic bands of the primer combination used. Multiplex ratio (MR) for each marker was calculated by using the formula: MR = (m + p)/n, where p is the total number of polymorphic bands, m is the total number of monomorphic bands and n is the total number of primer combinations used (Powell et al. 1996). Polymorphic information content (PIC) was calculated using the formula: PIC = 1 − ∑pi2, where ‘pi’ is the frequency of the ith allele. Average heterozygosity (Hav) was obtained by taking the average of PIC values obtained for all the markers. Marker index (MI) was obtained by multiplying the average heterozygosity (Hav) with the multiplex ratio (MR).

Results

RAPD analysis

Randomly amplified polymorphic DNA assays were performed with 25 random primers. Of these, 17 produced polymorphic and reproducible bands and were selected for further screening. In an assay of the 17 RAPD primers on the eight cultivars, 76 bands were scored in the size range 0.3–3.0 kb, with an average of 4.47 bands per primer. Of the 76 bands, 37 were polymorphic (Table 1), with 48.68 % polymorphisms. RAPD-15 was found most efficient with 87.50 % polymorphism out of eight bands. A representative RAPD profile obtained by primer RAPD-10 and 11 in the 8 cultivars of M. oleifera is shown in Fig. 1a and b. The PIC values, a measure of informativeness of a marker, were found in the range from 0.49 (RAPD-17) to 0.817 (RAPD-15) with a mean of 0.726 (average heterozygosity).

Table 1 Summary of genetic diversity obtained by RAPD primers
Fig. 1
figure 1

Profile of 8 Moringa oleifera cultivars 1 PKM-1, 2 PKM-2, 3 GKVK-1, 4 Dhanraj, 5 Amar32, 6 CO-1, 7 Bhagya and 8 PAVM-1 by Cyt P450, ISSR and RAPD markers. a RAPD-10, b RAPD-11, c ISSR-7, d ISSR-12, e Cyt P450 (Cyt 2F & Cyt 2R) and f Cyt P450 (Cyt 3F & Cyt 3R)

ISSR analysis

Inter-Simple Sequence Repeat (ISSR) assays were performed with 10 ISSR primers. Of these, six gave polymorphic and reproducible bands and were selected for further screening. In an assay of the 6 ISSR primers on the 8 cultivars, 35 bands were scored in the size range 0.2–2.5 kb, with an average of 5.83 bands per primer. Of the 35 bands, 17 were polymorphic (Table 2), with 48.57 % polymorphism. ISSR-6 was found most efficient with four polymorphic bands out of six bands (66.66 %). A representative ISSR profile obtained by primer ISSR-3 and six in the eight cultivars of M. oleifera is shown in Fig. 1c and d. The PIC values were found in the range from 0.744 (ISSR-4) to 0.871 (ISSR-12) with a mean of 0.810.

Table 2 Summary of genetic diversity obtained by ISSR primers

Cytochrome P450-based markers analysis

Cytochrome P450-based marker assays were performed with 11 set of Cyt P450-based (forward and reverse) primers. Of these, seven produced polymorphic and reproducible bands and were selected for further screening. Tests of 7 sets of Cyt P450-based primers on the 8 cultivars gave 30 bands in the size range 0.1–1.2 kb, with an average of 4.28 bands per primer. Of the 30 bands, 12 were polymorphic (Table 3), with 40.00 % polymorphisms. CYP1A1 F with CYP1A1 R was found to be most efficient with 2 polymorphic bands out of 2 bands (100.0 %). A representative profile obtained by primer Cyt 02 F with Cyt 02 R and Cyt 03 F with Cyt 03 R in the eight cultivars of M. oleifera is shown in Fig. 1e and f. The PIC values were recorded in the range from 0.444 (CYP1A1F with CYP1A1R) to 0.838 (Cyt 02 F with Cyt 02 R) with a mean of 0.689.

Table 3 Summary of genetic diversity analysis obtained by Cytochrome P450-based primers

Statistical analysis

Jaccard’s similarity coefficient, which reflects the genetic similarity among the cultivars, was calculated using binary data based on the RAPD, ISSR and cytochrome P450-based markers and is shown in Table 4. The Jaccard’s similarity coefficient ranged from 0.689 (between PKM-1 and PAVM-1) to 0.875 (between Dhanraj and Amar-32). The dendrogram, constructed according to the UPGMA method and based on the RAPD, ISSR and Cyt P450-based markers data, grouped the eight cultivars into two separate clusters (Fig. 2). The principal component analysis of RAPD, ISSR and Cyt P450-based markers revealed that the genotypes belonging to a particular sub-cluster were grouped together in the PCA plots (Fig. 3). The coefficient of correlation (Pearson’s correlation coefficients) was determined by a mantel test using matrices generated by these markers. The correlation coefficient between the ISSR and Cyt P450-based markers, between the RAPD and Cyt P450-based markers and between the ISSR and RAPD markers was 0.115, 0.097 and 0.0013, respectively. The correlation coefficient between Cyt P450-based markers and the combined RAPD, ISSR and Cyt P450-based markers, between the ISSR and the combined RAPD and ISSR and between the Cyt P450-based marker, the RAPD markers and the combined RAPD, ISSR and Cyt P450-based markers was 0.557, 0.237 and 0.580, respectively. The three molecular markers, ISSR, RAPD and Cyt P450, were compared on the basis of Percentage polymorphism, Multiplex ratio (MR), Average heterozygosity (Hav) and Marker index (Table 5). In relation to per cent polymorphisms, ISSR markers scored higher (48.57 %) than RAPD (48.68 %) and cytochrome P450-based markers (40 %). The Marker index (MI), which is a measure to evaluate the overall usefulness of a marker system considering all parameters, was highest for ISSR (4.726), compared to RAPD (3.246) and Cyt P450-based markers (2.954), making the former a highly efficient marker for the genetic diversity studies in M. oleifera.

Fig. 2
figure 2

UPGMA cluster analysis showing the relationship and diversity among eight cultivars of Moringa oleifera, produced by RAPD, ISSR and Cyt P450-based markers. Numbers at the node indicate bootstrap values

Fig. 3
figure 3

Representation of principal component analysis (PCA) for the eight cultivars of Moringa oleifera, produced by RAPD, ISSR and Cyt P450-based primers

Table 4 Jaccard similarity coefficients among eight cultivars of Moringa oleifera generated from binary matrix of RAPD, ISSR and Cytochrome P450-based marker

The eight cultivars of M. oleifera were grouped into two major clusters using a Jaccard similarity coefficient of 0.75. Cluster 1 comprises 5 cultivars; PKM-1, PKM-2, GKVK-1, Dhanraj and Amar-32. Cluster 2 contains 3 cultivars; CO-1, Bhagya and PAVM-1. Bootstrap analysis from the binary data was used to determine the robustness of the dendrogram and confidence values as percentages at each node. Very high bootstrap values (94.4 and 82.3) were obtained from the major nodes (Fig. 2), indicating the robustness of these groups.

Table 5 Comparison of RAPD, ISSR and Cyt P450-based molecular markers in evaluating genetic diversity of Moringa oleifera

Discussion

Information on genetic variability of wild and cultivated plants is important for identification, conservation and cultivar development (Demir et al. 2010) and DNA markers are the most appropriate to study the genetic diversity among a set of genotypes or cultivars. In the present study, the combination of non-functional (RAPD and ISSR) and functional (Cyt P450 based) marker systems were used to provide wider genome coverage and, therefore, will be a better indicator of the genetic relationships among the eight cultivars.

Assessment of genetic diversity in M. oleifera accessions has been reported using AFLP (Muluvi et al. 1999) and RAPD markers (Muluvi et al. 2004; Mgendi et al. 2010). Muluvi et al. (1999) analysed the genetic diversity by AFLP among 140 accessions of M. oleifera, collected from south India, Southern Malawi and Kenya and found the highest levels of genetic diversity within the Indian genotypes. They used only one cultivar (PKM-1) from India whereas the other genotypes were from natural populations. In our study, we used eight commercially used Indian cultivars, among these 40.00 % (by ISSR markers) to 48.68 % (by RAPD markers) polymorphisms were recorded. This is the first report on genetic diversity among commercially grown Indian cultivars. Similarly, Mgendi et al. (2010) investigated the genetic diversity by RAPD markers between and within cultivated and non-cultivated provenances of M. oleifera from coastal regions of Tanzania and found that wild provenances were more diverse then cultivated. In cultivated plants, genetic changes are slowed down because of reduced competition and natural selection, due to optimum environmental conditions (Manoko et al. 2008). However, there is lack of information for Indian cultivated genotypes since India is the centre of origin for M. oleifera. Similarly, there is a need to evaluate the efficiency of other molecular markers for proper assessment of genetic diversity in M. oleifera, particularly markers based on Cyt P450 genes to be exploited for assessing the intra-species diversity.

In the present study, we used RAPD, ISSR and Cyt P450-based markers to analyse the genetic diversity between eight Indian cultivars of M. oleifera”. The three molecular markers used detected an average of 46.80 % polymorphisms (genetic diversity) among the cultivars. RAPD and ISSR revealed an average of 48.68 and 48.57 % polymorphism across all cultivars. The Jaccard’s similarity coefficient was calculated based on RAPD, ISSR and cytochrome P450 markers and it was found in the range of 0.689 to 0.875. Similar values of this coefficient were obtained between cultivars of close relatives of M. oleifera, i.e., papaya and Phaseolus vulgaris (Stiles et al. 1993; Maciel et al. 2003).

In the cluster analysis, constructed according to the UPGMA method and based on the three sets of marker data, the cultivars were not clustered according to their geographical origin. This may be due to spread of planting materials and high rates of gene flow through cross-pollination since this plant is highly cross-pollinated and different cultivars were collected from different states of India, i.e., Karnataka, Tamil Nadu, Maharashtra and Kerala. These states are located in close proximity of each other. Several studies have reported similar observations in other plants, e.g., Andrographis paniculata and Olea europaea (Padmesh et al. 1999; Bronzini de Caraffa et al. 2002), whereas some other studies reported the grouping of genotypes in dendrogram on the basis of their geographical collection centres (Das et al. 2007; Panwar et al. 2010). The cluster (dendrogram) was bootstrapped by the WINBOOT program (Yap and Nelson 1996) with 2,000 replications as suggested by Hedges (1992) and the values obtained in terms of percentages were high (94.4 and 82.3) at major nodes, suggesting the robustness of the dendrogram.

The comparative analysis of RAPD, ISSR and Cyt P450-based marker systems revealed ISSR to be the best marker as it generated highest percentage of polymorphisms, marker index (MI), average heterogeneity (Hav) and multiplex ration (MR). In the present study, Cytochrome P450-based markers were not found very efficient, due to low value of PIC compared to RAPD and ISSR markers, but these new markers have been found very efficient in evaluation of diverse species such as rice and various endemic wild plants (Tanaka et al. 2001; Yamanaka et al. 2003a). Similarity Pearson’s correlation coefficients was found very low (R 2 = 0.115, between ISSR and Cyt P450-based markers), showing a very poor fit of similarity matrix between two markers. This indicates that the two sets of markers target different regions of the genome. Similar results of low levels of correlation between RAPD and ISSR markers were observed by Lalhruaitluanga and Prasad (2009) in Melocanna baccifera. However, Panwar et al. (2010) observed the high value of coefficient of correlation between RAPD and Cyt P450 gene-based markers.The dendrogram and PCA plots generated from the binary data matrices of the three marker systems was found highly concordant to each other. In dendrogram, all the eight cultivars were grouped in four major sub-clusters, similar pattern of grouping was recorded in PCA plot. The similar type of agreements between dendrogram and PCA plot was reported by Panwar et al. (2010).

The high genetic diversity (variability) found in the present study is directly linked with higher levels of biodiversity, which is useful for food security, productivity and ecological sustainability. Significant variability can be utilised in breeding programmes to produce high-yielding nutritionally superior cultivars with better adaptations to different climatic conditions. In India, major breeding programmes are on-going for the development of fast growing, disease resistant and high pod yielding cultivars of M. oleifera. Thus, cv. Bhagya (KDM-1) has recently been developed by University of Horticultural Sciences, Bagalkot (Karnataka) and it is gaining popularity among farmers, due to its high yield of pods. Similarly, in future, attention is required to develop cultivars for high foliage yield, since Moringa leaves are a rich and affordable source of major nutritional and nutraceutical phytoconstituents (Oduro et al. 2008).