Introduction

Eleusine coracana, commonly called finger millet and ragi, is an allotetraploid cereal, is widely cultivated in the arid and semiarid regions of the world. It is rich in protein (~6–13%) and calcium (~0.3–0.4%) [1] and serves as an important staple food for rural populations in developing tropical countries where calcium deficiency and anaemia are widespread [2]. On the other hand, some of the health benefits such as hypoglycemic, hypocholestrolemic and anti-ulcerative are associated with the millet [3]. DNA fingerprinting techniques are very useful in correct identification of taxa. The most commonly used marker systems are random amplified polymorphic DNA (RAPD) [4, 5], restriction fragment length polymorphism (RFLP) [6], amplified fragment length polymorphism (AFLP) [7], inter simple sequence repeats (ISSRs) [8] and microsatellites or simple sequence repeats (SSRs) [9]. Randomly amplified polymorphic DNA (RAPDs) have been extensively used for the assessment of genetic diversity in a variety of plants like Zea mays [10], barley [11], rice [12, 13], wheat [14], etc. Random Amplified Polymorphic DNA analysis is quick [15], however, problems with reproducibility in amplification of RAPD markers and with data scoring have been reported. SSRs are multi-allelic; they have high potential for use in evolutionary studies and studies regarding genetic diversity and relationships analysis in a variety of plants like wheat [16], sorghum [17] and Zea mays [18], etc. Cytochrome P450 mono-oxygenases are widely found in animals, plants and microorganisms [19]. Sequence diversity of P450 gene-analogues in different plant species have been studied and it has been reported that P450 gene-analogues can be used as new genetic markers for diversity studies in plants reflecting both functional and genome-wide regions [20, 21]. Data mining on Arabidopsis thaliana has resulted in the development of number of primer-sets derived from Cyt P450 genes, which could be used as universal tools for the assessment of genome-wide diversity in diverse plant species lacking relevant genetic markers [22].

Assessment of genetic diversity in finger millet (Eleusine coracana L.) was done using random amplified polymorphic DNA (RAPD) [2325], restriction fragment length polymorphism (RFLP) [26], and interspersed simple sequence repeat (ISSR) markers [23, 27], isozymes [27] and SSRs [28, 29]. Recently attempts have been made to develop genetic map of tetraploid finger millet (Eleusine coracana subsp. coracana) genome (2n = 4x = 36) using RFLP, AFLP, EST and SSR markers [29]. The availability of genetic map of E. coracana could be used to investigate the diversity of different genes and its possible locations to specific chromosomes in near future.

We have conducted this study to analyze genetic diversity present in the accessions of Eleusine coracana collected from different agro-climatic regions of India and Africa. We have used RAPD, SSR and Cytochrome P450 gene based markers for this purpose and the potential of these molecular markers was also compared.

Materials and methods

Plant material

A total of 83 accessions were used in the present study. Seed samples of 83 accessions of Eleusine coracana (collected from different regions of India and Africa (Map 1) were obtained from, Project Coordination Cell, All India Co-ordinated Small Millets Improvement Project, ICAR, UAS, GKVK, Bangalore. The pass port data was available with the accessions (Table S1).

DNA extraction

The genomic DNA of different accessions of finger millets was isolated by standard method [30] quantified and analyzed on agarose gel electrophoresis [31].

RAPD, SSR and Cytochrome P450 gene based markers analyses

A total of 18 random primers, 10 SSR primers and 10 pairs of cytochrome P450 gene based primers were used for the polymorphism survey. Randomly amplified polymorphic DNA assay was carried out in 25 μl reaction volume containing 50 ng DNA, 5 mM MgCl2, 2 mM each of dNTP, 10 μM random primers, 0.6 U Taq polymerase (Bangalore Genei pvt. Bangalore, India). Amplification was performed with thermal cycler (Eppendorf Germany).The standardized amplification was: Initial denaturation 95°C for 5 min. followed by 40 cycles of denaturation 94°C for 1 min; Primer annealing based on T m value for 1 min; primer extension at 72°C for 2 min; and final primer extension at 72°C for 7 min. For SSR analysis, 50 ng of template DNA was subjected to amplification with microsatellite primers, other components remaining same as that of RAPD. The annealing temperatures of the cycling parameter were readjusted for each microsatellite primers according to their calculated melting temperature (T m) based on the sequence composition [T m = 4° (G+C) + 2° (A+T) − 3°C]. For Cytochrome P450 gene based markers PCR amplification was performed as per the standard protocol using 50 ng of template DNA, 5 mM MgCl2, 2 mM each of dNTP, 10 μM of forward and reverse primers, 1.5 U Taq polymerase (Bangalore Genei pvt. Bangalore, India) in a volume of 25 μl.

PCR amplified products of all the primers were subjected to gel electrophoresis using 1.8% agarose gel in 1× TBE buffer. The fragment sizes, ranged from 0.3 to 4.0 kb were detected by comparing the amplicons with a 100 bp DNA ladder and EcoR1/HindIII double digest λ-DNA as marker (Genei Pvt., Bangalore, India) and the ethidium bromide stained gels were documented using Alpha Imager 1200TM (Alpha Innotech Corporation, USA). Duplicated independent DNA preparations for each sample were done and only major bands consistently amplified were scored.

Data analysis

The amplified products were scored for the presence (1) or absence (0) of bands of various sizes across the genotypes to generate a binary matrix. Data were analyzed using the NTSYS-pc version 2.11w software to calculate the similarity values and generate the phenogram [32]. The SIMQUAL program was used to calculate the Jaccard’s coefficients. Jaccard’s similarity coefficient was utilized for estimating the pairwise similarity between the operational taxonomic units (OTUs) using the formula Jaccard’s coefficient = N AB/(N AB + N A + N B) where, N AB is the number of bands shared by samples, N A represents amplified fragments in sample A, and N B represents fragments in sample B [33]. Similarity matrices were utilized to construct the UPGMA (unweighted pair- group method with arithmetic average) dendrograms, clustering was performed by sequential agglomerative hiearchical nested clustering, a distance based method, where series of successive mergers are used to group individuals with similar characteristics.

The product-moment correlation (r) based on Mantel Z-value [34] was computed to measure the degree of relationship between similarity index matrices produced by any two-marker systems. To determine robustness of the dendrogram, the data were bootstrapped with 1,000 replications along with Jaccard’s coefficient by the computer programme WINBOOT [35].

Polymorphic information content (PIC) was calculated using the formula PIC = 1 − ∑p 2 i , where p i is the frequency of the ith allele. Average heterozygosity (Hav) is obtained by taking the average of PIC values obtained for all the markers. Multiplex ratio (MR) for each assay was estimated by dividing the total number of bands (monomorphic—m, and polymorphic—p) amplified by the total number of assays (primer combinations employed—n) as per [36].

$$ {\text{MR }} = {\text{ m }} + {\text{ p/n}} $$

Marker index (MI) was obtained by multiplying the average heterozygosity (Hav) with MR.

$$ {\text{MI }} = {\text{ Hav }} \times {\text{ MR}} $$

Results

RAPD analysis

Randomly amplified polymorphic DNA assays were performed with a total of 18 random primers. Initial screening of 18 RAPD primers on 83 accessions showed that 7 of these were monomorphic. In total, 90 amplicons were scored in the size range of 0.15–3.5 kb. This gives an average of 8.18 bands per primer. Of the 90 bands, 53 proved to be polymorphic (56.17%) and hence informative (Table S2). RAPD-12 proved to be the most efficient with 81.81% of the 9 products being polymorphic. A representative RAPD profile obtained by primer RAPD-11 is shown in Fig. 1a. Of a total of 11 bands (0.15–3.5 kb), 6 are polymorphic (54.54%). Two distinct categories are visible. The first marked ‘M’ are monomorphic, and the second category marked ‘A’, a 1.2 kb band, is unique to genotypes from Africa (lanes 12–20, Fig. 1a), ‘B’, a 2.0 kb band, is unique to genotypes from Karnataka (lanes 1–4, Fig. 1a) and ‘C’, a 1.8 kb band, shared by accessions from Tamil nadu and Maharashtra (lanes 5–11, Fig. 1a). The PIC values, a reflection of allele diversity and frequency among the varieties, were not uniformly higher for all the RAPD loci tested. The PIC value ranged from 0.139(RAPD-21) to 0.487 (RAPD-011) with a mean of 0.280.

Fig. 1
figure 1

a RAPD profile of 83 finger millet genotypes generated by random primer (RAPD-11). The lanes represent, Lane L: 100 bp ladder; Lane 1–4 Karnataka, Lane 5–8 Uttar Pradesh, Lane 9–11 Tamil Nadu, Lane 12–20 African accessions and Lane M: λ-DNA marker. b SSR profile of 83 finger millet genotypes generated by primer (SSR-01). The lanes represent, Lane L: 100 bp ladder; Lane 1–4 Karnataka, Lane 5–8 Uttar Pradesh, Lane 9–11 Tamil Nadu, Lane 12–20 African accessions and Lane M: λ-DNA marker. c Cytochrome P450 gene based primer profile of 83 finger millet accessions generated by primer combination (Cyt 13(F) and cyt14(R). The lanes represent, Lane L: 100 bp ladder; Lane 1–4 Karnataka, Lane 5–8 Uttar Pradesh, Lane 9–11 Tamil Nadu, Lane 12–20 African accessions and Lane M: λ-DNA marker

Map. 1
figure 2

Geographical map of India and Africa showing sites of sample collection

SSR analysis

Ten SSR primers generated a total of 70 bands. Several polymorphic products specific to a particular geographic region could be observed. A representative fingerprint pattern generated by primer SSR-01 is shown in Fig. 1b. Out of 12 amplicons generated by this primer (size range 0.15–4.0 kb), eight were polymorphic. A 0.4 kb amplicon ‘A’ is present only in genotypes from Africa (lanes 12–20, Fig. 1b), and a 1.25 kb product is unique to genotypes from Karnataka (‘B’, lanes 1–4) and (‘C’, lanes 5–8, Fig. 1b) is shared by all genotypes except those from Uttar Pradesh (1.4 kb; Fig. 1b). Several other bands were shared by genotypes from most geographical regions. Such bands, exemplified by ‘D’, are shared by accessions from Tamil nadu and Maharashtra. The results are summarized in Table S3. As mentioned earlier, 5 SSR primers generated 70 bands (size range 0.15–4.0 kb), an average of 14 amplifications per primer. Fifty-one markers (70.19%) with an average of 10.2 per primer were polymorphic. PIC values for SSR primers were from 0.872 (SSR-02) to 0.922 (SSR-10) with an average 0.890.

Cytochrome P450 gene based markers analysis

All the 10 pairs of primers used in the present study were polymorphic, the number and size of polymorphic fragments revealed by each primer are presented in Table S4. A total of 103 scorable markers were yielded by the 10 pairs of polymorphic primers with an average of 10.3 bands per primer. Fifty-five markers (54.29%) with an average of 5.5 per primer were polymorphic. Cytochrome P450 gene based marker profile produced by the primer Cyt 13(F) and cyt14(R) is shown in Fig. 1c. Out of 11 amplified products, six are polymorphic. A (0.8 kb lanes 12–20, Fig 1c) and B (1.4 kb lanes 5–8, Fig 1c) are rare polymorphic bands as they are present in only genotypes from Africa and Uttar Pradesh, respectively. Another such product is indicated as ‘C’ which is absent from genotypes originating from Tamil Nadu and Karnataka. The PIC value ranged from 0.132 (Cyp2C19F and Heme2C19R) to 0.500 Cyt 015(F) and Cyt 016(R) with a mean of 0.327.

Statistical analysis

The binary matrix generated was used to calculate a similarity matrix based on Jaccard’s coefficient. The phenograms based on RAPD, SSR and cytochrome P450 gene based markers data indicate that accessions from different geographical regions are clearly distinguishable as separate clusters (Fig. S2a–c). All the phenograms confirm consistency of data. The principal correspondence analysis of RAPD, SSR and cyt P450 gene based markers (data not shown) reveal that all the genotypes belonging to a particular geographic region were grouped together in PCA plots except the genotypes from Uttar Pradesh as being most diverse. They are not only different from other genotypes but also from each other, and hence, cannot be grouped together. Mantel test [34] was employed to determine the coefficient of correlation between the similarity matrices generated by these markers. The coefficient of correlation between RAPD and SSR, RAPD and cyt P450 gene based primers and SSR and cyt P450 gene based primers was 0.675, 0.583 and 0.631, respectively. In E. coracana combined analysis of all the three markers (RAPD, SSR and cyt P450 gene based markers) grouped Uttar Pradesh genotypes into one tight cluster (Fig. S2d). Likewise, genotypes from Tamil Nadu and Karnataka formed distinct groups. But, genotypes from Africa appear diverse and showed relatedness to genotypes from Andhra Pradesh and Maharasthra. The coefficient of correlation between RAPD and combined RAPD, SSR and cyt P450 gene based primers, SSR and combined RAPD and SSR and cyt P450 gene based primers and cyt P450 gene based primers and combined RAPD and SSR and cyt P450 gene based primers was 0.607, 0.731 and 0.725, respectively. The three molecular marker systems were compared on the basis of different criteria (Table 1).

Table 1 Comparison of various molecular markers in evaluating genetic diversity of finger millet

Dendrogram (unweighted pair group method analysis) clustered the finger millet accessions into two major groups, one comprised the accessions collected from north or northeast regions of India, and the other comprised five subclusters and the accessions collected from different states of southern India and Africa. In case of polymorphism detection, SSR as a tool scored higher than RAPD and cytochrome P450 gene based markers. Marker index ‘MI’ a measure of overall efficiency of a marker is highest for SSR, making it the most efficient marker for the genetic diversity studies in E. coracana. Mantel test was also employed to analyze the ‘goodness of fit’ for the UPGMA dendrograms generated by each marker system. This was done by making the cophenetic similarity matrices from each UPGMA phenogram and then comparing these cophenetic similarity matrices with the original similarity matrices (generated from binary data) for each marker technique. It revealed values higher than 0.90 for all the markers used [RAPD (r = 0.994, P = 0.01), SSR (r = 0.987, P = 0.01) and cytochrome P450 gene based markers (r = 0.940, P = 0.01)], thus confirming their authenticity and very good fit of clustering. Bootstrap analysis of this data was performed to determine the confidence values as percentages at each node. Very high bootstrap values were obtained from the major nodes.

Discussion

DNA markers have been used to evaluate genetic diversity in different crop species [37]. DNA fingerprinting is a routine method employed to study the extent of genetic diversity across a set of germplasm or cultivars and group them into specific categories. In general, random amplified polymorphic DNA (RAPD) is increasingly being employed in genetic research owing to its speedy process and simplicity [4]. In case of finger millet besides several phenotypic/morphological descriptors various molecular markers have been utilized for characterizing the germplasms. Based on seed colors as phenotypic descriptor, attempt has been made to analyze 36 genotypes of finger millet (Eleusine coracana (L.) Gaertn) and correlate with protein and calcium contents [38]. Several attempts have been made to assess the genetic diversity in finger millets using different DNA based molecular markers namely RAPD, RFLP and SSR. There are no reports of markers being used in characterization of accession of finger millet so far using markers based on Cyt P450 genes based markers. The rapidly evolving gene sequences of mixed function oxidases like cytochrome P450 can be exploited for assessing the intra-species diversity amongst finger millet genotypes.

Geographically isolated populations of plant species tend to accumulate genetic variations during the course of environmental adaptations. Plant species grow in greatly diverse environments. There is not only similarity, but also specificity or diversity as to the mechanisms employed by them in perceiving environment signals and adapting to the growth environments. In the present study distinct genetic identity present between genotypes was highlighted using DNA amplification profile procured from separate geographical regions similarly genetic divergence and phylogenetic analysis of genus Jatropha widely distributed in India has been done based on nuclear ribosomal DNA ITS sequence [39].

All the molecular markers, used in the present study detected polymorphism percentage ranging between 20 and 80%. RAPD revealed an average of 56.17% polymorphism across all accessions. Fakrudin et al. [24] also reported 85.82% of polymorphism in 12 finger millet accessions with RAPDs. Such high percentage polymorphism (91%) is also reported in 32 accessions of E. coracana by Babu et al. [25]. SSR revealed an average of 70.19% polymorphism across all accessions. Similar results were also reported by Dida et al. [28] in E. coracana. Cyt P450 genes based markers revealed an average of 54.29% polymorphism across all accessions. Among the three marker systems employed, 18 RAPD primers produced a total of 90 markers whereas SSR and 10 pairs of cytochrome P450 gene based primers produced 70 and 103 markers, respectively. The level of polymorphism revealed by SSR (70.19%) is higher than RAPD (56.17%) followed by cytochrome P450 gene based markers (54.29%).

In the present investigation, the average number of fragments amplified by RAPD primers among the 83 accessions was 8.18 with a range of 1–15. Fakrudin et al. [24] reported an average of 6 markers amplified by each primer in Eleusine coracana. Such a high variation in the number of fragments produced by these arbitrary primers may be attributed to the differences in the binding sites throughout genome of the genotypes included. Cytochrome P450 gene based primers generated 2–13 markers with average of 10.3 per genotype. Earlier, generation of 7 markers in Cymbopogon species by cytochrome P450 gene based primers were reported by Kumar et al. [40].

Comparison of PIC values for three marker systems (a parameter associated with the discriminating power of markers) indicated that the range of PIC values for RAPD primers was from 0.139(RAPD-21) to 0.487 (RAPD-011) with a mean of 0.280. PIC values for SSR primers were from 0.872 (SSR-02) to 0.922 (SSR-10) with an average 0.890. The comparison of the average PIC values of cytochrome P450 gene based primers revealed that the lowest was with 0.132 (Cyp2C19F and Heme2C19R) and the highest was with 0.500 Cyt 015(F) and Cyt 016(R) with a mean of 0.327.

The coefficient of correlation between RAPD and SSR, RAPD and cyt P450 gene based primers was 0.675, 0.583 and 0.631, respectively. RAPD and combined RAPD,SSR and cyt P450 gene based primers, SSR and combined RAPD and SSR and cyt P450 gene based primers and cyt P450 gene based primers and combined RAPD and SSR and cyt P450 gene based primers was established by means of high value of matrix correlation value of 0.607, 0.731 and 0.725, respectively. Similar results have been obtained with RAPD and ISSR in Vigna umbellate by Muthusamy et al. [41]. Hence, all the marker systems RAPD, SSR and cytochrome P450 gene based markers either individually or combined can be effectively used in determination of genetic relationships among finger millet genotypes.

Cluster analysis was carried out on four sets of marker profiling data based on (1) RAPD, (2) SSR, (3) cyt P450 gene based primers, (4) combination of RAPD, SSR and cyt P450 gene based primers. Dendrogram clustered the finger millet accessions into two major groups, one comprised the accessions collected from north or northeast regions of India, and the other comprised five subclusters and the accessions collected from different states of southern India and Africa. In E. coracana individual and combined analysis of all the three markers grouped Uttar Pradesh genotypes into one tight cluster. Likewise, genotypes from Tamil Nadu and Karnataka formed another two distinct groups. But genotypes from Africa appear diverse but showed relatedness to genotypes from Andhra Pradesh and Maharashtra. Distinct clusters were formed by the genotypes of southern India and northern India. The bootstrap values for the phenogram generated from combination of RAPD, SSR and cyt P450 gene based markers are 100% for the clusters (Africa, Tamil Nadu and Andhra Pradesh). Das et al. [42] detected highest similarity coefficient in the genotypes collected from southern India and the least from northern India. The results of present investigation also suggest the close relationship within the genotypes collected from different states of southern India as they are showing highest similarity coefficient and clustered together along with African genotypes. Several reasons could be attributed to the low intra-zonal diversity detected in the present investigation. First, the genotypes have genetically adapted to local environment and have become homogenous as a result of ‘inbreeding’ or exchange of genetic material within the said population. Thus, in the present investigations, unique DNA amplification profile brought out a distinct genetic identity belonging to genotypes of separate geographical regions.

In the present investigation, various high intensity bands were amplified, like a 0.4 kb band amplified by primer SSR-01, a 1.2 kb band amplified by primer RAPD-011 and a 0.8 kb band amplified by primer (Cyt 13(F) and cyt 14(R) only in African accessions. The region-specific bands detected in our studies can be used for the authentication of this crop. Das et al. [42] have already exploited RAPD for confirming the genuineness of E. coracana. Salimath et al. [23] have detected ISSR and RAPD primers which can discriminate between accessions of E. coracana from Africa and Asia. Microsatellites detect high level of polymorphism, which can be used to discriminate between closely related genotypes [43].

The comparison of the three molecular marker systems revealed SSR to be the best as it generated highest percentage polymorphism and high MI. Phenograms generated from RAPD, SSR and cytochrome P450 gene based markers data showed similarity in relative placement of genotypes. SSR and cytochrome P450 gene based markers provided a more detailed idea on the genetic relationship.

The cophenetic correlation values of Mantel test are above 0.90 for each marker system. This validates the phenograms obtained after UPGMA analysis of binary matrices data. These findings can be used in authentication of the nutritionally important crop and perhaps in recognizing high yielding population in a particular area. Exploration and evaluation of diversity among these germplasm would be of great significance for in situ conservation and finger millet breeding programmes.