Introduction

India being one of the 12 mega biodiversity centers of the world has tremendous natural wealth in the form of over 7,500 medicinal plant species. This accounts for 20% of its total flora, which is much higher than an average of 12.5%, reported of the world flora. The traditional plant based medicines have now become quite popular in both developing and developed countries. The ever-increasing demand of herbal drugs is causing loss of precious biodiversity and also creating shortage of raw material. Since, there have been few studies on genetic diversity on Indian medicinal plants (Singh et al. 2002; Negi et al. 2004; Bahulikar et al. 2004); for efficient conservation and management of genetic resources, it is imperative to analyze the genetic composition of species of different phytogeographical regions. For this purpose, we have selected Tribulus terrestris (Puncture-vine), a herb of Zygophyllaceae, endowed with various medicinal properties. For example, fruits are used in curing urinary discharges, cough, asthma, pain, spermatorrhoea, ophthalmia, anemia, dysentery, skin and heart diseases; leaves purify blood and are used as aphrodisiac; and roots are good stomachic and appetizer. Its purported effects include increased luteinizing hormone release and thus testosterone production, increased sperm production, increased ejaculatory volume and increased libido. The original use for T. terrestris extract was as a ‘tonic’ to treat sexual dysfunction. It is an important constituent of various medicinal preparations like Dashmularishta, Tribestan etc. world-wide. The herb is a C4 summer annual, distributed in the tropics as well as sub-tropics. In India it occurs in the tropics and warm temperate region, up to an altitude of 5,400 m. It possesses solitary yellow flowers and globose fruits with hard and sharp spines and is propagated through seeds. There are several native species in the genus Kallstroemia (K. parviflora and K. californica, family: Zygophyllaceae) that could be confused with T. terrestris, because of their similar morphology; especially when the plants have not begun to flower and fruits are not present. Such plants, often get accidentally mixed with T. terrestris while collecting raw material for herbal drug preparations, making them less efficient or some times toxic. This lack of authenticated raw material is a major problem facing the herbal pharmaceutical industry which have severe consequences, starting from decrease in drug efficacy to cardiac arrhythmias and poisoning. DNA fingerprinting techniques are very useful in correct identification of taxa, and thus preparation of authenticated and effective drugs. Another application of molecular markers is in carrying out genetic diversity studies. Multi-locus markers are more advantageous than single-locus markers, as they have a high mutiplex ratio. Randomly amplified polymorphic DNA (RAPDs) have been extensively used for the assessment of genetic diversity in a variety of plants like Saxifraga cernua (Kjølner et al. 2004), Zea mays (Garcia et al. 2004), Ziziphus spp. (Singh et al. 2006), Saccharum and Erianthus (Selvi et al. 2006), Panax quinquefolius (Lim et al. 2007) etc. Inter-simple sequence repeat (ISSR) markers are much more informative than RAPDs and have been used for the analysis of genetic diversity in Cicer sp. (Souframanien and Gopalakrishna 2004), Morus alba (Awasthi et al. 2004), Pisum sativum (Baranger et al. 2004), Asparagus acutifolius (Sica et al. 2005), Corchorus species (Roy et al. 2006) and others. Amplified fragment length polymorphism (AFLP) has helped unravel genetic diversity in Azadirachta indica (Singh et al. 2002), Brassica nigra (Negi et al. 2004), Ranunculus acris (Odat et al. 2004), Nicotiana attenuate (Bahulikar et al. 2004), Brassica rapa (Zhao et al. 2005), Cicer sp. (Shan et al. 2005), Z. mays (Menkir et al. 2005), Cynodon (Wu et al. 2006), Glycine soja (Ru et al. 2006), Myricaria laxiflora (Liu et al. 2006) Gardenia jasminoides (Han et al. 2007), Chimonanthus spp. (Zhou et al. 2007) and others. Selective amplification of microsatellite polymorphic loci (SAMPL) though a modification of AFLP, detects more polymorphic loci. It is therefore more useful in cases with expected low level of diversity. Inspite of having manifold medicinal uses and world-wide distribution there has been no report on the extent of the genetic diversity prevalent in T. terrestris to the best of our knowledge. Therefore, we have conducted this study to analyze genetic diversity present in this valuable medicinal plant from five different agro-climatic regions of India. We have used AFLP, SAMPL, ISSR and RAPD markers for this purpose. The potential of these molecular markers was also compared.

Materials and methods

Plant material

Leaf samples from each of the single plant were collected from natural populations growing in Karnataka (six samples from Mysore), Uttaranchal (ten samples from Dehradun), Maharashtra (four samples from Amravati) and Rajasthan (four samples from Udaipur). Fifteen plants growing in the herbal garden at Hamdard University (Jamia Hamdard) represent the genotypes from Delhi.

DNA extraction

Genomic DNA was extracted from lyophilized leaf material following the modified protocol of Doyle and Doyle (1990) and stored at −20°C till further use. The DNA was quantified using agarose gel electrophoresis and employing serial dilutions of uncut λ-DNA as marker.

RAPD and ISSR analyses

Randomly amplified polymorphic DNA assay was carried out in 15 μl reaction volume containing 50 ng DNA, 5 mM MgCl2, 2 mM each of dNTP, 10 μM random decamers, 0.6 U Taq polymerase (Invitrogen). Amplification was performed with thermal cycler (Eppendorf Germany) using the cycling parameters of Das et al. (1999). One cycle of 94°C for 2 min, 37°C for 2 min, 72°C for 2 min followed by 30 cycles of 94°C for 1 min, 37°C for 1 min, 72°C for 1 min. The last cycle was followed by 10 min extension at 72°C. The amplified products were resolved in 1.2% agarose gel (0.5 × TBE), stained with ethidium bromide (10 μg/ml) and photographed under UV light. For ISSR analysis, 20 ng of template DNA was subjected to amplification with 7.5 μM of 3′- anchored microsatellite primers, other components remaining same as that of RAPD. Electrophoresis was performed on 1.2% agarose gel. The annealing temperatures of the cycling parameter were re-adjusted for each microsatellite primers according to their calculated melting temperature (T m) based on the sequence composition [T m = 4°(G + C) + 2°(A + T) − 3°C].

AFLP and SAMPL analyses

Amplified fragment length polymorphism analysis was performed as per the technical instructions of the manufacturer (Invitrogen, Life Technologies). Genomic DNA (300 ng) was restricted with EcoRI and MseI (2.5 U each) in a restriction buffer (50 mM Tris–HCl, pH 7.5, 50 mM Mg acetate, 250 mM K-acetate) in total volume of 25 μl. MseI and EcoRI adapters were subsequently ligated to digested DNA fragments. The adapter ligated DNA was preamplified using the following parameters: 20 cycles of 94°C for 30 s, 56°C for 60 s and 72°C for 60 s. The preamplified DNA was diluted 50 folds and used as a template for selective amplification reaction using EcoRI and MseI primers with three selective nucleotides at 3′-end. The EcoRI adapter specific primer was end labeled at 3′ using -P32-ATP using PNK enzyme. Selective amplification was carried out initially at 94°C for 30 s, annealing at 68°C for 30 s and 72°C for 30 s; this was followed by a touch-down cycling protocol where the annealing temperature was reduced by 1°C in each cycle. This continued till the annealing reached 56°C. And finally 23 cycles of 94, 56 and 72°C for 30 s each. The reaction was stopped by addition of gel loading dye (98% formamide dye) and the samples were resolved on 6% denaturing polyacrylamide gel and autoradiographed (Sambrook et al. 2001). For SAMPL analysis, the EcoRI primer was replaced by 3′ anchored microsatellite primers, all other conditions and parameters remaining the same. Products were resolved on 6% denaturing PAGE and autoradiographed.

Data analysis

The amplification products were scored for the presence (1) and absence (0) of bands across the genotypes to generate a binary matrix. Co-migrating bands were assumed to be originating from the same genetic locus.

The binary matrix was analyzed using the NTSYS-pc version 2.11w software to calculate the similarity values and generate the phenogram. Jaccard’s similarity coefficient was utilized for estimating the pairwise similarity between the operational taxonomic units (OTUs) using the formula GSij = a/a + b + c, where GSij represent the genetic similarity between the two individuals ‘i’ and ‘j’. ‘a’ is the number of polymorphic bands shared by ‘i’ and ‘j’, ‘b’ is the number of bands in ‘i’ and absent in ‘j’ and ‘c’ is the number of bands present in ‘j’ and absent in ‘i’ (Jaccard 1908). After obtaining the similarity matrix, clustering was performed by sequential agglomerative hiearchical nested clustering, a distance based method, where series of successive mergers are used to group individuals with similar characteristics. The graphical representation of the cluster (phenogram) was obtained by using the unweighted pair group method of mathematical averages (UPGMA). Mantel test (Mantel 1967) was used to test the correlation of coefficient between pairs of similarity matrices and for determining cophenetic correlation values using NTSYS-pc, version 2.11w.

The degree of confidence at the nodes of the phenogram was evaluated through WINBOOT software (Yap and Nelson 1995). The original binary matrix was bootstrapped between 100 and 1,000 replicates to generate variant matrices, which were then re-analyzed to generate phenograms with the bootstrap values at the nodes.

Polymorphic information content (PIC) or average heterozygosity was calculated as per the formula of Roldan-Ruiz et al. (2000): PIC = 2f i (1 − f i), where f i is the frequency of the amplified allele and 1 − f i is the frequency of null allele. Average heterozygosity (H av) is obtained by taking the average of PIC values obtained for all the markers and is calculated as:

$$ {\text{H}}_{{{\text{av}}}} = {\sum {{{\left[ {2{\text{fi}}{\left( {1 - {\text{fi}}} \right)}} \right]}} \mathord{\left/ {\vphantom {{{\left[ {2{\text{fi}}{\left( {1 - {\text{fi}}} \right)}} \right]}} {\text{N}}}} \right. \kern-\nulldelimiterspace} {\text{N}}} } $$

Multiplex ratio (MR) for each assay was estimated by dividing the total number of bands (monomorphic—m, and polymorphic—p) amplified by the total number of assays (primer combinations employed—n) as per Powell et al. (1996).

$$ {\text{MR}} = {\text{m}} + {\text{p}}/{\text{n}} $$

Marker index (MI) was obtained by multiplying the average heterozygosity (H av) with MR (Powell et al. 1996).

$$ {\text{MI}} = {\text{H}}_{{{\text{av}}}} \times {\text{MR}} $$

Results

AFLP analysis

Six AFLP primer combinations produced 500 scorable products. Table 1 summarizes the number of bands amplified for different primer combinations and percentage polymorphism detected in T. terrestris. The average number of bands per primer combination was 83.3 and percentage polymorphism ranged from a low of 74.7 in primer combination E-AAG + M-CTA to a high of 97.2 in primer combination E-ACT + M-CAG. Figure 1a is a representative AFLP profile generated by primer combination E-ACT + M-CAG. This proved to be the most informative primer combination as it amplified maximum number of products (106) and detected highest polymorphism (97.2%). The monomorphic products are marked ‘M’ (Fig. 1a). The characteristic feature of this AFLP profile is the amplification product unique to the genotypes from Delhi (lanes 1–6, band marked as ‘D’). In addition, some of the genotypes from Mysore (lanes 17–19) lack an amplification product indicated as ‘H’. Additionally, the genotypes from Amravati has an amplification profile distinct from rest of the genotypes due to presence of unique bands (lane 23, bands marked ‘U’). Other rare polymorphic bands are ‘B-1’ in the accessions from Mysore and ‘C-1’ in Dehradun.

Table 1 Tribulus terrestris: AFLP results
Fig. 1
figure 1

AFLP profile showing the genetic polymorphism in Tribulus terrestris (a), detected with primer combination E-ACT + M-CAG. Samples in lanes C control (tomato DNA), 1–6 Delhi, 7–11 Dehradun, 12–15 Rajasthan, 16–20 Mysore, 21–24 Amravati. b SAMPL profile of 24 different accession lines obtained by the primer combination (CT)4C(AC)4A + M-CTG. Lanes C control (tomato DNA), 1–6 Delhi, 7–11 Dehradun, 12–15 Rajasthan, 16–20 Mysore, 21–24 Amravati. c Inter-simple sequence repeat polymorphism using ISSR primer S-13. Lanes—M Marker (mix of λ DNA digested with HindIII and φX 174 DNA digested with HaeIII), 1–4 Mysore, 5–14 Delhi, 15–21 Dehradun, 22–25 Amravati. d RAPD profile using primer A-1. Lanes—M Marker, 1–6 Mysore, 7–16 Delhi, 17–22 Dehradun, M Marker

SAMPL analysis

An average of 81.3 bands were amplified per primer combination and the percent polymorphism ranged between 89.9 [(CT)4C(AC)4A + M-CTG] and 100 [(GACA)4 + MCTA]. Of 488 bands obtained with six primer combinations, 462 were polymorphic (94.7% polymorphism). Figure 1b shows a SAMPL fingerprint obtained with primer combination (CT)4C(AC)4A + M-CTG. Out of 89 amplified products, nine are monomorphic, three are marked ‘M’ in Fig. 1b. A-1, A-2 and A-3 are rare polymorphic bands as they are present in only some of the genotypes from Mysore, Rajasthan and Delhi, respectively. The result is listed in Table 2.

Table 2 Tribulus terrestris: summary of SAMPL assays

ISSR analysis

Twenty-one ISSR primers generated a total of 239 bands. Several polymorphic products specific to a particular geographic region could be observed. A representative fingerprint pattern generated by primer S-13 is shown in Fig. 1c. Out of 13 amplicons generated by this primer (size range 0.45–2.3 kb), five were monomorphic marked as ‘A’. A 0.8 kb amplicon ‘B’ is present only in genotypes from Mysore (lanes 1–4, Fig. 1c), and a 1.25 kb product is unique to genotypes from Delhi (‘C’, lanes 5–14). Several other bands were shared by genotypes from most geographical regions. Such bands, exemplified by ‘C*’, are shared by samples from Dehradun and Amravati, and ‘M*’ is shared by all genotypes except those from Mysore (0.8 kb; Fig. 1c). Another such product is indicated as ‘D’ which is also absent from genotypes originating from Mysore. The bright intensity of amplicon ‘D’ suggests its origin from a repetitive region such as a microsatellite locus. The results are summarized in Table 3.

Table 3 Tribulus terrestris: interzonal genetic diversity through ISSR

As mentioned earlier, 21 ISSR primers generated 239 bands (size range 0.4–2.5 kb), an average of 11.4 amplifications per primer. The detected percent polymorphism was as low as 40 (primer UBC-2) to as high as 100 (UBC-4), an average of 73.6%.

RAPD analysis

Randomly amplified polymorphic DNA assays were performed with a total of 21 random decamers. In total, 276 amplicons were scored in the size range of 0.4–3.0 kb. This gives an average of 13.1 bands per primer. Of the 276 bands, 163 proved to be polymorphic (59.0%) and hence informative (Table 4). G-12 proved to be the most efficient with 90% of the 20 products being polymorphic. In contrast, G-11 primer could detect only 8.3% polymorphism though it generated 12 amplified products. A representative RAPD profile obtained by primer A-1 is shown in Fig. 1d. Of a total of 12 bands (0.4–2.3 kb), 7 are polymorphic (58.33%). Two distinct categories are visible. The first marked ‘A’ are monomorphic, and the second category marked ‘B’, a 0.55 kb band, is unique to genotypes from Dehradun (lanes 17–22, Fig. 1d).

Table 4 Tribulus terrestris: summary of interzonal genetic diversity study using RAPD

Statistical analysis

The presence (1) and absence (0) of bands were scored across all genotypes. The binary matrix generated was used to calculate a similarity matrix based on Jaccard’s coefficient. The phenograms based on genetic similarity data of AFLP and SAMPL are comparable. The major difference is in the clustering of genotypes of Amravati. AFLP presents them as sub-cluster of cluster II with similarity coefficients of 0.95–0.97 (Fig. 2a). In SAMPL analysis they cluster with similarity coefficients ranging from 0.64 to 0.8 (Fig. 2b). This reiterates that SAMPL markers detect more polymorphic loci. The phenograms based on ISSR and RAPD data indicate that genotypes from different geographical regions are clearly distinguishable as separate clusters (Fig. 2c, d).

Fig. 2
figure 2

Phenetic relationships among Tribulus terrestris genotypes as revealed by data from a AFLP, b SAMPL, c ISSR and d RAPD marker systems

All the phenograms confirm consistency of data. The principal correspondence analysis of the AFLP and SAMPL (data not shown) reveal genotypes from Mysore as being most diverse. They are not only different from other genotypes but also from each other, and hence, cannot be grouped together. In the case of ISSR and RAPD (data not shown), however, all the genotypes belonging to a particular geographic region were grouped together in PCA plots. This confirms the ability of AFLP and SAMPL markers to analyze more number of loci and thus differentiate closely related genotypes.

The four molecular marker systems were compared on the basis of four different criteria (Table 5). AFLP revealed highest MR (83.3) followed by SAMPL, RAPD and ISSR. In case of polymorphism detection, SAMPL as a tool scored higher than AFLP, ISSR and RAPD. The fact that microsatellites are hypervariable and when assayed revealed higher heterozygosity, as both SAMPL and ISSR score over AFLP and RAPD in terms of average heterozygosity (H av). Finally, ‘MI’, a measure of overall efficiency of a marker is highest for SAMPL, making it the most efficient marker for the genetic diversity studies in T. terrestris. Mantel test (Mantel 1967) was employed to determine the coefficient of correlation between the similarity matrices generated by these markers. The coefficient of correlation between AFLP and SAMPL was 0.793 (P = 0.01). Mantel test was also employed to analyze the ‘goodness of fit’ for the UPGMA dendrograms generated by each marker system (AFLP, SAMPL, ISSR and RAPD). This was done by making the cophenetic similarity matrices from each UPGMA phenogram and then comparing these cophenetic similarity matrices with the original similarity matrices (generated from binary data) for each marker technique. It revealed values higher than 0.95 for all the markers used [RAPD (r = 0.998, P = 0.01), AFLP (r = 0.993, P = 0.01), ISSR (r = 0.973, P = 0.01), SAMPL (r = 0.957, P = 0.01)], thus confirming their authenticity. Bootstrap analysis of this data was performed to determine the confidence values as percentages at each node. Very high bootstrap values were obtained from the major nodes. Hundred percent support value was obtained for the node at which cluster II separates from cluster III in the phenogram obtained from AFLP data. The support value for the Delhi cluster was also 100% (Fig. 2a). The bootstrap values for the RAPD phenogram are also 100% for each cluster (Mysore, Delhi and Dehradun) as evident from Fig. 2d.

Table 5 Comparison of various molecular markers in evaluating genetic diversity of Tribulus terrestris

Discussion

Geographically isolated populations of plant species tend to accumulate genetic variations during the course of environmental adaptations. The medicinal herb, T. terrestris chosen for the current study is an annual with propagation through seeds. However, the predominant mode of genetic material exchange, whether through selfing or out crossing needs to be unambiguously established. All four molecular markers, that rely on arbitrary and semi-arbitrary nature of amplification detected polymorphism percentage ranging between 60 and 95%. AFLP revealed an average of 82.9% polymorphism across all genotypes. Such high percentage polymorphism (79.5) is also reported in ten populations of two different habitat types of R. acris (Odat et al. 2004). In our studies the low region-wise diversity is in congruence with the study of Fu et al. (2004) on little blue stem (Schizachyrium scoparium), where AFLP revealed high (>91%) interpropulation and low (only 7.9%) intrapopulation genetic diversity. Natural populations propagating through seeds implicitly have higher level of variation. AFLP analysis of six natural populations of Rubus arcticus detected high level of interpopulation genetic variation (0.72–0.94) measured as Simpson index (D). In T. terrestris AFLP grouped Delhi genotypes into one tight cluster. Likewise, genotypes from Dehradun and Amravati formed distinct groups. But, genotypes from Jaipur appear diverse and showed relatedness to genotypes from Delhi, Dehradun and Amravati. Several reasons could be attributed to the low intra-zonal diversity detected in the present investigation. First, the genotypes have genetically adapted to local environment and have become homogenous as a result of ‘inbreeding’ or exchange of genetic material within the said population. Thus, in the present investigations, unique DNA amplification profile brought out a distinct genetic identity belonging to genotypes of separate geographical regions. However, such a specific genetic separateness has not created a hybridization barrier across zones, and therefore provides an opportunity to genotypes from different zones to exchange genetic material. The spines and warty protuberances on T. terrestris fruits enable them to easily attach to animals and humans and to stick onto vehicle tires (cars, farm, airplane), subsequently facilitating long distance dispersion and spread and thus exchange of genetic material between distant populations. Our results are comparable to white clover (Trifolium repens) where RAPD indicated closeness between population of Atlantic Canada to population from Georgia and those from West Virginia to the cultivar Sacramento (Gustine et al. 2002). Similarly, RAPD analysis of Iris setosa showed genotypes from one location sharing higher level of genetic relatedness with other geographic location (Artyukova et al. 2001). This corroborates the fact that geographically and genetically distinct populations can exchange genetic information and that these are not completely isolated. In the present study similar results were obtained with SAMPL, ISSR and RAPD markers as they unraveled higher level of interzonal but a much lower level of intra population genetic diversity. In the present investigation, various high intensity bands were amplified, like a 1.8 kb ISSR band amplified by primer S-13. Artyukova et al. (2001) have also reported RAPD bands of such high intensity in Iris. The high intensity bands could be a consequence of high copy number. The region-specific bands detected in our studies can be used for the authentication of this medicinal herb as the biochemical properties of the pharmaceutically active ingredients is known to be influenced by the geographic region. Wang et al. (2003) have already exploited RAPD for confirming the genuineness of Bupleurum chinense. Na et al. (2004) have detected three RAPD primers which can discriminate between Korean and Chinese Astragali radix. The comparison of the four molecular marker systems revealed SAMPL to be the best as it generated highest percentage polymorphism and high MI. The capability of SAMPL markers lies in the detection of hypervariability in the microsatellite region. Microsatellites detect high level of polymorphism (Teulat et al. 2000), which can be used to discriminate between closely related genotypes. The hypervariability of SSR is due to variable copy number of repeat units at a locus (Tautz and Renz 1984). The measure of overall efficiency of a marker system remains the ‘MI’ which is highest for SAMPL as also reported by Tosti and Negri (2002). Phenograms generated from AFLP, SAMPL, ISSR and RAPD data showed similarity in relative placement of genotypes. SAMPL and AFLP markers provided a more detailed idea on the genetic relationship. The coefficient of correlation between SAMPL and AFLP markers is high (r = 0.793) in T. terrestris. Similar results have been obtained in A. indica by Singh et al. (2002), thereby implying that these marker techniques can be used judiciously and effectively for analyzing genetic diversity. The cophenetic correlation values of Mantel test are above 0.95 for each marker system. This validates the phenograms obtained after UPGMA analysis of binary matrices data. These findings can be used in authentication of the medicinally important herbs and perhaps in recognizing high yielding population in a particular area. Further advantages can be for planning conservation strategies, the population representing the maximum genetic diversity need to be conserved followed by the population which compliments the previous one.