Introduction

Salvia, boasting more than 1890 species from old and new world, holds the distinction of being the largest genus within Lamiaceae family (IPNI).

Iran stands as a significant repository of Salvia L. germplasm, with approximately 61 species of annual and perennial herbaceous plants belonging to Salvia genus. Furthermore, 17 of these species are endemic to the country (Mozaffarian 2004). The Salvia genus has 1893 known species worldwide (IPNI), and the plants may be found thriving in a variety of climates, from temperate to semitropical to arctic, including tropical areas in Iran. Consequently, inter-specific hybridization contributes to a remarkable morphological diversity (Hedge 1982).

Salvia, commonly known as sage or Chia, has garnered attention in recent research in terms of its numerous medicinal properties (Valizadeh et al. 2014a, b; Mahmoudi Kordi et al. 2015). The genus was associated with various biological activities, such as antioxidant, antifungal, cytotoxic, antibacterial, antitumor, antiprotozoal, and HIV inhibitory effects (Ulubelen 2003; Imanshahidi and Hosseinzadeh 2006). Therefore, certain Salvia species hold economic importance as the sources of essential oils and aroma chemicals for the perfume industry (Kamatou et al. 2008). These species produce active natural compounds falling into categories, such as terpenoids, steroids, flavonoids, and polyphenols (Shirota et al. 2006).

The survival and adaptability of plant species depend critically on genetic variation because it enables them to adjust to changing environmental conditions and retain their long-term viability. Understanding the genetic diversity within plant populations is crucial for conservation efforts, and the development of effective management strategies.

Three significant factors contribute to genetic diversity in sage: selection, geographical distribution, and speciation. Alongside open and cross-pollination, these factors create uncertainty in the taxonomy and genetic relationships within Salvia genus (Wang et al. 2011). Species identification based solely on morphological characteristics is challenging, as environmental conditions easily influence such traits.

The basic chromosome numbers of Salvia species were classified into five groups: x = 7, 8, 9, 10, and 11. The ploidy levels vary among different species, ranging from diploid to tetraploid (Hesamzadeh Hejazi 2016; Kursat et al. 2018). The chromosome counts for the species studied in this research are as follows: S. verticillata (2n = 4x = 32), S. glutinosa (2n = 2x = 16), S. reuteriana (2n = 2x = 20), S. hypoleuca (2n = 2x = 22), S. leriifolia (2n = 2x = 22), and S. limbata (2n = 2x = 22) (Hesamzadeh Hejazi and Ziaei Nasab 2023).

Regarding the importance of preserving and using plant genetic resources, understanding the genetic characteristics, and the diversity of plants, particularly in highly diverse genera like Salvia, are essential. In the era of modern biotechnology, DNA-based molecular markers offer cost-effective, stable, abundant, highly polymorphic, and environmentally-independent tools for identifying cultivars and evaluating genetic relationships among plant species.

Various molecular markers were used in Salvia-related research, including random amplified polymorphic DNA (RAPD), inter simple sequence repeats (ISSR), molecular phylogenetics, nrDNA ITS sequences, DAMD-PCR, PCR–RFLP, and simple sequence repeats (SSR) markers (Erbano et al. 2015; Liber et al. 2014; Zhang et al. 2013a, b; Song et al. 2010; Javan et al. 2012; Zhang et al. 2012; Karaca et al. 2008; Radosavljević et al. 2012; Walker et al. 2007).

We used R-ISSR markers, which involve the amplification of DNA regions via multiplex PCR reactions using a suitable combination of RAPD and ISSR primers. The number of amplified R-ISSR products, including novel genomic loci, depended on the combination of RAPD primers employed in the research (Smolik 2012).

During PCR genomic amplifications, using ISSR and RAPD primers in the combination yielded three types of amplified products: RAPD amplicons (obtained by amplifying DNA with a single RAPD primer as forward and reverse primers), ISSR amplicons (obtained by amplifying DNA with a single ISSR primer as forward and reverse primers), and hetero-amplicons (obtained by amplifying DNA with ISSR and RAPD primers simultaneously).

R-ISSR markers have emerged as a significant tool in genetic diversity research owing to their capacity to create high-resolution DNA profiles by amplifying sections of the genome that contain simple sequence repeats. This technique allows for the identification of polymorphic loci within the genome, enabling researchers to assess genetic variability at the population level. Using R-ISSR markers,

the objectives of our research were to investigate the genetic diversity within, and among different populations of two endemic species and four native species in Iran, as well as to demonstrate the potential application of R-ISSR technique as a screening tool to describe genetic variations. We examined a total of 28 populations (92 specimens) from six different Salvia species to establish breeding programs. Considering the potential threats to Iran's natural areas, the sage plant is one of the endangered species requiring urgent evaluation of its diversity, particularly in populations that hold economic value.

Materials and methods

Plant Materials and DNA Extraction

To assess the genetic diversity of six Salvia species (S. verticillata, S. glutinosa, S. reuteriana, S. hypoleuca, S. leriifolia, and S. limbata) in Iran, we collected plant samples from multiple populations across different geographical regions. A total of 28 populations were included, with 92 individuals sampled from all populations. Care was taken to ensure representative sampling across the species distribution range in Iran (Table 1).

Table 1 Geographical characteristics of a sampled population of six species of Salvia genus

Genomic DNA was isolated from the collected fresh young leaves samples using a modified cetyl trimethyl ammonium bromide (CTAB) protocol (Doyle and Doyle 1987). Approximately 0.6 g of leaves were ground into a fine powder in a mortar using liquid nitrogen. The powdered tissue was transferred to a 2 ml Eppendorf tube containing 900 μl of the extraction buffer (100 mM Tris–HCl, 50 mM EDTA, 1.5 M NaCl, 0.2% mercaptoethanol, 2.5% (w/v) CTAB, 0.35 M sorbitol, 1% Triton X-100, and 1.5% PVP). The homogenate was then incubated at 65 °C for 90 min with mixing every 5 min. The homogenate was subsequently extracted with 800 μl of chloroform/isoamyl alcohol (24:1), and centrifuged at 8,000 rpm for 10 min. The supernatant was carefully transferred to a labeled new tube, and 200 μl of NaCl (5 M) was added. Following a 10-min centrifugation at 8,000 rpm, the supernatant was transferred to a fresh tube and 1/4 (v/v) LiCl was added. After a few hours of vigorous mixing by inversion, the mixture was centrifuged for 20 min at 10,000 rpm. The supernatant was transferred to a new tube, and 2/3 (v/v) cold isopropanol (70%) was added. After mixing for a few minutes, the mixture was centrifuged at 11,000 rpm for 11 min. The supernatant was discarded, and the pellet was washed with 400 μl of 70% ethanol. Following centrifugation at 14,000 rpm for 3 min, the supernatant was discarded, and the pellet was dried in an Eppendorf Vacfuge™ at 37 °C for 15 min. The pellet was then re-suspended in 200 μl of TE (10:1) buffer and 5 μl of RNase A solution (100 mg/ml) was added. The mixture was thoroughly mixed, and incubated at 37 °C for 1 h. The homogenate was again extracted with an equal volume of chloroform/isoamyl alcohol (24:1), followed by centrifugation at 12,000 rpm for 10 min. The supernatant was transferred to a new 1.5 ml clear-colored tube, and 2/3 (v/v) cold isopropanol (70%) was added. After mixing for a few minutes, the mixture was centrifuged at 14,000 rpm for 10 min. The supernatant was discarded, and 400 μl of 70% ethanol was added to each tube to wash the pellet. After the centrifugation at 14,000 rpm for 3 min, the supernatant was discarded, and the pellet was dried in an Eppendorf Vacfuge™ at 37 °C. The dried pellet was then re-suspended in 200 μl of TE (10:0.1) buffer and stored at -80 °C until further use.

DNA quantity and quality

Purified DNA samples were quantified using a BioQuestTM CE2502 Spectrophotometer (CECIL Instrument Limited, Milton Technical Center, Cambridge, England). DNA concentration was measured by determining the absorbance ratios (A260/A280), A230, and A320. A pure DNA sample typically has an A260/A280 ratio of 1.8, indicating minimal protein contamination. The purity of the isolated total DNA was subsequently evaluated by 0.8% agarose gel electrophoresis. For further analysis, the DNA samples were diluted to a 5 ng/μl concentration and kept at −20 °C.

Primers and assays

For the genetic analysis, we used R-ISSR molecular marker technique. R-ISSR markers were selected based on their ability to produce clear and reproducible banding patterns.

Ten pairs of primers (Table 2), purchased in lyophilized form from TIB MOLBIOL-Germany, were tested for their ability to amplify DNA. Among them, four pairs of primers that showed significant polymorphism and reproducibility were selected for further analysis.

Table 2 R-ISSR primers with their sequences and annealing temperature (TM), Y = C/T

Polymerase chain reaction (PCR) amplifications were performed in a 50 μl reaction volume containing 1X PCR buffer, 1 mM MgCl2, 0.2 mM each of dNTPs, 0.7 μM of each primer, 2.5 U of Taq DNA polymerase (Qiagen-Germany), 5% formamide, and 12 ng of template DNA.

PCR amplification was carried out using an Eppendorf Mastercycler gradient Thermal Cycler under the following cycling conditions: an initial denaturation step at 95 °C for 5 min, followed by 40 cycles of denaturation at 95 °C for 1 min, annealing at specific temperatures depending on the primers (refer to Table 2) for 1 min, and extension at 72 °C for 1.5 min. A final extension step was performed at 72 °C for 10 min. The resulting amplicons were separated using agarose gels electrophoresis (1%) run at 50 V in 1 × TBE buffer, visualized by ethidium bromide staining, and photographed under ultraviolet light. The molecular weights of amplified fragments were estimated using a 100 bp DNA ladder.

Data analysis

The obtained DNA profiles were visualized, documented, and scored for the presence or absence of bands. The data were then used to construct a binary matrix, representing the genetic variation among individuals and populations.

In order to evaluate each primer's ability to identify polymorphic loci in both populations and individuals, Marker Indices (MI) for the R-ISSR markers were calculated using the technique outlined by Milbourne et al. (1997). The polymorphic information content (PIC) value was calculated using the formula PICi = 1−∑Pi2 where Pi is the frequency of the bands, as proposed by Roldan-Ruiz et al. (2000). The software programs POPGENE 1.31 (Yeh et al. 1997) and GenAlex (Peakall and Smouse 2006) were used to calculate single population gene frequencies and grouped population gene frequencies, as well as Nei's (1972) genetic distances matrix based on the binary data matrix. The distance matrix obtained was used to construct a Weighted Neighbor-Joining (WNJ) phenogram for the 28 populations using DARwin V5.0.158 software (http://darwin.cirad.fr/).

Additional genetic diversity parameters, including the observed number of alleles (na), number of effective alleles (ne), Nei's (1973) genetic diversity index (h), Shannon's Information index (I), total heterozygosity (Ht), expected heterozygosity within subpopulations (Hs), the coefficient of genetic differentiation (Gst), and the estimate of gene flow from Gst (Nm) were analyzed using POP GENE, and GenAlex software. R-ISSR data were subjected to a hierarchical analysis of molecular variance (AMOVA) following the method described by Excoffier et al. (1992). The AMOVA analysis was performed using GenAlex software.

Additionally, GenAlex software was used to perform a Principal Coordinates Analysis (PCOA) to illustrate the connection between the components of the distance matrix based on their first two principal coordinates.

Results

Using R-ISSR as a cost-effective and efficient method for genotype identification, and genetic relationship assessment within and among populations was found to be promising. Among 10, R-ISSR primers surveyed across Salvia species, only four primers showed considerable polymorphism and reproducibility which were selected for further analysis (Table 2 and Fig. 1).

Fig. 1
figure 1

R- ISSR marker profiles of 28 populations (92 individuals) of Salvia species (according to Table 1) generated by just one of the best primer (P9) in 1% agarose gel. Lane (L): 100 bp DNA ladder

A total of 28 populations from six different salvia species were analyzed using R-ISSR primers, resulting in the amplification of 93 scorable bands ranging in size from 100 to 2000 bp. On average, each primer yielded 23.25 bands (Table 3 and Fig. 1). The population with the highest number of scored bands was S. hypoleuca(Pop15), of which 41.94% were polymorphic. This was followed by S. glutinosa (Pop4) with 37.63% polymorphic bands. The population with the least polymorphism (5.38%) was observed in S. limbata (Pop26).

Table 3 The number of total and polymorphic bands amplified in 28 populations with polymorphism information content (PIC) and marker index (MI) values using the best R-ISSR primers in Salvia species

The range of polymorphic information content (PIC) values, which serve as indicators of the markers' ability to discriminate, was found to be between 0.223 and 0.260. Similarly, the marker index (MI) varied from 4.43 to 6.02 per primer. Among the primers, P9 showed the highest PIC value (0.260), and P1 had the highest MI value (6.02) (Table 3). Primer P9 (GT)8C + ACCGCGAAGG stood out as the best primer in terms of the sharpness of its patterns, and the high number of polymorphic markers it provided.

The average values of gene diversity (h) and Shannon's Information index (I) for 92 individuals across different populations, determined based on four R-ISSR markers, were found to be 0.236 ± 0.144 and 0.381 ± 0.187, respectively (Table 4). The highest values for (h) and (I) among the 92 individuals were observed for the P1 marker (0.4996 and 0.6927, respectively). The mean values of effective alleles, gene diversity, Shannon’s index, total Heterozygosity (Ht), Expected Heterozygosity within subpopulations (Hs), coefficient of genetic differentiation (Gst), an estimate of gene flow from Gst (Nm), based on R-ISSR data, were 1.36, 0.24, 0.38, 0.25, 0.074, 0.701, and 0.213, respectively (Table 4).

Table 4 Overall genetic variability across 28 populations and 92 individuals in Salvia species

Among the loci, P1-G showed the highest number of effective alleles (1.99), Shannon’s index (0.693), and gene diversity value (0.499), while P4-E had the highest Total Heterozygosity (0.496) and P8-H had the highest Expected Heterozygosity (0.221) (Table 4). The gene flow, estimated as Nm = 0.5(1—Gst)/Gst, among the 93 scorable bands, ranged from 0.02 to 3.07 (Table 4).

The genetic identity values determined using Nei's (1987) unbiased measures varied from 0.624 between S. hypoleuca (Pop19-20) and S. limbata (Pop27) to 0.995 between S. reuteriana (Pop8—Pop10) and S. reuteriana (Pop9—Pop11).

The mean coefficient of gene differentiation (Gst) was 0.701 and the value of (φST) was 0.522, indicating that 48% of the genetic diversity resided within the populations (Table 5). In addition, the examination of population genetic structure revealed the presence of discrete genetic clusters and subpopulations within every Salvia species. The 28 Salvia populations were categorized into six primary groupings using clustering analysis, which used unweighted neighbor-joining based on Nei's unbiased measurements of genetic distance. The cophenetic correlation of single tree was approximately r = 0.81, indicating a relatively good fit among the genetic distances calculated and the dendrogram. The analysis showed a high similarity among S.verticillata populations, indicating their close relatedness. The sub-clustering pattern of the cluster correctly grouped all populations except for one population, S. reuteriana(AZ), which showed a mismatch with the initial identification of the population. This mismatch may be attributed to the inclusion of genotypes bred for specific objectives and collected from different geographical locations may have contributed to the clustering of most cultivars or populations in a single cluster. Hence, Hedge (1982) has divided Salvia species into five groups, and two species S. glutinosa and S. reuteriana are in the same group—the isolation of certain species such as S. glutinosa (S2), and S. verticillata (va). S. hypoleuca(5) and S. hypolecuca(8) are likely attributed to the presence of a specific genetic band within them. This particular band might exert control over certain characteristics or traits found in these species (Fig. 2).

Table 5 Analysis of molecular variance (AMOVA) for 28 populations of different Salvia species based on R-ISSR markers
Fig. 2
figure 2

Band patterns across populations based on R-ISSR data in Salvia species

The genetic affinities and relatedness of the six taxa considered can be observed among some of the 28 populations (Fig. 3). Six almost distinct clusters can be observed in Fig. 3.

Fig. 3
figure 3

Dendrogram constructed from Nei’s unbiased genetic distance matrix among 28 populations (according to Table 1), clustered with the unweighted neighbor- joining (r = 0.81) based on R-ISSR data

AMOVA results indicated that a significant proportion of the genetic variation was attributed to variation among populations, rather than within populations. These results point to restricted gene flow and possible genetic isolation within some populations, which may be caused by ecological or geographic constraints. The results of the analysis of molecular variance (AMOVA) revealed that 52% and 48% of the genetic diversity resided between and within populations, respectively. Furthermore, the genetic variation among and within populations was found to be significant at a 1% level (Table 5).

Principal Coordinate Analysis (PCOA) of R-ISSR data showed that the first three factors accounted for approximately 65.14% of the total variance. Specifically, the first, second, and third axes contributed approximately 30.36%, 17.69%, and 17.08% of the total variance, respectively. The dendrogram clustering based on Nei's unbiased measures of genetic distance produced similar results, which were supported by PCOA ordination plot (Figs. 3, 4, and Table 6).

Fig. 4
figure 4

Scatter plot of 28 populations (according to Table1) for the first two PCOA analyses in Salvia species

Table 6 The cumulative percentage of population changes in three components

The findings of this study showed that R-ISSR markers can be effectively utilized for genetic differentiation of Salvia accessions. Specifically, the primers P1 and P9 demonstrated a significant degree of polymorphism and might be useful resources for managing Salvia germplasm and conducting further breeding research.

Discussion

The experimental results of this study provide evidence for the reliability, and the usefulness of R-ISSR markers in estimating genetic diversity within and between native Salvia populations. The R-ISSR method effectively differentiated all species that were tested, enabling identification from the genus level to the geographical species level. Additionally, it allowed for the distinction of populations and individuals within the same species based on variations in chemical components (data not shown). The assessment of genetic diversity in the studied Salvia species using R-ISSR markers provides valuable insights into their evolutionary history and population dynamics in Iran. Based on provided results of the R-ISSR analysis for the six Salvia species in Iran, several hypotheses can be generated. (1) Genetic Differentiation: The observed polymorphism and genetic diversity within and among populations of Salvia species suggest that there is significant genetic differentiation among populations. This could be due to factors, such as geographic isolation, habitat differentiation, or reproductive barriers. (2) Population Structure: The clustering analysis, and sub-clustering patterns indicate the presence of distinct genetic clusters and subpopulations within each Salvia species. This suggests that certain populations have unique genetic characteristics which are genetically distinct from others. (3) Limited Gene Flow: The AMOVA results and significant genetic variation among populations suggest limited gene flow among populations, potentially affected by geographical barriers or ecological factors. Restricted gene flow may contribute to the genetic differentiation observed among Salvia populations. (4)Marker Utility: The primers P1 and P9, which exhibited high levels of polymorphism, can serve as valuable tools for future breeding research and germplasm management in Salvia species. These markers can potentially help identify specific genetic traits or characteristics of interest. 5) Genetic Affinities: The genetic affinities and relatedness observed among some populations suggest that certain Salvia species are more closely related to each other than to others. Evolutionary history, hybridization events could influence this pattern, or shared genetic traits within specific groups.

It's important to note that these hypotheses are based on the provided results and the information mentioned in the study. Further analyses and research would be needed to validate and explore these hypotheses in more detail.

Conservation initiatives should consider preserving the genetic integrity of these Salvia species, particularly by targeting populations with unique genetic characteristics or those facing significant threats. Additionally, steps should be implemented to increase gene flow across communities to decrease the hazards associated with genetic isolation and inbreeding depression. The observed heterozygosity and Nei's genetic diversity index values reflected the rich genetic reservoir harbored by these species in Iran. The reproducibility of R-ISSR markers in this study may be attributed to using longer primers and higher annealing temperatures compared to RAPD markers. R-ISSR markers exhibit simplicity and can be used for genome mapping, gene tagging, and marker-assisted selection. Considering their high simplicity and the limited genomic knowledge for certain organisms, ISSR markers should be the primary choice for mapping or gene tagging. In cases where ISSR or RAPD alone fails to detect sufficient genetic loci, R-ISSR can be advantageous by revealing new genomic loci, requiring only the screening of suitable combinations of ISSR and RAPD primers. Comparative research using RAPD and ISSR marker systems in Salvia species has also been limited (Javan et al. 2012; Zhang et al. 2013a, b).

To the best of our knowledge, this study represents the first application of R-ISSR markers to characterize and evaluate genetic diversity within and among Salvia species. This technique offers new and interesting possibilities in the field of Salvia genus characterization. Additional investigation might be conducted to examine the functional consequences of the discovered genetic diversity and investigate possible associations between genetic variation and features that are ecologically or medicinally significant. Ultimately, such research will contribute to the development of sustainable management strategies and the preservation of invaluable genetic variations.

In conclusion, using R-ISSR markers proved to be a valuable tool for assessing genetic diversity and differentiation within and between populations of Salvia species. The markers exhibited high reproducibility and efficiency, making them suitable for genetic research and population analysis. The study successfully identified genetic clusters and relationships among the populations, providing insights into the genetic structure of Salvia species.

The results showed significant genetic diversity within populations, indicating the presence of a rich source of biodiversity in Salvia landraces. The comparison with other marker systems, such as RAPD, SRAP AFLP, SAMPL, RFLP, AFLP (Song et al. 2010; Liber et al. 2014; Rodriguez et al. 1999; Zhang et al. 2013a; Erbano et al. 2015; Bertea et al. 2006; Radosavljevic et al. 2012; Myśków et al. 2001) showed the superiority of R-ISSR markers to survey of Salvia species in terms of the number of markers generated, and the level of polymorphism detected (Chunjiang et al. 2005; Smolik 2012). R-ISSR markers proved to be a cost-effective alternative for molecular characterization and fingerprinting of Salvia species.

In general, this study highlights the importance of genetic diversity assessment in plant breeding and conservation programs. The use of R-ISSR markers offers significant contributions to the understanding of the genetic composition and interconnections across populations of Salvia species. Further research can build upon these findings to expand the study of genetic diversity in additional Salvia species and populations, contributing to broader understanding and conservation of this diverse genus.