Introduction

Sorghum (Sorghum bicolor L. Moench) is the third most important grain crop in India, after rice, wheat, maize, and millet [32]. Maharashtra is the leading producer of sorghum, ranking third after wheat and rice, regardless of the fact that prior to the agricultural revolution, sorghum was ranked second in India in terms of area and production [26]. Globally, it is the fifth most important cereal after rice, wheat, maize, and barley. In the production of alcohol, sorghum is used to produce ethanol, starch, and syrups [2].

Sorghum belongs to the Poaceae family, the genus Andropogoneae, and the species Sorghastrum. Sorghum originated from the Greek word “sorgo” that means to supersede, as in having grown taller than in other crops in the field. The Sorghum family has been arranged into two all-around emblazoned areas by specifically Eu-sorghum and Para-sorghum [8]. Divided the category Sorghum into six sub-genera, namely Eu-sorghum, Chaetero-sorghum, Hetero-sorghum, Para-sorghum, Sorghastrum, and Stipo-sorghum, and officially assigned the specific name Sorghum imlgare, to the detailed outline and its cross-viable wild and weed species members [3]. Despite, Price et al. [12] assumed the title Sorghum bicolor (L.) Moench was correct, and this idea has indeed been endorsed by Kong et al. [31]. Sorghum bicolor (L.) Moench includes yearly consist of varying with 10 sets of chromosomes (2n = 20), which includes all advanced grain sorghum and Sorghum-sudan grass crossovers, as well as their cross-viable wild and weedy relatives.

The primary focus is on genetic analyses of sorghum for its excellent quality. (Sorghum bicolor (L.), 2n = 2x = 20) is the nation’s fifth most important food crop after wheat, rice, maize, and barley, with over 80% of the crop grown in Africa and Asia [7]. The ICRISAT sorghum collection represents the major sorghum diversity centers. The National Plant Genetic Resources Centers house a wide range of sorghum landraces. The ICRISAT gene bank is a major source of diversity for subtropical agricultural crops [4].

SSR markers (simple sequence repeats) are suitable to study genetic variability. Typical landraces, optimized cultivars, and breeder’s cultivars are among the accessions. A further important feature of SSR markers is their convenience of its use [16]. DNA markers are typically nucleotide sequences that conform to a physical in a genome as well as their polymorphisms. SSR fingerprints are often extremely specific, and they are frequently used to differentiate various individuals and identities, as well as to expose polymorphism [14, 24].

Simple sequence repeats (SSR) markers—also called microsatellites—instantly became the DNA markers of choice for plant and animal genomes [5]. SSRs are tandem repeats of di-, tri-, tetra-, penta-, or hexanucleotide units in the DNA of plants [24]. A good number of microsatellite markers have been developed for sorghum that allows so much high rate of sorghum genotype assessment [20].

The analysis of genetic diversity in 20 sorghum accessions is from the ICRISAT gene bank, Hyderabad, India, both for agro-morphological character and SSR markers. Protein and mineral composition vary considerably within and between 20 sorghum accessions from the ICRISAT gene bank in Hyderabad, India [6]. The accessibility of such markers for evaluating genetic variation is a simple way for conventional breeding to offer integrated genetic traits for breeding programs. Predefined kinds of molecular markers used for sorghum diversity evaluations became relevant, as well as they differ in complexity, reliability, and information generating capacity [22]. When genetic diversity was assessed using microsatellite (SSR) markers, a p < 0.05 of genetic variation was evidenced among 20 sorghum accessions from different agro-ecological regions in Marathwada (70% between many accessions and 30% within accessions) [19].

Molecular markers and its value have indeed been demonstrated in studies of genetic variation (Morand et al., 2002; Zeid et al., 2003), breeding systems (Durand et al., 2000), and pollination biology (Durand et al., 2000), including pollination ecology (Durand et al., 2000) (White et al., 2002). Crop diversification has increased crop production such as wheat via domestication, yet this has diminished crop genetic diversity. As a consequence, a strategy to enhance the advancement of a crop is necessary. Advanced species is determined by genetic diversity. For the protection and use of germplasm of particular species diversity, genetic diversity is essential.

To broaden the genetic base for crop improvement, it is essential to reveal genetic variation. The employment of markers serves the purpose of revealing genetic diversity. Variety understanding also assists in the development of methods for integrating relevant variety into breeding plans. Use of markers to describe crops reveals both similarities (shared alleles) and diversity (typical alleles) across cultivars of the same crop. Also, it aids in determining the genetic pool or provenance of the selected crop. The genetic basis should be extended in order to develop elite cultivars. The species whose genetic diversity has indeed been discovered utilizing markers are utilized to generate superior hybrids via germ line exchanges. Markers are used to characterize and construct a genetic profile. Such crop DNA profiles are used to maintain a crop’s gene pool in gene bank.

Materials and Methods

The twenty seed material accessions for the current experiment were obtained from ICRISAT gene bank, originally obtained from various locations of Maharashtra, passport data for seeds listed in Table 1; these cultivars were selected because they had 90 to 100% germination rates and are cultivated in the semi-arid regions of Maharashtra, India. Selected seeds were allowed to germinate by blotter paper method, under controlled and optimum conditions on the Petri dishes lined with 75% humidity by Whatman no. 1 filter papers and incubated at 30 °C. After germination, 16 days old leaf tissues were harvested from 20 different sorghum accessions harvested and stored at − 20 °C for DNA extraction.

Table 1 List of germplasm accessions along with their biological status and geographical location

DNA Extraction

Genomic DNA extracted from 4 days old seedlings were performed by the CTAB extraction method modified by Khan et al. [10]. Three to four seeding from each sample was selected randomly and root and shoot both sampled and used for DNA isolation. Genomic DNA was purified by RNAse treatment. Concentration of purified DNA was measured both spectrophotometrically by using the UV visible spectrophotometer at 260 and 280 nm and was calculated, and purity was also confirmed by gel electrophoresis in electrophoresis system in 0.8% (w/v) agarose gel containing 0.5 µl/ml ethidium bromide at 6 V/Cm in 1X TBE buffer. About 20 to 100-ng total DNA were selected for the next experiment [18].

Polymerase Chain Reaction (PCR) Amplification of SSR

The PCR program was set in a thermal cycler amplification reaction mixture prepared in 0.2-ml thin-walled flat cap PCR tubes, containing the following components. The total volume of each reaction mixture was 20 µl; details of PCR master mixture are given in Table 2. Five pairs of SSR primers were chosen for the current experiment for analysis of diversity among selected accessions were part of commercial SSR primer, i.e., kit29 (http://sorghum.cirad.fr/SSR_kit), which provides reasonable coverage across the sorghum nuclear genome. Details of the primer sequences and type of microsatellite repeats are depicted.

Table 2 PCR master mixture for individual reaction of SSR analysis

In (Table 3) PCR, the PCR program consisted of an initial denaturation for 30 s at 94 °C and then 30 cycles of denaturation for 30 s at 94 °C, annealing at 57 °C (55 or 60 °C) for 60 s, depending on the annealing temperature for the primer, and extension at 72 °C for 60 s. The last PCR cycle was followed by a 5-min extension at 72 °C and then put on hold at 10 °C, at infinity (∞). The amplified products were stored at − 20 °C until they were needed to run gels [1, 15].

Table 3 Details of SSR primer sequence and microsatellite repeats

Gel Alectrophoresis of PCR Products

After amplification amplified product first separated by gel electrophoresis, for this purpose, gel was run using a 2.5% agarose gel. The gel contained 6.25 g of agarose, weighed using a Mettler Toledo electronic balance, dissolved in 250 ml TAE in a conical flask (volume of flask is required) and the mouth covered with cotton wool. The mixture was then heated in a microwave to dissolve finally and allowed to cool to about 60 °C, and 5 µl of ethidium bromide was added [28, 29]. This was swirled gently to avoid bubbling, and when cooled, the solution was poured into a mold with comb placed on a level surface and allowed to cool and solidify. The comb was gently removed, and the gel was transferred into an electrophoretic tank filled with a 1X TAE buffer. PCR products of 10 µl were loaded into each well and run at 90 V for 45 min. The gel was then observed under an UV trans illuminator and Gel Doc. Each set of PCR product data was obtained from each accession and was scored [27].

Scoring SSR Bands

Product from any of the eight primers with different dye labels were pooled into groups based on their respective agarose band length and resolution capacity of dye labels. One microliter from each co-loaded group was added to 8 µl of solution containing 0.108 µl of GSLIZ500 internal size standards to 8 µl master mix and vortexing at 900 RCF for solution mixing. Vortexted samples are denatured and immediately placed on ice for 3 min. Each fragment analyzes separately to estimate particular fragment size and allele sizing, and detection of homozygous and heterozygous was done with Gene Mapper software; the amplified fragments were visualized using the GEL DOCUMENTARY unit. Connected with Image Reader software version 1.01, size of DNA fragments was determined using Pe which could spontaneously recognize the lane and band and measure the fragment size. Genotypes of individual plants were indicated by the allele size at all SSR loci, and the pairwise different method was determined for each haplotype pair using the parameters. Polymorphic information content (PIC) values for each primer set were calculated using the algorithm described by Smith (1997), Varshney et al., and Davila et al. [11, 23].

$$\mathrm{PIC}=1-{\sum }_{\mathrm{i}=0}^{\mathrm{n}}{f}_{\mathrm{i}}^{2}\mathrm{i}=1$$

where \({f}_{\mathrm{i}}^{2}\) is the frequency of ith allele. PIC is the estimated polymorphism of locus by taking into consideration that not only no of allele who expresses but also estimates frequency of alleles. PIC value ranges from 0 (monomorphic) to 1 (polymorphic). Statistical analysis of data was performed using Power Marker ver. 3.25 to estimates genetic distance, locus, hetrozygosity, PIC content, allele size, length, range, and abundance within chromosome. It is also reported for allele size of loci, and product of SSR was used for genotyping [21]. The genetic diversity of the landraces was measured at different levels of population structure. The genetic variation of each locus shows the utility and general information of each SSR tool [13].

Result and Discussion

Five SSR primers located variable regions of chromosomes that we selected for study of in vitro variability among sorghum accession were found highly polymorphic in nature and therefore provided a powerful data for evaluation of molecular diversity within sorghum landraces. These results also find close relation with Wang Min et al. [30] which concludes that wild sorghum and landraces possessed greater allelic diversity than cultivars and might contain alleles not present in the cultivars which can strengthen further conservation and utilization. It means that higher level of polymorphism is directly proportional with unique replication slippage and mutation responsible for generation of SSR allelic diversity [6, 14]. As results indicate that 5 markers involved in this study average 14 alleles were observed with 2.5 alleles per locus, this is a direct and indirect consequence of artificial selection for traits that improved agronomic qualities; the favorable alleles and variation allow to fixation of traits among population during introduction and acclimatization of species (Table 4). Similar results were found by Murray et al. and Ritter et al. [17, 25]. Number of microsatellite in SSR marker correlates with allele no and PIC (polymorphic information content) ratio which was found 0.37–0.70 with average mean of 0.44; SSR markers with di, tri, and tetra repeats show more no of PIC ratio than mono repeats; this also means no of repeats is associates with higher no of microsatellite and higher PIC number; similar results were found by Afolayan et al. [6], in which he also added that PIC cannot be more than 0.50, which is contrary to what is obtainable for multi-allelic markers, but according to Smita Shingane et al. [27], PIC value can give better and more information on variability, such as SSRs, in which the maximum no of repeats. This further confirms the fact that this detects more alleles than RAPD markers [8]. Similar PIC values found by Davila et al. and Afolayan et al. [6, 11] were significant with Akansha et al. [13] and El-Awady et al. [5] but significantly at par with Davila et al. [11]. Thus, the above shows that significant variation is observed in RAPD, SSR, and RFLP markers (Table 5). Findings confirm effective application of SSR markers over other in relation with evaluating allele frequency and PIC content. Average gene diversity is high, and according to Onyango and Ngugi [21], this strong allelic variation is due to di-nucleotide repeats containing SSRs and is the result of high mutation rates among such repeats. Earlier studies evaluate genetic relation between the cultivars and landraces of wild relatives were successfully identified by Ajeigbe et al. [7] and Maugham et al. (1995) on basis of his studies; 20 sorghum accessions can be divided into three different groups; hence, SSR marker analysis is effective tool for quantification of genetic diversity.

Table 4 Analysis of genetic diversity across the sorghum accession
Table 5 Diversity indicators revealed by 5 SSR primer combination used in the study

Out of five SSR markers, MsbCIR223 and XTSEP SSRs usually had more no of SSR loci in 20 accessions than the other; this may be due to the difference in allelic no and position in chromosome. XTSEP markers are reported to be extracted from a small tandem repeat library (Brown et al., 1996; Kong et al., 2000). These loci therefore were more likely to include non-coding regions than the other SSR primers. A neighbor-joining (NJ) tree was constructed based on genetic distance (Fig. 1). The group 1 in NJ tree consists of most of the traditional landraces, whereas the group 2 consists of mostly breeding material. Furthermore, in group 1 and 2, the accessions were classified into sub-sub groups based on their origin and types. Similar observations were arrived by many investigators in different crops.

Fig.1
figure 1

Neighbor-joining (NJ) tree of 20 sorghum accessions constructed using Power Marker software

Our results suggest that the sorghum accessions from different geographic origins or institutions could come from the same gene pool and may not be useful for continual sorghum improvements. Therefore, when planning for a new germplasm collection and conservation, it is imperative to characterize the available germplasm before commencing on a breeding program. Conclusively, our study revealed high genetic diversity within the sorghum germplasm. Hence, these sorghum accessions will be protected from genetic vulnerability to biotic and abiotic stress in environment, and breeder will guide how to utilize and conserve germplasm.