Introduction

Pomegranate (Punica granatum L.) is an economically important perennial crop, with high nutritional, medicinal and ornamental importance. It is believed to have originated from Iran and is widely cultivated in drier parts of Southeast Asia, Iran, China, Japan, the USA (California), West Indies, Tropical America and India (Holland and Bar-Yaakov 2014). Globally, India stands first in pomegranate cultivation with area 2.20 lakh ha, production and productivity of 27.95 lakh tonnes and 12 tonnes/ha, respectively (NHB 2018). Pomegranates are used as fresh fruit since long and continue to serve the food industry for production of juice, ready to serve beverages, alcoholic beverages and seed oil (Holland et al. 2009). The medicinal properties of pomegranate have been well documented. For instance, recent metabolomic analyses in pomegranate have revealed the presence of a wide range of phytochemicals, including gallotannins, ellagic acid, flavonoids, antioxidants, terpenoids and alkaloids in its different parts, such as arils, seed, rind, flower, bark and root (Caliskan and Bayazit 2013; Mayuoni-Kirshinbaum and Porat 2014; Ophir et al. 2014; Ahmed et al. 2014; Aslan et al. 2014; Bellesia et al. 2014). These bioactive compounds have been demonstrated to be beneficial in combating high blood pressure and other serious diseases, such as diabetes and various cancers (Shishodia et al. 2006). Given a growing body of literature reporting multiple health benefits of pomegranate, commercial cultivation has risen tremendously in the recent years. Importantly, pomegranate serves as a valued cash crop for small and marginal farmers owing to its inherent properties to grow with limited resources. Its cultivation remains highly remunerative in tropical and subtropical regions. Therefore, pomegranate cultivation is increasingly replacing subsistence farming to alleviate the livelihood of farmers (Priya et al. 2016).

The breeding efforts aimed at improving pomegranate has led to the development and release of improved varieties in India. These efforts mainly relied on conventional hybridization and selection from natural genetic variants (Jalikop et al. 2005). However, improving operational efficiency of pomegranate breeding programs demands integration of modern genomics tools. Molecular markers are the prerequisite to accelerate breeding program through genomics assisted breeding (GAB). The availability of appropriate DNA marker technology facilitates the identification of genetic determinants (genes/QTLs) underlying various traits of economic significance. Such developments pave the way for marker-assisted selection and help in designing future improvement strategies through guiding selection of parents with favourable alleles (Singh et al. 2015).

To date, a wide range of DNA marker systems like randomly amplified polymorphic DNA (RAPD) (Singh et al. 2013; Orhan et al. 2014), and inter simple sequence repeat (ISSR) (Narzary et al. 2010), amplified fragment length polymorphism (AFLP) (Jbir et al. 2008; Sarkhosh et al. 2011, 2012), sequence-related amplified polymorphism (SRAP) (Soleimani et al. 2012), simple sequence repeat (SSR) (Ravishankar et al. 2015) and single-nucleotide polymorphism (SNP) (Harel-Beja et al. 2015) have been deployed in pomegranate. Among these, SSRs are still preferred due to their genome abundance, high reproducibility, multi-allelic and co-dominant nature, which provides more information per unit assay as compared to other marker systems (Ebrahimi et al. 2017). In pomegranate, SSRs have been employed extensively to study genetic diversity and to understand population structure and association analysis (Curro et al. 2010; Pirseyedi et al. 2010; Singh et al. 2015). However, linkage mapping and QTL analysis based on SSR markers are currently lacking in pomegranate. A possible reason may be limited DNA polymorphism demonstrated in pomegranate by the currently available SSR markers. The length of the repeat motif is of paramount importance while surveying genetic polymorphisms with SSR markers. Temnykh et al. (2001) reported there are two major classes for SSR based on the length of repeat motifs (track length) i.e. SSRs with > 20 nucleotides and < 20 nucleotides are referred to as Class I and Class II, respectively. Further, the subsets, (i) Class I SSR with SSR lengths of > 20 bp and (ii) highly variable SSR with SSR lengths of > 50 bp, have shown significantly higher polymorphism in rice and pigeonpea (Temnykh et al. 2001; Singh et al. 2012; Bohra et al. 2017).

The widespread utility of such hyper or highly variable SSRs has been well established for genetic studies owing to ease of amplicon and scoring (Singh et al. 2012; Dutta et al. 2013; Bohra et al. 2017). Recent development of whole genome sequence of pomegranate by Qin et al. (2017) has offered unprecedented opportunities for genome-wide characterization of informative markers. Therefore, the present study was conducted with the aim to develop a comprehensive set of Class I SSRs (> 20 bp) through in silico analysis using the draft genome sequence of pomegranate cultivar Dabenzi. We then demonstrate the utility of these new SSR markers for genetic applications in pomegranate through analyzing genetic diversity among 46 pomegranate genotypes. The identified genome-wide Class I SSR markers will serve as an important resource for pomegranate genetics and genomics studies.

Materials and methods

Plant materials and DNA extraction

A total of 46 pomegranate genotypes, which are maintained at field gene bank of ICAR-National Research Centre on Pomegranate, India were collected. Details on passport data of genotypes are given in Table 1. Initially, a subset of eight pomegranate genotypes namely Bhagwa, Daru 17, G-137, Ganesh, Arakta, Dholka, Jodhpur Red and Solapur Lal, was used for experimental validation of the developed SSRs followed by diversity study of the 46 genotypes. Fresh leaf samples were collected from the genotypes and washed with sterile distilled water. Genomic DNA was extracted from the leaf samples following the modified CTAB method (Ravishankar et al. 2000). The quality and concentration of genomic DNA were determined on agarose gel electrophoresis (0.8%) by using uncut lambda DNA as standard. Final dilution of 10 ng/μl was made for the entire DNA samples for subsequent polymerase chain reactions (PCR).

Table 1 Details of forty-six pomegranate genotypes used in the study

In silico SSR mining and primer designing

The whole genome shotgun (WGS) sequences of Punica granatum cv. Dabenzi (296.38 Mb) was retrieved from the National Center for Biotechnology (NCBI) Genome Assembly database (GenBank accessions MTKT01000001 to MTKT01017405) (Qin et al. 2017). Total 17,405 shotgun sequences were surveyed for SSR motifs using MIcroSAtellite Identification (MISA) tool (https://pgrc.ipk-gatersleben.de/misa/). Keeping minimum repeat length of 12, the repeat unit was defined as 12, 6, 4, 3, 3 and 2 for mononucleotides, dinucleotides, trinucleotides, tetranucleotides, pentanucleotides and hexanucleotides, respectively. Compound SSRs were defined as two SSRs interrupted with 100 bases. The MISA statistics were further analyzed to draw frequency distribution graphs using Microsoft Excel.

The SSR loci identified from the genome assembly were recorded. Primer designing was performed using Batch Primer 3 v1.0 (https://wheat.pw.usda.Gov/demos/BatchPrimer3) with the criteria of minimum repeat length (≥ 24 bp). Primers were designed to generate amplicons of 100–400 bp in length with the following parameters: primer length (bp) 18–20, with 19 as the optimum; GC content (%) 40–60, with the optimum value being 50%; Tm (°C) 50–60, with 55 as the optimum. The other parameters used were as that of default program values.

SSR screening and genotyping

For experimental validation, a subset of 110 SSRs was selected for PCR amplification on eight pomegranate genotypes using Prime-96™ Thermal Cycler (Himedia, India) and resolved by gel electrophoresis. Subsequently, 13 SSRs were selected based on their clear amplification profile and screened on 46 pomegranate genotypes to evaluate genetic diversity. PCR amplification was carried out in 10 μl reaction volume containing 1.0 μl of 10X PCR buffer, 1 μl (1 mM dNTP mix), 0.5 μl each of forward and reverse primers (10 pmol), 0.2 μl of Taq DNA polymerase 5U/μl (Himedia, India) and 1 μl (10 ng) of template DNA. PCR condition was set as initial denaturation at 94 °C for 5 min, followed by 36 cycles of 94 °C for 1 min, 55 °C for 1 min, 72 °C for 2 min and a final extension at 72 °C for 7 min. PCR products were separated on 3% metaphor agarose gels containing 0.5 μg/ml ethidium bromide and 1X TBE running buffer at 130 V for 4 h, visualized and photographed in gel documentation system (Vilbert Dourmet, France).

Data collection and analysis

The PCR amplicons of each SSR marker obtained on all the samples were scored for allele size (bp). The genotype data were then used for estimating the following parameters using GenAlEx v. 6.5 (Peakall and Smouse 2012): the number of alleles (Na), the effective number of alleles (Ne), observed heterozygosity (Ho), expected heterozygosity (He) and polymorphic information content (PIC).

Population structure and Cluster analysis

A Bayesian clustering analysis implemented in Structure v. 2.3.3 (Pritchard 2000) was used to evaluate population genetic structure. An admixture model and correlated allele frequencies were applied to estimate the ancestry fractions of each cluster attributed to each genotype. For each value of K (range 1–10), five independent runs were performed with a burn-in period of 20,000 followed by 2,00,000 MCMC (Markov Chain Monte Carlo) repetitions. Parameters were set to the default values, and all genotypes were treated as having unknown origins. The delta K method (Evanno et al. 2005) implemented in Structure Harvester (Earl 2012) was used to determine the most probable value of K. UPGMA (Unweighted pair group method with an arithmetic mean), based neighbour-joining tree was constructed using DARwin v. 6.0.13 (Perrier and Jacquemoud-Collet 2006). A principal coordinate analysis (PCoA), based on the standardized covariance of genetic distances was performed using GenAlEx v. 6.5.

Results

SSR identification in the pomegranate genome

The SSR survey of the 17,405 pomegranate genome sequences resulted in the identification of a total of 1,73,633 SSRs, which translates to an average marker density of 527.97 SSRs/Mb (excluding mononucleotide SSRs 25,114). The general information on SSR containing sequences is summarized in Table 2. The total number of contigs with SSRs was 5,524; of which 2,377 contained more than one SSR and 15,483 were present in compound form. A total of 1,58,150 perfect SSRs (excluding compound SSRs 15,483) were identified in the genome. Further, the overall frequency distribution for SSR repeat units di, tri, tetra, penta and hexanucleotides revealed dominance of dinucleotides repeats (31.19%), followed by tetra (20.5%), tri (16.80%) and mono (14.46%) nucleotides (Fig. 1a, b). Interestingly, we found penta- and hexanucleotides in nearly equal proportions, i.e. 8.16% and 8.87%. Based on the hypervariable criteria (> 20 bp, Class I-SSRs), a total of 43,853 (29.53%) motifs were detected in the genome. Dinucleotide repeats (55.84%) were found dominated in this class followed by hexa- (16.87%), tri- (11.8%), tetra- (7.84%) and penta- (7.65%) nucleotides (Fig. 1a).

Table 2 Characterization of microsatellites in the pomegranate genome
Fig. 1
figure 1

Frequency distribution of different SSR repeats types and numbers in pomegranate genome: a, b overall distribution patterns of different SSR repeat in the genome, c, d frequency distribution graphs for dinucleotide and trinucleotide repeats

Frequency distribution graphs for dinucleotide and trinucleotide repeats were examined (Fig. 1c, d). In dinucleotides, maximum frequency of 35,149 (64.90%) was observed for classified motifs AT/AT, followed by AG/CT (28.51%), AC/GT (6.3%) and CG/CG (0.29%). The graph suggested a decline in SSR frequency with rise in the number of repeat units, and the trend was most pronounced for AT/AT followed by AG/CT, AC/GT and CG/CG repeat motifs. Similarly, for trinucleotides maximum frequency was observed for AAT/ATT (34.66%) and AAG/CTT (28.91%), followed by AGG/CCT (10.45%) and ATC/TAG (9%). The graph also outlined a gradual decrease in SSR frequency as the number of repeat units increased.

Development and validation of SSR markers

We designed primers for 2,856 Class I SSRs following survey of the whole genome sequence of pomegranate (Supl Table 2). Three hundred fifty contig sequences (2.01%) were selected randomly for primer designing, with SSRs distributed across the pomegranate genome. The majority of these primers were specific to dinucleotide motifs (primers 2429, 85.05%), followed by trinucleotide repeats (228, 7.98%). Further, we randomly selected a subset of 110 primers, referred to as NRCP_SSRs, for experimental validation on eight genotypes (Supl Table 1). As a result, 24 primers yielded monomorphic profiles while four markers did not show any amplification. Eighty-two primers (77.36%) yielded scorable amplicons with polymorphism (Fig. 2a). The number for alleles in the panel ranged from 1 to 3, with an average of 1.81. Similarly, PIC values ranged from 0 to 0.63, with an average of 0.28 (Supl Table 1). Of the total 82 polymorphic markers, 46 (56.10%) had PIC values between 0.4 and 0.6, whole remaining 36 primers showed PIC values ranging from 0.1 to 0.3.

Fig. 2
figure 2

Gel images showing allelic variations as revealed by SSR markers: a assaying on eight pomegranate genotypes using NRCP_SSR 18, 19, 23 and 24 and b assaying NRCP_SSR 92 and 93 on 46 pomegranate genotypes (where L-50 bp DNA ladder for Lane 18 eight set of pomegranate genotypes as mentioned in material and methods, L-100 bp DNA ladder for Lane 1–46 pomegranate genotypes as listed in Table 1, genotypes in lane 47 and 48 were excluded from analysis)

Assaying informative SSRs on a broad diversified panel

To further demonstrate the utility of newly developed SSR markers, we studied genetic diversity in a broader set of 46 pomegranate genotypes with 13 highly polymorphic SSRs. The SSR analysis generated 30 alleles across 46 genotypes, ranging from 2 to 4 alleles (at SSR 62 and 92), with an average of 2.30 alleles per locus (Table 3). The Ne values ranged from 1.55 alleles at locus SSR 97 to 3.32 at SSR 74, with a mean of 1.92 alleles per locus. The allele sizes varied from 160 bp at locus SSR 91 to 420 bp at locus SSR 74. The maximum allele frequency (0.77) was observed for allele 300 bp at SSR 97. In addition, the mean Ho and He values per locus ranged from 0 at SSR 24 to 0.57 at SSR 74 and from 0.36 at SSR 97 to 0.70 at SSR 74. The average PIC value of 0.46 suggested moderate diversity levels among the genotypes considered here. Four highly polymorphic SSR loci (PIC > 0.5) had PIC values of 0.51 (SSR 24, 92 and 93) and 0.71 (SSR 74). The remaining nine loci exhibited moderate polymorphic trends (0.3 < PIC < 0.5), ranging from 0.36 (SSR 97) to 0.48 (SSR 18) (Table 3). The average value of Shannon information index was 0.70 for the entire population. Representative gel images illustrating the SSR fingerprints of the 46 pomegranate genotypes are shown in Fig. 2a, b.

Table 3 Genetic diversity statistics of 13 SSR as computed from 46 pomegranate genotypes

Population structure and principal coordinate analysis

Population structure analysis of 46 pomegranate genotypes by adopting non-stratified strategy showed clear peak for ΔK at K = 4, where entire genotypes were divided into four major groups, with few admixture among the groups (Fig. 3a). Group 3 contained the highest number of genotypes (16), followed by group 2 (11), group 4 (10) and group 1 (9). Statistical analysis indicated that the percentage of genotypes with a membership coefficient ≥ 90% was 54.34%, total of 23.91% of genotypes exhibited a membership coefficient ≥ 60% and only 10% of the genotypes exhibited a membership coefficient of 5% or less. A principal coordinate analysis (PCoA) also divided 46 genotypes into four clusters (Fig. 3b).

Fig. 3
figure 3

Population structure, dendrogram and PCA plot depicting the genetic relationships among 46 pomegranate genotypes based on 13 SSR markers

Principal coordinates (PCos) 1 and 2 explained 33.31% and 24.29% of the variance in the genotype data, respectively. More than 30% of the accessions were assigned to cluster 3, whose accessions were much more scattered than those in clusters 4, 2 and 1. The dendrogram demonstrated clustering of 46 genotypes into four groups (Fig. 3c).

Discussion

Identification of SSRs in the pomegranate genome

With the rapid developments in next-generation sequencing (NGS) technologies, WGS sequencing of plant genomes is gaining widespread attention. These sequences serve as valuable resources for genome-wide mining and development of genetic markers, such as SSR and SNPs (Bohra and Singh 2015; Simsek et al. 2018). Whole-genome assemblies have enabled the development of genome-wide SSR markers in many plant species, such as cotton (Gossypium sp) (Wang et al. 2015), tea plant (Camellia sinensis) (Liu et al. 2018) and pear (Pyrus spp) (Xue et al. 2018). Although reports on largescale development of EST–SSR markers from transcriptome data using NGS technologies have been reported in pomegranate (Ono et al. 2011; Simsek et al. 2018). However, largescale development of SSR markers from draft genome sequences of pomegranate has not been reported so far in pomegranate. The genomic Class I SSRs have tremendous utility for diversity analysis, varietal fingerprinting and molecular breeding applications in addition to enable efficient management of germplasm resources (Dutta et al. 2013). Therefore, we report development of the first set of genome-wide Class I SSRs in pomegranate and validate them on a set of diverse genotypes to show their immediate utility for genetic analysis. The relevance and importance of Class I SSRs to plant breeding is well described in various crops including rice and pigeonpea (Singh et al. 2010; Narshimulu et al. 2011; Bohra et al. 2017). The availability of such SSR markers spanning entire genome would facilitate genetic studies in pomegranate.

In the present study, we identified SSRs in the pomegranate genome with an average density of 527.97 SSRs/Mb, which is comparable to SSR density reported in other plant species like cucumber (552 SSR/Mb) and Arabidopsis (371 SSR/Mb) (Cavagnaro et al. 2010). Although analyzing the pomegranate WGS data, Ravishankar et al. (2015) have estimated the density of SSRs to be one SSR for every 5.56 Kbp based on the partial pomegranate genome sequence; however, here we found one SSR/1.90 Kbp.

We also analyzed the distribution and frequency of SSR motifs of 2–6 bp in the genome. We observed that the SSR frequency decreased with increase in the number of repeat units, and the change was more conspicuous in case of dinucleotides than the longer repeat motif types as observed in other studies (Liu et al. 2018). Following the definition of Class I SSRs (> 20 bp), we found 43,853 (29.53%) Class I SSRs in the pomegranate genome. Simsek et al. (2018) also reported largescale identification of 1900 SSR motifs in pomegranate genome through RNA seq using NGS technology. In our study, dinucleotide repeats (31.19%) were most abundant followed by tetra- (20.5%), tri- (16.8%) and mono- (14.46%) nucleotide repeats. Among the di-nucleotide repeats, AT/AT motifs were the most abundant (64.90%) followed by AG/CT repeat motifs (28.51%), while motifs AAT/ATT (34.66%) and AAG/CTT (28.91%) were the most frequent among tri-nucleotides. These observations were in close agreement with earlier findings that dinucleotides (73.33%) are the most abundant class of repeats in the pomegranate genome sequences (Ravishankar et al. 2015). Previously, authors have confirmed the abundance of AT/TA, TTA/TAT/ATT types in the pomegranate genome. However, the frequency of a given motif is also known to vary depending on sequence data sets; for instance, genome vs. transcriptome (Bohra et al. 2011; Varshney et al. 2005).

Marker validation and polymorphism survey

We successfully validated a set of randomly selected 110 SSR markers, of which 82 primers (77.36%) could reveal polymorphism among eight pomegranate genotypes with 2 to 3 alleles. The allele sizes (bp) obtained across pomegranate genotypes were similar to the expected sizes of the products for each locus. The PIC values for these markers ranged from 0.12 to 0.63, with a mean value of 0.36. Basaki et al. (2013) studied genetic diversity in 202 pomegranate genotypes belonging to 22 different provinces of Iran using polymorphic SSR as reported by Pirseyedi et al. (2010). The authors observed PIC values for these SSRs in the range of 0.01–0.56 on simple agarose gel detection system. The possible reason for lesser number of alleles and PIC values for SSR markers is limited resolution for agarose and metaphor gels as compared to automated capillary-based systems. Therefore, the SSRs developed here could show a higher level of polymorphism when assayed on polyacrylamide gel and capillary systems. Following the criterion laid by Botstein et al. (1980), 46 primers (56.10%) showed PIC values (> 0.4) in the current study, of which 15 SSRs met the criteria of highly polymorphic nature (PIC of 0.5). The most informative was the locus NRCP_SSR 2 with an average PIC value of 0.63, whereas the lowest average PIC value (0.12) was recorded for three loci-NRCP_SSR38, NRCP_SSR46 and NRCP_SSR 103 (Supl Table 1). Assaying 106 primers on eight genotypes produced 192 alleles with an average of 1.81 alleles per locus, which is comparable to 1.95 alleles per locus for 11 pomegranate genotypes as reported by Soriano et al. (2011).

In order to demonstrate the utility of these new SSR markers, 13 SSRs were selected based on their clear amplification profile with varying polymorphism rates to study the diversity in a broader panel of 46 genotypes. The allelic patterns suggested occurrence of a total of 30 alleles with an average of 2.31 alleles per locus. The PIC values ranged from 0.36 to 0.71 with a mean PIC value of 0.47. Similarly, Jian et al. (2012) reported 2–5 alleles with mean allele of 2.80 and PIC values ranged from 0.091 to 0.656 based on the analysis of 42 pomegranate accessions with 15 SSRs. Similar range of PIC values was obtained with SSR markers in pomegranate earlier by various research groups (Soriano et al. 2011; Zarei and Sahraroo 2018; Basaki et al. 2013). However, Ravishankar et al. (2015) reported a very high polymorphism (97.6%) for SSRs primers having allele range 1–14 with an average PIC value of 0.54 in pomegranate using high-throughput capillary-automated DNA Sequencer. We anticipate that the SSRs developed in this study might be highly informative when assayed on polyacrylamide and capillary-based system. To the best of our knowledge, this study reports for the first-time genome-wide SSR mining and designing of 2,856 primers with the SSR track length of ≥ 24 bp. For instance, a higher level of polymorphism has been revealed by SSRs (track length > 50 bp) on simple gels in different crops (Dutta et al. 2013; Bohra et al. 2015, 2017).

Out of 13 SSRs, four SSR loci were found to be highly polymorphic (PIC > 0.5) on 46 pomegranate genotypes, with PIC values ranging from 0.51 (SSR 24, SSR 92 and SSR 93) to 0.71 (SSR 74). Most of the selected primers confirm their polymorphism efficiency in larger set as per the initial screening results on eight pomegranate genotypes. Shannon information index for thirteen SSR loci ranged from 0.54 to 1.29 with an average value of 0.70, revealing higher genetic diversity among 46 genotypes. Similar observations were reported earlier in pomegranate using SSR markers (Pirseyedi et al. 2010; Raina et al. 2013). Further, relatively higher values of diversity index for these loci indicated the potential usefulness of these new SSR markers to analyze genetic diversity in pomegranate. In this study, we report successful development and validation of genome-wide SSR markers from draft genome sequence of pomegranate variety Dabenzi for genetic applications in pomegranate.

Genetic diversity

Examination of genetic variation in germplasm is key to accelerating genetic improvement of plants. To this end, molecular marker technologies including SSRs have emerged as a promising tool to uncover genetic polymorphism in a given set of genotypes/germplasms. In this context, structure and cluster analyses are effective means for studying genetic relationships related to germplasm resources (Goossens et al. 2002). Structure analysis showed that the grouping was largely consistent with the UPGMA clustering (Fig. 3a, c). The structure analysis revealed that K = 4 was the best value for classification of the 46 pomegranate genotypes, which remains in concordance with other studies. Singh et al. (2015) used 44 publicly available SSRs to study population structure among 88 pomegranate accessions and classified these into four populations. Considering the higher genetic diversity levels, cluster 4 constituted most diverse genotypes like introduced exotic pomegranate lines Tabesta and Alah (Iran), Shrin Anar, GR Pink and Gulesha Red (Russia), Spendander (Exotic cultivars), selections P-26, P-13, P-23 and P-16 of exotic line Muscat made at MPKV Rahuri, Maharashtra, India and few wild accessions IC1205, IC1182 and IC 1198 (Uttaranchal, India), which contributed for making this cluster most diverse. In this group, we find large fruits with either hard or soft-seeded arils.

Cluster 2 exclusively contained wild indigenous collection of Daru types of India. ‘Daru’ pomegranates are mainly grown in the forests on Himalayan slopes as wild pomegranates (Jalikop et al. 2005), and characterized by thorny bushes having small fruits with a sour and hard-seeded arils (Jalikop 2007). Cluster 3 covered highly domesticated pomegranate commercial varieties and cultivars, which are grown in India. This cluster included few introduced exotic breeding lines, such as Kandhari and Kabuli Yellow (Afghanistan) and commercial varieties, such as Nimali and Kalpitiya (Srilanka). The genotypes of this cluster are mainly characterized by large fruits with sweet and soft-seeded arils. Similarly, Singh et al. (2015) also found distinct grouping of cultivated pomegranate types, including commercial varieties, local types and introduced varieties, from wild accessions by screening 44 SSRs markers on 88 accessions. Cluster 1 constituted few cultivars of Tamil Nadu (Yercaud local, Yercaud HRS and Co-white) and few introduced exotic lines, i.e. Bedana Sedana and Kabuli Canoor (Afghanistan) and Spin Saccharin (Exotic Cultivar). The fruits are characterized by either sweet to acidic with hard-seeded arils.

PCoA plot explained 57.6% of the total variation, which is in close agreement with the results as observed for PCA explaining 60% variation among 88 pomegranate genotypes using 44 SSR markers (Singh et al. 2015). In PCoA plot, Axes 1 explained higher proportion of the variance (33.31%) compared with Axes 2 (24.29%). Probability it may be due to Axes 1 that separated CL 2 and CL4, which are representing highly diverse wild indigenous and introduced exotic lines. However, Axes 2 clearly separated CL1 from CL3, which represented commercial pomegranate lines of India and few introduced exotic lines with limited diversity.

The results of cluster analysis revealed grouping patterns of both geographical distributions and pedigree relationships. For instance, all Daru type wild accessions grouped together belong to Himachal regions of North India. The genotypes Yercaud HRS and Yercaud Local and Co-White grouped together are found in Tamil Nadu region of South India. Similarly, Jyoti and KRS belong to Karnataka regions. Proximity of Nimali and Kalpitiya may be due to their cultivation in Srilanka. Similarly, Tabesta and Alah grouped closer are from Iran. Remaining exotic lines dispersed across the clusters. Similarly, Jian et al. (2012) suggested geographical basis of clustering for 42 pomegranate accessions based on the expressed sequenced tag (EST)-SSR markers. With respect to pedigree, G-137, Arakta and Ganesh resided in the same cluster. Similarly, the genotypes P-26, P-13, P-23 and P-16, selected from Muscat, grouped together within the same cluster. The genotypes Yercaud HRS and Yercaud Local are also grouped together. Raina et al. (2013) observed grouping of some pomegranate genotypes on the basis of their parentage and pedigree.

Conclusion

The present work examines the SSR markers (Class I and II) in the draft genome sequence of pomegranate plant cv. Dabenzi providing a snapshot of the differential coverage and density of different SSR repeats in this species. The dinucleotides were most pronounced repeat types, accounting for up to 64.90% (AT/AT) of all the identified SSRs. The AAT/ATT (34.66%) and AAG/CTT (28.91%) were the abundant classes among trinucleotide repeats. A total of 43,853 motifs belonging to Class I SSRs (> 20 bp) were identified, and we designed primer pairs for 2856 SSRs targeting motifs ≥ 24 bp. Forty-six of the 82 primers (56.10%) showed polymorphic patterns, and a subset of 13 informative SSRs revealed genetic relationships among 46 pomegranate genotypes. Thus, the genome-wide development of such SSR markers could be very useful for various research areas in pomegranate, such as identification of the economically important pomegranate cultivars, study genetic diversity and evolutionary origin analysis, molecular fingerprinting, genetic linkage map construction, QTL/gene mapping and marker-assisted selection for breeding.

Author contribution statement

PGP, NVS, SP and JS designed the experiments. PGP, AB and RS performed the experiments and analyzed the data. NG, DM VS, SJ, AI, DB and NVS contributed to the collection of test materials. PGP and AB wrote the original draft.