Introduction

Wheat is one of the most important strategic crops in the world, with a cultivated area of 219 million hectares and a production of 808 million tons (FAOSTAT 2022). Providing 25% of the total proteins and calories in the human diet and food supplies worldwide relies on wheat (Triticum aestivum L.) (Alipour et al. 2021). The wheat crop is subjected to various biotic and abiotic stresses. Most likely, drought is the most significant abiotic stress affecting wheat yield. Hence, improving drought-tolerant genotypes is a promising strategy for dealing with the world’s rapidly diminishing water resources and growing population (Rabieyan et al. 2023). Superior drought-tolerant genotypes could be used as donors in a breeding program (Reddy et al. 2023). The yield and agronomic traits in drought resistance breeding programs are multifaceted characteristics governed by multiple genes (Gupta et al. 2024). Therefore, identifying markers related to these traits can accelerate the development of new wheat cultivars. Morphological markers are valuable indicators of genetic diversity, but they are not the most reliable, as they are highly affected by the environment (Yang et al. 2020). In contrast, DNA markers can pinpoint multiple genes that regulate complex traits without environmental influence (Dagnaw et al. 2023). On the other hand, in drought tolerance, molecular markers can help examine the presence or absence of novel genomic regions associated with drought tolerance and assist in marker-assisted selection (MAS) by allowing focus on the most promising candidates (Ghazy et al. 2021). Simple sequence repeats (SSRs) are high-performance markers primarily used to analyze genetic diversity and population structure and elucidate phylogenetic relationships among plant genetic resources, as such relationships play a crucial role in developing appropriate breeding programs (Belete et al. 2021). SSR markers have many advantages, such as co-dominance, high levels of polymorphism, chromosome specificity, and high reproducibility; they are also excellent for identifying and monitoring target traits within varieties (Alsaleh 2022). Drought tolerance is a multifaceted characteristic governed by multiple genes. Therefore, the breeding challenge for this particular trait is considerable. Identifying of quantitative trait loci (QTLs) associated with drought tolerance marker-assisted is of considerable importance in crop breeding and is a valuable approach to increasing wheat yield and development of drought-tolerant wheat cultivars (Zhao et al. 2023; Gupta et al. 2024). Using novel molecular techniques such as molecular markers and quantitative trait loci (QTL) mapping methods are needed to produce drought tolerance genotypes (Gupta et al. 2024). One of these procedures in plant breeding is association analysis, which has high resolution and accuracy and is used to identify genomic regions for studying agronomic traits (Mehrabi et al. 2020; Pang et al. 2020; Rabieyan et al. 2023), and drought stress tolerance (Reddy et al. 2023; Zhao et al. 2023) has been used in wheat. A review of quantitative trait loci mapping using regression as a robust, efficient, and effective method concluded that regression has always been an essential tool for quantitative geneticists (Knott 2005). Multiple regression analysis (MRA) based on the relationship between molecular markers (as an independent variable) and morphological traits (as a dependent variable) is a suitable method for identifying trait-dependent markers. This analysis determines the coefficient of explanation (R2), which shows the degree of relationship between the morphological trait and the molecular marker (Gomez and Gomez 1984). As the value of R2 increases, the probability of a significant linkage of a trait with the marker also increases. Therefore, it can be hoped that by finding gene loci with high R2, it will be possible to find agronomic traits related to that marker (Amombo et al. 2018). In a study using association analysis based on multivariate regression, 55 wheat lines, 55 SSR markers, 38 SAMPL markers, and 54 AFLP markers were identified that were associated with at least one of the 14 agronomic traits under study such as biological yield and thousand seed weight. They were introduced as positive markers for marker-assisted selection programs of these traits (Roy et al. 2006). In research, association analysis discussed superior alleles for yield-related traits and their distribution in important wheat cultivars. A significant relationship between the number of desirable alleles and yield indicates that multi-QTL pyramidal production with marker-assisted selection can effectively increase wheat yield. This study is helpful for marker-assisted selection for yield improvement and elucidating the genetic mechanism of important cultivars in wheat (Li et al. 2020). In research, molecular, physiological, and biochemical variability in bread wheat and its wild relative species were investigated under water-deficit stress conditions. The association analysis results showed that 28 and 27 significant marker-trait associations (MTAs) were identified under control and water-deficit stress conditions, respectively. Furthermore, ten MTAs showed sufficiently stable expression across both growth conditions. Of these, five SSR markers were associated with several traits (Khodadadi et al. 2023). The research revealed 44 significant marker-trait associations in durum wheat and 26 of the MTAs were novel. In contrast, the remaining 18 were previously reported and confirmed in this study (Mulugeta et al. 2023). In the study to evaluate the genetic diversity of durum wheat germplasm as revealed by morphological and SSR markers, the results showed significant associations between SSR primers with one or more target phenotypic traits, including spikelets/spike, plant height, spike density, thousand-kernel weight, and grain yield (Dagnaw et al. 2023). Abiotic stresses, especially drought, are associated with changes in various plant characteristics, including agronomic traits, and cause yield loss. Identifying and applying primers related to agronomic characteristics in different moisture conditions can be a useful guide in breeding programs to select superior genotypes. Therefore, this research aims the following: (1) identify genotypes with high yield and tolerance to drought; (2) evaluate the genetic diversity in terms of agronomic traits and microsatellite markers under rainfed and irrigated conditions; (3) investigate the relationship between agronomic traits and SSR molecular markers, and also to identify informative markers related to each trait in rainfed and irrigated conditions.

Materials and Methods

Field Experiment

Twenty-five bread wheat genotypes were used, including two cultivars, Pishtaz and Pishgam (check), and 23 accessions of winter bread wheat (Table 1). The genotypes were obtained from Karaj Seedling and Seed Breeding Research Institute. Field experiments in randomized complete block design with three replications were carried out during two consecutive cropping seasons (2018–2019 and 2019–2020) under rainfed and irrigated conditions in the cold Mediterranean climate (34°21′N latitude, 47 9′E Longitude, and 1319 m altitude) in Iran. This climate is classified as semi-arid with mean annual rainfall of 430–460 mm. Each plot consisted of five rows that were 2 m in length. The row distance was 23 cm with a density of 400 seeds per m2. In rainfed and irrigated conditions, initial irrigation was done for both conditions, due to the lack of rain during cultivation. The first irrigation after planting was considered the planting date (14/11/2018 in the first year and 6/11/2019 in the second year), but no irrigation was done for rainfed conditions during the entire growth period. While for irrigated conditions, in the first year, on May 15th, at the 50% spike stage, the second stage of irrigation was done in late May, after the full spike, and the third stage was done on June 14th, during the milking stage of the seeds; in the second year on May 7th, at the 50% spike stage, the second stage of irrigation was done in late May after the full spike, and the third stage was done on June 6th, during the milking stage of the seeds. Five random samples were selected from each plot, respecting the marginal effect. Harvesting was done for the first year in early July 2018 and the second year in early July 2020. Twenty-four agronomic traits studied are presented as follows: grain yield (GY), the number of spike per m2 (NSP), the number of grains per spike (NGPS), 1000-grain weight (TGW), biological yield (BY), plant height (PH), spike weight (SW), spike length (SL), peduncle length (PL), xteragen length (XL), mitt penalty length (PML), peduncle length/plant height (PL/PH), hectoliter weight (HW), spike dry weight (SDW), spike grain weight (SGW), and stem weight (StW), the number of fertile spikelets (NPS), number of infertile spikelets (NNPS), number of spikelets per spike (NSS), harvest index (HI), spike harvest index (SHI), straw yield, other inter nodes length (OIL), and spike density (SD). In spike harvest index (SHI), grain yield is divided by spike yield. Straw Yield (SY) = Biological yield - spike yield. Other Inter nodes Length (OIL) = Plant height - total length of the peduncle, mitt penalty, and spike. Spike Density (SD) was calculated based on five spikes randomly selected for each plot and repetition accession, and calculated by this formula: [10 × (spike length/number of grains per spike)]. Then, the length of each spike and the number of grains per spike were calculated for each accession in each plot, according to the formula (10 × (spike length/number of grains per spike)); the density of each spike was obtained for 10 cm of spike length. The diversity in a population can be evaluated by various methods such as simple measurement of diversity, analysis of variance components, and molecular markers used in this research. To perform statistical analysis, first, the Kolmogorov–Smirnov test was used to test the normality and check the skewness of the data. Variance analysis for traits in rainfed and irrigated conditions from the evaluation of 23 genotypes, and two cultivars to determine the main effects of block and genotype, in such a way that the effect of fixed genotype and random block was considered. Statistical analysis was done using SAS 9,1,3 software. The statistical model of the randomized complete block design is as follows:

Table 1 Cultivars and studied genotypes of bread wheat
$${X}_{ij}=\mu +{\delta }_{i}+{\tau }_{j}+{\varepsilon }_{ij}$$

The value of each observation (\({X}_{ij}\)) is obtained from the sum of block effects (\({\delta }_{i}\)), treatment (\({\tau }_{j}\)), experimental error (\({\varepsilon }_{ij}\)), and the total population mean (\(\mu\)).

To estimate the broad-sense heritability (h2b.s), Kearsey and Pooni’s (1996) and SAS software GLM MANOVA analysis were used. The relationships used to estimate the genetic parameters are as follows:

$${\delta }_{e}^{2}=MSe$$
(1)
$${\delta }_{p}^{2}={\delta }_{g}^{2}+\frac{{\delta }_{e}^{2}}{r}$$
(2)
$${\delta }_{g}^{2}=\frac{MSg-MSe}{r}$$
(3)
$${h}_{b.s}^{2}=\frac{{\delta }_{g}^{2}}{{\delta }_{p}^{2}}$$
(4)
$$PCV=\frac{\sqrt{Vp}}{\overline{X} }.100$$
(5)
$$GCV=\frac{\sqrt{Vg}}{\overline{X} }.100$$
(6)
$$ECV=\frac{\sqrt{Ve}}{\overline{X} }.100$$
(7)
$$GG=\frac{\left(\left(2.06.\frac{{\delta }_{G}^{2}}{\sqrt{{\delta }_{P}^{2}}}\right).100\right)}{\overline{X} }$$
(8)

In the above relationships, Vg and \({\delta }_{g}^{2}\) are the genotypic variance, Ve and \({\delta }_{e}^{2}\) are the environmental variance, Vp and \({\delta }_{p}^{2}\) are the phenotypic variance, MSe is the mean square of the test error, and MSg is the mean square of the genotypes. Also, genetic gain (GG), phenotypic coefficient of variation (PCV), genotypic coefficient of variation (GCV), and environmental coefficient of variation (ECV) were calculated as the ratio of phenotypic and genotypic standard deviation to the mean of traits (Farshadfar 2010). In this research, genetic parameters were estimated for traits that had significant genetic diversity (according to variance analysis) in the studied genotypes.

Molecular Experiment

The 20 pairs of SSR markers were used for molecular evaluation of the studied genotypes. DNA was extracted from 2 to 3-week-old seedlings by the modified CTAB method (Doyle and Doyle 1987). DNA extraction was done as follows: After transferring 50 mg of the crushed sample with liquid nitrogen to the tube, 800 µl of extraction buffer (4 g CTAB, 16.36 g NaCl, 3.15 g Tris–HCl, 1.48 g EDTA, 400 µl β-mercaptoethanol at pH = 8 to a volume of 100 ml) was added to the tubes and for half an hour. The samples were placed in a water bath at 65 °C. Then, 800 µl of chloroform-isoamyl alcohol solution (24:1) was added to each sample and stirred well for 60 min. After that, the samples were centrifuged for 15 min at 13,000 rpm, and the supernatant phase was transferred to a clean tube. Five hundred microliters of cold isopropanol solution was added to each tube and placed in a freezer at − 20 °C for 2 h. Then, the tubes were centrifuged for 15 min at 13,000 rpm, and after that, the supernatant was slowly emptied, and 500 µl of cold 80% ethanol was added to each tube, and then, a short centrifuge was performed, and the liquid phase was slowly emptied (this step was twice repeated). At the end, the tubes were placed at room temperature to dry, and then, 100 µl of sterile double-distilled water was added to each tube.

To determine the quality and quantity of extracted genomic DNA, electrophoresis on 0.8% agarose gel was used. Five microliters of extracted DNA from each sample was mixed with 2 µl of sampling buffer on the adhesive or clean surface and then loaded. Electrophoresis was performed with a voltage of 90 to 120 V until the blue color reached the end of the gel. Then, after washing the gel for a few seconds with distilled water for 15 to 30 min, the gel was placed inside the ethidium bromide solution for staining. At the end, the gel was put in the gel document device, and the condition of the DNA was examined.

PCR (polymerase chain reaction) is carried out in three steps by temperature. For PCR, DNA samples diluted at a concentration of 10 ng µl−1 were used and amplified using 20 pairs of primers (primers were designed from the NCBI site, and the names and sequences of the primers used are given in Table 2). The polymerase chain reaction was performed using a Bio-Rad thermocycler in 20 µl volumes. The volume included reaction buffer 1x, 50 ng DNA, 2 mM MgCl2, 0.05 mM dNTP, 0.2 µM primer (forward and reverse microsatellite primers), and 1 U enzyme of Taq DNA polymerase. The PCR program was 95 °C for 5 min as the primary denaturation, 35 normal PCR cycles, denaturation (95 °C for 30 s), annealing (primer-specific Tm (°C) for 30 s, in this research, ranged from 46 to 60 °C, which was different for each primer), and extension (72 °C for 60 s). The last extension time was 72 °C for 5 min.

Table 2 SSR primers used to assess the genetic diversity of bread wheat genotypes

After completing the cycles and taking the samples out of the device, they should be kept at 4 °C until electrophoresis. The PCR product was electrophoresed on a 3% agarose gel in 1 × TBE buffer. Ten microliters of safe stain was used for staining. At first, 5 µl of sampling buffer was injected into the amplified DNAs. Then, 15 µl of each sample was loaded into the wells created in 3% agarose gel with a voltage of 90–120 V and run for 2 h. At the end, the gel document device quantum model ST4 was used to highlight the bands. Nevertheless, as mentioned, because all the samples were not placed on the same gel, DNA size marker 100 to 1500 Bb was used in this experiment, which produced 11 bands on 3% agarose gel. After recording the information, the obtained gel was scored for each primer. To perform statistical analysis, the information obtained from the gels and the pattern of band separation during electrophoresis, the resulting data were prepared in a matrix format (zero absence of a band and one presence of a band). For each marker, the number of alleles in the studied genotypes were named as a, b, c, and d.

Statistical Analysis of Genomic Information

The distance between genotypes, cluster analysis by the UPGMA method, and dendrogram drawing was done by NTSYS (ver 2.02). In the UPGMA (Unweighted Paired Group Method using Arithmetic Average) method, the initial cluster is formed based on the similarity between two individuals. Then, the similarity (or distance) of each individual with the other members in a cluster is considered the average similarity of that member among the individuals in the cluster. In this method, two groups are merged when the average distance between them is small enough (Farshadfar 2010).

The molecular indices were calculated as follows:

  • Polymorphic percentage: To calculate this index, the number of polymorph bands was divided by the total number of bands (Mohammadi and Prasanna 2003)

  • Polymorphic information content index (PIC): from the relationship of Anderson et al. (1993)

  • Marker index (MI) and effective multiplex ratio index (EMR): from the relationship of Kumar et al. (2009)

  • Resolving power (RP): It was obtained from the relationship of Altintus et al. (2008)

Finally, the association between SSR markers and traits measured in field conditions was investigated using stepwise multiple regression by SPSS23 software, so that each quantitative trait was considered a dependent variable and SSR markers as independent variables.

Results

The results of the analysis of variance (Table 3) showed significant differences between genotypes for most of the traits in both irrigated and rainfed conditions.

Table 3 Mean squares (MS) and coefficient of variation (CV) derived from analysis of variance of different traits in bread wheat genotypes under rainfed and irrigated conditions

Estimation of Genetic Parameters of Studied Traits in 25 Bread Wheat Genotypes Under Rainfed and Irrigated Conditions

The estimation of genetic parameters for agronomic traits in rainfed conditions (Table 4) showed that the highest broad-sense heritability in the first year was respectively related to the TGW (0.766) with low PCV (12.29), PH (0.586), PML (0.575), and PL (0.571) with medium to low PCV and (13.64), (15.66), and (16.5), respectively, NNPS (0.567) with high PCV (38.98) and SL (0.559) with low PCV (12.84). In addition, the contribution of GCV in these traits was higher than that of ECV. In the second year, the estimation of genetic parameters was related to HW (0.765) with high PCV (30.87), SHI (0.765), OIL (0.657), and SGW (0.545) with average PCV and (21.82), (20.51), and (18.05), respectively. In addition, the contribution of GCV in these traits was higher than that of ECV. The TGW (0.392) with low PCV (11.28) and the contribution of ECV in this trait were more than GCV.

Table 4 Estimation of genetic parameters of agronomic traits in 25 bread wheat genotypes under rainfed conditions in two cropping seasons

Based on the more significant genetic gain (Table 4) in the first year, respectively, NNPS (45.54), XL (42.66) with high PCV (39.78), SD (22.39) with average PCV (19.58). In addition, the contribution of GCV in these traits was more than that of ECV. The StW (20.31) and OIL (19.96) have medium to high PCV (23.01) and (26.96), respectively. In addition, the contribution of ECV in these traits was higher than that of GCV. The PL was (19.42), and TGW and PML were 19.39 and 18.56, respectively. In the second year; parameters including HW (48.64), SHI (34.4), OIL (27.76), and SGW (20.27). The StW (18.54) has a medium to high PCV (23.54). The contribution of the ECV in the StW trait was higher than that of GCV. Also, in the first year, for the traits of PH and SD, and in the second year, for the traits of HW and SHI, the contribution of Vg was high. Therefore; these traits are likely to respond to the selection.

The estimation of genetic parameters for agronomic traits in irrigated conditions (Table 5) showed that the highest broad-sense heritability in the first year was respectively related to the TGW (0.872) with low PCV (14.15), OIL (0.638) with medium to high PCV (23.37), NNPS (0.638) with high PCV (41.05), SL (0.623) with low PCV (13.60), GY (0.604) with medium to high PCV (26.68), PML (0.542) with low PCV (13.62), and SD (0.514) with medium PCV (18.64). In addition, the contribution of GCV in these traits was more than that of ECV. The HI (0.5) and SGW (0.49) have medium to high PCV (21.63) and (23.4), respectively. In these traits; the contribution of GCV and ECV were almost equal. In the second year; parameters including OIL (h =0.866) with high PCV (27.03), SHI (0.797) with medium PCV (21.54), HW (0.796) with high PCV (27.4), SGW (0.708) with low PCV (15.08), and HI (0.65) with medium to high PCV (23.33). In addition, the contribution of GCV in these traits was more than that of ECV.

Table 5 Estimation of genetic parameters of agronomic traits in 25 bread wheat genotypes under irrigated conditions in two cropping seasons

Genetic gain (Table 5) for the following traits in the first year consisted of NNPS (53.98), GY (33.2), OIL (30.73), TGW (25.4), SGW (23.51), and HI (22.11). The XL (21.14) has a high PCV (28.66). In addition, the contribution of the ECV in XL trait was more than that of GCV. In the second year, genetic gain for the following traits consisted of OIL (48.23), HW (44.91), SHI (35.35), HI (31.22), GY (22.99), and SGW (21.98). So, in the first year, the Vg for the traits of GY, TGW, and SD, and in the second year for the traits of HW and SHI, was also high. Therefore, it is possible to choose the optimal breeding method for these traits.

Evaluation of the Genetic Diversity of the Wheat Genotypes Using SSR Markers

In this research, 16 out of the 20 primers showed suitable polymorphism and were selected to estimate genetic diversity. Table 6 shows the results of the primer indices. The total number of bands was 35 bands, of which 33 bands were polymorphic, and the percentage of total polymorphism was estimated at 93.75. Most primers had 100% polymorphism, and the lowest polymorphism (50%) belonged to primers XGWM334 and XGWM642. Therefore, all the primers of this research were found to be useful for genetic diversity study in wheat. The average number of bands and average polymorphism were 2.188 and 2.063, respectively. The highest number of alleles was related to primer XGWM136 with five, and the lowest number of alleles was related to primers XGWM155, XGWM410, and XGWM234 with two alleles. The highest index of polymorphism information content was related to primers XGWM155, XGWM234, XCFD168, XGWM577, XGWM642, and XCFD5, so these primers have more power in creating polymorphism among the investigated genotypes, and according to this issue, these primers can be introduced in the program of genetic diversity among bread wheat genotypes. Among the investigated molecular indices, the highest marker index belonged to primers XGWM136, XCFD168, and XGWM350, and the lowest belonged to primers XGWM334 and XGWM642. The highest index of effective multiplex ratio (EMR) was related to primers XGWM136, XGWM350, XCFD168, and XGWM165, and the lowest belonged to primers XGWM334 and XGWM642. The highest resolving power index (RP) was related to primers XGWM4, XCFD168, and XGWM350, and the lowest belonged to XGWM334. In total, XCFD168, XGWM350, and XGWM136 markers had 100% polymorphism. The banding pattern of SSR markers using the XGWM350 primer in bread wheat genotypes is shown in in Fig. 1.

Table 6 Diversity indicators of microsatellite (SSR) markers in the studied wheat genotypes in this experiment
Fig. 1
figure 1

The banding pattern of SSR markers using XGWM350 primer in bread wheat genotypes

Classification of the Bread Wheat Genotypes Based on SSR Markers

Cluster analysis using the UPGMA method based on the Jaccard similarity coefficient and high cophenetic correlation (74%) classified wheat genotypes in four different groups (Fig. 2). The first group includes genotypes number 1, 3, 25, 11, 17, 16, 22, 4, 12, 10, 15, 8, 9, 23, 24, 5, 6, and Pishtaz cultivar. The second group includes genotypes number 2 and 21; the third group consists of genotypes number 7, 13, and Pishgam cultivar, and the fourth group includes genotypes number 14 and 18. The results of the principal coordinates analysis based on the SSR marker also showed a similar pattern with the cluster analysis and the two-dimensional distribution diagram. The genotypes were placed in four groups. Finally, genotypes 1, 3, and 25 have the most significant genetic distance with genotypes 13, 7, and the Pishgam cultivar, and these two groups can be introduced for breeding programs to obtain heterosis achievements.

Fig. 2
figure 2

Grouping of bread wheat genotypes based on Jaccard similarity coefficient and UPGMA method

Evaluating the Association of Studied Traits with Molecular Markers

In this research, the association of studied traits in 25 bread wheat genotypes with SSR markers using stepwise multiple regression analysis to identify genomic regions involved in yield control and drought tolerance in bread wheat by considering marker locations as independent variables and agronomic traits as dependent variables was investigated using SPSS software in Tables 7 and 8. It should be noted that the association with SSR markers was estimated only for significant traits in the analysis of the variance.

Table 7 Markers with significant association with agronomic traits of bread wheat genotypes under rainfed conditions
Table 8 Markers with significant association with agronomic traits of bread wheat genotypes under irrigated conditions

Agronomic Traits Under Rainfed Conditions

The results of stepwise regression analysis to identify informative markers related to agronomic traits in rainfed conditions are presented in Table 7. It was found that the XGWM124(a2) and XGWM410(a1) markers in the first year and XGWM577(a2) marker in the second year have a significant association with the grain yield trait in rainfed conditions, which explained 34% of the total variations in the first year and 17% in the second year. Traits associated with yield, i.e., 1000-grain weight in the first year, have a significant association with XGWM577(a2), XGWM350(a3), and XGWM165(a3) markers, and in the second year with XGWM350(a2), XGWM2(a2), and XGWM642(a1) markers, which could justify 64 and 57% of the total variations, respectively. The number of grains per spike has a significant association with XGWM350(a3), XGWM165(a2), and XGWM136(a1) markers in the first year, which could justify 51% of the total variation. The most significant and positive effect was associated with the locus of XGWM136(a1), the number of grains per spike, and relatively the 1000-grain weight. The results of the rest traits are observed in Table 7. Eventually, it can be expressed that 64 gene loci for agronomic traits in rainfed conditions were identified. It can be concluded that 26 gene loci covered all measured traits. In the first year, the markers XGWM577(a2,a1), XGWM124(a2), XGWM165(a1), XGWM334(a1), XGWM136(a3), and XGWM155(a1) were the most repeated, and in the second year, the markers XGWM577(a2,a1), XGWM70(a2), XGWM136(a3), XGWM334(a1), XGWM642(a1), and XCFD5(a2) were the most repeated, and jointly, in both years, XGWM577(a2,a1), XGWM334(a1), and XGWM136(a3) markers were the most repeated. So, these markers can be used more in breeding programs.

Agronomic Traits Under Irrigated Conditions

The results of stepwise regression analysis to identify informative markers related to agronomic traits in irrigated conditions are presented in Table 8. It was concluded that the grain yield in the first year has a significant correlation with seven amplified loci, which include XGWM577(a2,a1), XGWM136(a3,a4), XGWM265(a1), XGWM410(a1), and XGWM2(a2) markers and in the second year with three amplified loci, which include XGWM124(a1,a2) and XGWM265(a1) markers. They justified 84 and 52% of the total variations, respectively. The most significant and positive effects in the first year were associated with the three loci XGWM136(a3), XGWM577(a2), and XGWM410(a1), and in the second year, they were associated with the locus XGWM265(a1). For associated traits with yield, i.e., 1000-grain weight in the first year, five markers, including XCFD5(a2), XGWM577(a2), XGWM155(a1), XGWM350(a3), and XGWM165(a2) markers, and, in the second year, the XGWM2(a2) marker are informative and explained 82% and 24% of the total variability, respectively. In the case of this trait, all the mentioned markers in the first and second years had a significant and positive effect. The number of grains per spike trait in the first year was significantly associated with two markers XGWM350(a3) and XCFD5(a1) and could justify 44% of the total variation. The number of spikes per m2 in the second year includes markers XGWM234(a1) and XGWM136(a4), which justified 38% of the total variation. The results of the rest traits are observed in Table 8. Eventually, it can be expressed that 51 gene loci for agronomic traits in irrigated conditions were identified. It can be noted that 23 gene loci covered all measured traits. The markers XGWM124(a2), XGWM410(a1), XGWM350(a3), XGWM577(a2), and XCFD5(a2) in the first year, the markers XGWM136(a2), XGWM70(a2), and XGWM124(a2) in the second year, and jointly, in both years, the XGWM124(a2) marker were the most repeated. So, these markers can be used more in breeding programs.

Discussion

High significant differences for most of the traits indicated the genetic diversity among genotypes, and it is possible to select superior genotypes based on agronomic traits in rainfed and irrigated conditions. High significant variation (p < 0.01) among the spring wheat for all morphological traits measured under drought stress was reported (Ahmed et al. 2022). In the research on genome-wide association mapping of genomic regions associated with drought stress tolerance at seedling and reproductive stages in bread wheat, analysis of variance (ANOVA) indicated highly significant differences among the wheat genotypes for all the studied seedling traits, including shoot weight, plant height, thousand-kernel weight, and grain yield under drought stress (Reddy et al. 2023), which was consistent with the results of this research.

Plant breeding programs mainly focus on genetic diversity, inheritance, conservation, and evolution (Peterson et al. 2014). The diversity in a population can be evaluated by various methods such as simple measurement of diversity, analysis of variance components, and molecular markers applied in this research. In this study, the estimation of genetic parameters showed that the phenotypic variation coefficient values were more significant than the genotypic variation coefficient in both conditions, which shows the existence of environmental effect on the expression of traits, according to researchers like Kumar et al. (2023b) and Amare (2023). Also, it is likely evidence of genotype-environment interactions (Kaur et al. 2023). Therefore, selecting these genotypes based on phenotype may help increase grain yield (Kobir et al. 2023). Also, in the case of most studied traits, the genotypic variation was more than the environmental variation. Therefore, if a significant part of the phenotypic diversity is genetic, it indicates that the effect of the environment on the emergence of this trait is small; if the contribution of the genetic variation coefficient is small, it means that the resulting diversity is not only due to genotypes but also due to the effects of the environment. A low broad-sense heritability value indicates more phenotypic variation due to the effect of the environment on the traits, and breeding through selection is problematic because it covers genotypic effects. A high amount of genetic advance or genetic gain indicates that the trait is controlled by additive genes, and selection helps improve such a trait. The low genetic advance or genetic gain indicates that the trait is controlled by non-additive genes and heterosis breeding will be helpful (Farshadfar 2010). In this research, concerning the issue of high heritability along with high genetic gain in traits indicating the additive effects of genes and the selection of early generations may be more favorable for these traits, with the opinions of researchers such as Kumar et al. (2023a) and Kumar et al. (2023b). Therefore, heritability and genetic gain are essential parameters for the selection of breeding methods. In the present research, by comparing the results obtained in each environmental condition, it was observed that stress has led to an increase in heritability and advanced or genetic gain in some traits investigated in this research. Traits with h2b (> 60.0%) and GG (> 20.0%) together indicate that the variation is mainly due to genetic factors, making them reliable candidates for a selection process (Sallam et al. 2024; Kobir et al. 2023), which is similar to the results of this study.

Heritability is a powerful indicator that explains the transfer of traits from parents to their progeny. Heritability estimation helps the plant breeder to select elite genotypes from a genetically diverse and segregating population (Farshadfar 2010). The genetic advance is the mean genotypic value over the parental population, a measure of genetic gain under selection (Kumar et al. 2023b). Therefore, in this research, the estimation of the broad-sense heritability of traits showed that in both years, in rainfed conditions, the TGW trait; in irrigated conditions, the GY, TGW, OIL, SGW, and HI traits; and so on. Common in both environmental conditions, the TGW trait (according to the opinions of Mulugeta et al. (2023) in durum wheat and Krishnappa et al. (2023) in bread wheat) had high broad-sense heritability. Also, based on the genetic gain in both years, in rainfed conditions, OIL and StW traits, in irrigated conditions, in addition to GY, traits of OIL, SGW, and HI, and jointly in both environmental conditions, the OIL trait had the highest genetic gain, which was more suitable for the selection of traits. The results of genetic parameters, in terms of GY, TGW, and HI traits, were consistent with the opinion of Choudhry and Khaliq (1992) concerning bread wheat. In a study, evaluating the genetic diversity of Ethiopian durum wheat landraces under water-stressed and nonstressed conditions, intermediate to high estimates of the phenotypic coefficient of variation (PCV), genotypic coefficient of variation (GCV), heritability in a broad sense (h2b), and genetic advance in percent of the mean (GAPM) were observed for all the studied traits except for 1000 seed weight at stress. The estimation of variability parameters showed that genotypic variation was higher than environmental variation for most traits. Also, the intermediate to high estimate of h2b and relatively high estimate of GAPM were observed in spike length and grain yield at normal conditions and in spike length at stressed conditions (Dukamo et al. 2023), which was consistent with the results of this study. Due to the high heritability and genetic gain of the measured traits, high genetic efficiency will be expected. Selection based on these characteristics leads to the release of new cultivars.

In evaluating the genetic diversity of bread wheat genotypes studied using SSR markers (Table 4), it can be concluded that XCFD168, XGWM350, and XGWM136 markers with 100% polymorphism, the highest number of alleles, high amount of polymorphism information content indices, marker index, effective polymorphism ratio index and resolving power index, and considering the high reproduction of bands and the production of high polymorphism bands to the most appropriate primers for wheat were introduced in subsequent studies. They have the necessary efficiency to investigate the relationships between genotypes and genetic structures. The XCFD168 marker was introduced as the superior marker in the study of Ramya et al. (2010), which was also consistent with the results of this study. Also, the large number of amplified sites in this experiment shows that a small number of SSR primers with high polymorphism information can distinguish many samples and different sample populations. Already, various researchers have reported the high efficiency of SSR markers in studies related to determining genetic diversity and evolutionary relationships in different plant species (Dagnaw et al. 2023; Jabari et al. 2023; Khodadadi et al. 2023). Markers with high polymorphic information content values can reveal variation between alleles and effectively be used for molecular mapping and analysis of genetic variation in populations (Kalivas et al. 2011). In research on morphological characteristics using 12 spike-related traits and analysis of genetic diversity of durum wheat (Triticum turgidum var. durum), high genetic diversity using 10 SSR markers, with polymorphic information content (PIC), was obtained as 0.69 (Ouaja et al. 2020). An evaluation of the genetic diversity of 17 bread wheat hexaploid genotypes using 16 microsatellite markers (SSR) concluded that only 11 markers had the most polymorphism and reproducibility (Kara et al. 2020). In a study to investigate the genetic diversity of Iranian wild relatives of bread wheat using ISSR and SSR markers, the polymorphism information content (PIC), marker index (MI), and resolving power (Rp) in SSR marker were 0.830 to 0.919, 1.326 to 3.167, and 3.169 to 5.692, respectively (Jabari et al. 2023). Therefore, based on the results of molecular markers in this research and the results of other researchers, the application of molecular markers, including SSR as indirect selection criteria, plant genetic improvement, and cultivar release operations, is performed faster and more accurately in breeding programs.

In this research, the high value of the cophenetic correlation coefficient showed that the cluster analysis using Jaccard’s similarity coefficient based on the data obtained from the SSR marker was very suitable in the grouping of the studied wheat genotypes (Fig. 2). The results of the principal coordinate analysis based on the SSR marker also showed a high agreement with the results of the cluster analysis using the UPGMA method based on the Jaccard similarity coefficient. Finally, genotypes 1, 3, and 25 have the most significant genetic distance with genotypes 13, 7, and the Pishgam cultivar. Therefore, it is recommended to use superior genotypes in breeding programs and genotypes with high genetic distance based on molecular studies to use heterosis in genetic diversity programs.

A dendrogram was made based on the similarity matrix to evaluate bread wheat cultivars using SSR markers. The cultivars were divided into three main groups, which showed the similarity between cultivars based on morphological characteristics of a limited range. In contrast, the similarity between cultivars based on the SSR range was comprehensive. Therefore, SSR markers were reported to be more accurate than morphological traits (Feltaous 2019). In a study using SSR markers and structure analysis, 105 bread wheat genotypes were classified into four clusters. This study showed the reliability of SSR markers, which can be helpful for future genetic studies, such as genetic mapping, comparative genomic analysis, genetic variation, and molecular marker-assisted breeding in wheat to select promising genotypes (Ahmed et al. 2020). In another study to assess genetic diversity and population structure based on SSR markers, 63 genotypes were classified into three clusters (Türkoglu et al. 2023).

Identifying QTLs controlling traits compatible with drought tolerance in breeding drought tolerance is necessary to understand how these traits are controlled and expressed (Sallam et al. 2019). In evaluating the association between studied traits in bread wheat genotypes with SSR markers in this research, the results showed a relationship between studied traits and primers. Although mapping based on quantitative trait loci (QTL) is suitable for tracking traits-related genes, this process is time-consuming and laborious (Rakshit et al. 2010). To overcome these limitations, it seems appropriate to identify markers related to traits through regression. Multiple regression analysis determines the explanation coefficient R2, which indicates the degree of relationship between the trait and the molecular marker (Gomez and Gomez 1984).

Therefore, according to the results of this research (Tables 7 and 8), in both environmental conditions, in the first year, the markers XGWM410(a1), XGWM577(a2), XGWM350(a3), XGWM165, XCFD5(a1), XGWM136(a2), XGWM265(a1), and XGWM124 were common, and in the second year, the markers XGWM2(a2), XGWM136 and XCFD5 about agronomic traits were common. Therefore, the presence of a significant association between several markers with one trait shows the existence of a quantitative and multigenic nature. The identification of common markers is essential in plant breeding because they enable the simultaneous selection of several traits (Tuberosa et al. 2002). Also, due to the high association of these markers, scanning in their adjacent chromosomal regions can effectively identify the genes controlling agronomic traits in wheat genotypes. In addition, the most positive markers in the first year, in rainfed conditions, respectively, are related to the traits of thousand grain weight, mitt penalty length, hectoliter weight, xteragen length, and spike grain weight, and in irrigated conditions, respectively, related to the traits of thousand grain weight, grain yield, spike length, and number of infertile spikelets, and in the second year, in rainfed conditions, related to thousand grain weight, spike dry weight, spike grain weight, and mitt penalty length, and in irrigated conditions, related to thousand grain weight trait. In this study, some markers related to traits were different in irrigated and rainfed conditions, indicating environmental conditions’ influence on these traits. For this reason, different QTLs were identified in this experiment. In a study regarding stress tolerance using SSR markers, the XGWM577 marker was introduced as the superior marker (Younis et al. 2020). According to the results obtained from the researchers, there was a significant association between the grain yield trait and XGWM124 marker (Jlassi et al. 2021), between grain yield with the XGWM410 marker and 1000-grain weight and peduncle length with XGWM577 marker (Maccaferri et al. 2011), and between the grain yield with XGWM577 marker (Barakat et al. 2011; Rahmati et al. 2018) that was consistent with the results of this research. According to the opinion of Ali et al. (2011), the marker XGWM155 was related to the agronomic traits, and according to the results of this research, this marker was associated with the agronomic traits of 1000-grain weight, hectoliter weight, and xteragen length and had a significant association. In this research, the significant association between SSR markers and 14 agronomic traits on bread wheat was investigated using stepwise multiple regression analysis. The results showed that, for example, one marker with the trait of plant height, two markers with the trait of peduncle length, four markers with the trait of the number of spikelets per spike, two markers with the trait of the number of spikes, two markers with the trait of the number of seed per spike, one marker with the trait biological yield, three markers with grain yield trait, and four markers with harvest index trait had significant association (Roy et al. 2006), which was consistent with the results of this research in terms of traits used with SSR markers. Sequencing markers produced by primers with high R2 and comparing them with the sequences in the selection programs with the marker-assisted selection and gene matching based on the map increases their ability. Some markers were associated with more than one trait, which can be caused by pleiotropic effects or the continuity of related QTLs in different traits. Multivariate regression analysis identifying the quantitative trait loci (QTL) requires less time and cost (Ruan et al. 2009) and does not require the formation of a population for mapping. In an evaluation, 34 continuous SSR markers were identified with plant height trait, grain yield, and fodder yield in millet through association analysis (Kannan et al. 2014). Stepwise regression analysis was used to evaluate bread wheat genotypes in salt-tolerant conditions (Al-Ashkar et al. 2020). Population structure and analysis of the association between morphological traits in some landraces of Egyptian bread wheat were investigated using SSR, ISSR, and AFLP markers under heat stress. In total, 57 and 60 significant MTAs were obtained at the 1% probability level for ten traits under heat stress. Spike length (SL) had the highest (35) significant MTAs distributed across 23 loci, followed by 14 MTAs for the number of seeds per spike (NKSp) at both loci. SSR locus (Xgwm369) on chromosome 3A is pleiotropic with grain weight per spike (GWP), spike length (SL), and plant height (PH), and common to both loci, which were potentially important targets for selection. The results showed that the identified MTAs were related to the genes controlling important heat stress tolerant traits (Muhammad et al. 2020). In the study related to association analysis for agronomic traits in wheat under terminal heat stress, highly significant MTAs under heat stress conditions were identified for spikelet per spike (xwmc553), grains per spike (cxfa2147, xwmc418 and xwmc121), biomass (xbarc7), and grain yield (xcfa2147 and xwmc671). The identified markers in this study could facilitate MAS and gene pyramiding against heat stress in wheat (Khan et al. 2021). Genetic diversity and phylogenetic relationships of wheat (T. aestivum L.) genotypes using phenological, molecular, and DNA barcoding markers, the 16 wheat genotypes showed significant genetic variation using the markers assayed. Desirable phenological parameters with molecular markers were convenient for evaluating Egyptian wheat genotypes (El-Esawi et al. 2023). Based on the results of this research and other researchers, wheat, drought-tolerant superior genotypes with high grain yield, and yield components are desirable as donors parent in the breeding program. Identifying molecular markers related to agronomic traits can accelerate the screening of drought-tolerant genetic materials in breeding programs. The presence of molecular markers with a high explanatory coefficient for the examined traits allows breeders to select QTLs related to drought tolerance, regardless of the effect of environmental factors. We reached this achievement that marker-trait associations (MTAs) are key elements for identifying genomic regions associated with agronomic traits in wheat under drought stress. The identified MTAs could be applied in marker-assisted breeding, fine mapping, and cloning of the underlying genomic regions in wheat germplasm.

Conclusion

Using molecular markers along with regression analysis is a valuable tool in applying the genetic diversity of crops. It will make it possible to predict the state of plant growth by combining different methods before conducting the field experiment. In addition, some germplasms of a plant species are evaluated in terms of quantitatively important traits to drought tolerance stress using molecular markers. Also, the relationship of these markers with different traits is investigated; this will be a markable contribution to the breeding programs. In this research, the agronomic traits of wheat seedlings in two environments (irrigated and rainfed) were investigated during two consecutive cropping seasons. Significant differences between the genotypes were revealed for most traits, indicating the genetic diversity between the genotypes. The estimation of genetic parameters showed that the phenotypic variation coefficient was greater than the genotypic variation coefficient in both conditions. In the case of most studied traits, the genotypic variation was more than the environmental variation. Besides, almost high broad-sense heritability (h2b.s) and genetic gain were observed for many traits in this study, indicating their potential in wheat breeding. Therefore, due to the high genetic diversity, heritability, and genetic gain, it is possible to produce desirable cultivars through selection and hybridization. The XCFD168-2D, XGWM350-7D, and XGWM136-1A primers were identified as the most suitable primers for wheat in subsequent studies. These microsatellite primers used in this study can provide a high level of polymorphism and separate groups from each other—also, the XCFD168-2D marker in the first year with mitt penalty length and the number of infertile spikelets traits in irrigated conditions, the number of fertile spikelets in rainfed conditions and in the second year with spike dry weight trait in rainfed conditions; the XGWM350-7D marker in the first year with traits of thousand grain weight and number of grains per spike in both environmental conditions, number of infertile spikelets in rainfed conditions, xteragen length, peduncle length, and spike length in irrigated conditions and in the second year with the traits of thousand grain weight and spike grain weight in rainfed conditions; and the XGWM136-1A marker in the first year with spike length in both environmental conditions, grain yield in irrigated conditions, spike dry weight, spike grain weight, and number of grains per spike traits in rainfed conditions and in the second year with hectoliter weight in both environmental conditions, xteragen length in rainfed conditions and harvest index, biological yield, and number of spike per m2 traits in irrigated conditions had a high significant association. Cluster analysis classified 25 bread wheat genotypes into four different groups. Finally, genotypes 1, 3, and 25 have the most significant genetic distance with genotypes 13, 7, and Pishgam, and these two groups can be introduced for breeding programs to release superior cultivars. In general, the results of association analysis by stepwise regression showed that in both years under rainfed conditions, the XGWM350 marker about 1000-grain weight, the XCFD5 marker about spike length trait, and the XGWM165 and XGWM70 markers about spike dry weight trait, and under irrigated conditions, the XGWM265 marker about grain yield trait jointly had a significant association. In addition, the XGWM136 and XCFD5 markers were common in agronomic traits in all environments. Meanwhile, the most common positive markers in both years are related to 1000-grain weight in both conditions, mitt penalty length, and spike grain weight in rainfed conditions. The association of these markers with the traits showed that their location inside the genome is probably the regions adjacent to the genes controlling the desired traits. Drought stress reduces genetic reserves, increases genetic erosion, and finally causes a decrease in yield production and a negative effect on human livelihood. Consequently, there is an urgent need to develop drought-tolerant and high-yielding genotypes to ensure sustainable production and global food security. Therefore, selection based on molecular markers, including SSR, and investigating their relationship with agronomic traits can be precious for accelerating classical breeding programs to select superior genotypes in different moisture conditions, produce high-yielding and drought-tolerant cultivars, and help the global production of wheat and create human food security.