Introduction

Durum wheat (Triticum turgidum L. var. durum) is the crop of preference for premium pasta production worldwide. Yield is the most important trait to many members of the wheat production chain, particularly farmers, distributors, and exporters. As grain yield is a complex trait that normally shows relatively low heritability, it is difficult to obtain high genetic gains in yield during the breeding process.

Durum wheat cultivation area in Argentina reached nearly half a million hectares in the early 1970s but decreased rapidly in the subsequent decades and stabilized at up to ~ 57,000 ha in the period 2001–2011 (http://www.siia.gob.ar/sst_pcias/estima/estima.php). Durum wheat is planted mainly in the southern sector of Buenos Aires province, Argentina. The main reason why durum cultivation decreased was the low average durum wheat yield compared to that of bread wheat and the lower profit margin compared to other agricultural systems although durum was priced higher than bread wheat in Argentina.

Breeding strategies for yield improvement focused mainly on biomass partitioning (harvest index) towards the spike through the pleiotropic effects of genes introduced during the “green revolution” (Rht) (Abbate et al. 1995; Royo et al. 2007). The most common methods to improve wheat yield were through understanding how yield components could be manipulated and could contribute individually to yield potential (Slafer 2007), and through better adaptation to environmental stresses (Tuberosa 2012). In the past 2 decades, new biotechnological techniques helped accelerate improvements in wheat yield via marker assisted selection (MAS) based on quantitative trait locus (QTL) mapping and on specific genes.

QTL analysis of yield components and morpho-phenological traits as well as of overall yield offers the possibility of detecting direct and indirect genetic effects on yield-related traits. Epistatic interactions of yield-related genes/QTL as well as QTL × environment interactions should also be taken into account to better understand the complex genetic basis of wheat yield. Previous studies, particularly on bread wheat, were conducted to identify either QTL or genomic regions with effects on different yield components (Quarrie et al. 2006; Kuchel et al. 2007; Kumar et al. 2007; Hai et al. 2008; Wang et al. 2011; Wu et al. 2012; Rustgi et al. 2013).

Relatively few studies on durum wheat (Maccaferri et al. 2008; Diab et al. 2008; Peleg et al. 2009; Golabadi et al. 2011; Blanco et al. 2012; Patil et al. 2013; Dura et al. 2013, 2014; Graziani et al. 2014) have reported the presence of QTL associated with yield, yield components, and pheno-physiological parameters in all chromosomes. These studies highlight the importance of a major QTL located on chromosome 3BS with effects on yield, plant height, heading date (Maccaferri et al. 2008), canopy reflectance (NDVI index), leaf greenness (SPAD units), peduncle length (Graziani et al. 2014), thousand-kernel weight (Blanco et al. 2012; Graziani et al. 2014), and kernel number/spike and spike weight (Marza et al. 2006). Other key QTL were reported on chromosome 2BL (Maccaferri et al. 2008; Graziani et al. 2014). In addition, QTL on chromosomes 2AS, 2BS and 4BS were associated with thousand-kernel weight, test weight, spikelets per spike, and grain yield (Patil et al. 2013). A high-density consensus map constructed based on SNP and other additional integrated markers has allowed localization of QTL positions more precisely for different traits and was also useful as a framework map to complement linkage disequilibrium analysis and genome-wide association mapping (Maccaferri et al. 2014, 2015).

The aim of this work was to identify the main genomic regions associated with variation in grain yield, yield components, and morpho-phenological related traits using a durum wheat RIL population. To explore the complexity of these traits, we also investigated the existence of pleiotropic effects, QTL × QTL, and QTL × environment interactions in the durum wheat genome.

Materials and methods

Plant material

A durum mapping population consisting of 93 F9 recombinant inbred lines (RIL) derived from a cross between UC1113 and variety Kofa was used (Zhang et al. 2008). Kofa is a Desert Durum® variety with intermediate yield and high quality, selected from a population designated ‘‘DICOCCUM ALPHA POP-85 S-1’’ by the West-Bred Company. UC1113, is a CIMMYT-derived line (KIFS//RSS/BD1419/3/MEXIS-CP/4/WAHAS/5/YAV79) selected by the Wheat Breeding Program of the University of California (Davis). Eight local commercial varieties were used as controls (Buck Platino, Buck Topacio, Buck Esmeralda, Buck Cristal, Buck Ambar, Bonaerense INTA Facón, Bonaerense INTA Cariló, and Bonaerense INTA Cumenay).

Experimental design and planting

Six field trials were carried out over 2 years (2006 and 2007) at different locations in the southern sector of Buenos Aires province. RIL, parental lines (UC1113 and Kofa), and controls were all grown in Cabildo [CA] (39°36′S 61°64′W), Barrow [BW] (38°20′S 60°13W) and Balcarce [BC] (37°45′S 58°18′W), following a gradient of water availability from Cabildo to Balcarce. Rainfalls in Cabildo, Barrow, and Balcarce in the periods from July to December were 280.6, 397.7, and 328.9 mm in 2006, and 248.4, 286.9, and 381.7 mm in 2007. The agronomic management and rainfall conditions are described in detail in Supplementary Tables S4 and S5 of Conti et al. (2011).

Field trials were organized in a randomized complete block design with three replications (plots were 3 m2 with 3 rows of 5 m spaced 0.20 m apart). Planting dates ranged from July to August and harvest dates were in December and January. An additional experiment was carried out at Marcos Juárez [MJ] (32°42′S 62°07′W), but only two phenological traits (heading and flowering times) were considered because a severe drought period caused crop loss.

Yield and yield-related trait measurements

The following data were recorded: (1) heading date (Hd), growth (GS) stage 55 (Zadoks et al. 1974); (2) flowering time (Flt), GS 65; (3) number of spikes per square meter (Ssm) calculated from the number of spikes in a 2 m row section located in the middle of the plot; (4) grain yield (Yld, kg/ha), weight of clean grains from the entire machine-harvested plot, and (5) thousand grain weight (Tgw, g) calculated as the average weight of two 100 grain samples from each plot.

At harvest time, ten plants from the middle row of each plot (replicate × genotype × environment) were collected for analysis of yield-related traits. Average values by plot were also calculated. The following traits were analyzed per plant: (6) plant height (Ph), calculated as the distance from the edge of separation of the stem from the root to the tip of the spike (cm); (7) peduncle length (PdL), measured as the distance from the last internode to the base of the spike and calculated as the average of all tillers per plant (cm); (8) biomass per plant (Bpp, g), obtained as the aerial dry weight of the entire plant; (9) number of spikes per plant (Snp); (10) grain number per ear (Gne); (11) grain number per plant (Gnp), calculated as the sum of the number of grains from all ears per plant; (12) spikelet number per ear (Sne) expressed as the average number of spikelets/ear, obtained by counting the number of spikelets in all ears/plant; (13) number of fertile spikelets per ear (Fse) obtained in the same way as Sne but considering only the number of fertile spikelets; (14) grain weight per ear (Gwe, g), obtained by weighing the grains from each ear of the plant and averaged; (15) grain weight per plant (Gwp, g), obtained as the sum of the weight of grains from all ears per plant; (16) grain number per fertile spikelet (Gnfs), calculated as the average ratio of Gne/Fse; (17) grain number per spikelet (Gnts), calculated as the average ratio of Gne/Sne; (18) harvest index (Hi), calculated as the ratio between grain weight per plant (Gwp) and total above-ground biomass per plant (Bpp), and (19) spike fertility (Sf) calculated as ratio Fse/Sne (%).

Genetic map

The genetic map used in this study consisted of 269 markers, including 230 SSRs, 23 SNPs, 10 RFLPs, three STSs, two proteins, and one morphological marker, arranged on 14 linkage groups covering a total length of 2140 cM with an average 153 cM per chromosome (Zhang et al. 2008).

Statistical analysis

The descriptive statistics (Table 1) of untransformed variables by environment was obtained using PROC MEANS. Normality of residuals was assessed by modified Shapiro–Wilk test in Infostat software (Di Rienzo et al. 2016). The LSMEANS for all traits by environment were calculated following PROC MIXED procedure. Pearson’s correlation coefficients (r) of LSMEANS were estimated for RILs (n = 93) by the PROC CORR procedure implemented in SAS 9.0 software (SAS 9.2 Procedures Guide 2010). For ANOVA within each environment, genotype was considered a fixed effect and replication was considered a random effect. Broad sense heritability (h2) was calculated using the mean square values obtained from PROC MIXED procedure in each environment as \({\text{h}}^{ 2} = \, \sigma_{\text{g}}^{ 2} / \, \sigma_{\text{p}}^{2}\), where genotypic \(\left( {\sigma_{\text{g}}^{2} } \right)\) and phenotypic \(\left( {\sigma_{\text{p}}^{2} } \right)\) variances were calculated as \(\sigma_{\text{g}}^{2} = \, \left( {{\text{MS}}_{\text{RIL}} - {\text{ MS}}_{\text{e}} } \right)/{\text{r and }}\sigma_{\text{p}}^{2} = \, \left( {\sigma_{\text{g}}^{2} + \, \sigma_{\text{e}}^{2} } \right)\), respectively. MSRIL and MSe are the mean sums of square for genotype and residual error, respectively, and r is the number of replications.

Table 1 Phenotypic variation and heritability (\({\text{h}}_{\text{B}}^{2}\)) of yield-related traits in RIL, parental lines, and controls

The criterion proposed by Cruz and Regazzi (1997) to join environments in a combined ANOVA over all environments was used. Residual mean squares (RMSs) from ANOVA of each individual environment were compared and variance was considered homogeneous when the ratio between the larger and the smaller RMS was lower than 7. Combined analysis of variance (ANOVA) was carried out using a mixed lineal model with PROC MIXED procedure in SAS and considering genotype as a fixed effect and environments and interactions as random effects.

QTL mapping

Arithmetic means by environment were used to map QTL. For the combined QTL analyses, mean values across environments were calculated. QTL were mapped using the CIM method with software Windows QTL Cartographer v.2.5 (Wang et al. 2005). Model 6 was implemented with a walking speed of 0.5 cM and a window size of 10 cM to exclude closely linked control markers using a forward and backward regression method. A LOD threshold value of three was used to consider a QTL to be significant. A confidence interval of 95% was calculated as the two-LOD drop from the maximum peak value (Van Ooijen 1992). QTL with LOD values between 2.0 and 3.0 that were mapped either on or closest (into the two-LOD interval) to significant QTL for the same or another trait were considered to be “suggestive” and were included in the results.

QTL were indicated as forming a cluster when they co-localized based on overlapping confidence intervals. In addition, we tested if some QTL could be putatively pleiotropic using the Multiple-trait CIM (MCIM) method (Jiang and Zeng 1995). To run the MCIM, we considered the main clusters detected in this work involving correlated traits. However, to test the hypothesis of pleiotropy, the QTL, which had been observed in a previous study from our group, to affect quality traits on a RIL mapping population, were also taken into account. Parameter settings and threshold values were the same as those for CIM analyses.

An analysis of epistatic (QQ) and environmental (QE and QQE) interaction was carried out with a mixed linear model (Wang et al. 1999) using software QTLNetwork v.2.0 (Yang et al. 2007) (http://ibi.zju.edu.cn/software/qtlnetwork/). The parameter set to select marker intervals was a walk speed of 0.5 cM. A window size of 10 cM was used to consider other marker intervals as cofactors. The critical threshold value of F-statistics was determined by the 1000 permutation test at a significance level of 0.05 (Doerge and Churchill 1996). QTL were indicated following the nomenclature suggested by McIntosh et al. (2003), the letter × preceding DNA marker names was omitted as in a previously published genetic map (Zhang et al. 2008).

Results

Analyses of phenotypic traits

The modified Shapiro–Wilk test showed that most of the phenotypic data sets [79/110] were normally distributed (Table S1). Ten traits [Ph, PdL, Yld, Snp, Gnp, Gwe, Gwp, Gnfs, Gnts, and Tgw] were found to be normally distributed in five environments, three [Gne, Hi, and Bpp] in four environments and only Ssm was normal in all tested environments. Only the test for Flt was rejected in all environments.

RIL mean values, distribution ranges, standard deviations, broad sense heritabilities (\({\text{h}}_{\text{B}}^{2}\)), and control average values for all traits analyzed are shown in Table 1. Transgressive segregant genotypes were identified for all traits in each environment.

Analysis of variance

ANOVA revealed highly significant differences among RIL (G) and environments (E) (Table S2h). The main source of variation in all traits was the environment. GxE was highly significant for all traits, indicating that differences among genotypes should be considered for each environment. Based on these results, ANOVA by environment was performed (Table S2a–g), considering genotype as a fixed effect and replication as a random effect. The genotype effect was significant for most traits and environments. Only Bpp (CA 2006, BC 2006), Snp (CA 2006, BC 2006, CA 2007, BW 2007, BC 2007), Ssm (BC 2006, BC 2007), Gwp (BW 2007), Gnp, Sf, and Gnp (CA 2006) showed no significant effects.

Correlation analysis

Correlations of LSMEANs among the 19 traits considered were analyzed for each environment. Higher correlation values for Ph versus PdL, Hd versus Flt, Bpp versus Gwp, Gnp versus Gwp, Gne versus Gnts, Bpp versus Snp, Gne versus Gwe, and Gnts versus Gnfs were detected in all environments (Table S3a–g). Gwe and Gne were significantly positively correlated across all environments with ten (Gne, Ph, Bpp, Fse, Gnp, PdL Gnfs, Gnts, Hi, and Sf) and eight (Gnts, Gwe, Gnp, Sne, Fse, Gwp, Gnfs, and Sf) different traits, respectively. Grain yield showed positive correlations with Ph, PdL, and Hi across environments but was negatively correlated with Hd and Flt in BW and BC 2006. Yld was significantly positively correlated with Gwp, Gwe, Gnp, and Gnts in five of six environments analyzed. Both crop seasons in Balcarce showed the highest significant correlation values involving Yld, mainly associated with Gwe, Fse, and Gne. In BC 2006, Tgw was highly correlated with Yld (r = 0.64), whereas in 2007 Hi showed a high association with Yld (r = 0.61). The lowest associations between traits in both years were at Barrow. In BW 2006, Yld was significantly correlated with Ssm (r = 0.47), Hi, and Sf. However, in 2007 Yld was slightly associated with grain weight (Gwe, Gwp), grain number (Gne, Gnp), Bpp, and Hi. Yld and Hi were also significantly correlated in Cabildo (2006 and 2007). On the other hand, Flt and Hd were significantly and negatively associated with PdL in all environments and positively correlated with Sne. Only Flt versus Sne in BC 2007 was not significant. Tgw showed the highest number of positive correlations in BC 2006 with 10 traits, and was negatively correlated either with Hd or Flt in three environments (BC 2006, CA 2007, and BW 2007).

Identification of genomic regions associated with yield-related traits

A total of 224 significant QTL (LOD ≥ 3) was identified for the 19 traits considered in the present study. Additionally, 47 suggestive QTL (LOD > 2) were located in the same positions, yielding a total of 271 QTL for all traits and environments (Table S4). These results allowed us to identify 47 genomic regions affecting yield across the complete durum genome. Based on QTL with overlapping confidence intervals, it was possible to define 35 QTL clusters (C1–C35) affecting different related traits (Table 2). The remaining 12 genomic regions were involved in genetic control of only one trait. A summary of the number of genomic regions identified per trait and their locations is shown in Table 3.

Table 2 QTL analyses for yield and yield-related traits in the UC1113 × Kofa RIL population in seven environments in Argentina
Table 3 QTL analyses for 19 traits, co-localization, and stability among environments. (Color figure online)

The number of QTL identified per trait varied from 4 (Snp, Ssm) to 14 (Tgw, Yld) and the percentage of phenotypic variation (R2) explained by those QTL ranged from 6.3 to 55.1%. The highest LOD score (16.5) obtained in this study corresponds to the 3BS QTL (3BS.1) mapped using plant height data (Cluster 13). This was the most important mapped genomic region affecting a total of 11 traits. Another 20 trait-specific QTL were mapped in three or more environments and could be considered environmentally stable QTL, the most important ones being located on 1AS, 1BL.1, 2BS.1, 3AL.1, 3BS.1, 3BS.2, 4AL.3, 5AL.2, 5AL.3, 6AL.1, and 6BS.1. A subset of these QTL (1AS, 1BL.1, 2BS.1, 3AL.1, 3BS.1, 3BS.2, 4AL.3) affected more than five traits (Table 2). Two additional regions, 4AL.2 and the centromeric QTL 4BS/4BL.1 were mapped for 6 and 7 traits, respectively, but no trait was associated with a stable QTL across environments.

The distribution of QTL between the two genomes of durum wheat was approximately similar, with a somewhat larger contribution of B genome (55%). The contribution of favorable alleles from both parents for the 271 QTL detected was approximately equal, where Kofa contributed with 47.2% of favorable alleles, and UC1113 with 52.8%. The corresponding positions and the QTL confidence intervals are shown in Supplementary Table S4.

As stated above, the genomic regions affecting yield and yield-related traits were grouped in 35 clusters (Table 2). The main clusters are described below and shown in Fig. 1. Additional clusters are shown in Supplementary Figure S1.

Fig. 1
figure 1

QTL associated with yield and yield-related traits mapped on chromosomes 1A, 1B, 2B, 3A, and 3B using UC1113 × Kofa RIL mapping population. Bars represent QTL confidence intervals with 2-LOD drop offs; QTL and environment are indicated at the top. Centromeres are indicated by grey squares. Peak positions are indicated by horizontal lines within QTL bars. Major genes Ppd-B1, Rht4, and Rht5 are indicated on the map according to the positions reported by Hanocq et al. (2004), Mohler et al. (2004) and Ellis et al. (2005). New QTL for yield-related traits (QTL-3AL.1) are also indicated. The genetic linkage map was adapted from Zhang et al. (2008)

Cluster 1 (1AS)

This genomic region affected 5 phenotypic traits (Sne, Flt, Fse, Gne, and Gwe), Sne being the main trait detected. QTL for Sne were detected in six environments and explained 29.1% of variation in the mean data from all environments and more than 20% in four environments. The number of spikelets/ear (Sne) ranged from 10.1 to 21.1 across environments. Fse, the fertile portion of spikelets, was mapped in three environments, but was only a suggestive QTL in the mean data from all environments. Flt was mapped only in the additional environment, Marcos Juárez (LOD > 3).

Cluster 3 (1BL.1)

Eight traits were affected by this region (Yld, Bpp, Fse, Sne, Gnp, Gwp, Gwe, and Tgw). The main association was with grain weight measured at different levels, from plant to spike or individual grains. This region explained between 10 and 18% of the phenotypic variation observed in the traits involved. Five of these traits (Yld, Bpp, Fse, Gwp, and Gwe) were mapped using the mean data for the six environments although Gwp and Gwe were not mapped in any individual environment. Most of the QTL from this cluster were mapped in Barrow data from both sampling years. Fse was detected as suggestive in three locations in 2006.

Cluster 7 (2BS.1)

The 2BS.1 genomic region was second in importance after 3BS.1 in terms of QTL stability and number of traits. It was associated with eight traits (Sne, Hd, PdL, Flt, Yld, Gnfs, Hi, and Gnts). Sne, the main trait affected by this cluster, was mapped in four environments with R2 values ranging from 15 to 22% and explaining 32% of variance in the mean data of the six environments. Hd and PdL were mapped in three environments and Yld, Flt, and Gnfs, in two. A delay in the heading/flowering stages was caused by the Kofa allele, which was also associated with a high Sne, whereas the UC1113 allele was responsible for higher yield and increasing Gnfs, Gnts, Hi, and PdL. Three of the traits (Sne, Hd, and PdL) controlled by this region could be considered as stable QTL, and their effects were also significantly detected using the means of the environment dataset. Most of the QTL in 2BS.1 cluster were mapped in CA 2006 and BW 2006.

Cluster 11 (3AL.1)

This cluster had a significant influence on 5 traits (Yld, Hi, Gne, Sf, and Gnts) and the QTL for Yld, which was detected in 3 of 6 environments as well as the mean data, showed the highest stability. Taking into account R2 values, it was the second most important QTL for Yld, after QYld.cerz-3BS.1, explaining 11–18% of the variation. This region was one of three identified as affecting yield in the mean data. Gne, Hi, and Yld QTL were each mapped in two to three environments and in the mean analysis. For Hi, 30% of phenotypic variation was explained by this QTL in BW 2007. The positive allele for all traits mapped in this region was provided by UC1113.

Cluster 13 (3BS.1)

This was the main genomic region controlling the traits tested in this work, with effects on 11 traits (Ph, PdL, Yld, Tgw, Bpp, Sf, Gne, Gnp, Gwe, Gnfs, and Gnts). The 3BS.1 region was mapped mainly in CA 2007, BC 2006, and BW 2006, affecting 8, 7, and 6 traits in each environment, respectively. It explained most of the variation for Ph, PdL, and Yld, accounting for 45, 40, and 20%, respectively, of the variation in the mean data for all environments. The QTL mapped for Ph and PdL in the 3BS.1 region in five and six environments, respectively, showed the highest stability. This region was significantly associated with Yld only in two environments and the mean data. However, it explained 35–38% of variation, corresponding to the highest R2 value for Yld. Effects on three additional traits (Gnts, Gwe, and Gnfs) were detected in this cluster in individual environments (3, 2, and 2, respectively) and in the mean data for all environments, with R2 values of 10–21%. The remaining traits (Tgw, Bpp, Sf, Gne, and Gnp) were mapped on this cluster only in one environment. The favorable allele for the 3BS.1 QTL cluster was from Kofa.

Cluster 14 (3BS.2)

A second QTL (3BS.2) mapped on 3BS was associated with Hd, Flt, Gnp, Gwp, Ph, Pd, and Yld. For Hd, Flt, Gnp, and Gwp; the favorable allele was provided by UC1113, while for PdL, Ph, and Yld, the favorable allele was provided by Kofa, as for the 3BS.1 QTL. The LOD confidence intervals for these linked QTL (3BS.1 and 3BS.2) controlling PdL in BW 2007 and Ph in BC 2006 were not overlapping. In CA 2007, both QTL affected Gnp with opposite alleles, and the corresponding two-LOD confidence intervals were separated by 15 cM. 3BS.2 was the main genomic region mapped for flowering time in the mean analysis (R2 = 16%). Flt and Hd were mapped in two and three environments (with LOD > 3), respectively (Table 1).

QTL with effects on single traits

Twelve QTL with single-trait effects were mapped using our RIL population (Table 2). Three QTL (QSne.cerz-5AS.3/5AL.1, QTgw.cerz-5BL, and QTgw.cerz-6BS.2/6BL.1) were identified in various environments and the mean data. The QSne.cerz-5AS.3/5AL.1 QTL was located in the centromeric region of chromosome 5A and was significantly associated with Sne in BW 2007. It was mapped in CA 2007 and was considered as suggestive in the mean analysis. Another two QTL were associated with Tgw, one of which was mapped on 5BL (QTgw.cerz-5BL) and the other one in the centromeric region of 6B (QTgw.cerz-6BS.2/6BL.1). QTgw.cerz-5BL explained 13–14% of the phenotypic variation in CA 2006, BW 2007, and the mean data. The strongest LOD value for this region occurred in the mean QTL analysis. The second QTL for Tgw (QTgw.cerz-6BS.2/6BL.1) was mapped in the same two environments and the mean across environments, where it explained 16% of variation. The other nine single-effect QTL were not consistent among environments.

QTL with epistatic and environmental effects

The environmental interaction effect on the QTL mapped for all traits was explored and 12 of them (Hi, Hd, Sne, Ssm, Bpp, Yld, Tgw, Gne, Gwe, Gnfs, Gnts, and Sf) showed QE effects (Table 4). The main traits with QE interactions were Hi and Hd, with three and two different QTL involved, respectively. The main clusters with QE interactions were 3BS.1 and 2BS.1, which were also the main QTL affecting yield-related traits in our study. The 3BS.1 QTL had the highest number (13) of environmental interactions. Two of the six traits showing QE interaction in 3BS.1, Yld and Gnfs, involved the highest number (3) of environments. Taking into account the environments involved in QE interactions, Balcarce had the majority of interactions in both crop seasons, followed by CA 2006.

Table 4 QTL × environment interactions detected in the RIL population

Epistatic interactions between QTL (with and without the main effects) were analyzed. Seventeen of the 19 traits analyzed showed QQ effects with at least one interaction. The maximum number of interactions was detected for Sne (four epistatic interactions), followed by Flt and Hd with three QQ interactions. Nine of the 14 remaining traits were involved in two QQ interactions (Tgw, Ph, PdL, Gnfs, Gnts, Snp, Gnp, Gwp, and Bpp) and 5 traits (Yld, Ssm, HI, Fse, and Gwe) showed only one interaction. The epistatic interactions associated with Sne, Gwe, Gnts, Ssm, Snp, Gnp, Gwp, and Bpp traits also had QQE effects (Table 5). The majority of QQ interactions identified per trait involved no main effect QTL.

Table 5 Epistatic and QTL × QTL × environment interactions detected for all traits

Multiple trait mapping (MCIM) to test pleiotropy

We next aimed to distinguish genetic linkage from pleiotropy in the different clusters by MCIM. Our previous results from mapping quality traits using the same population coincided with several of the clusters identified in the present work (Table S5). MCIM was used to formally test pleiotropy in the main clusters detected, considering all traits involved in each cluster, the traits mapped using mean values and the 2 or 3 main traits involved, separately or together with the quality traits analyzed in this population. Both negative and positive significant correlations were detected for the traits involved in each cluster (Tables S3a–g).

Twenty clusters with significant MCIM (LOD values higher than 3.0) involving different traits and/or the joint-trait (Table S6; Figures S2–S8) suggested putative pleiotropic effects for QTL mapping in the same region. For several clusters, the use of MCIM increased the power of QTL detection power on the joint-trait compared to each trait separately. Four clusters showed a MCIM with LOD > 3 involving quality traits. In three of them, the highest LOD value was obtained when these quality traits were included.

Discussion

Genomic analysis of complex traits requires the discovery of genes/QTL and their validation in different genetic backgrounds and environments for further use in MAS. The stability of QTL across years and locations determines their suitability for MAS in a breeding program. It is also necessary to consider the interactions among genomic regions during selection using multiple markers. The RIL mapping population used in this study proved to be highly useful for QTL mapping of traits like the ones analyzed here.

Four major clusters for yield-related traits

Linked clusters 13 and 14 (3BS.1 and 3BS.2)

The main genomic region associated with yield and related traits was represented by cluster 13 (3BS.1 region) involving 11 traits. This finding is in agreement with previous reports of an important QTL on 3BS (Marza et al. 2006; Maccaferri et al. 2008; Blanco et al. 2012; Graziani et al. 2014). This region also showed the highest number of QTL × environment interactions. However, only 1 trait (Gnts) showed epistatic effects (QTL × QTL). No epistatic effects involving Yld, Ph, or Hd/Flt in cluster 13 were observed as was previously detected by Maccaferri et al. (2008). However, cluster 14, linked to cluster 13, showed epistatic interactions for five traits (Gnts, Snp, Gnp, Gwp, and Bpp). Cluster 14 was the second most important QTL affecting Hd and Flt. Our results give support to the presence of two linked genomic regions (clusters 13 and 14) based on the simultaneous detection of these two regions affecting Gnp (3BS.1 and 3BS.2) in the same environment, with alleles of opposite effect and without overlapping LOD intervals (15 cM apart). In agreement with this, Griffiths et al. (2009) reported two QTL for heading date in 3BS linked to wmc500 and wmc540, near the position of cluster 14.

Markers gwm493 and barc147 flanking cluster 13 were associated with Fusarium head blight response in bread and durum wheat populations (Buerstmayr et al. 2009) and grain yield/plant height/heading date (Maccaferri et al. 2008). In our analyses, the QTL for peduncle length and plant height were the main QTL mapped in cluster 13 (explaining 40.7 and 45.0% of phenotypic variation, respectively). Previous studies reported the presence of semi-dwarfing gene Rht5 with a large effect on plant height in the telomeric region of 3BS linked to barc102 (Ellis et al. 2005; Rebetzke et al. 2012). Although this marker was not included in the UC1113 × Kofa linkage map and taking into account two different linkage maps (Maccaferri et al. 2008; Sourdille et al. 2004), SSRs linked to cluster 13 were located close to barc102. As to Rht5, it has not yet been cloned. However, the possibility of natural variation at the Rht5 locus in our mapping population as a basis for the Ph QTL should not be discarded. Further work is needed to determine if this locus is involved.

Cluster 7 (2BS.1)

Heading date and flowering time are important traits related to the adaptability of crops. Phenotypic variation for these traits is controlled by the allelic combination of genes affecting vernalization requirements (Vrn), photoperiod response (Ppd), and earliness per se (Eps). These traits are responsible for variation in grain yield potential, particularly in stress environments. The Vrn-1 and Ppd genes are located in homoeologous groups 5 and 2, respectively (Kamran et al. 2013). Eps genes were reported on all wheat chromosomes (Kamran et al. 2013, 2014). In line with this, the second most important QTL cluster (cluster 7) was detected on chromosome 2BS (2BS.1), and affected eight traits. Previous research reported that SSR gwm148 is linked to Ppd-B1 (Hanocq et al. 2004; Mohler et al. 2004). Based on the bread wheat SSR consensus map, this marker was located in the same region as that of cluster 7 (Somers et al. 2004). We found a QTL in cluster 7 that also affected Sne and PdL and was strongly associated with Hd and Flt, explaining up to 32% of the phenotypic variation. This finding agrees with the presence of Ppd-B1 in this region. Moreover, Maccaferri et al. (2008) found a QTL in the gwm429-gwm148 region in chromosome 2BS with effects on Hd. Using a durum wheat RIL population, Patil et al. (2013) found microsatellite gwm429 linked to Tgw and Gwe QTL. In addition, SSR gwm148 in durum wheat was found to be associated with Hi (Golabadi et al. 2011).

In our study it could also be observed that among the traits involved in cluster 7, Kofa provided alleles responsible for increasing Sne and Hd/Flt whereas UC1113 provided alleles for high Yld and PdL. The association of positive alleles for Sne and Hd/Flt agreed with the correlation coefficients in each environment.

Cluster 3 (1BL.1)

Another important region identified was cluster 3 (1BL.1), which affected eight traits. This cluster was associated with Yld and seven yield components and showed a strong effect on Fse. Blanco et al. (2012) reported a QTL in a similar region associated with grain yield, thousand kernel weight, and kernel number per spike using the durum population Svevo × Ciccio. Graziani et al. (2014) and Maccaferri et al. (2008) mapped QTL in this region for Tgw (TKW), leaf greenness (SPAD units), and Yld. It is important to note that the population used in the present work and in the studies carried out by Maccaferri et al. (2008) and Graziani et al. (2014) shared Kofa variety as a parent. In bread wheat, a QTL associated with six yield-related traits (Gwp, Fse, and Sne in common) was identified in the same region (Wu et al. 2012) based on the positions of SSR markers gwm131 and wmc156 in the consensus map (Somers et al. 2004). Another study also found that this region (barc181-wmc156) was associated with mean and maximum grain filling rate (GFR), Gwe, and Tgw, explaining about 20.2% of variation in the mean value of GFR (Wang et al. 2009). In addition, two linked QTL were mapped for earliness per se in a similar region in bread wheat (Kamran et al. 2013).

Cluster 11 (3AL.1)

This cluster, which was the main stable region affecting Yld, included 4 additional traits (Gne, Sf, Gnts, and Hi). Marker wmc428, located at the peak of the Yld QTL, was found to be near the centromere on chromosome 3A. The simultaneous detection of QTL for Yld and Sf/Gne/Hi/Gnts suggested that the increased yield associated with this region could be due to the high number of florets in the spike. Gne, Gnts, and Hi were found to be highly correlated traits and had a common positive allele from UC1113. Blanco et al. (2012) mapped a QTL for Tgw linked to wmc428 in durum wheat. Several QTL for grain yield and yield components were detected in this region using the same mapping population of bread wheat (Campbell et al. 2003; Ali et al. 2011; Rustgi et al. 2013). Chromosome 3AL was also found to be associated with Flt (Ali et al. 2011; Mengistu et al. 2012), and it could be assumed that there is an Eps gene in this region (Mengistu et al. 2012). In support of this, we found that region 3AL.1 was involved in one of the three epistatic interactions for Hd (QHd.cerz-3AL.1 × QHd.cerz-7AS.1). Previous research detected marker cfa2193 in the peak of the adjacent cluster 12 (3AL.2) though not as a separate region (Mengistu et al. 2012). We found cluster 11 to be far from cluster 12. Based on our results and on previous evidence, we speculate that cluster 12 may correspond to region four reported by Ali et al. (2011).

Minor clusters

Cluster 1 (1AS.1)

The most important and stable QTL for Sne was located in cluster 1 within the interval wmc95barc148, with a peak position at wmc24. In addition, cluster 1 was observed to have effects on Flt, Fse, Gne, and Gwe. Wang et al. (2011) mapped two linked QTL in the wmc24 region, with effects on floret number per spikelet, spike number per plant, spikelet number per spike, and grain number per spike. In our study, two peak positions were mapped, but the presence of two different QTL was not evident due to a wide overlapping confidence interval. The flanking markers of this cluster were also involved in epistatic interactions affecting Yld (QYld.cerz-1AS × QYld.cerz-6AL) and Ssm (QSsm.cerz-1AS × QSsm.cerz-4BL). Dura et al. (2013) reported a Ph QTL linked to wmc24, whereas Börner et al. (2002) detected QTL for Ph, Gwe, and Gne at 38 cM on 1AS. Using an association mapping strategy, Maccaferri et al. (2011) found SSR wmc24 associated with Yld and test weight. In addition, a QTL affecting Fse was mapped on 1AS in a telomeric position (Ma et al. 2007).

The positive relationship between the favorable alleles for Flt and Sne observed in this cluster was in agreement with the relationship previously discussed for Cluster 7. We therefore hypothesize that the increased number of spikelets could be a consequence of prolonged pre-heading time, thus allowing complete development of the terminal spikelets present in the culms. Likewise, Gonzalez et al. (2003) postulated that an increase in the pre-heading stage during stem elongation causes an increase in the number of fertile florets and grains.

Genomic regions previously associated with Rht genes

Several clusters with major or minor effects on Ph and/or PdL identified in the present work were located in genomic regions previously associated with Rht genes (Rht1, Rht4, Rht5, Rht9, Rht12) (Ellis et al. 2005). Our results showed that clusters 9 (2BL), 16 (4AS), 19 (4AL.3), 24 (5AL.2), and 25 (5AL.3) were associated with Ph and/or PdL, as stated above in relation to cluster 13 (3BS.1). Each of these clusters was coincident with known Rht positions (Maccaferri et al. 2008; Rebetzke et al. 2012; Ellis et al. 2002) (for further details see the Supplementary Discussion).

Although cluster 21 (4BS/4BL.1) showed no association with Ph and/or PdL, it was located near the reported position of Rht-B1 (Ellis et al. 2002). Quarrie et al. (2005) detected a QTL for yield and yield components in the centromeric region of chromosome 4B, between markers Rht-B1 and gwm165.1.

For further discussion about the remaining clusters, see the Supplementary Material section. The positions of QTL on the remaining chromosomes are shown in Fig. S1.

Multiple trait mapping for yield and yield-related traits

The location of multiple QTL in the same region affecting different traits could be indicative of the presence of a single locus with pleiotropic effects on several traits. The genetic effects on different traits are not independent if genes/QTL for different traits are either linked or pleiotropic. The MCIM method was used to determine if some of the QTL mapped in the same cluster represent a single gene/QTL.

The MCIM performed using mean data for traits mapped in cluster 13 showed a significant joint-trait QTL (LOD = 11.6) that was also significant for four individual traits (Yld, Ph, PdL, and Gwe; LOD > 3 to 9) (Figure S4 [4]). In addition, the analysis of cluster 7 (2BS.1) yielded a significant result for joint-trait QTL (LOD = 7.4). In this case, a LOD score > 3.5 was found for the individual traits Hd, Flt, and Sne (Fig. S3 [1]). For cluster 11 (3AL.1), Yld, Hi, and the joint-trait confirmed QTL results (LOD > 3) using 2 different trait combinations (Figure S4 [1, 2]).

The MCIM for different trait combinations was tested in 15 clusters and showed higher LOD values for joint-traits than for single traits in all cases. In some cases, results showed only the joint-trait to be significant. However, this test failed to detect QTL for individual traits, as in the case of clusters 26 (Figure S7 [1]), 28 (Figure S7 [4, 5]), 30 (Figure S8 [2]), and 35 (Figure S8 [8]). MCIM showed that clusters 7, 11, 13, and 35, among others, could be explained by a single gene/QTL.

Genomic regions co-localized for quality QTL and pleiotropy

Several regions analyzed in the present work co-localized with QTL affecting quality traits previously identified using the same mapping population (UC1113 × Kofa) (Conti et al. 2011; Roncallo et al. 2012). Cluster 35 was mapped for four traits (Gnfs, Sne, Hd, and Flt) and was involved in QQ (QPdL.cerz-5AL.3 × QPdL.cerz-7BL.4) and QE (QHd.cerz-7BL.4) interactions. This region was the second-most important QTL for grain protein content (Gpc), Ypc, and flour yellow color (Fb) (Conti et al. 2011; Roncallo et al. 2012). The results derived from MCIM including quality traits (Fb, Ypc, and Gpc) for cluster 35 detected a significant joint-trait QTL with a LOD score higher than that for each individual trait (Figure S8 [9,10,11]), suggesting the presence of a single gene acting on these traits.

Cluster 13 was associated with Gpc (Conti et al. 2011). In the present work, MCIM was used to analyze the pleiotropic effect of yield and protein in this cluster. Both traits were significant (LOD > 3) and found at the same position.

Conclusions

Yield and related traits are under complex genetic control. Our analyses allowed us to construct a model for the genetic network involved in yield-related traits, as well as to catalog the number and importance of QTL by trait and epistatic QTL and/or environmental effect. It was possible to dissect four main genomic regions (clusters 13, 7, 3, and 11) located on 3BS.1, 2BS.1, 1BL.1, and 3AL.1. The first three regions were previously reported in durum wheat whereas the fourth was observed in bread wheat. In our study, 3AL.1 was found to affect 6 yield-related traits. Other important regions were also found to be located on 3BS.2, 1AS, 4AL.3, 5AL.2, and 5AL.3. The most stable traits mapped in the present study were plant height, peduncle length, spikelet number/ear, grain yield, heading date, and flowering time. Among these traits, spikelet number/ear, heading date, and flowering time were affected by the highest numbers of epistatic interactions.

The use of MTM allowed us to confirm that the most important pleiotropic regions (3BS.1, 2BS.1) could be explained by single genes/QTL.

As to the usefulness of these regions for MAS in breeding programs, it is interesting to note that the 3AL.1 region increased yield through its effect on Hi and Gne without altering grain protein content, gluten strength, flour yellow color, and crop phenology. 1AS, another important region, was associated with Sne and was consistently mapped in all environments tested as well as in the mean of all environments. No candidate gene(s) produced the effect described. In view of this the 1AS region is a good candidate for future studies.

An additional finding worthy of note was the presence of several QTL coincident with the positions reported for Rht genes, thus suggesting that it could be fruitful for breeders to explore and exploit the effects of these genes in obtaining higher yields.