Introduction

Hybrid breeding is a remarkable success story in several allogamous crop species, but it is still not established on a large scale in autogamous cereals except in rice (Longin et al. 2012). Reasons are a low magnitude of heterosis in autogamous crops, need for high seed densities in small grain cereals coupled with the lack of a low cost hybridization system, and the lack of high yielding heterotic patterns (Edwards 2001; Singh et al. 2010). Recently, large public and commercial projects have been launched to establish hybrid wheat (Triticum aestivum L.) breeding programs in Europe, Mexico, and Australia, mainly driven by the demand of an increase in grain yield per area.

The decision to start a hybrid breeding program depends on the long-term competitiveness of hybrid versus line breeding (Oettler et al. 2005; Longin et al. 2012). A stable yield surplus is required to justify the higher seed production costs of hybrids. The yield surplus is a function of the extent of heterosis and of the selection gain of hybrid versus line breeding. A recent study based on experimental data of the current market leader in hybrid wheat breeding reported 1.86 Mg ha−1 yield advantage of the best hybrid compared to the highest yielding line variety (Gowda et al. 2012). This suggests that hybrid wheat breeding has the potential to be competitive with line breeding, but further data are needed to substantiate these findings.

The genetic variances, which can be exploited in line versus hybrid breeding, and the variances of genotypes-by-environments interactions are further crucial factors determining the competiveness of hybrids (Longin et al. 2012). In particular, the magnitude and the ratio of variance due to general (GCA) and specific combining ability (SCA) are relevant to estimate the expected selection gain in hybrid breeding. Moreover, detailed knowledge of the correlation between line per se and hybrid performance, and between GCA effects and line per se performance is required to optimize hybrid breeding schemes (Longin et al. 2007; Gordillo and Geiger 2008). For wheat, only very limited information is available on these important quantitative genetic parameters in the literature (Borghi and Perenzin 1994; Perenzin et al. 1998; Gowda et al. 2012).

In our current treatise, we evaluated a vast population comprising 1,604 wheat hybrids and their 135 parental European elite wheat inbred lines in intensive field trials. Our objectives were to (1) study the extent of midparent, better-parent and commercial heterosis and (2) discuss the consequences of relevant quantitative parameters for the design of hybrid wheat breeding programs.

Materials and methods

Plant material and field experiments

Four wheat breeding companies provided 68 male inbred lines with known good pollination characters and 275 female lines. We used in preliminary work 24 SSR markers and selected 15 males and 120 females of the 343 elite lines based on genetic distances among the lines eliminating close relatives and maximizing the allelic diversity (>77 % of alleles maintained after selection, data not shown). For detailed molecular characterization of the parental lines, we refer to Würschum et al. (2013) and Zhao et al. (2013). The 120 female lines were crossed with the 15 male lines using a factorial mating design and 1,604 of the 1,800 potential single-cross hybrid combinations could be produced with sufficient amount of seeds. Females were emasculated using chemical hybridizing agents. The sterility was checked bagging 1–3 female plants. Sterility was high with mean values above 95 %. To avoid pollen contamination from other male parents, the separate crossing blocks, each containing one male and several female parental lines, were isolated by at least 16 m distance from adjacent crossing blocks. The 1,604 hybrids, their 120 female and 15 male parental lines, as well as 10 commercial varieties as checks, 8 inbred line cultivars (quality class E: Genius, quality class A: JB Asano, Julius, Tuerkis; quality class B: Colonia, Kredo, Tobak; quality class C: Tabasco) and 2 hybrids (As de Coeur and Hystar), were evaluated in multiple environments for 9 agronomic and disease resistance traits in Germany in the year 2012 (Table 1). The environments were Adenstedt, Bredstedt, Böhnshausen, Hohenheim, Hadmersleben, Harzhof, Rosenthal, and Seligenstadt.

Table 1 Agronomic and disease resistance traits and number of locations assessed in our study

All genotypes were evaluated in yield trials and also in observation plots. In the yield trials, all hybrids and lines were split into three adjacent trials linked with 10 common checks. The experimental designs were partially replicated alpha designs (Williams et al. 2010), where all parents, checks as well as 29 % of the hybrids were used in both replications. Sowing density ranged from 230 to 290 seeds m−2 and plot size ranged from 5 to 7.4 m2. The yield trials were treated with fertilizers, fungicides and herbicides according to farmers practice for intensive wheat production. Data were recorded for natural occurring frost and lodging susceptibility on a ordinal scale of 1–9, where 1 refers to low susceptibility and 9 to high susceptibility, for plant height in cm between soil surface and tip of the ear, for heading time in days between the beginning of the year and heading, and for grain yield in Mg ha−1 with a moisture content of 140 g H2O kg−1.

In the observation plots, all genotypes were tested in an alpha design without replications using micro-plots of 0.5–1.5 m2. The observation plots were treated with fertilizers and herbicides like the yield trials, but fungicides were not applied. Disease susceptibility was recorded for yellow rust, leaf rust, and septoria tritici blotch based on artificial inoculation with a mixture of different races and for powdery mildew based on natural infection. All diseases were scored in an ordinal scale of 1–9, where 1 refers to healthy plants and 9 to fully infected plants (Table 1).

Phenotypic data analyses

Residuals for all traits in each environment were normally distributed except for yellow rust, where data were skewed more toward resistance. Therefore, yellow rust data were transformed by applying arcsin(sqrt(x/100)). In the yield trials, the phenotypic data of each environment were first analyzed separately based on the statistical model

$$y_{ijklm} = \mu + g_{ij} + t_{k} + r_{lk} + b_{mlk} + e_{ijklm} ,$$

where y ijklm is the phenotypic performance for the ijth genotype (parental line i = j, or hybrid i ≠ j) in the mth incomplete block of the lth replication in the kth trial, μ an intercept term, g ij the genetic effect of the ijth genotype, t k the effect of the kth trial, r lk the effect of the lth replication in the kth trial, b mlk the effect of the mth incomplete block in the lth replication of the kth trial, and e ijklm is the residual. Except b mlk , all effects were treated as fixed.

The adjusted means of each genotype and one divided by the squared standard error as weighting factor were used to estimate the genetic variance components across environments (Möhring and Piepho 2009) with the following model:

$$y_{ijn} = \mu + a + l_{n} + p_{ij} + g^{\prime}_{i} + g^{\prime\prime}_{j} + s_{ij} + (pl)_{ijn} + (g^{\prime}l)_{in} + (g^{\prime\prime}l)_{jn} + (sl)_{ijn} + e_{ijn} ,$$

where y ijn is the phenotypic performance of the ijth entry (line i = j, or hybrid i ≠ j) at the nth environment, μ an intercept term, a the group effect for lines, hybrids and checks, l n the effect of the nth environment, p ij the genetic effect of the parental lines, \(g^{\prime}_{i}\) the GCA effect of the ith female line, \(g^{\prime\prime}_{j}\) the GCA effect of the jth male line, s ij the SCA effect of crosses between lines i and j, (pl) ijn the interaction effect of the ijth parental line with the nth environment, \((g^{\prime}l)_{in}\) and \((g^{\prime\prime}l)_{jn} ,\) are GCA × environment effects of female and male lines, (sl) ijn the SCA × environment interaction effect, and e ijn is the residual. Except group effects, all other effects were treated as random.

For the observation plots, statistical analysis was performed assuming that each environment corresponds to one replication (Payne 2006). Thus, the phenotypic data were analyzed with following linear model

$$y_{ijnm} = \mu + a + l_{n} + p_{ij} + g^{\prime}_{i} + g^{\prime\prime}_{j} + s_{ij} + (g^{\prime}l)_{in} + (g^{\prime\prime}l)_{jn} + b_{mn} + e_{ijnm} ,$$

where y ijnm is the phenotypic performance of the ijth entry (line i = j, or hybrid i ≠ j) in the mth incomplete block at the nth environment, μ an intercept term, a the group effect for lines, hybrids and checks, l n the effect of the nth environment, p ij the genetic effect of the parental lines, \(g^{\prime}_{i}\) the GCA effect of the ith female line, \(g^{\prime\prime}_{j}\) the GCA effect of the jth male line, s ij the SCA effect of crosses between lines i and j, \((g^{\prime}l)_{in}\) and \((g^{\prime\prime}l)_{jn} ,\) are GCA × environment interaction effects of female and male lines, respectively, and e ijnm is the residual confounded with SCA × environment interaction effects. Except group effects, all other effects were treated as random.

Significance of the variance components was tested by model comparison with likelihood ratio tests, where the halved P values were used as approximation (Stram and Lee 1994). In addition, we assumed fixed genetic effects and estimated the best linear unbiased estimates (BLUEs) of the 1,749 genotypes. Pair-wise Pearson’s correlation coefficients (r) were calculated among the traits for parental lines and F1 hybrids, and their significance was tested using tabulated values based on Fisher z transformation (Fisher 1921). Broad-sense heritability on an entry-mean basis was calculated as the ratio of genotypic to phenotypic variance, \(h^{2} = \sigma_{\text{G}}^{2} /\left( {\sigma_{\text{G}}^{2} + \sigma_{{{\text{G}} \times {\text{E}}}}^{2} /l + \sigma_{e}^{2} /lr} \right)\), where \(\sigma_{\text{G}}^{2}\) and \(\sigma_{{{\text{G}} \times {\text{E}}}}^{2}\) refer to the total genotypic variance of hybrids (or lines) and their interaction with environments, respectively, l to the number of environments, r to the average number of replications and \(\sigma_{e}^{2}\) to residual variance. For the observation plots, \(\sigma_{{{\text{G}} \times {\text{E}}}}^{2}\) is confounded with \(\sigma_{e}^{2}\).

For each combination of parental lines, midparent value (MP), relative midparent heterosis (MPH), relative better-parent heterosis (BPH) and commercial heterosis (CH) were calculated using hybrid performance (HYB) as follows: MP = (P1 + P2)/2, MPH = [(HYB − MP)/MP] × 100, BPH = [(HYB − P max)/P max] × 100, and CH = [(HYB − best check)/best check] × 100, where P max is the better performing parent. We tested the Pearson’s correlation of HYB with MP, r(MP, HYB), and the GCA effects with per se performance of parents, r(GCA, per se). Further, we also tested the Pearson’s correlation of HYB with the sum of GCA effects of both the parents r(GCA, HYB) using a leave-one-out crossvalidation as explained in detail by Schrag et al. (2009). All analyses were performed using the software ASReml-R 3.0 (Butler et al. 2009).

To combine the single traits in a base index (Baker 1986), we transformed the data by \(y_{ij} = \left( {x_{ij} - x_{k} } \right)/s_{j}\), where x ij is the ith genotypic value for the jth trait, x j and s j the mean, and standard deviation of the jth trait. The sign of the weight of a single trait in the index depended on the aim to increase or decrease the trait.

Results

Genotypic variances \(\sigma_{\text{G}}^{2}\) were significantly (P < 0.05) larger than zero for parents and hybrids for all nine traits (Table 2). Parents and hybrids displayed a similar amount of \(\sigma_{\text{G}}^{2}\) for susceptibility to lodging, leaf rust, yellow rust and powdery mildew. In contrast, we observed larger \(\sigma_{\text{G}}^{2}\) for the parents compared to the hybrids for grain yield, heading time, plant height, and susceptibility to frost as well as septoria tritici blotch. For grain yield, \(\sigma_{{{\text{G}} \times {\text{E}}}}^{2}\) was larger than \(\sigma_{\text{G}}^{2}\), and vice versa for the other traits for both lines and hybrids. Estimates of \(\sigma_{\text{GCA}}^{2}\) were significantly (P < 0.01) larger than zero for all traits, except for lodging susceptibility of the females. For all traits, \(\sigma_{\text{GCA}}^{2}\) was larger for female than that for male lines. Estimates of \(\sigma_{\text{SCA}}^{2}\) were significantly (P < 0.01) larger than zero for all traits, except for septoria tritici blotch susceptibility. Relative to \(\sigma_{\text{G}}^{2}\) of the hybrids, the amount of \(\sigma_{\text{SCA}}^{2}\) was close to zero for plant height, heading time, and susceptibility to frost, leaf rust, powdery mildew, and septoria tritici blotch. In contrast, for grain yield and yellow rust susceptibility, we observed that the proportion of \(\sigma_{\text{SCA}}^{2}\) of the total genotypic variance \(\sigma_{\text{G}}^{2}\) amounted to 33 and 28 %, respectively. Heritabilities were moderate to high, showed only small discrepancies for parents and hybrids, and ranged from 0.33 for lodging susceptibility to 0.89 for frost susceptibility.

Table 2 Estimates of variance components, heritabilities, mean and range of midparent (MPH) and better-parent heterosis (BPH), as well as correlations between midparent values and hybrid performance r(MP, HYB), general combining ability effects and hybrid performance r(GCA, HYB), as well as between general combining ability effects and line per se performance r(GCA, per se)

In contrast to grain yield, negative values for heterosis are preferable for the remaining traits as the goal was to develop early sturdy hybrid varieties with low susceptibility against abiotic and biotic stresses. Thus, lower heading time, plant height and susceptibility to frost, lodging and disease are desired (Table 1). Compared to the midparent performance, hybrids had on average a higher grain yield (10.7 %), plant height (9.16 %) and lodging susceptibility (21.7 %), but lower susceptibility to frost (−7.2 %), leaf rust (−8.42 %), and septoria tritici blotch (−9.27 %; Table 2). Midparent heterosis for grain yield ranged from −7.2 to 28.6 %. Compared to the better-parent, hybrids had on average a 7.66 % higher grain yield, were taller, later heading time, and less susceptibility to frost and all diseases scored.

Midparent performance was significantly (P < 0.01) correlated with hybrid performance for all traits (Table 2). The lowest correlation was observed for lodging susceptibility with a value of r = 0.28 (P < 0.01) and the largest value for frost susceptibility of r = 0.85 (P < 0.01). The correlation between GCA predicted and observed hybrid performance was generally higher than the correlation between midparent and hybrid performance. Correlation between GCA effects and line per se performance was moderate to high ranging from r = 0.49 (P < 0.01) for lodging susceptibility to r = 0.86 (P < 0.01) for frost susceptibility.

The pattern of phenotypic trait correlations was similar for parents and hybrids (Table 3). We observed a significant (P < 0.01) and moderate correlation between grain yield and plant height with values of r = 0.43 for the parents and r = 0.26 for the hybrids. A significant (P < 0.01) and moderate correlation was further detected between plant height and frost susceptibility as well as between plant height and septoria tritici blotch susceptibility.

Table 3 Phenotypic correlation coefficients among 9 traits determined for 135 parental lines (below diagonal) and their 1,604 hybrids (above diagonal)

Distribution of phenotypic values of the hybrids for each trait approximated a normal distribution (Fig. 1). The commercial line cultivar with the highest grain yield in our study was Tobak with an average of 11.30 Mg ha−1. Sixty-nine hybrids had a significantly (P < 0.05) higher grain yield than Tobak and the best hybrid had an average grain yield of 12.79 Mg ha−1. Positive commercial heterosis was further identified for all other traits. Ranking of the commercial checks and hybrids, however, changed substantially depending on the trait of interest. For grain yield, 11 further hybrids were not significantly (P < 0.05) different than the best hybrid (Fig. 2). Nearly all of these 12 hybrids had a female parent with very high GCA effects, while only 7 of the 12 hybrids had a male parent with high GCA value. Interestingly, the 12 hybrids comprised combinations among but also within particular breeding programs (data not shown).

Fig. 1
figure 1

Distribution of the phenotypic values of 1,604 hybrids compared to the 10 commercial checks for 9 traits (line checks: 1 = Tobak, 3 = JB Asano, 4 = Julius, 6 = Colonia, 7 = Kredo, 8 = Tuerkis, 9 = Tabasco, 10 = Genius; hybrid checks: 2 = As de Coeur, 5 = Hystar; H = hybrid mean)

Fig. 2
figure 2

Heat map of the grain yield performance of the 1,604 hybrids evaluated in our study. The columns and rows of the 15 male lines and the 120 female lines were ordered according to their GCA values for grain yield. For each male times female combination, hybrid values were shown in colored boxes (yellow = good, red = poor; black boxes = missing hybrids, blue boxes = hybrids not significantly different from the best hybrid, which is marked in green) (color figure online)

Discussion

The decision to embark on a hybrid breeding program strongly depends on the long-term competitiveness of hybrid versus line breeding. Detailed knowledge of the amount of heterosis and the ratio of genetic variance components are of paramount importance to study the long-term selection gain of hybrid wheat breeding. Only a few experimental surveys investigated quantitative genetic parameters, which are relevant to gain insights into the potential of hybrid wheat breeding. Previous reports have the major shortcomings of either the use of unbalanced and often small data sets or the consideration of a limited set of traits (for review, see Longin et al. 2012). This stimulated us to initiate a joint public–private partnership to study a large population of wheat hybrids and their parental inbred lines.

Heterosis and hybrid performance

We observed an average midparent heterosis for grain yield of 10.7 % (Table 2), which is in the same range as previous findings in wheat (Barbosa-Neto et al. 1996; Perenzin et al. 1998; Oury et al. 2000). Average better-parent heterosis for grain yield was slightly smaller than midparent heterosis, but maximum values amounted up to 23.8 %. This value for better-parent heterosis is higher than any one previously reported in the literature (Borghi and Perenzin 1994; Barbosa-Neto et al. 1996; Perenzin et al. 1998; Corbellini et al. 2002) and might reflect the first success in the search for excellent male parental lines in European hybrid breeding programs.

Schachschneider (1997) as well as Weißmann and Weißmann (2002) estimated the yield surpluses needed to counterbalance the higher seed costs of hybrid versus line varieties. Both studies regarded a commercial heterosis of about 1 Mg ha−1 as sufficient to justify higher seed production costs. Eleven hybrids had a commercial heterosis for grain yield exceeding this threshold of 1 Mg ha−1 with a maximum grain yield advantage of 1.3 Mg ha−1 (Fig. 1). Consequently, our result suggests that a competitive hybrid breeding program seems feasible in wheat.

For heading time, heterosis was close to zero (Table 2), which is also in accordance with the literature (Borghi and Perenzin 1994; Perenzin et al. 1998; Corbellini et al. 2002). For plant height, we observed a substantial amount of heterosis, which was considerably higher than reported in the literature (Borghi and Perenzin 1994; Perenzin et al. 1998; Corbellini et al. 2002). Plant height is often associated with early vigor (Spielmeyer et al. 2007). Increased vigor is desired, especially in the first growing phase, because the plant rapidly covers the soil, thus preventing the growth of herbs and exploiting the photosynthetic potential per area in an efficient manner. During later stages of plant development, this can cause lodging susceptibility as reflected by the moderate but positive correlation between plant height and lodging susceptibility (Table 3). This correlation, however, does not prevent to select hybrids combining lodging tolerance and high grain yield. For instance, of the 20 hybrids with the highest grain yield, only 4 hybrids showed an increased lodging susceptibility (>3, data not shown).

Predicting hybrid performance

In hybrid breeding, a high number of parental lines are available and, therefore, not all potential factorial crosses can be evaluated in field trials. Thus, efficient methods are required to predict the hybrid performance. In allogamous species, new lines were usually selected as hybrid parents based on their line per se performance or their GCA effects (Hallauer et al. 1988). The efficiency of this selection strongly depends on the correlation between hybrid performance and midparent value r(MP, HYB) or the sum of GCA effects of both parents r(GCA, HYB). Further aspects on the implementation of estimation of GCA effects in top cross tests are discussed in detail elsewhere (Gowda et al. 2012). For plant height, heading time and susceptibility to frost and leaf rust, we observed a correlation between midparent and hybrid performance of r > 0.70 (Table 2). Thus, for these traits, line per se performance serves as an accurate predictor of the hybrid performance.

For grain yield, we observed a correlation between midparent and hybrid performance of r = 0.49 (P < 0.01), while the correlation increased to r = 0.60 (P < 0.01) for GCA effects instead of line per se performance (Table 2; Supplementary Figure S2). These findings were in accordance with results reported for triticale (Oettler et al. 2005). One explanation for this moderate correlation is the high extent of \(\sigma_{\text{SCA}}^{2}\) compared to \(\sigma_{\text{GCA}}^{2}\) observed for grain yield (Table 2), which corroborates previous findings in wheat (Gowda et al. 2012). The \({{\sigma_{\text{GCA}}^{2} } \mathord{\left/ {\vphantom {{\sigma_{\text{GCA}}^{2} } {\sigma_{\text{SCA}}^{2} }}} \right. \kern-0pt} {\sigma_{\text{SCA}}^{2} }}\) ratio varies depending on the allele frequencies between parental populations (Reif et al. 2007). Increasing genetic divergence of the two parental populations tends to promote a high ratio of \(\sigma_{\text{GCA}}^{2}\) versus \(\sigma_{\text{SCA}}^{2}\) (Fischer et al. 2008). Consequently, the low \({{\sigma_{\text{GCA}}^{2} } \mathord{\left/ {\vphantom {{\sigma_{\text{GCA}}^{2} } {\sigma_{\text{SCA}}^{2} }}} \right. \kern-0pt} {\sigma_{\text{SCA}}^{2} }}\) ratio observed in wheat may be due to the lack of genetically diverse groups (Miedaner et al. 2010; Würschum et al. 2013). Applying inter-population improvement programs in wheat, keeping the male and female pool strictly separated, possesses a great potential to generate a long-term genetic divergence among heterotic pools and promotes, therefore, also a more favorable \({{\sigma_{\text{GCA}}^{2} } \mathord{\left/ {\vphantom {{\sigma_{\text{GCA}}^{2} } {\sigma_{\text{SCA}}^{2} }}} \right. \kern-0pt} {\sigma_{\text{SCA}}^{2} }}\) ratio (Labate et al. 1997).

The relevance of \(\sigma_{\text{SCA}}^{2}\) makes it favorable to use a high number of tester lines or to use complex testers, like double crosses or line mixtures, as recommended for maize and rye hybrid breeding (Hallauer and Miranda 1981; Tomerius 2001; Longin et al. 2007). This advantage is overestimated due to the unrealistic assumption that tester lines would be unrelated in elite breeding. As the relatedness of additional tester lines increases, there is less reduction of the masking SCA effects (Appendix, Supplementary Figure S1). For instance, the efficiency in reducing SCA is reduced by >50 % for a coefficient of coancestry between tester T1 and T2 of 0.25, e.g., half sibs, compared to non-related testers. Furthermore, for wheat, practical concerns for the use of complex testers arise from the current hybrid seed production system. Due to pollination problems, typically female lines cannot be used as pollinators like in maize. As commercial seed production is only feasible by chemical hybridization agents, homogenous flowering time and a smaller plant height than males is required for females complicating the use of complex testers. Nevertheless, the use of single-cross male tester for first GCA screenings of females is still an interesting option.

Additive genetic variance in populations of lines versus hybrids

If we assume absence of epistasis, \(\sigma_{\text{G}}^{2}\) among inbred lines and \(\sigma_{\text{GCA}}^{2}\) estimated for the hybrids reflect the additive genetic variation, which can be exploited in line or hybrid breeding in a recurrent manner. For grain yield, the magnitude of the \(\sigma_{\text{GCA}}^{2}\) reached only 43 % of the genetic variance observed among the parental lines (Table 2). This is astonishing considering that the substantial magnitude of \(\sigma_{\text{SCA}}^{2}\) observed for grain yield points toward the presence of significant dominance effects. Consequently, a ratio of \(\sigma_{\text{GCA}}^{2}\) versus \(\sigma_{\text{G}}^{2}\) of the lines higher than 0.5 is expected (Longin et al. 2012). A likely explanation for this low ratio is the presence of epistasis. Focusing exclusively on the epistatic variance due to additive × additive interactions (\(\sigma_{\text{AA}}^{2}\)) and an inbreeding coefficient of one for the inbred lines, the expected genetic variance among inbred lines in line breeding is \(\sigma_{\text{G}}^{2} = 2\sigma_{\text{A}}^{2} + 4\sigma_{\text{AA}}^{2} .\) In contrast, expected genetic variance among inbred lines in hybrid breeding is \(\sigma_{\text{GCA}}^{2} = 0.5\sigma_{\text{A}}^{2} + 0.25\sigma_{\text{AA}}^{2}\) and \(\sigma_{\text{SCA}}^{2} = \sigma_{\text{D}}^{2} + 0.5\sigma_{\text{AA}}^{2}\) (Wricke and Weber 1986; Reif et al. 2007). Thus, additive × additive epistasis is a potential explanation for the low variance due to GCA effects compared to the variance among lines for grain yield. Consequently, further experimental studies are required with special field designs to gain more insights into the relative importance of variance components due to additive, dominance, and epistatic effects facilitating the comparison of the long-term efficiency of line versus hybrid breeding. Moreover, detailed knowledge of the relative importance of the different genetic effects is of great value to judge the potential to increase the prediction accuracy of hybrid performance including besides additive also dominance and epistatic effects (Zhao et al. 2013).

Multi-stage selection for hybrid wheat breeding

Breeders are often concerned that intensive selection in early generations for traits with high heritability, such as disease resistances, might reduce largely the genetic variability for grain yield. This fear is even more pronounced in hybrid wheat breeding, where parental lines are intensively pre-selected for high per se performance showing only moderate correlations with the hybrid performance, e.g., r = 0.49 for grain yield. We investigated the potential elimination of high yielding hybrids through early stage selection using an index combining line per se performance of grain yield, plant height, heading time, and susceptibility to frost, lodging, leaf rust, yellow rust, powdery mildew and septoria tritici blotch with equal weights (Supplementary Figure S3, open circles). In contrast to our concerns, no correlation between the index values and GCA for grain yield was determined.

Instead of using an index of all traits, a line might be already eliminated in early generations if it does not surpass a given threshold in one trait regardless of its performance in the other traits, i.e., selection of independent culling levels (Baker 1986). Thus, we eliminated all lines with a frost susceptibility >6.5, disease susceptibility >5 and the 30 % poorest lines regarding grain yield per se performance (Supplementary Figure S3, filled circles). The resulting selected lines still cover the full range of GCA values for grain yield. Consequently, early selection of line per se performance does not result in a drastic reduction of \(\sigma_{\text{GCA}}^{2}\) for grain yield in later stages of selection.

Choice of parents in hybrid wheat breeding programs

Selection of parents for the next breeding cycle represents one of the most crucial steps for maximizing selection gain (Wegenast et al. 2008). One relevant criterion for parental selection is the breeding value of a genotype (Bernardo 2002). Breeding values depend on the allele frequencies of the population, the inbreeding coefficient, as well as on the additive and dominance effects (Falconer and Mackay 1996). According to a companion study, differences in allele frequencies were not pronounced between the 120 female and the 15 male parental lines (Zhao et al. 2013). However, the presence of dominance effects may cause substantial differences in breeding values warranting further research.

Assuming absence of epistasis, either per se performance of a line or its GCA effect can be considered as an estimate of the breeding value. For grain yield, the correlation among GCA effects and line per se performance amounted to r = 0.72 (P < 0.01). This suggests that the decision for parental selection is similar for line and hybrid breeding when focusing exclusively on grain yield. Consequently, hybrid breeding can be conducted successfully as a spin-off from line breeding. Nevertheless, we speculate that with further development of heterotic groups, breeding values will diverge with ongoing reciprocal recurrent selection. In combination with the option to perform disease gene management in a synergistic manner between male and female parent pools, choice of parents will differ between line and hybrid breeding requiring a separate breeding program for hybrids.