Introduction

Since its first introduction in 1998 (Haley and Visscher) and its extension with methods like ridge regression BLUP and Bayesian methods to calculate genomic estimated breeding values (Meuwissen et al. 2001), genomic selection (GS) has become a highly attractive complement or even alternative to conventional selection strategies employed in animal and plant breeding (Desta and Ortiz 2014; Henryon et al. 2014). The advantages of GS, in comparison to phenotypic selection, include the ability to predict breeding values within families (as opposed to only between families in pedigree based approaches), to generate and use genetic information for the planning of crosses, the provision of phenotypic values if they cannot be observed in a given season (e.g., no disease pressure present), and, in particular, the shortening of the breeding cycle (Hickey et al. 2014).

Owing to the limited availability of marker resources in the past, marker-assisted selection in barley was essentially restricted to monitor the inheritance of single QTL alleles with large effects, or of monogenic traits such as disease resistances (e.g., Graner et al. 1999; Hofmann et al. 2013). Regarding the latter, marker-assisted selection represents an instrumental tool for the rapid introgression of monogenic traits from unadapted germplasm into elite lines (König et al. 2012; Werner et al. 2007) and to minimize linkage drag during subsequent backcross cycles (König et al. 2012; Miedaner and Korzun 2012; Werner et al. 2007).

Regarding quantitative traits, which are characterized by a large number of small effect loci, marker-assisted selection can harness only a limited portion of the genetic variation. By contrast, GS captures a much larger spread of genetic variation: because a large number of markers cover the entire genome, all chromosome segments are simultaneously considered for analysis to predict the breeding value of a given progeny line or genotype (Meuwissen et al. 2001). GS captures all QTL effects irrespective of size by estimating allele effects of all markers simultaneously, using phenotypes and marker data of the training set (TS). These effects are then used to estimate the breeding value of selection candidates, based solely on their allelic profile. The accuracy of GS is crucial for its success and effectiveness in a practical breeding scheme. Important factors influencing accuracy include heritability of the trait under consideration, number of markers used, statistical models employed, linkage disequilibrium, effective population size, relationship between TS and selection candidates, population structure and size of the TS (Isidro et al. 2015).

In plant breeding, GS has been employed mostly to yield and yield-related traits, e.g., thousand kernel weight in barley (Schmid and Thorwarth 2014), grain moisture and grain yield in maize (Endelman et al. 2014; Zhao et al. 2012), grain yield, plant height, starch- and total pentosan content in rye (Wang et al. 2014), grain yield, plant height, flowering time in rice (Spindel et al. 2015) or grain yield in wheat (Poland et al. 2012).

Barley is a major crop with wide adaptability grown on all continents except for Antarctica. About 60–70 % of the global production is used for feed, while malt for brewing, distilling and food makes up for 30–40 % (Shewry and Ullrich 2014, pp. 1–9). The use of barley as a substrate of fermentable carbohydrates and a source of enzymes for beer production requires high quality seed as a raw material. Malting quality is a complex trait that is described by a series of trait components which follow complex inheritance. A series of Quantitative Trait Loci (QTL) have been identified over the past decade, shedding light on the genetic architecture of malting quality and functional genomics studies have been performed to identify candidate genes (for review, see Shewry and Ullrich (2014, pp. 293–309).

These trait components are expensive to measure and require relatively large amounts of seed. Therefore, in practical breeding, quality tests are restricted to advanced generations and marker-assisted selection during early generations is limited to few genes and major QTLs, e.g., ß-amylase activity (Bmy1) (Chiapparino et al. 2006) or ß-glucan content (Han et al. 1997). Often the first reliable phenotypic quality results are available just before entering National List Trials. Against this backdrop, genomic-estimated breeding values (GEBVs) have the potential to (a) save costs if used to select candidates with high malting quality without phenotyping and (b) to allow for a (pre)selection of the target quality traits in early stages of the breeding program.

To explore the applicability of GS in barley for the complex trait malting quality, this study aimed at the establishment of training populations for two malting barley breeding programs and to assess the potential of GS to efficiently improve malting quality components in elite germplasm within the framework of a practical barley breeding program.

Materials and methods

Phenotypic data and analysis

Plant material

The plant material used for spring and winter barley consists of elite material, i.e., already registered lines and/or potential candidates for registration. They have been selected for yield, yield stability, resistances and quality scores if available. In winter barley, all genotypes are DH lines. In spring barley, 80 % of the materials are DH lines and 20 % have been derived from an SSD breeding scheme. The developed breeding material, excluding listed varieties, remains the property of KWS LOCHOW GMBH and cannot be provided without specific legal agreement.

Traits measured and structure of data

For spring barley, phenotypic data were available for 4 years (2011–2014) with three to five locations per year. All trials were laid out as alpha lattice designs in two or three replications. Phenotypic trial structure in the spring barley data was not completely orthogonal. The number of lines per location within a year ranged from 7 (BAI, UK) to 279 (WOH, GER) with a mean of 107 lines per location within a year and 116 (2012) to 294 (2014) unique lines per year over the duration of the study. There were 12 check lines and up to overlapping 37 lines between years.

In winter barley, orthogonal sets of lines were grown at four locations in each of the three seasons 2010/2011 (2011), 2011/2012 (2012) and 2012/2013 (2013). In those seasons, different combinations of locations were used. In 2011 and 2012, 64 lines were tested in each location and 54 lines in 2013. Trials were laid out as alpha lattice designs with two replications per location. At least two overlapping locations across all years were present. The overall number of lines, including feed type varieties, in trials was 148 with 12 check lines and up to 17 lines overlapping between different years.

Field trials were conducted by KWS LOCHOW GmbH using standard growing practices including fungicide treatment. Malting quality analyses were carried out by VLB (Versuchs- und Lehranstalt für Brauerei, Berlin, Germany).

For both spring and winter barley, 12 malting traits were analyzed in this study. Descriptions of these traits have been given in Potokina et al. (2004).

AMA = alpha amylase activity (dry extract units/g dry weight), AMB = beta amylase activity (dry extract units/g dry weight), EXT = extract (% dry weight), FAN_L = free amino nitrogen (mg/l), Fin_At = final attenuation (%), FRI = friability (%), GLU = beta glucan content (mg/l), KOL = kolbach index (%) (percentage of denatured proteins), LOSS = malting loss (while malting, added water leads to growing of radicals, these are cleaned and the loss of mass during cleaning is measured, associated with early germination) (%), NIT = soluble nitrogen (mg/100 g dry weight), PRT = protein content (% dry weight), VIS = viscosity (connected to GLU, high molecular carbon hydrates, solubility of carbon hydrates) (8.6 %, mPas*s).

Calculation of BLUEs and heritabilities

Phenotypic data for spring barley were analyzed in a two-stage model. First, a within trial single location analysis was carried out using Plabstat (Utz 2011). The final adjusted means over locations, trials, and years were estimated with a mixed model using ASreml software (Butler 2009). In this model genotype was set as fixed term and all other factors as random terms [main factors being year, location and trial and interaction factors (genotype:year), (genotype:location), (genotype:year:location)]. The connection between the years was accomplished with at least 12 check lines overlapping between years. Broad sense heritabilities were estimated with the same formula as for winter barley (see below) but without replication because single location results provided by the breeder were already adjusted for replication.

The phenotypic data for the winter barley material were analyzed in a single-stage mixed model approach combining the 3 years with ASreml. In the mixed model equation, genotype was treated as fixed effect while year and location as well as all their interactions with genotypes were assumed to be random effects, resulting in BLUEs (best linear unbiased estimators) for the phenotypic means of each genotype and trait. Twelve lines were grown in all 3 years and up to 17 lines connect two consecutive years.

Broad sense heritabilities for winter barley were estimated as:

$$h^{2} = \frac{{\sigma_{G}^{2} }}{{\sigma_{G}^{2} + \frac{{\sigma_{{\text{GL}}}^{2} }}{L} + \frac{{\sigma_{{\text{GY}}}^{2} }}{Y} + \frac{{\sigma_{{\text{GLY}}}^{2} }}{Y*L} + \frac{{\sigma_{\varepsilon }^{2} }}{Y*L*R}}}$$

where σ 2G, σ 2GL, σ 2GY, σ 2GLY and σ 2ϵ are the genotype, genotype x location, genotype × year, genotype × location × year and residual variance, respectively, and L, Y and R are the number of locations, years, and replications. Since numbers of years and locations varied between trials, harmonic means were used as the denominator instead of absolute values.

For some traits (AMA, Fin_At, KOL, Loss, VIS), genotype x environment interaction components could not be separated from the residual. In these cases, a reduced model was used with the main effects genotype, location, year and the residual error (containing all the interaction terms with genotype), which was divided by the harmonic mean of locations, years and two replications in the resulting reduced formula for h 2.

Correlations between phenotypic means in different years were calculated from checks and other lines present across years using adjusted single-year data.

Genotypic data and analysis

Genotyping and data cleaning

An Illumina 9 k array was employed for genotypic analysis (Comadran et al. 2012). Raw data were cleaned by removing markers with a minor allele frequency (maf) ≤0.01, more than 20 % missing data points, and more than 20 % heterozygous allele calls. Lines with ≥20 % missing data points and ≥15 % heterozygous calls were also excluded. Missing values were imputed using Beagle (Browning and Browning 2007) with a confidence level of 0.95, using only information based on neighboring markers, as identified from the genetic map. Where genotypically identical individuals were present, all but one were excluded from the training set. Out of 7916 potential markers, 3958 (50 %) were mapped on the Morex × Barke genetic map from Comadran et al. (2012), with a total chromosome length of 2127.4 cM.

Analysis of population structure

Population structure was estimated with principal component analysis. Genetic distances for population analysis were based on modified Roger’s Distance, as implemented in the R package SelectionTools (Frisch 2015). To estimate the probability for the number of groups for spring and winter barley, the Calinski criterion (Calińskia and Harabasza 1974) from the vegan R-package (Oksanen et al. 2014) was used.

Analysis of linkage disequilibrium

The Synbreed R-package (Wimmer et al. 2012) was used for creating the Linkage Disequilibrium (LD)-plots. LD-decay was estimated using the 95 % percentile of all pairwise LD values from all mapped markers between chromosomes, excluding LD results from markers on the same chromosome. This value was used as cutoff LD or intercept to estimate the average LD size of each chromosome for the LD decay smoothing curve. The average across all chromosomes was used as final LD estimate. This was done separately for spring and winter barley.

Estimation of GEBVs

To estimate the genetic effects of the cleaned SNPs, the following linear model (RR-BLUP) was used, as described by Endelman (2011):

$$y = X\beta + Zu + \varepsilon$$

where y is a numeric vector of the adjusted means of the corresponding trait, β is a vector of fixed effects, X is the design matrix of fixed effects (here all = 1 because all effects are adjusted within the phenotypic analysis), Z is the marker matrix assigning marker genotypes to phenotypes (y) with dimension rows equal to the number of phenotypes and columns equal to the number of markers. Genetic effects are given in vector u and ε is the residual error. The latter were assumed to follow a normal distribution with mean 0 and variance σ 2u and σ 2ε, respectively. The sum of all allele effects is the GEBV of a line. Prediction error variances (PEV) to estimate the reliability of single GEBVs were calculated for all individuals and traits for spring and winter barley.

For estimating unshrunken GEBVs, a linear model (lm) was applied in which centered phenotypic and genotypic means were fitted using the following formula:

$${\text{GEBVus }} = {\bar{\text{x}}}_{\text{Phenotypic means}} + \left( {{\bar{\text{x}}}_{{{\text{Centered GEBV }}}} - {\text{Intercept }}\left( {\text{lm}} \right)} \right)/{\text{Regression coefficient }}\left( {\text{lm}} \right)$$

Calculation of predictive abilities (PA) Predictive abilities were obtained separately for each trait from fivefold cross validation, replicated five times. Within one replication, the data set is randomly split into five sets of equal size, of which four are combined as “training set” to predict the fifth set. This is then repeated such that each of the sets is four times part of the training set and predicted once. The mean correlation of the predicted value to the observed phenotype over all folds and replications is the predictive ability.

Results

Traits and phenotypes

For ten of the twelve malting quality traits studied (AMA, AMB, EXT, FAN_L, Fin_At, FRI, KOL, LOSS, NIT and PRT), the breeding goal is to increase the values in the breeding lines, whereas a reduction of the values is the goal for GLU and VIS. In the spring barley breeding material, the phenotypes were higher than in winter barley for six traits (AMA, EXT, FAN_L, Fin_AT, FRI and KOL) and lower for GLU and VIS where those results are desired. The difference between spring and winter barley ranged from +19 % for FRI to −33 % for AMB. The mean standard deviation of the traits for spring barley was 31.74 and 34.66 for winter barley. In relation to the corresponding mean, spring barley showed a 12 % higher mean standard deviation of all traits on average (Supplementary Table S1).

Broad sense heritabilities for spring barley ranged between 0.50 for AMA and >0.80 for EXT, Fin_At, FRI, GLU, KOL and NIT. For winter barley, heritabilities ranged from 0.71 for KOL to 0.98 for FRI and GLU (Table 1).

Table 1 Heritabilities for the twelve analyzed malting quality traits, with total number of locations (#loc), years (#yr) and harmonized number of locations (#loc_hm) and years (#yr_hm) for spring barley and winter barley

Correlations of phenotypic mean values over years for winter barley, assessed from 12 resp. 17 overlapping lines, ranged between 0.727 and 0.997 (Supplementary Table S3).

Genotypic data

After data cleaning, 4095 markers remained for spring and 4359 for winter barley. For a total chromosome length of 2127.4 cM, this yielded on average 1.9 and 2 markers per cM, respectively.

Assessment of population structure by principal component analysis revealed that spring barley did not show any subgroups (Fig. 1). The first two principal components together explained 15 % of the variance; the Calinsky criterion had a maximum at one group. In winter barley, mild subpopulation structure was detected. The Calinksy criterion had a maximum for three groups and the first two components explained 31 % of the variance.

Fig. 1
figure 1

Principle component analysis of spring barley (left) and winter barley (right). First two principle components explain 15- and 31 % of the variance

The family structure was different between both groups. In spring barley, there were 80 unique parents in the training population. These represent all parents for spring barley in this data set. Each of them was present with a maximum of 10 % of the 260 unique crosses. Sister plants have the same parents and were counted as a unique cross. 50 % of the crosses were single crosses, 42 % three-way crosses with three different parents, and 8 % were four-way crosses or a three-way cross with overlapping parents (AxB)xA which is similar to a back cross. In winter barley, 45 unique parents were used, resulting in 55 unique crosses. Some varieties occurred frequently in the pedigree of the analyzed lines. The most frequent line, Wintmalt, was present in 41 of the 102 pedigrees. From all crosses, 41 % were two-way crosses and 59 % were three-way crosses. The mean genetic distance of the winter barley lines in the trainings set was 0.18 ± 0.09, whereas spring barley showed a mean distance of 0.35 ± 0.07.

Spring barley showed an average LD block size of 60.46 ± 14.24 cM, with a minimum of 42.04 on chromosome 4H and a maximum of 77.68 on 2H. Compared to spring barley, winter barley had a smaller average LD block size of 16.45 ± 7.8 cM, with a minimum of 9.25 on chromosome 4H and a maximum of 31.76 on 7H (Fig. 2a, b). In spring barley, the other chromosomes showed LD block sizes in cM of 63.7 (1H), 50.66 (3H), 46.4 (5H), 75.12 (6H), 68.33 (7H), and in winter barley 11.85 (1H), 20.86 (2H), 14.1 (3H), 16.87 (5H), 10.49 (6H) and 31.76 cM (7H). Minor allele frequencies were unequally distributed between both groups (Supplementary Figure S1). Winter barley showed a higher proportion of extreme allele frequencies with 60 % of markers having a minor allele frequency lower than 0.01 compared to spring barley when compared to 40 % of markers with maf < 0.01.

Fig. 2
figure 2

LD decay on chromosome 1H in spring barley (a, left) and winter barley (b, right), with an LD decay of 63.7 and 11.85 cM, respectively

GEBVs

All predictive abilities (PA) presented were estimated with RR-Blup (Endelman 2011). Other tested methods such as Elastic-Net and Lasso (Friedman et al. 2010) did not yield different GEBVs or predictive abilities (results not shown).

The training sets consisted of between 65 (FANL) and 424 (e.g., EXT) individuals for the different traits in spring barley and of 102 individuals in winter barley for all traits (Table 2).

Table 2 Predictive abilities (PA), standard deviation of the predictive ability after fivefold cross validation and size of the available training sets (TS) for spring and winter barley

In spring barley, the lowest PA was observed for AMB with 0.14 (Table 2). Medium PA of 0.42 to 0.5 were observed for AMA, FANL, Fin_At, GLU, KOL, LOSS and VIS. High PA of above 0.5 up to 0.58 were observed for EXT, FRI and NIT. In winter barley, the lowest PA of 0.4 were observed for PRT, whereas for all other traits medium to high PA were observed with the highest PA of 0.8 for GLU. The average PA for spring barley were 0.47 as compared to 0.63 for winter barley.

When training set size was small (e.g., 65), larger standard deviations were observed in spring barley. This amounted to an increase of the standard deviation in % of the PA value from 18 to 44 % when training set size decreased from 424 to 65, with a maximum of 153 % for AMB (stddev higher than PA, lowest PA of all traits) and a minimum of 12 % for PRT. In winter barley, training set size was constant with 102 individuals for all traits. The standard deviations ranged from 14 % (FRI) to 53 % (AMA). The proportions of the standard deviations, compared to the PA, were slightly higher for spring barley with 33 % as compared to winter barley with 27 %.

Reliabilities based on PEV were medium to high for both spring and winter barley with 0.54–0.74 for spring barley and 0.63–0.77 for winter barley. The prediction error variance showed values in a wide range for both spring and winter barley (Supplementary Table S1). In spring barley, it ranged from very low values of 0–0.17 (VIS, PRT, LOSS and EXT) to extremely high values of 7340.68 (GLU) to 7518.41 (AMB). In winter barley, PEV values ranged from 0.01 to 0.32 (LOSS, EXT, PRT, VIS) and up to 15473.62 (GLU) to 16296.76 (AMB). The same three traits (AMB, GLU and NIT) showed the highest PEV values in both spring and winter barley.

The correlations of GEBVs to phenotypes deliberately assumed as non-observed (Table 3) had a range from 0.23 (AMA) to 0.67 (FRI). The correlation to the PA was 0.35 (excluding auto correlation) and 0.53 (mean correlation if all lines were in the training set to phenotypes in 2011, 2012 and 2013). In consecutive years, correlations increased, e.g., the lowest mean of all tested correlations with 0.36 was observed when 2012 and 2013 were used as training sets. Strong variation was observed within the traits, e.g., VIS had a minimum of 0 (2012 only to predict 2011) and a maximum of 0.87.

Table 3 Correlations of GEBVs to adjusted phenotypes of different training set (TS) combinations for winter barley over different year combinations to a given prediction set (PS)

Discussion

Over the past decade, genome analysis of barley has greatly benefitted from the augmenting sequence information which provided the basis for the establishment of high-throughput marker systems (Close et al. 2009; Comadran et al. 2012) These have paved the way for the development and implementation of novel approaches for trait mapping, such as genome-wide association analysis and, at the applied level, genomic selection (GS). Regarding the latter, we have investigated in the present study the potential and limitations of GS to predict the expression of malting quality in the two major gene pools, spring- and winter barley, which have evolved their genetic distinctness as a result of directed breeding and selection.

The major findings of this study were (1) in narrow genetic material “small” training sets of around 100 individuals were sufficient to achieve PA up to 0.798, depending on the trait and the corresponding heritability, (2) increasing the training population size leads to a higher stability of PA and (3) reliabilities of the GEBVs indicate that for many of the traits, the chances of applying GS as selection tool are good, especially in winter barley.

Phenotypic data

Spring barley heritabilities were on average 0.07 lower than those of winter barley, which can be partly explained by the reduced balance of the phenotypic data and also in some cases by the lower number of locations and years. In general, the heritabilities for all traits exceeded 0.5, which was expected with the available number of years and locations. The observed heritabilities and corresponding PA indicate that GS can be applied. Field trial designs might have to be adjusted for the requirements of calculating BLUEs across different years by, e.g., improving connectivity over years by a sufficient number of checks, increasing the number of plots per entry or increasing the number of entries, as indicated by Endelman et al. (2014). When GS is fully implemented into a breeding scheme, possibilities arise to divide the breeding population into a carefully managed training set that is phenotyped more extensively (more locations) to deliver reliable BLUEs and consequently reliable GEBVs, and the rest of the breeding population for which phenotyping is reduced to a minimum and selection mainly based on GEBV instead.

In comparison to winter barley, spring barley lines were characterized by a 33 % lower Beta amylase activity (AMB). One explanation could be that the priorities of other traits in spring barley were higher in the last selection cycles and that the current level was sufficient for current malting quality requirements.

Relatedness, LD, population structure, genetic distances

The background noise of LD was estimated by the LD of markers between chromosomes, using the 95 % percentile as cutoff to estimate the average length of markers in LD. The cutoff of spring barley (0.032) was 5.5 times smaller than the winter barley cutoff (0.176), resulting in average LD block length difference of 44.1 cM. This mathematical translation into the average length of haplotype blocks may not be exact because the cutoff value is crucial and only estimated by this elite material. If only the dots are compared, the decay between spring- and winter barley is more similar. A recombination analysis may lead to different results. This is thought to be due to longer and more intensive selection of spring barley for malting quality. The concentration of elite breeding material should increase average LD block sizes in comparison to more unselected material, wide crosses, exotic or diversity material. The large difference can partly be explained by the lower background noise of spring barley, which was used as the basis to estimate the cutoff value. Since only inter chromosomal distances were used, the employed map, whose mapping parents (Barke × Morex) were two- and six rowed but both spring barley varieties, should not have an impact. Comparable LD sizes in barley were also observed by Rostoks et al. (2006) who observed LD over distances up to 60 cM in controlled crosses. In conventional breeding programs, crosses are usually genetically narrow, especially if the breeding program has limited genetic resources and traits have limiting thresholds to get registered as official varieties. Both breeding programs were affected by these limitations, avoiding wide crosses and limited introduction of new resources. This in sum can explain the observed extent of LD. The family structure explains the larger mean genetic distances within the spring barley breeding program. In spring barley, more unique parents were used, resulting in more divergent crosses relative to winter barley. The number of markers used should sufficiently capture all chromosome segments, especially in spring- but also winter barley.

Reliability and accuracy of GEBVs

The accuracy of GS usually increases with the relatedness of training and prediction sets (Zhong et al. 2009). Genetic diversity analysis showed that the mean Roger’s distance was considerably lower for winter barley and, therefore, the reduced size of the training sets did not result in a lower PA when compared to spring barley (Table 2). The predictive abilities for spring barley mostly had a lower standard deviation than winter barley, because the size of the training sets was larger for most traits. Over all traits, the PA of winter barley were on average 0.163 above spring barley, e.g., in AMB and GLU the PA of winter barley were 0.465 and 0.319 above spring barley. Only for NIT and PRT, spring barley showed a PA above winter barley (0.032 and 0.122). The pedigree structure of winter barley, where the frequently used parent Wintmalt was included in the training set, and the smaller mean genetic distance in comparison to spring barley are two explanations for increased mean PA. The size of the training sets and the standard deviations of the PA were negatively correlated (Table 2).

Reliabilities of the GEBV indicate that for many of the traits, the prospects of applying GS as selection tool are good, especially in winter barley. The higher the PA and the reliability, the more reliable the GS will be and the sharper GS selection can be.

The knowledge of increased standard deviations of the PA is especially important in small training sets, where few individuals with rare alleles and haplotype blocks could have a greater influence and, therefore, can contribute more to the PA. This can also happen if a frequently used parent or a major progenitor is excluded from a small training set, e.g., Wintmalt in winter barley. Nevertheless, Wintmalt was used in many crosses as parent, the remaining material would still have a good connectivity and, therefore, the PA should not drop.

Correlation of GEBVs and phenotypes

In general, all observed correlations of the traits (phenotypic and GEBVs) corresponded to the expectations (personal communication with breeders). VIS and GLU were negatively correlated to other traits because they were selected to have low values (Supplementary Table S2). The higher phenotypic variance (mean of the standard deviations) of the winter barley material compared to spring barley may explain the higher correlations. The observed correlations to phenotypes deliberately assumed as non-observed were strongly influenced by the training set combination. This can be explained by the small number of individuals in the training set in general. If closely linked markers become monomorphic, the trait cannot be estimated. This can be compensated for if the size and genetic diversity of the training population are increased or by increasing marker density. The reduced size of the training set also affects the variability of phenotypic observations. In trials including around 100 individuals, inclusion of certain individuals in the training set may be important to minimize the occurrence of rare alleles and to maximize phenotypic variance. The distribution of the minor allele frequency for winter barley supports this assumption. As expected, a replacement of a single-year by a two-year training set increased the correlations (by 0.09 on average). This indicates that increasing the training set size increases the reliability to estimate unobserved years. Also relatively small training sets seem to be sufficient in the case of malting barley—in the current study, single-year correlations over the traits were 0.45 on average which is an acceptable figure. Nevertheless, increasing the size of the TS leads to an increased stability of PA and probably to more confident results to future years. For spring barley, the number of lines within the years was not as evenly distributed. In correlation results, the year effect and the size of the training set were thus confounded. Therefore, no correlations to artificially non-observed phenotypes were calculated.

The correlation of overlapping phenotypes over years for winter barley (Supplementary table S3) was at least 0.727, which shows that (1) genotype by environment interaction is generally low for the traits under consideration and (2) the variance between GEBVs and observed phenotypes of different training sets is probably not caused by large genotype by environment interaction but by the small size of the TS. Correlations for spring barley were not derived due to few overlapping phenotypes.

Marker application and molecular breeding in barley

Reports on improvement of quality traits in barley have so far been based on marker-assisted selection (MAS) (Heffner et al. 2011). MAS employs large effect markers and is a powerful tool (Lande and Thompson 1990), but studies indicated that MAS has serious limitations and that GS can be useful in combination with marker-assisted selection [reviewed by Schmid and Thorwarth (2014) or Heslot et al. (2015)]. In several QTL studies, QTL explained a large proportion of genetic variance for malting quality traits. Islamovic et al. (2014) reported that 56 % of the variance for free amino nitrogen was explained by applying 180 SNPs and 165 DArTs markers to an F6:8 bi-parental RIL population. In theory, a bi-parental population is the most effective way to demonstrate the applicability of GS (Lorenzana and Bernardo 2009), but in a bi-parental population QTL may not segregate in some backgrounds or effects may be overestimated (Gutiérrez et al. 2011), where GS models correct for overestimation. Genomic selection for grain quality traits in wheat (Heffner et al. 2011) outperformed marker-assisted selection for all traits using less than 1000 markers, reaching a plateau at 256 markers. This might be due to the fact that two bi-parental populations were used in that study. In the current study, more diverse lines than a bi-parental population, i.e., elite lines from two breeding programs of spring and winter barley, were used as training populations. Therefore, more than 256 markers were required.

Implementation of GS in a breeding scheme

The point where GS is implemented into the breeding scheme is crucial. The implementation of GS in an early stage is most promising because phenotyping for yield and especially quality traits is expensive. Malting trait analysis requires approximately 100–1000 g of grains, and this amount of seed is not available at early stages of breeding. However, half or one seed is sufficient for DNA extraction for genotyping. The implementation of GS at an early stage of the breeding cycle bears the highest potential for a breeding program because (1) the selection intensity can be increased when more candidates are grown initially and GS is used as an additional tool for selection in a two-stage approach, (2) the effect of saving costs of phenotyping is maximized and (3) the potential for shortening the breeding cycle is highest. By GS the selection of further candidates to be phenotyped can be optimized, which saves costs due to the reduced number of new phenotypes in the future. In the breeder’s equation for prediction of the expected selection gain, the cycle length is the denominator, so shortening the breeding cycle has the most impact on selection gain per unit time. Malting quality is a complex of multiple inter-related traits of which 12 were analyzed in this study. For the breeder, this poses the problem of how to handle those individual traits and how to define and select for the overall complex trait “malting quality”. Traits can either be selected for separately or combined in an arbitrary index—which may differ between countries or markets—or both. In practice, often one or several of the individual traits need to be improved while others need to be kept within a certain acceptable range. Very often, the major breeder’s goal is to improve grain yield while maintaining a good malting quality. This problem affects phenotypic and GS in the same way. So currently, the GEBVs are treated in the same manner when applying genomic selection.

Real multivariate genomic selection on multiple traits has rarely been explored in plant breeding to date (Jia and Jannink 2012), but might offer chances for the complex trait “malting quality”, e.g., by making use of genetic correlations between individual quality traits.

Training set optimization

In accordance with the findings of Endelman et al. (2014), increasing training population sizes in this study for spring barley resulted in increased predictive abilities with a lower standard deviation in the cross validation (Table 2). The training population sizes used in this study, especially for winter barley, yielded standard deviations of the cross-validated predicted abilities (Albrecht et al. 2011) and predicted error variances (Rincent et al. 2012) comparable to other published results. Several combinations of training sets were analyzed for this study (data not shown), but these resulted mostly in reduced PA because the total size of the training sets became very small and the standard error of the PA increased accordingly.

Increasing training set size by combining spring and winter barley was of no avail. If both spring- and winter barley were combined, the first principal component explained more than 60 % of the genetic variance (result not shown), which would violate the assumption that the material is related. The correlation between GEBVs and phenotypic results would then be artificially increased owing to the population structure caused by the two subgroups. The minor subpopulation structure in winter barley can partly explain the increased PA but since the total size of the training set was 102, the effect of auto correlation should be minor. In addition, individuals distant to the main group do not show extreme phenotypic values.

Next to increasing training population sizes, predictive abilities can be increased by the selection of particular individuals to maximize the relationship between training and prediction population. This can be done on the basis of the individuals’ prediction error variance (PEV) or its reliability (Isidro et al. 2015; Rincent et al. 2012). These values were also estimated for all individuals forming the training sets in this study. If the data set is split on the basis of PEV or reliability, an optimized training set of the remaining lines can be created to achieve highest correlations in predicting the other set (Akdemir et al. 2015). Optimization within the existing training set does not necessarily improve the correlation of GEBVs to non-phenotyped lines, but it can be used together with genetic distance information to select a stable and genetically representative sample of genotyped lines chosen for phenotyping.

Conclusions

In this study, GS training populations for two malting barley breeding programs were established for which GEBV and predictive abilities for twelve malting quality traits were derived. The results suggest a high potential of genomic selection to improve malting barley breeding: in the early generations of a breeding program, a combined phenotypic selection (for easy to select agronomic traits) and genomic selection (for malting quality traits) could be employed, which would (1) reduce the costs to assess malting quality and (2) shorten the breeding cycle, since selection for quality is brought forward.

Author contribution statement

Schmidt M contributed to phenotypic data analysis, genotypic data analysis and writing; Kollers S contributed to genotypic data analysis and writing; Maasberg-Prelle A contributed to the raw phenotypic data preparation for winter barley; Großer J contributed the winter barley breeding material; Schinkel B contributed the spring barley breeding material; Tomerius A contributed to writing; Graner A contributed to the preparation of concept InnoGRAIN-MALT project; Korzun V contributed conceptualization of the project, initialization and final writing of this manuscript.