Abstract
Genetic diversity, population structure and genome-wide linkage disequilibrium (LD) was estimated in Nordic spring barley (Hordeum vulgare L. subsp. vulgare) by genotyping 180 breeding lines with 48 SSR markers and 7842 high-confidence SNPs using the Illumina Infinium 9K assay. In total 6208 SNPs were polymorphic and selected for further statistical analysis. A Mantel test revealed a strong positive correlation with a Pearson’s correlation coefficient (r) of 0.86, between the estimates of genetic distances based on SSR and SNP data. Population structure analysis identified two groups with a clear ancestry and one group with an admixed ancestry. The groups were primarily separated based on row-type and geographical origin. Average LD for the whole population decayed below a critical level of r2 = 0.20 within a range of 0–4 cM. To avoid confounding effects of the strong population structure, LD decay for the different groups was analysed separately and ranged from 0 to 12 cM. A slower LD decay was found within the two-rowed lines compared to the six-rowed lines and the two-rowed lines originating from the northern part, which could be the result of strong selection for malting quality and yield in the southern part. No large difference in genetic diversity was observed between population sub-groups, but differences at certain chromosomal regions were evident.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Cultivated barley (Hordeum vulgare L. subsp. vulgare) is one of the most important crops in the Nordic countries covering a total area of 1.56 million ha in 2014 (http://faostat3.fao.org). Nordic barley breeding began more than 100 years ago, starting by selections in the landrace gene pool to the present modern elite cultivars developed by crosses between pure lines of advanced material (Kolodinska Brantestam et al. 2004; Fischbeck 1992). Exotic sources are used in modern breeding, but then primarily as donors of single resistance genes against major diseases (Melchinger et al. 1994; Weibull et al. 2003). Today’s cultivation of genetically uniform cultivars is raising concerns about loss of genetic diversity. A previous study of the genetic diversity in spring barley germplasm in the Nordic and Baltic region reported a significant decrease of genetic diversity in the spring barley from southern parts of the investigated region in the middle of the twentieth century, but not in the spring barley from the northern parts (Kolodinska Brantestam et al. 2007). Likewise, no signs of genetic erosion were observed in a recent study of genetic diversity for barleys from the northern European area over a hundred years of barley breeding (Rajala et al. 2016). This highlights the importance of knowledge regarding the level of genetic diversity in breeding material, since it enables the detection of any changes in diversity that might lead to genetic erosion.
Several types of molecular markers such as amplified fragment length polymorphism (AFLP), random amplified polymorphic DNA (RAPD), simple sequence repeats (SSR), diversity array technology (DArT), and single nucleotide polymorphism (SNP) have been used to study genetic diversity and structure in crops (Kesawat and Das Kumar 2009). SSRs have the advantage to be abundant, highly polymorphic and multi-allelic, and therefore often provide more information compared to biallelic markers such as SNPs. On the other hand, the new iSelect genotyping platform, based on the Illumina Infinium assay, allows the simultaneous testing of 7842 gene-derived SNPs (Comadran et al. 2012). The genetic polymorphism of the SNP and SSR marker systems are generated through different mechanisms, thus they could give different views of the structure of a population.
Analysing the genetic structure within a population is a critical step to the way to understand and reveal the complexity within this population (Pritchard et al. 2000). Factors such as human or environmentally driven selection, genetic drift, mating system and growth habit can have an effect on the population structure (Buckler and Thornsberry 2002; Flint-Garcia et al. 2003). Studies of worldwide (Malysheva-Otto et al. 2006), European (Rostoks et al. 2006), American (Hamblin et al. 2010) and Nordic (Rajala et al. 2016) barley germplasm have shown that cultivated barley has a clear level of population structure with major subpopulations caused by differences in ear type, i.e. two-row and six-row, and seasonal growth habit, i.e. winter and spring (Hamblin et al. 2010; Malysheva-Otto et al. 2006; Rostoks et al. 2006). In addition to these major subpopulations, it has been shown that American barley accessions further can be divided into minor sub-populations corresponding to the breeding programs, which might be allocated to limited exchange of material between the breeding programs and/or a result of local adaptation (Hamblin et al. 2010).
Another important factor to consider is linkage disequilibrium (LD), which is the non-random association of alleles between two loci and shows the correlation between genetic polymorphisms, e.g. SNPs, and their history of mutations and recombination (Flint-Garcia et al. 2003). LD is important since the rate of its decay in a given species determines the number and density of the molecular markers needed to perform GWAS (Rafalski 2002). In many self-pollinated species such as barley where LD extends over long chromosomal distances (Malysheva-Otto et al. 2006), fewer markers are needed to cover the whole genome, whereas a higher marker density is needed when LD decays very rapidly in species such as maize where it declines to nominal levels within 1.5 kb (Remington et al. 2001).
The aim of the Public Private Partnership (PPP) for pre-breeding in barley, partly funded by the Nordic Council of Ministers (NMR), is to lay a foundation for barley breeding for disease resistance and yield stability to meet current and future challenges in the Nordic region. This collaboration is between five breeding companies and three governmental organizations in the Nordic region. One of the goals with this program is to identify markers linked with traits of interest via genome-wide association studies (GWAS). The main objective with the present study is to determine the population structure and the LD decay in a Nordic barley panel, in order to estimate the relationships among individuals.
Materials and methods
Plant materials
A total of 134 and 46 spring barley Hordeum vulgare L. subsp. vulgare breeding lines and cultivars, respectively, were included in this study. The selected spring barley breeding lines and cultivars are hereafter referred to as lines. Equal number of lines was selected by breeders from each of Boreal Plant Breeding (Finland), Graminor Breeding AS (Norway), Agricultural University of Iceland (AUI Iceland), Lantmännen Lantbruk (LSW Sweden), Nordic Seed and Sejet Planteforaedling I/S (Denmark). The lines were chosen to represent the available genetic variation in current elite Nordic barley germplasm. Out of the 180 lines, eleven lines were removed from further analyses since they were duplicates, or due to incomplete genotyping. Out of the remaining 169 lines, 124 were two-rowed and 45 six-rowed.
DNA extraction
DNA was extracted from 2-week-old seedlings, using a CTAB (Cetyl Trimethyl Ammonium Bromide) method as described earlier by (Orabi et al. 2014). The DNA was precipitated with isopropanol, washed two times with 75% ethanol, air-dried and finally diluted in TE buffer (pH 8.0).
Microsatellite genotyping
All lines were genotyped using 48 microsatellite markers evenly distributed over all chromosomes. PCR amplifications were performed on a GeneAmp® PCR System 2700 thermal cycler (Applied Biosystems, Foster City, CA, USA) using a single universal touchdown PCR program as previously described in (Orabi et al. 2014). The forward primers were 5′-labeled with fluorescent dyes to achieve the maximum multiplex capacity of the ABI 3130xl sequencer. Direct and M13-labelling were used for the microsatellite fragments, with 6-carboxyfluorescein (6-FAM, blue) or hexachloro-6-carboxyfluorescein (HEX or VIC, green) and 5′-fluorescein phosphoramidite (NED, yellow) for the direct labelling. For the M-13 labelling, 6-FAM (blue), VIC (green) and NED (yellow) were used. For fragment detection the ABI 3130xl DNA analyzer (Applied Biosystems, Foster City, CA, USA) was used and the fragment analysis and genotyping were performed using the GeneMarker genotyping software program, version 1.85 (Soft genetics, State College, PA, USA).
SNP genotyping
All lines were genotyped with the barley iSelect SNP chip based on the Illumina Infinium 9K assay. The genotyping of the lines was outsourced to Trait Genetics. The chip consists of 7842 high-confidence SNPs derived from expressed genes (Comadran et al. 2012).
Data analysis
Gene diversity and marker allele frequency
Genetic distances between genotypes, genetic diversity, allele frequency and private alleles (alleles present only in one group) were calculated using an in-house program written in VBA (Visual Basic for Applications) and implemented in Microsoft Excel 2007 (Microsoft, Redmond, WA, USA). The program utilises R language software v.2.14.2 (R Development Core Team 2012), which includes the Modern Applied Statistics with S-plus (MASS) package (Ripley 2002).
The average number of alleles per locus per group represents how polymorphic a given marker was within each group, and this value was calculated for each SNP marker. The average number of alleles per marker is between 1 and 2, where markers with a number of 1 were considered monomorphic.
Modified Roger’s distances (MRD) based on (Wright 1978) were calculated for the SSR data based on the following equation:
where p ij and q ij are the allele frequencies of the jth allele at the ith marker of the two barley lines in consideration; a i is the number of alleles at the ith marker; and m is the number of SSR loci. The genetic distances (D SM ) based on the SNP data were determined by estimating a simple matching coefficient (S SM ) (Reif et al. 2005):
where v ij refers to the allele in common between two lines, i and j; w ij is the number of alleles present in i and absent in j; x ij is the number of alleles present in j and absent in i; and y ij is the number of alleles absent in both i and j. Correlations between the MRD (SSR) and D SM (SNP) matrices were calculated using Mantel test (Mantel 1967). The polymorphic information content (PIC) of the individual markers was calculated as explained by Botstein et al. (1980):
where p i is the frequency of the ith allele, and n is the number of alleles per marker.
To show genetic diversity and differentiation along the barley chromosomes Shannon’s diversity indices were calculated using GenAlEx v. 6.5.0.1 (Peakall and Smouse 2006, 2012) based on the SNP data.
Population structure analysis
To determine population structure of the barley lines using SNP markers, the software package STRUCTURE v.2.3.4 based on a Bayesian clustering approach, was used (Pritchard et al. 2000). STRUCTURE was run 10 times for each hypothetical number of subpopulations (K) between 1 and 12 with the ploidy level set as 2. The Markov chain Monte Carlo (MCMC) was set to 9999 burn-in phases followed by 9999 iterations. Structure Harvester v.0.6.94 (Earl and von Holdt 2012), was used to estimate the most likely number of groups (K) using the DeltaK method (Evanno et al. 2005). Population structure based on the SSR markers was calculated as described above with the same settings but by using the previously described VBA program in Excel.
Analysis of Molecular Variance (AMOVA), Nei’s unbiased genetic distance and Principal Coordinates Analysis (PCoA) were calculated using GenAlEx v. 6.5.0.1.
Linkage disequilibrium analysis
The TASSEL 3.0 software (http.//www.maizegenetics.net) was used to calculate the LD (allele frequency correlation, r2) estimates between the SNP marker pairs using the full matrix option. Only intra-chromosomal comparisons were included and markers with minor allele frequency (MAF) below 0.05 were excluded. Thus 4884 out of the total 6280 polymorphic markers were subjected to analysis. To estimate the LD decay, the intra-chromosomal r2 values were plotted against the genetic distance with a second-degree smoothed loess curve fitted using the program R (R Development Core Team 2012) and a baseline based on the critical value of r2 was drawn. The critical value of r2, as an evidence of linkage, was calculated based on the method described in Breseghello and Sorrells (2006), by square root transforming the r2-values and taking the 95th percentile of unlinked r2-values. In the analysis markers located more than 50 cM apart were considered unlinked.
Results
Population structure in Nordic spring barley
The STRUCTURE analysis indicated that the barley panel could be divided into two groups with common ancestry (>0.7) K1SSR (n = 109) and K2SSR (n = 50) and one group (n = 10) with an admixed ancestry (<0.7), based on the SSR data. AMOVA analysis revealed that the K1SSR, K2SSR and admixedSSR group were significantly separated (p < 0.001) and explained 35% of total molecular variance (Online Resource 1). This is similar to the result obtained from the PCoA analysis, where the first two principal coordinates combined explained 32.1% (21.7 and 10.4%) of the variation (Fig. 1a). The subdivision of the population along the first principal coordinate (PC1) corresponded to the separation of the barley population into K1SSR and K2SSR. The K1SSR and K2SSR group corresponded to two-rowed lines and six-rowed lines, respectively, with the exception of six two-rowed lines found in K2SSR. In the small admixedSSR group, nine two-rowed lines and one six-rowed line from mainly the northern parts were found.
Also for the SNP data, two groups with a common ancestry (>0.7) (K1SNP: n = 109; K2SNP: n = 47) and one group (n = 13) with an admixed ancestry (<0.7) were inferred by the STRUCTURE analysis. Three clusters were observed in the PCoA analysis, where the first and second principal components explained 24.6 and 5.9% of the variation, respectively, or 30.5% combined (Fig. 1b). AMOVA analysis revealed that the K1SNP, K2SNP and admixedSNP groups were significantly separated (p value <0.001) and explained 42% of the total molecular variance (Online Resource 1). The two-rowed lines were distributed between K1SNP and the admixedSNP group and the six-rowed lines were found in group K2SNP, with the exception of two two-rowed lines that were found in the latter group. The lines with an admixed ancestry in the admixedSNP group were two-rowed and were, just as the lines in the admixedSSR group, mainly from the northern parts. More detailed information regarding the origin of the lines and the inferred population structure groups can be found in Online Resource 2.
Genetic diversity in Nordic spring barley
Marker system comparison
The barley lines were genotyped using 48 SSR markers. The same lines were also genotyped using the iSelect 9K SNP barley chip resulting in a total of 6208 polymorphic SNP markers. The SSR markers produced 234 scorable loci. The number of alleles per marker ranged from 1 to 15 with an average of 4.9 alleles per SSR marker. The average polymorphic information content (PIC) value was 0.46 for the SSR markers and 0.28 for the SNP markers. The genetic diversity index was higher for the SSR markers (0.514) than for the SNP markers (0.359).
Mantel test showed a strong correlation between the SSR-based Modified Roger’s genetic distances and the SNP-based simple matching coefficient, with a Pearson’s value of r2 = 0.86 (p value <0.0000).
Genetic diversity based on ear row type
Allelic richness parameters in the form of average number of alleles per locus, number and proportion of private alleles and genetic diversity for each marker type based on ear row type are presented in Table 1. There were no major differences in the average number of alleles per locus found in the two-rowed lines (SSR, 4.0; SNP: 1.9) compared to the six-rowed lines (SSR: 3.5, SNP: 1.8).). The genetic diversity was higher in the two-rowed lines (SSR: 0.431; SNP: 0.305) compared to the six-rowed lines (SSR: 0.386; SNP: 0.225). The two-rowed lines had also a higher number of private alleles (SSR: 64; SNP: 1145) compared to the six-rowed lines (SSR: 41; SNP: 447). AMOVA analysis revealed that 35 and 40% of the molecular variance between the lines based on the SSRs and SNPs, respectively, could be explained by the row-types (data not shown).
Genetic diversity based on population structure
Allelic richness parameters in the form of average number of alleles per locus, number and proportion of private alleles and genetic diversity for each marker type based on the inferred population structure groups are presented in Table 2. For the SSRs the highest average number of alleles per locus and gene diversity was found in the admixedSSR group (3.7; 0.404). However, this group also had the lowest proportion of private alleles (0.30), whereas the highest proportion of private alleles was found within the six-rowed lines in group K2SSR (0.86).
No major differences in the average number of alleles per locus were found between the three subgroups, based on the SNP data. The highest proportion of private alleles (10.2) was, no different from the results obtained with the SSRs, found within the six-rowed lines in K2SNP whereas the lowest proportion of private alleles (0.54) were found in the admixedSNP group. The highest gene diversity (0.279) was found in K1SNP, whereas the lowest (0.198) was found in the admixedSNP group.
Matrices showing relationships between the structure groups were generated based on Nei’s unbiased genetic distance for SSR and SNP data (Table 3). In the SSR matrix, the smallest distance (0.272) was found between the K1SSR and admixedSSR group (both mainly two-rowed lines from the southern and northern parts, respectively). The largest distance (0.566) was found between groups K2SSR (mainly six-rowed lines) and admixedSSR (mainly two-rowed lines from the southern and northern parts, respectively). Similar results were seen in the SNP matrix, where the smallest distance (0.217) was found between groups K1SNP and admixedSNP (mainly two-rowed lines from the southern and northern parts, respectively) and the largest (0.338) between K2SNP (mainly six-rowed lines) and admixedSNP (mainly two-rowed lines from the northern parts).
When comparing genetic diversity along the barley chromosomes, differences between the population structure groups were evident in several genomic regions (Fig. 2). Both group K2SNP and admixedSNP were low in diversity on chromosome 2H (around 83–113 cM), 3H (around 37–64 cM) and 5H (around 108–127 cM), whereas the K1SNP lines were very diverse in these regions. A large difference in genetic diversity were also observed on chromosome 6H (around 55–58 cM), where the K1SNP and K2SNP lines were much more diverse compared to admixedSNP lines. Right before this region on chromosome 6H (around 49 cM), a region with low diversity was seen for the K1SNP lines, but here the diversity was maintained for the two other groups. On chromosome 7H (around 68–88 cM) the lines in group K2SNP were very diverse in contrast to the low diversity observed for the K1SNP and admixedSNP lines in the same region.
Linkage disequilibrium in Nordic spring barley
LD for the whole population, the population structure groups and the ear row-types were calculated based on the SNP markers. The percentage of unlinked marker pairs ranged between 40 and 42% and no large differences were found between the different groups. The number of intra-chromosomal marker-pairs and the number of unlinked pairs in the total population and in the different groups are presented in Table 4. A total of 1,795,852 intra-chromosomal marker pairs were found in the entire population. The mean r2-value of the entire population, the ear row-types and population structure groups were calculated for the whole genome and for the seven chromosomes separately (Table 5). The mean r2-value for the whole genome of the entire population was found to be 0.10. The highest and lowest mean r2-value for the whole genome was found in the admixedSNP and K1SNP group (r2: 0.19; 0.07), respectively. The interval in which the Loess curve intercepts the critical value (background LD) was considered as the LD decay. The background LD and the LD decay of the entire population, the ear row-types and population structure groups were calculated for the whole genome and for the seven chromosomes separately (Table 6, Online Resource 3). Average LD for the whole population decayed below the critical level (r2 = 0.20) within a range of 0–4.0 cM. However the LD decay for the different chromosomes, ear row-types and population structure groups ranged from 0 to 12, with the most extended LD decay found for 5H in the six-rowed lines and in group K1SNP with an average r2 of 8–12 (Table 6). When comparing the whole genome, the six-rowed lines, group K2SNP and the lines from the northern parts in group admixedSNP had a more rapid LD decay with an average r2 between 0 and 4 compared to the two-rowed lines and group K1SNP, with lines from the southern parts, where the average LD decay were more slow with an average r2 between 4–8 and 8–12, respectively.
Discussion
Comparison of marker systems
It is of interest to compare if the information regarding population structure and genetic diversity is affected by the marker system of choice, since there are different mutational mechanisms behind SSR (replication slippage) and SNP (point mutation) markers. SSRs have been the most commonly used for studies of genetic diversity, mainly due to their abundance in the genome, reproducibility and high level of polymorphism. However, the increased availability of the SNP markers and the fast and highly automated genotyping technologies, have recently moved the attention to the use of the SNPs in studies of genetic diversity and population structure. This study showed that the average PIC and genetic diversity values were higher for the SSRs compared to the SNPs. However, these values have a maximum value of 0.5 for bi-allelic markers such as SNPs when the markers scores are 50% (0) and 50% (1). Taking this into consideration, the SNPs would be just as or even more informative than the SSRs. A higher number of private alleles were found with the SNPs compared to the SSRs (Tables 1, 2). However, considering the different number of markers, the SSRs actually had the highest private allele frequency, which could be expected since the SNPs are bi-allelic and have a lower mutation rate compared to the SSRs (Martinez-Arias et al. 2001; Li et al. 1981; Kruglyak et al. 1998).
It has earlier been reported that there is strong correlation between these two marker systems in barley (Varshney et al. 2008; Varshney et al. 2010). This was also demonstrated here with a strong and positive correlation (r = 0.86) between the modified Roger’s distances based on the SSR data and the simple matching coefficient based on the SNP data. Also the PCoA and structure analysis grouped the lines in a similar way with the two marker systems. However, the PCoA clustering based on the SNP data showed clearer and more distinct groupings than the SSR-based PCoA, but no major difference in the amount of molecular variance explained was found between the two marker systems. Neither did the AMOVA analyses of the structure groups reveal any large differences between the two marker systems (Table S1). However, superimposing the results from the population structure analysis on the results from the PCoA provided a clearer image and higher resolution of the population structure based on the SNPs compared to the SSRs (Fig. 1a, b). That reveals the ability of SNPs to explain the population structure at a more specific level. This was also seen in a study of genetic diversity and population structure of 375 rice varieties (Singh et al. 2013), where a comparison of the SSR and SNP marker systems revealed that at the structure level the SNPs were better at describing genetic relatedness whereas at the diversity level the SSRs showed a better grouping of samples. In contrast, some studies of genetic diversity and population structure in maize report a better estimate of population structure with SSRs compared to SNPs (Yang et al. 2011; Hamblin et al. 2007). The different results between those reports and this study might be due to the different numbers of SNP markers used (<900 vs. 6208) or due to the complexity of the maize genome. According to a theoretical prediction by Laval et al. (2002), (k − 1) times more bi-allelic markers are needed to achieve a comparable accuracy of the genetic distance as a set of SSRs with k alleles. With the average of about 3 alleles per SSR marker in this study, the number of SNPs needed would be [(3 − 1) × 48] = 96, which are about 65 times less compared to the 6208 used. Altogether the results from this study show that the numbers of SNPs used are more than enough to retrieve a comparable accuracy of genetic diversity in barley as the set of SSRs used. In addition, the SNPs seem to provide a higher resolution for the genetic relatedness than obtained with the SSRs.
Diversity and relationships within Nordic spring barley
The average genetic diversity for the Nordic spring barley collection analysed here was 0.514 and 0.359 based on the SSRs and SNPs, respectively. These are similar to the diversity estimates of Nordic breeding lines and cultivars released after 1970 (0.601) reported in an earlier study based on SSRs (Kolodinska Brantestam et al. 2007). In addition a similar result based on SSRs was reported in accessions from Europe (0.593), Eritrea (0.573) and Ethiopia (0.620), whereas the Hordeum vulgare subsp. spontaneum (K. Koch) and H. vulgare accessions from the West Asia North Africa (WANA) region had a higher diversity (0.826 and 0.762, respectively) (Orabi et al. 2007).
To estimate the genetic relatedness within this Nordic spring barley germplasm, the lines were analysed using Bayesian clustering in STRUCTURE, PCoA and AMOVA analyses. The analyses revealed that the population could be differentiated primarily by ear row-type, but also a further separation due to the geographical origin was found. This was also previously reported by Kolodinska Brantestam et al. (2007) in a study of genetic diversity within Nordic barley. Both ear row-type and geographical origin are factors known to cause population structure in barley (Malysheva-Otto et al. 2006; Hamblin et al. 2010; Cockram et al. 2010; Rostoks et al. 2006). The progenitor of cultivated barley is two-rowed and the six-rowed barley was selected shortly after domestication and originated from independent mutations of the Vrs1 allele (Komatsuda et al. 2007). Moreover, there has been a geographic separation of the two ear row-types in the Nordic countries, where two-rowed barley cultivars have been preferred in the southern and earlier, and six-rowed barley cultivars in the northern region. In this study two groups, K1 and K2, with a clear ancestry and a small group with admixed ancestry was inferred from STRUCTURE. The classification of K1 and K2 corresponded to the two-rowed lines from the southern region and the six-rowed lines, respectively, whereas the lines in the admixed group mainly corresponded to the two-rowed lines from the northern region. Structure corresponding to row-type and growth habit, but also due to breeding program, have earlier been reported of in American barley germplasm (Hamblin et al. 2010). Hamblin et al. (2010) reported that the separation observed based on breeding programs appeared to have been generated by recent breeding history and that local adaptation could have contributed to this structure. Local adaptation, due to differences in day length between the northern and southern region could also be a likely explanation of the geographical separation of the lines observed in this study. No large differences in genetic diversity between the population structure groups or row-types was found in the current study. One reason for this could be the difference in the number of lines within each group. The different ear row-types explained 40% of genetic variation in the population, whereas only an additional 2% was explained by the population structure groups. However, even if no large differences in genetic diversity could be found between the different population structure groups of the population, differences in diversity could be identified in specific chromosomal regions. In a previous study of European two-rowed spring barley, a lack of diversity was observed on 2H at the region where the HvCEN locus are positioned (Comadran et al. 2012; Tondelli et al. 2013). The HvCEN locus is associated with late flowering and high yield, and the lack of diversity in this region was explained as a result of fixation of HvCEN in European two-rowed spring barley (Tondelli et al. 2013). Unsurprisingly, since early flowering is an important characteristic in Nordic barley, and more so in the northern part of the region than in the southern, no lack of diversity on 2H was evident in this study, neither was it shown in a previous study of Nordic barley (Rajala et al. 2016). Low diversity was seen for the six-rowed lines in K2SNP and the two-rowed lines from the northern parts in the admixedSNP group at certain regions on 2H (83–113 cM), 3H (37–64 cM) and 5H (108–127 cM). In these chromosomal regions are the Vrs1 allele (2H) determining the row-type and genes related to the photoperiod pathway and circadian clock such as HvGI, HvFT2 (3H) and VRNH1 (5H), all of them characteristics important in barley breeding for the northern regions (Dunford et al. 2005; Sasani et al. 2009; Komatsuda et al. 2007; Distelfeld et al. 2009; Faure et al. 2007; Ramsay et al. 2011). In a previous study of genetic diversity in European two-rowed spring barley, low PIC values were observed at regions on chromosome 1H (47.8–55.4), 6H (30.2–53.6) and 7H (29.8–47.6), which were postulated to be a result of human selection reflecting targets like malting quality traits (Tondelli et al. 2013). Also in the present study the two-rowed lines in group K1SNP showed a low diversity at 6H (45–55 cM), in contrast to the other population structure groups where the diversity was maintained. On the corresponding region on 7H the diversity was maintained for all groups, however, a prominent low diversity was evident within another genomic region (around 62–88 cM) on chromosome 7H in the K1SNP and the admixedSNP group. This region close to the centromere on 7H has previously been described to contain several important QTLs controlling malt extract, α-amylase activity, diastatic power and β-glucanase (Hayes et al. 2003).
Linkage disequilibrium within Nordic spring barley
Since the population in this study has a clear and strong structure the LD was estimated in both the whole population, without considering the structure, as well as in the subpopulations estimated from the STRUCTURE analysis. The background LD (r2) for the whole population was 0.20 (Table 6). For the different row-types as well as the population structure groups the background LD was lower (0.10–0.19), except for the admixedSNP group which had a background LD of 0.31. However, the admixedSNP group is a very small group with only 13 lines and with lines from four of the six breeders, and also with a more complex structure. These results are in accordance with the results from a study of LD in elite barley breeding germplasm from the United States by Zhou et al. (2012), where the background LD decreased when the structure was considered, but was still high in more diverse sub-populations with unexplained structure.
In this study there was a slower LD decay found in the two-rowed lines from the southern part, compared to the six-rowed lines and the two-rowed lines from the northern part that may be a result of the focused breeding for higher malting quality in the southern parts. This result is in accordance with an earlier study were a faster LD decay were found in six-rowed compared to two-rowed barley (Pasam et al. 2012). The LD decay in a whole-genome survey for modern two-rowed spring barley cultivars in Europe has been estimated to be at least 10 cM (Kraakman et al. 2004), whereas another study of LD decay in a world-wide spring barley collection reports of LD decay below a critical value of r2 of 0.2 within a map distance of 7 cM (Pasam et al. (2012), which is in accordance with the results of the two-rowed lines in this study (Table 6).
In the chromosome comparison, a rapid average LD decay was found within all groups on chromosome 4H that might be due to introduction of important traits that occurs during breeding, e.g. mlo. In this study the occurrence of the mlo locus on 4H is about 50% within the population, which may indicate that there is a targeted recombination due to mlo introgression in half of the population that might be contributing to the rapid LD decay.
Conclusion
The results obtained with the two marker systems in this study correlated well and both systems revealed a strong structure based on the row-types, but also on geographical origin. No large differences in genetic diversity between the row-types or population structure groups were observed, however differences in certain chromosomal regions was evident which could reflect different breeding aims for the northern and southern regions. The results from this study help breeders in the Nordic region to understand genetic structure and diversity in their barley breeding material better. This is of importance when planning breeding programs for higher genetic gains.
References
Botstein D, White RL, Skolnick M, Davis RW (1980) Construction of a genetic linkage map in man using restriction fragment length polymorphisms. Am J Hum Genet 32(3):314
Breseghello F, Sorrells ME (2006) Association mapping of kernel size and milling quality in wheat (Triticum aestivum L.) cultivars. Genetics 172(2):1165–1177
Buckler ES, Thornsberry JM (2002) Plant molecular diversity and applications to genomics. Curr Opin Plant Biol 5(2):107–111
Cockram J, White J, Zuluaga DL, Smith D, Comadran J, Macaulay M, Luo Z, Kearsey MJ, Werner P, Harrap D, Tapsell C, Liu H, Hedley PE, Stein N, Schulte D, Steuernagel B, Marshall DF, Thomas WTB, Ramsay L, Mackay I, Balding DJ, The AC, Waugh R, O’Sullivan DM, Members of the AC, Booer C, Pike S, Hamilton G, Jellis G, Davies N, Ross A, Bury P, Habgood R, Klose S, Vequaud D, Christerson T, Brosnan J, Newton A, Russell J, Shaw P, Bayles R, Wang M (2010) Genome-wide association mapping to candidate polymorphism resolution in the unsequenced barley genome. Proc Natl Acad Sci USA 107(50):21611–21616. doi:10.1073/pnas.1010179107
Comadran J, Kilian B, Russell J, Ramsay L, Stein N, Ganal M, Shaw P, Bayer M, Thomas W, Marshall D, Hedley P, Tondelli A, Pecchioni N, Francia E, Korzun V, Walther A, Waugh R (2012) Natural variation in a homolog of Antirrhinum CENTRORADIALIS contributed to spring growth habit and environmental adaptation in cultivated barley. Nat Genet 44(12):1388–1392. doi:10.1038/ng.2447
Distelfeld A, Li C, Dubcovsky J (2009) Regulation of flowering in temperate cereals. Curr Opin Plant Biol 12(2):178–184
Dunford R, Griffiths S, Christodoulou V, Laurie D (2005) Characterisation of a barley (Hordeum vulgare L.) homologue of the Arabidopsis flowering time regulator GIGANTEA. Theor Appl Genet 110(5):925–931
Earl DA, von Holdt BM (2012) STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv Genet Resour 4(2):359–361. doi:10.1007/s12686-011-9548-7
Evanno G, Regnaut S, Goudet J (2005) Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol Ecol 14(8):2611–2620
Faure S, Higgins J, Turner A, Laurie DA (2007) The FLOWERING LOCUS T-like gene family in barley (Hordeum vulgare). Genetics 176(1):599–609
Fischbeck G (1992) Barley cultivar development in Europe–success in the past and possible changes in the future. Barley Genet IV 2:885–901
Flint-Garcia SA, Thornsberry JM, Buckler ES (2003) Structure of linkage disequilibrium in plants. Annu Rev Plant Biol 54:357–374. doi:10.1146/annurev.arplant.54.031902.134907
Hamblin MT, Warburton ML, Buckler ES (2007) Empirical comparison of simple sequence repeats and single nucleotide polymorphisms in assessment of maize diversity and relatedness. PLoS ONE 2(12):e1367. doi:10.1371/journal.pone.0001367
Hamblin MT, Close TJ, Bhat PR, Chao S, Kling JG, Abraham KJ, Blake T, Brooks WS, Cooper B, Griffey CA, Hayes PM, Holek DJ, Horsley RD, Obert DE, Smith KP, Ullrich SE, Muehlbauer GJ, Jannink J-L (2010) Population structure and linkage disequilibrium in US barley germplasm: implications for association mapping. Crop Sci 50(2):556–566
Hayes PM, Castro A, Marquez-Cedillo L, Corey A, Henson C, Jones BL, Kling J, Mather D, Matus I, Rossi C, Sato K (2003) Genetic diversity for quantitatively inherited agronomic and malting quality traits. Dev Plant Genet Breed 7:201–226
Kesawat MS, Das Kumar B (2009) Molecular markers: it’s application in crop improvement. J Crop Sci Biotechnol 12(4):169–181. doi:10.1007/s12892-009-0124-6
Kolodinska Brantestam A, Von Bothmer R, Dayteg C, Rashal I, Tuvesson S, Weibull J (2004) Inter simple sequence repeat analysis of genetic diversity and relationships in cultivated barley of Nordic and Baltic origin. Hereditas 141(2):186–192. doi:10.1111/j.1601-5223.2004.01867.x
Kolodinska Brantestam A, von Bothmer R, Dayteg C, Rashal I, Tuvesson S, Weibull J (2007) Genetic diversity changes and relationships in spring barley (Hordeum vulgare L.) germplasm of Nordic and Baltic areas as shown by SSR markers. Genet Resour Crop Evol 54(4):749–758. doi:10.1007/s10722-006-9159-4
Komatsuda T, Pourkheirandish M, He C, Azhaguvel P, Kanamori H, Perovic D, Stein N, Graner A, Wicker T, Tagiri A, Lundqvist U, Fujimura T, Matsuoka M, Matsumoto T, Yano M (2007) Six-rowed barley originated from a mutation in a homeodomain-leucine zipper I-class homeobox gene. Proc Natl Acad Sci USA 104(4):1424–1429. doi:10.1073/pnas.0608580104
Kraakman AT, Niks RE, Van den Berg PM, Stam P, Van Eeuwijk FA (2004) Linkage disequilibrium mapping of yield and yield stability in modern spring barley cultivars. Genetics 168(1):435–446. doi:10.1534/genetics.104.026831
Kruglyak S, Durrett RT, Schug MD, Aquadro CF (1998) Equilibrium distributions of microsatellite repeat length resulting from a balance between slippage events and point mutations. Proc Natl Acad Sci 95(18):10774–10778
Laval G, SanCristobal M, Chevalet C (2002) Measuring genetic distances between breeds: use of some distances in various short term evolution models. Genet Sel Evol 34(4):481–507. doi:10.1051/gse:2002019
Li WH, Gojobori T, Nei M (1981) Pseudogenes as a paradigm of neutral evolution. Nature 292(5820):237–239
Malysheva-Otto LV, Ganal MW, Roder MS (2006) Analysis of molecular diversity, population structure and linkage disequilibrium in a worldwide survey of cultivated barley germplasm (Hordeum vulgare L.). BMC Genet 7:6. doi:10.1186/1471-2156-7-6
Mantel N (1967) The detection of disease clustering and a generalized regression approach. Cancer Res 27(2 Part 1):209–220
Martinez-Arias R, Calafell F, Mateu E, Comas D, Andres A, Bertranpetit J (2001) Sequence variability of a human pseudogene. Genome Res 11(6):1071–1085. doi:10.1101/gr.167701
Melchinger AE, Graner A, Singh M, Messmer MM (1994) Relationships among European barley germplasm: I. Genetic diversity among winter and spring cultivars revealed by RFLPs. Crop Sci 34(5):1191–1199
Orabi J, Backes G, Wolday A, Yahyaoui A, Jahoor A (2007) The Horn of Africa as a centre of barley diversification and a potential domestication site. Theor Appl Genet 114(6):1117–1127. doi:10.1007/s00122-007-0505-5
Orabi J, Jahoor A, Backes G (2014) Changes in allelic frequency over time in European bread wheat (Triticum aestivum L.) varieties revealed using DArT and SSR markers. Euphytica 197(3):447–462
Pasam RK, Sharma R, Malosetti M, van Eeuwijk FA, Haseneyer G, Kilian B, Graner A (2012) Genome-wide association studies for agronomical traits in a world wide spring barley collection. BMC Plant Biol 12(1):1–22. doi:10.1186/1471-2229-12-16
Peakall R, Smouse PE (2006) GENALEX 6: genetic analysis in Excel. Population genetic software for teaching and research. Mol Ecol Notes 6(1):288–295
Peakall R, Smouse PE (2012) GenAlEx 6.5: genetic analysis in Excel. Population genetic software for teaching and research—an update. Bioinformatics 28(19):2537–2539. doi:10.1093/bioinformatics/bts460
Pritchard JK, Stephens M, Donnelly P (2000) Inference of population structure using multilocus genotype data. Genetics 155(2):945–959
R Development Core Team (2012) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. ISBN 3-900051-07-0, http://www.r-project.org/
Rafalski A (2002) Applications of single nucleotide polymorphisms in crop genetics. Curr Opin Plant Biol 5(2):94–100. doi:10.1016/S1369-5266(02)00240-6
Rajala A, Peltonen-Sainio P, Jalli M, Jauhiainen L, Hannukkala A, Tenhola-Roininen T, Ramsay L, Manninen O (2016) One century of Nordic barley breeding: nitrogen use efficiency, agronomic traits and genetic diversity. J Agric Sci. doi:10.1017/S002185961600068X
Ramsay L, Comadran J, Druka A, Marshall DF, Thomas WT, Macaulay M, MacKenzie K, Simpson C, Fuller J, Bonar N, Hayes PM, Lundqvist U, Franckowiak JD, Close TJ, Muehlbauer GJ, Waugh R (2011) INTERMEDIUM-C, a modifier of lateral spikelet fertility in barley, is an ortholog of the maize domestication gene TEOSINTE BRANCHED 1. Nat Genet 43(2):169–172
Reif JC, Melchinger AE, Frisch M (2005) Genetical and mathematical properties of similarity and dissimilarity coefficients applied in plant breeding and seed bank management. Crop Sci 45(1):1–7
Remington DL, Thornsberry JM, Matsuoka Y, Wilson LM, Whitt SR, Doebley J, Kresovich S, Goodman MM, Buckler ES (2001) Structure of linkage disequilibrium and phenotypic associations in the maize genome. Proc Natl Acad Sci USA 98(20):11479–11484
Ripley BD (2002) Modern applied statistics with S, 4th edn. Springer, Berlin
Rostoks N, Ramsay L, MacKenzie K, Cardle L, Bhat PR, Roose ML, Svensson JT, Stein N, Varshney RK, Marshall DF, Graner A, Close TJ, Waugh R (2006) Recent history of artificial outcrossing facilitates whole-genome association mapping in elite inbred crop varieties. Proc Natl Acad Sci 103(49):18656–18661
Sasani S, Hemming MN, Oliver SN, Greenup A, Tavakkol-Afshari R, Mahfoozi S, Poustini K, Sharifi H-R, Dennis ES, Peacock WJ, Trevaskis B (2009) The influence of vernalization and daylength on expression of flowering-time genes in the shoot apex and leaves of barley (Hordeum vulgare). J Exp Bot 60(7):2169–2178
Singh N, Choudhury DR, Singh AK, Kumar S, Srinivasan K, Tyagi RK, Singh NK, Singh R (2013) Comparison of SSR and SNP markers in estimation of genetic diversity and population structure of indian rice varieties. PLoS ONE 8(12):e84136. doi:10.1371/journal.pone.0084136
Tondelli A, Xu X, Moragues M, Sharma R, Schnaithmann F, Ingvardsen C, Manninen O, Comadran J, Russell J, Waugh R, Shulman AH, Pillen K, Rasmussen SK, Kilian B, Cattivelli L, Thomas WTB, Flavell AJ (2013) Structural and temporal variation in genetic diversity of European spring two-row barley cultivars and association mapping of quantitative traits. Plant Genome. doi:10.3835/plantgenome2013.03.0007
Varshney RK, Salem KFM, Baum M, Röder MS, Graner A, Börner A (2008) SSR and SNP diversity in a barley germplasm collection. Plant Genet Resour 6(02):167–174. doi:10.1017/S1479262108993187
Varshney R, Baum M, Guo P, Grando S, Ceccarelli S, Graner A (2010) Features of SNP and SSR diversity in a set of ICARDA barley germplasm collection. Mol Breed 26(2):229–242
Weibull J, Walther U, Sato K, Habekuß A, Kopahnke D, Proeseler G (2003) Diversity in resistance to biotic stresses. Dev Plant Genet Breed 7:143–178
Wright S (1978) Evolution and the genetics of populations. Variability within and among natural populations, vol 4. University of Chicago Press, Chicago
Yang X, Xu Y, Shah T, Li H, Han Z, Li J, Yan J (2011) Comparison of SSRs and SNPs in assessment of genetic relatedness in maize. Genetica 139(8):1045–1054. doi:10.1007/s10709-011-9606-9
Zhou H, Muehlbauer G, Steffenson B (2012) Population structure and linkage disequilibrium in elite barley breeding germplasm from the United States. J Zhejiang Univ Sci B 13(6):438–451. doi:10.1631/jzus.B1200003
Acknowledgements
We thank NordGen and the Nordic Council of Ministers (NMR) for financial support of this study that was conducted within a larger project, Public Private Partnership (PPP) for Pre-breeding in barley. We would also like to thank Dr. Nanna Nielsen, Nordic Seed, Denmark, for support with the LD analysis.
Author information
Authors and Affiliations
Consortia
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
The PPP Barley Consortium
Inger Åhman (Department of Plant Breeding, Swedish University of Agricultural Sciences, Box 101, 230 53 Alnarp, Sweden), Therése Bengtsson (Department of Plant Breeding, Swedish University of Agricultural Sciences, Box 101, 230 53 Alnarp, Sweden), Outi Manninen (Boreal Plant Breeding Ltd, Myllytie 10, 31600 Jokioinen, Finland), Merja Veteläinen (Boreal Plant Breeding Ltd, Myllytie 10, 31600 Jokioinen, Finland), Lars Reitan (Graminor A/S, Hommelstadvegen 60, 2322 Ridabu, Norway), Muath Alsheikh (Graminor A/S, Hommelstadvegen 60, 2322 Ridabu, Norway), Bo Gertsson (Lantmännen Lantbruk, von Troils väg, 213 37 Malmö, Sweden), Stine Tuvesson (Lantmännen Lantbruk, von Troils väg, 213 37 Malmö, Sweden), Marja Jalli (Natural Resources Institute Finland (Luke), Viikinkaari 4, 00790 Helsinki, Finland), Ahmed Jahoor (Nordic Seed A/S, Kornmarken 1, 8464 Galten, Denmark) (project leader), Jens Due Jensen (Nordic Seed A/S, Kornmarken 1, 8464 Galten, Denmark), Jihad Orabi (Nordic Seed A/S, Kornmarken 1, 8464 Galten, Denmark), Gunter Backes (Organic Agricultural Sciences, Universität Kassel, Steinstr. 19, 37213 Witzenhausen, Germany), Lene Krusell (Sejet Plant Breeding, Nørremarksvej 67, 8700 Horsens, Denmark), Rasmus Lund Hjortshøj (Sejet Plant Breeding, Nørremarksvej 67, 8700 Horsens, Denmark), Áslaug Helgadóttir (The Agricultural University of Iceland, Faculty of Land and Animal Resources, Hvanneyri, 311 Borgarnes, Iceland), Magnus Göransson (The Agricultural University of Iceland, Faculty of Land and Animal Resources, Hvanneyri, 311 Borgarnes, Iceland), Sæmundur Sveinsson (The Agricultural University of Iceland, Faculty of Land and Animal Resources, Hvanneyri, 311 Borgarnes, Iceland).
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Bengtsson, T., The PPP Barley Consortium., Manninen, O. et al. Genetic diversity, population structure and linkage disequilibrium in Nordic spring barley (Hordeum vulgare L. subsp. vulgare). Genet Resour Crop Evol 64, 2021–2033 (2017). https://doi.org/10.1007/s10722-017-0493-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10722-017-0493-5