Introduction

Sugar beet (Beta vulgaris L., 2n = 18) is one of the most important sugar crops worldwide and accounts for approximately 25% of global sugar production. Root yield and sugar content are two major economic traits of the sugar beet. Sugar beet breeders have strived to improve the root yield and sugar content with an ultimate goal of increasing sugar yield. However, the root yield and sugar content of the sugar beet are negatively correlated with one another. These traits are controlled by multiple genes and influenced by environmental factors. Therefore, it is necessary to construct a high-density genetic map to determine the quantitative trait loci (QTLs) of root yield and sugar content. These data lay a foundation for future beet breeding involving the introduction or pyramiding of high-yield and high-sugar genes.

QTL mapping depends on the method of genetic map construction. The earliest genetic maps of sugar beets were established using morphological and isozyme markers, as well as random-amplified polymorphic DNAs (RAPDs), restriction fragment length polymorphisms (RFLPs), amplified fragment length polymorphisms (AFLPs), and expressed sequence tags (ESTs) (Barzen et al. 1995; Halldén et al. 1996; Schondelmaier et al. 1996; Grimmer et al. 2007; Laurent et al. 2007). Molecular marker-based QTL mapping and analysis were also conducted on the sugar beet. For instance, based on 110 AFLP and 25 RFLP markers, Nilsson et al. (1999) identified five QTLs on linkage groups (LGs) 2, 3, and 9 for Cercospora leaf spot resistance. Grimmer et al. (2007) mapped a rhizomania resistance gene (RZ4), also on chromosome 3, using AFLPs, single nucleotide polymorphisms (SNPs), and RAPDs. Schneider et al. (2002) used ESTs, AFLPs, and RFLPs to map the sugar content, root yield, and quality-related QTLs in the sugar beet. However, the genetic maps of these QTL mapping studies have low resolution and marker saturation; thus, the accuracy and quantity of the QTLs are limited. Reduced representation genome sequencing (RRGS) is a rapid and cost-effective strategy for high-throughput SNP discovery and genotyping. Specific-length amplified fragment sequencing (SLAF-seq) is a recently developed approach based on RRGS and high-throughput pair-end sequencing (Van Geyt et al. 1990; Pillen et al. 1992; Wagner et al. 1992). This method has been used to construct genetic maps for many species. So far, few studies have mapped the QTLs of traits in the sugar beet using specific-locus amplified fragment sequencing (SALF-seq).

This study was the first to generate SNP primers for the sugar beet using a high-throughput technology platform. In addition, we obtained a large number of chromosome fragments using SLAF-seq technology. A high-density genetic map of the sugar beet was constructed, and QTL mapping analysis was performed on the main traits (i.e., root yield and sugar content).

Materials and Methods

Materials

In April 2015, we obtained an F1 population of 219 individuals from a cross of 3a (high-yield, low-sugar, diploid, monogerm, sterile line) and 3b (low-yield, high-sugar, diploid, polyembryonic, pollinated line) parents from the Sugar Beet Institute Trial Site of Hulan District (126°58′E, 45°9′N), Harbin City, Heilongjiang Province, China. In July 2015, the mature F1 seeds were harvested and directly sown every 0.25 m in 10 m long by 0.67 m wide rows at the Gaomi Test Site (119°51′E, 36°30′N), Shandong Province, China. We provided consistent weed and pest control measures. We chose 219 F1 individuals, in addition to the two parents, for leaf sampling, genomic DNA extraction, and genotyping.

Methods

Field Trait Survey and Data Analysis

In mid-December 2015, a total of 219 F1 plants and both parents were surveyed for the following traits separately: root length, root perimeter, root weight, and sugar content per plant. After washing, root length and root perimeter were measured using a tape measure. An electronic crane was used for root weight measurements. A brix meter (ATAGO, Japan) was used for sugar content measurement. We sampled the heart portion of the sugar beet (top of root to the tail) and extracted fluid via a press. The sugar content is equal to the brix percentage multiplied by 0.85. The normal distribution and correlation analysis were performed using SPSS 19.

DNA Extraction and High-Throughput Sequencing

On September 20, 2015, fresh leaf samples were collected from a total of 219 plants. The samples were dipped in liquid nitrogen, and DNA was extracted using the cetyltrimethylammonium bromide (CTAB) method. DNA concentrations were measured and adjusted to equal quantities.

Sequencing was performed using the improved SLAF-seq method (Sun et al. 2013). The number of restriction sites was determined by predicting the different enzyme sites within the sugar beet reference genome (Dohm et al. 2014) using the digestion prediction Perl script (Biomarker Technology Company; Beijing, China). The F1 samples were subjected to a double digestion (HaeIII and Hpy166II; New England Biolabs; Beverly, MA, USA). SLAF products that ranged from 450 to 500 bp (including the adapter sequence indexes and adaptors) were excised. After dilution, the samples were sequenced using HiSeq 2500 PE100 (Illumina, Inc.; San Diego, CA, USA).

SLAF marker identification and genotyping were performed using procedures described by Sun et al. (2013). Briefly, low-quality reads (quality score < 20e) were excluded. Next, the raw reads were sorted individually according to their duplex barcode sequences. After the barcodes and terminal 5 bp positions were trimmed from each high-quality read, the clean reads from the same sample were mapped onto the Beta vulgaris genome sequence using BWA software (Li and Durbin 2009). Only SLAFs with 2–4 alleles were identified as polymorphic and considered potential markers. All polymorphic SLAFs were regarded as having eight segregation patterns, and all patterns except “aa × bb” were used for genetic map construction of the F1 population.

Linkage Map Construction

Marker loci were first partitioned into LGs based on their locations in the Beta vulgaris genome. Next, the modified logarithm of odds (MLOD) scores between markers was calculated to further confirm the robustness of markers for each LG. HighMap software was utilized to order the SLAF markers and correct genotyping errors (Liu et al. 2014). The initial map order was obtained, and error correction was performed using the SMOOTH algorithm (van Os et al. 2005). The k-nearest neighbor algorithm was used to account for null genotypes, and the maximum likelihood method was used to integrate the skewed markers. The genetic distance was estimated by the Kosambi mapping function (Kosambi 1943).

QTL Analysis

In combination with the graphic and fractal data of the nine LGs and four quantitative traits, QTL mapping was conducted using the interval mapping (IM) model in MapQTL V5.0 (Van Ooijen 2004). We performed the permutation test (PT) with a threshold value of 2.5 or 3. The significance thresholds were determined using 1000 permutations. The percentage of phenotypic variance explained (PVE) by each QTL (expl.%) was calculated based on the population variance found within the segregation population. According to the QTL results of the four economic traits and chromosomal markers, we had identified candidate mRNAs in the region. Gene function annotation and enrichment analysis were performed in the region. The QTL name is notated as, “Q + triad abbreviation + LG number”; different QTLs within the same LG are expressed numerically.

Results

Phenotypic Analysis

The statistical analysis of the four economic traits in the parents and F1 population revealed that the root yield and sugar content of the parents were considerably different. The mean values of root length, root perimeter, root weight, and sugar content were 17.75, 21.90 cm, 866.09 g, and 14.55%, respectively. The coefficient of variation (CV) of all traits ranged from 7.90 to 56.79%. The root weight displayed the highest value, whereas sugar content had the lowest value. The absolute values of both skewness and kurtosis for all traits, except root weight, were less than 1. Additionally, the P values from the normality test were larger than the threshold (0.05) for all traits, except for root weight, indicating satisfactory results for QTL detection (Table 1). In general, the F1 population showed a normal distribution for the four main traits, which met the QTL mapping requirements (Supplementary Fig. 1 to Supplementary Fig. 4).

Table 1 Statistics analysis of sugar yield-related traits of parents and F1 population

Data Analysis and QTL Mapping

Based on the high-density genetic map, which spanned a total of 1554.64 cM and had an average inter-marker distance of 0.47 cM (Wang et al. 2018), we identified the QTLs underlying the four sugar yield-related traits. The threshold of the logarithm of odds (LOD) scores for evaluating the statistical significance of QTL effects was determined using 1000 permutations in the MapQTL V5.0 program (Van Ooijen 2004). According to the intervals with LOD scores above 2.5, 32 QTLs were detected for the four traits of interest (Table 2). The root length was mapped to six regions of LG2, most of which were located between 78 and 90 cM. A total of 15 markers were mapped with a PVE of 8.03%. The root perimeter was mapped to LG5 and LG7. The regions with the greatest effect were 40–47 cM in LG5 and 107–111 cM in LG7, where a total of 39 markers were located. The PVE ranged from 6.1 to 7.23%. The root weight was mapped to LG3 and LG7. On LG3, 17 markers were located between 132.1 and 141.97 cM. On LG7, 24 markers were located between 163.58 and 177.73 cM. The PVE ranged from 5.46 to 5.78%. The sugar content was mapped according to the intervals with LOD scores greater than 4.35, which were detected as effective QTLs; these QTLs were mapped to LG5 and LG9. The most effective region was 8–28 cM on LG9 (maximum PVE of 10.13% among all the QTLs), where a total of 39 markers were located (Figs. 1, 2, 3, 4, 5, 6, 7).

Table 2 The characteristics of QTLs associated with sugar yield traits
Fig. 1
figure 1

QTL analysis of the root length trait on LG2. The abscissa axis represents the genetic group; the ordinate axis represents the LOD score

Fig. 2
figure 2

QTL analysis of the root perimeter trait on LG5. The abscissa axis represents the genetic group; the ordinate axis represents the LOD score

Fig. 3
figure 3

QTL analysis of the root perimeter trait on LG7. The abscissa axis represents the genetic group; the ordinate axis represents the LOD score

Fig. 4
figure 4

QTL analysis of the root weight trait on LG3. The abscissa axis represents the genetic group; the ordinate axis represents the LOD score

Fig. 5
figure 5

QTL analysis of the root weight trait on LG7. The abscissa axis represents the genetic group; the ordinate axis represents the LOD score

Fig. 6
figure 6

QTL analysis of the sugar content trait on LG5. The abscissa axis represents the genetic group; the ordinate axis represents the LOD score

Fig. 7
figure 7

QTL analysis of the sugar content trait on LG9. The abscissa axis represents the genetic group; the ordinate axis represents the LOD score

Candidate Gene Annotation

According to the QTL mapping results based on four traits and the distribution of chromosomal markers, we identified 3690 genes in the candidate region, including 191 root length, 918 root perimeter, 409 root weight, and 2172 sugar content genes (Table 3). We found 143 of 191 root length candidate genes with annotation information, among which 49, 89, 16, 96, and 143 were annotated to the COG, GO, KEGG, SwissProt, and Nr databases, respectively. We found 721 of 918 root perimeter candidate genes with annotation information, among which 196, 320, 53, 416, and 721 had annotation information in COG, GO, KEGG, SwissProt, and Nr, respectively. We found 320 of 409 root weight candidate genes with annotation information, among which 126, 164, 19, 243, and 320 were annotated to the COG, GO, KEGG, SwissProt, and Nr databases, respectively. Lastly, we found 1631 of 2172 sugar content candidate genes with annotation information, among which 639, 874, 142, 1115, and 1631 were annotated in the COG, GO, KEGG, SwissProt, and Nr databases, respectively (Table 4).

Table 3 Basic information of mRNA in the trait candidate region
Table 4 mRNA annotation statistics

Discussion

QTL Mapping for the F 1 Beet Population Using SNP Markers

In this study, we constructed a genetic map using an F1 sugar beet population because of the relatively high genomic heterozygosity of the parents and the normally distributed major economic traits. These conditions can be proven by our SLAF-seq data. Notably, only approximately 42.18% (20,449/48,478) of the polymorphic SLAF markers were genotyped as “aa × bb” based on the parental SLAF-seq results, which were only suitable for genetic map construction for the homozygous parents. Thus, the F1 population was more suitable than the F2 population for the sequencing and mapping of the sugar beet, which is a highly out-crossed crop. In fact, many crops use F1 populations for genetic and QTL mapping. Zhu et al. (2015) constructed a high-density genetic map using SLAF-seq and identified a QTL for anthracnose resistance in walnut F1 populations. Wang et al. (2015) constructed cherry linkage maps and identified a QTL for trunk diameter using F1 populations.

Genetic Map Construction and QTL Mapping

The density and quality of genetic maps are important because they allow the detection of reliable and concise QTL confidence intervals, as well as the identification of functional genes within those intervals.

Previous genetic map construction and QTL mapping studies were not conclusive due to an insufficient number of markers, an inadequate density, large map distances, and few mapped QTLs. For example, Barzen et al. (1992) used RFLP markers to construct a genetic linkage map of the sugar beet to segregate populations of this crop; the map covered 540 cM and contained 9 LGs with 92 genomic markers. In 1996, Halldén et al. published a high-density sugar beet RFLP map covering 621 cM including 413 markers with an average distance of 1.5 cM between markers. In our study, QTL mapping was based on high-throughput genetic mapping (Wang et al. 2017), which spanned a total of 1554.64 cM, with an average inter-marker distance of 0.47 cM. The size of this map is larger than previous maps, which ranged from 446 cM (Schneider et al. 2002) to 1399.88 cM (Wang et al. 2014). The sugar beet genetic map has the highest molecular marker density reported to date.

Previous QTL mapping analysis in sugar beets focused on disease resistance genes. For instance, Setiawan et al. (2000) performed QTL mapping and detected loci on chromosomes 3, 4, 7, and 9 for resistance to Cercospora leaf spot. Janssen et al. (2003) mapped QTLs for powdery mildew resistance genes in the sugar beet and identified loci for monogenic resistance. Gidner et al. (2005) conducted QTL mapping using WB41 (a rhizomania-resistant sugar beet strain) and found a new rhizomania resistance gene, RZ3, on chromosome 3. Grimmer et al. (2008) identified the rhizomania resistance gene, RZ5, from WB258 (a rhizomania-resistant sugar beet strain). Several studies have mapped QTLs associated with sugar beet root yield and sugar content. Weber et al. (1999, 2000) performed a QTL analysis with yield data from two segregating sugar beet populations. Wang et al. (2017) mapped 17 QTLs of root yield and sugar content on to six linkage groups in the sugar beet. In this study, 32 QTLs were detected for four important traits (root length, root perimeter, root weight, and sugar content) and 3690 genes were identified in the candidate region. Wang et al.’s map had a lower number of QTLs than our map. Thus, our improved map is more informative and accurate.

Our paper focused on mining the QTLs responsible for underlying traits, and we believe that certain unknown factors may exert adverse effects on QTL mapping. Therefore, it is our opinion that with uniform field management and phenotyping, satisfactory QTLs can be identified using our methods.

SLAF-seq Strategy for QTL Development

The SLAF-seq strategy combines locus-specific amplification and high-throughput sequencing for large-scale SNP discovery and genotyping (Sun et al. 2013). SLAF-seq has been successfully applied in various species and used for genetic and QTL mapping in other crops. For example, Zhang et al. (2016) constructed a high-density genetic map by SLAF-seq and analyzed QTLs for boll weight in upland cotton. The high-density genetic map harbored 5521 SNP markers, which covered a total distance of 3259.37 cM; boll weight had 18 stable QTLs identified. Zhu et al. (2016) constructed a high-density genetic map of the cucumber (3057 SLAFs), which included 4475 SNP markers and detected 15 QTLs for two fruit traits. Using SLAF-seq, Jiang et al. (2015) constructed a high-density genetic map of the wax gourd from a population of 140 F2 individuals and identified a single locus on chromosome 5 that controlled pericarp color. Mao et al. (2015), through a combination of interval mapping and single locus analysis in two genetic populations, identified 13 QTLs for seedling cold tolerance in Dongxiang (DX) wild rice. Thus, the SLAF-seq strategy proves to be an effective new tool for QTL mapping.

Conclusion

Using high-throughput sequencing technology, the present study selected candidate markers in the F1 population (219 individuals) of the sugar beet. Based on the high-density genetic map, which spanned a total of 1554.64 cM with an average marker distance of 0.47 cM, QTLs underlying the four economic traits were identified. According to the intervals with LOD scores above 2.5, 32 QTLs were detected for these four traits. The root length was mapped to six regions of LG2, most of which were located between 78 and 90 cM; a total of 15 markers were mapped with a PVE of 8.03%. The root perimeter was mapped to LG5 and LG7, and the PVE ranged from 6.1 to 7.23%. The root weight was mapped to LG3 and LG7, and the PVE ranged from 5.46 to 5.78%. The sugar content was mapped to LG5 and LG9. The most effective region was 8–28 cM on LG9 (maximum PVE of 10.13% among all the QTLs), where a total of 39 markers were located. According to the QTL mapping results based on four traits and the distribution of chromosomal markers, 3690 genes were identified in the candidate region, including 191 root length, 918 root perimeter, 409 root weight, and 2172 sugar content genes. Thus, QTL mapping of sugar yield-related traits may advance cloning of sugar beet genes and improve future sugar beet yields.