Introduction

Oilseed rape (Brassica napus L) is an allotetraploid species formed about 7500 years ago by natural hybridization and genome doubling between turnip rape (Brassica rapa) and cabbage (Brassica oleracea) (Chalhoub et al. 2014). It is one of the most important sources of plant oil and is cultivated worldwide. Rapeseed provides a healthy vegetable oil for human consumption and a high-protein forage for livestock. It is also a source of renewable energy and can be transformed into biofuel for industrial applications. As the population grows and the need for renewable energy increases, the demand for rapeseed also continues to rise. Therefore, the breeding and cultivation of high-yield rapeseed varieties have become top priorities.

Seed yield is a complex trait that is influenced by various factors (Diepenbrock 2000). Silique length (SL), seed weight (SW), seed number per silique (SS), and siliques per plant (SP) are closely associated with seed yield improvement (Chen et al. 2007; Li et al. 2014). Rapeseed siliques continually supply nutrients from photosynthesis, transport carbohydrates from the vegetative organs to the seeds and ensure normal seed development (Bennett et al. 2011; King et al. 1997; Rossato et al. 2001). Siliques should have large photosynthetic areas, furnish sufficient nutrients, and provide adequate space for the development of the encoded seeds (Chay and Thurling 1989a). SL is closely associated with SW and seed yield, and numerous reports have proved that the incensement of SL could effectively improve seed yield (Chen et al. 2007; Khan et al. 2006; Wang et al. 2016; Sadat et al. 2010; Samizadeh et al. 2007; Udall et al. 2006; Zhang et al. 2011). Chay and Thurling (1989a) discovered that SP varied independently of SL and SW tended to increase with the increasing of SL. Through introgression of long-pod genes into cultivars with a suitable genetic background would increase seed weight per silique with little or no reduction in SP, there might still be some benefit to select SL in different genetic backgrounds (Chay and Thurling 1989b). The analysis of 112 full-sib families variated in SL indicated that long siliques had more seeds and a greater seed yield per plant with little differences in SP (Diepenbrock 2000). SL and SW showed similar high heritability, and the variations in SL were more likely to be the cause of the distinction in SW (Li et al. 2014). Thus, long silique is an appropriate feature in rapeseed breeding programs. Previous studies have confirmed that SL and SW are controlled by multiple gene loci. Several quantitative trait loci (QTLs) have been identified in B. napus through traditional genetic mapping and association studies (Chen et al. 2007; Dong et al. 2018; Fan et al. 2010; Fu et al. 2015; Li et al. 2014; Shi et al. 2009; Wang et al. 2016; Yang et al. 2012; Zhang et al. 2011). Several SL and SW QTLs have been found to be overlapped or linked on chromosomes A04, A09, C06, and C08 (Dong et al. 2018; Fu et al. 2015; Li et al. 2014).

Analysis of the loci related to SL and SW can elucidate the genetic mechanism of silique development and the improvement of rapeseed yield. A recent study revealed that the nuclear gene BnaA.ARF18.a on chromosome A09 affects both SL and SW development in rapeseed (Liu et al. 2015). Functional markers for this gene were developed to distinguish the optimal alleles governing SL and SW in A09 (Dong et al. 2018). However, SW selection in the field is difficult since SW can only be calculated after harvest and the recognized QTLs cannot yet be applied in high-yield rapeseed breeding. In this study, a major locus in B. napus acting as a single Mendelian SL-controlling factor was narrowed down to a 98.47 kb interval on A09 using a Brassica 60 K Illumina Infinium™ SNP array and a map-based cloning strategy. The SW could also be improved in individuals expressing high levels of this candidate gene locus. The relationship between SL and SW was also demonstrated in the 186 rapeseed accessions identified by the associated markers. These results provide a strategy for oil rapeseed improvement by selecting for appropriately long siliques. The corresponding locus and its functional markers may then be used in the cultivation of high-yield rapeseed varieties.

Materials and methods

Plant materials, population development, and trait measurements

A B. napus doubled haploid line ‘DH46’ achieved from the doubled haploid lines of the recurrent population (Zhao et al. 2016) and the conventional cultivar ‘ZS11’ were used as parents in this study, and they were provided by the Rapeseed Laboratory of Huazhong Agricultural University. ‘DH46’ exhibited short siliques and low SS‚ was crossed with ‘ZS11’ to produce a F1 generation. The F1 plants were backcrossed with the recurrent parent ‘DH46’ (as the male) three times without any selections to construct a mapping population, BC3F1. A BC3F1 subpopulation obtained from one BC2F1 plant was selected, which consisted of 236 individuals having SL variations with a 1:1 segregation ratio, and was used to locate the SL-related gene. To generate fine mapping of the BnSLA09 gene, the long-silique BC3F1 plants were self-pollinated to produce a BC3F2 population. Since the SS could influence silique-related traits, only individuals with no fertility problems and heterozygous locus of BnSLA09 in BC3F2 population were self-pollinated to develop 4824 lines of a BC3F3 population. Plants with a homozygous locus of BnSLA09 originating from ‘ZS11’ and ‘DH46’ were identified from the BC3F3 population and named L-NIL and S-NIL, respectively. Both the SL and SS for each plant among the BC3F3 generation were estimated based on the average length of ten well-developed siliques (not including the pod beak) from the middle of the main inflorescence. The weight of 1000 fully filled open-pollinated seeds was measured and represented the SW of each individual.

Histological observation

After 20 days of flowering, long and short siliques from the BC3F3 plants were sampled and fixed in FAA solution (50% ethanol, 5% glacial acetic acid, and 5% formaldehyde) for 24–48 h. The fixed materials were dehydrated in an ethanol series (60, 70, 80, 90, and 100%), vacuum-dried, and coated with gold under vacuum. The outer surfaces of the siliques were observed under a JSM-6390LV scanning electron microscope (JEOL Ltd., Akishima, Tokyo, Japan) at an accelerating voltage of 10 kV.

DNA extraction and bulked segregant analysis (BSA)

Genomic DNA of all accessions was extracted from fresh leaf tissues using a modified cetyltrimethylammonium bromide (CTAB) method (Murray and Thompson 1980). For the Brassica 60 K Illumina Infinium™ SNP array analyses, BC3F1 plants were randomly selected to construct DNA bulks. The DNA from forty long- and forty short-silique plants was extracted using a DNA Secure Plant Kit (DP320, Tiangen Biotech, Beijing, China) then randomly pooled with an equivalent amount of DNA to construct four long- and four short-silique pools (L-pools and S-pools, respectively). All DNA pools were adjusted to 100 ng/μl before the SNP array analysis.

SNP analysis

A Brassica 60 K Illumina Infinium™ SNP array contained 52,157 markers suitable for the allotetraploid Brassica napus (Clarke et al. 2016). The SNP analysis was performed according to the methodology detailed in Xu et al. (2016). Scanned SNPs differing between the four L-pools and the four S-pools were considered candidate gene regions associated with SL. The retained SNPs were used in a BLAST search against the B. napus ‘ZS11’ genome database (https://www.ncbi.nlm.nih.gov/assembly/GCF_000686985.2) to locate the chromosome positions (E value ≤ ‘1e−12’). SNPs matched to multiple loci in the genome were removed, and only the top blast hits were retained for further analysis.

Development of single sequence repeat (SSR) and InDel markers

According to the results of the SNP analysis, partial sequences of B. napus ‘ZS11’ A09 chromosome were selected to develop SSR markers in WebSat (http://wsmartins.net/websat/). Multiple insertions/deletions among the candidate regions were found by comparing the resequencing data of ‘DH46’ (unpublished) with B. napus ‘ZS11’ genome sequences. InDel markers were designed with Primer 3 (http://primer3.ut.ee/). All primers were used to amplify genomic DNA from the two parents and the L-pools and S-pools via a standard PCR as described by Xia et al. (2012). The PCR products were detected on 6% denaturing polyacrylamide gels, and only polymorphic markers were selected to narrow down the interval in the BC3F3 population.

Genetic mapping of the BnSLA09 locus

The 236 BC3F1 and 4824 BC3F3 individuals were used to construct a rough initial linkage map and a fine map of the BnSLA09 locus, respectively. Segregation data associated with polymorphic SSR and InDel markers linked to the gene were analyzed with JoinMap® 4 and MapDraw (Liu and Meng 2003; Van Ooijen 2006). All genetic distances (cM) were counted with Kosambi’s mapping function (Kosambi 1944).

Analysis of the genes in the mapping interval of the BnSLA09 locus

All putative genes with their annotation information in the target interval were obtained from the B. napus ‘ZS11’ (https://www.ncbi.nlm.nih.gov/assembly/GCF_000686985.2) and B. napus ‘Darmor-bzh’ (http://www.genoscope.cns.fr/brassicanapus/data/) genome databases. The orthologous genes were identified by BLAST analysis in the online Arabidopsis databases (TAIR, http://www.arabidopsis.org/), and only the top blast hits were considered. A bioinformatics analysis of all genes within the BnSLA09 locus region was performed for candidate gene predication. Promoter- and full-length genomic sequences of the candidate gene were amplified and sequenced between the two parents and the BC3F3 individuals.

RNA extraction and quantitative real-time PCR (qRT-PCR)

At the flowering stage, four different samples (L1: 0–2 mm buds; L2–L4: 2–4, 4–6, and 6–8 mm pistils of buds) were taken from BC3F3 individuals with long and short siliques and frozen in liquid nitrogen, and three biological replicates were used. Total RNA was isolated with a Universal Plant Total RNA Extraction Kit (RP3302; BioTeke Corporation, Beijing, China) according to the manufacturer’s instructions. A total of 2 μg RNA was reverse transcribed into cDNA with TransScript One-Step gDNA Removal and cDNA Synthesis SuperMix (TransGen Biotech, Beijing, China) and Oligo(dT)18 primer following the manufacturer’s instructions. Real-time PCR was performed in triplicate for each sample using the TransStart Tip Green qPCR SuperMix (TransGen Biotech, Beijing, China) on a CFX96 Real-Time System (Bio-Rad Laboratories, Inc., Hercules, CA, USA). The gene-specific primers used in the amplification are listed in Table S2, and the reference gene primers used for relative quantification and the analytical method are described in Zhou et al. (2012).

Results

Phenotypic description of ‘DH46’

The ‘DH46’ cultivar used in the present study was obtained from the doubled haploid lines of the recurrent population (Zhao et al. 2016) with short siliques and low SS. In the flowering stage, there were no obvious differences between ‘DH46’ and the conventional cultivar ‘ZS11’ in terms of the main inflorescence and the pollen, but their silique traits substantially differed (Fig. 1, Fig. S1). Most ‘DH46’ siliques had no seeds and dropped at the initial flowering stage; several siliques yielded a few seeds at maturity (Fig. 1b). For ‘DH46,’ the average SL was 1.50 ± 0.14 cm and the average SS was 1.13 ± 0.06; these values were significantly lower (P < 0.001) than those for ‘ZS11’ (8.07 ± 1.06 cm and 23.96 ± 0.10, respectively) (Fig. 1c).

Fig. 1
figure 1

Phenotypes of flowers and siliques in B. napus. a Main inflorescence of ‘ZS11’ (left) and ‘DH46’ (right) at flowering stage. b, c Main inflorescence and siliques of ‘ZS11,’ F1 hybrid of ‘ZS11’ × ‘DH46’ and ‘DH46’ at mature stage (from left to right). Bars, 2 cm

Genetic characterization

The F1 generation was constructed by crossing ‘ZS11’ with the male parent ‘DH46,’ as ‘DH46’ produced few seeds but normal pollen (Fig. S1). The F1 plants exhibited an intermediate phenotype with average SL and SS of 5.39 ± 0.23 cm and 6.31 ± 0.04, respectively. The SS can influence SL, but it was still easy to distinguish between long and short siliques within the mapping population because the former had longer and sharper pod beaks than the latter. The siliques of all F1 plants were considered long because their pod beaks were long and pointed (Fig. 1c). Compared to SL, SS was found to be a more complex trait in the mapping population (data not shown); thus, we focused on variations in SL in the next analysis.

The F1 individuals were backcrossed with the recurrent parent ‘DH46’ (as the male) to generate the advanced backcross lines. The short silique of ‘DH46’ was mainly due to the less SS, and SL variations among the mapping populations were ignored. However, in the BC3F1 generation, except the SS difference, the variations in SL were noticed in many BC3F1 progenies. A progeny of one BC2F1 plant comprising 236 BC3F1 individuals was analyzed and found to consist of 120 long-silique plants and 116 short-silique plants (according to their pod beaks). A Chi-square test indicated that the segregation ratio was nearly 1:1 (Table 1). To exclude the effects of SS on SL, only plants in the BC3F2 generation with no seed development defects and heterozygous locus of BnSLA09 were self-pollinated to construct a BC3F3 population (Fig. S2a). The BC3F3 progeny segregated in a 3:1 ratio (Table 1); therefore, SL is controlled by a single nuclear gene locus.

Table 1 Genetic analyses of the silique trait in F1, BC3F1 and BC3F3 progenies

SNP analysis

A Brassica 60 K Illumina Infinium™ SNP array was used to screen the L- and S-pools from the BC3F1 population, and 32 different SNPs were found after filtering. A BLAST search against the B. napus ‘ZS11’ genome database was performed, and only 16 SNPs remained. Fourteen SNPs were localized to the A09 chromosome, and one each was located in A03 and C06 (Table 2). The SNPs in A09 were localized mainly in a concentrated region between 36.32 and 38.53 Mb. This finding revealed that the target gene controlling the SL phenotype is located in ~ a 2.21 Mb interval within A09 and named BnSLA09.

Table 2 Distribution of retained SNPs against the B. napus cv. ‘ZS11’ genome

Fine mapping of the BnSLA09 gene

SSR and InDel markers were designed to fine map the BnSLA09 gene based on the 2.21 Mb candidate region of the B. napus ‘ZS11’ genome. Among the 19 polymorphic SSR markers, only five primer pairs with legible and steady bands were selected to analyze the 236 individuals from the BC3F1 population. SSR1253 and SSR1270 were closely linked to the BnSLA09 gene locus at genetic distances of 2.12 and 1.70 cM, respectively. To narrow the region of the BnSLA09 gene, several InDel markers between the SSR1253 and SSR1270 intervals were designed and 4824 plants of the BC3F3 generation were screened. Recombinant numbers indicated that the SSR1253, SSR1259, In1-18, and In1-19 markers were located on one side of BnSLA09, whereas the In2-52, In1-27, In1-34, and SSR1270 markers were located on the other side. In2-37 and In1-22 were co-segregated markers linked with BnSLA09 (Fig. 2). BnSLA09 was eventually localized between In1-19 and In2-52 which corresponded to ~ 98.47 kb on chromosome A09 in the B. napus ‘ZS11’ genome.

Fig. 2
figure 2

Mapping of the silique length locus in B. napus. a Physical map of candidate region on A09 chromosome in B. napus. b Partial genetic map of the BnSLA09 interval, developed from the BC3F1 generation. c Particle linkage map of the BnSLA09 region from the BC3F3 population. d Candidate region of BnSLA09 gene locus and the annotated genes in the reference genome of ‘ZS11.’ Sequences of molecular makers are listed in Table S1. More detailed information of these genes is listed in Table 4

Biological roles of BnSLA09

To elucidate the functions of BnSLA09, silique-related traits and the outer cells of young siliques were investigated in L-NIL and S-NIL plants of the BC3F3 generation. SL and SW both significantly differed between the L-NIL and S-NIL plants in the two environments and were on average 61.3% and 24.5% larger in the long-silique plants than in the short-silique plants, respectively (Table 3; Fig. 3). Consistent with the SW data, the seeds in the L-NIL plants were much larger than those in the S-NIL plants (Fig. 3e, f). Scanning electron microscopy (SEM) revealed that the outer cells of the L-NIL siliques were much longer than those of the S-NIL siliques (Fig. 3c, d). Long siliques have a comparatively large photosynthetic area and can provide more nutrients and space for encoded seed development. Therefore, the variation in SL controlled by BnSLA09 is influenced by silique cell wall elongation. The relatively larger silique surface area in L-NIL plants can provide much more photosynthetic product and increase SW to a greater extent than S-NIL plants.

Table 3 Statistical data of silique-related traits of BC3F3 from two environments
Fig. 3
figure 3

Phenotypes of silique-related traits in L-NIL and S-NIL plants in BC3F3 generation. a, b Comparison of the silique morphology in L-NIL (a) and S-NIL (b) plants. Bars, 2 cm. c, d Scanning electron microscope images of the outer cells of siliques in L-NIL (c) and S-NIL (d) plants, respectively. Bars, 50 μm. e, f Comparison of seeds in L-NIL (e) and S-NIL (f) plants. Bar, 0.5 cm

Candidate gene identification

The target region of BnSLA09 was analyzed based on the B. napus ‘ZS11’ and ‘Darmor-bzh’ genomes. The putative functions of these genes were adopted from their orthologs in A. thaliana. The candidate interval contained 14 predicted genes in the B. napus ‘ZS11’ genome and corresponding to 13 genes in the B. napus ‘Darmor-bzh’ genome (Fig. 2; Table 4). An annotation analysis of these genes indicated that BnA09g0377760.1/BnaA09g55530D was highly orthologous to AT3G61880 in Arabidopsis. AT3G61880, also known as CYP78A9, encodes a cytochrome P450 monooxygenase. Compared to the wild type, mutants of this gene exhibit short pods and small seeds, whereas overexpressing lines have long siliques and large seeds (Ito and Meyerowitz 2000; Sotelo-Silveira et al. 2013). BnA09g0377780.1/BnaA09g39470D is orthologous to AT3G61900, which is also known as the SAMLL AUXIN UP RNA33 (SAUR33) gene. It belongs to the SAUR-like auxin-responsive protein family which is one of three primary auxin-induced gene families in Arabidopsis. The function of SAUR33 is unclear, but it was reported that SAUR41 regulates cell expansion by auxin modulation and SAUR63 promotes hypocotyl and stamen filament elongation (Chae et al. 2012; Kong et al. 2013). Both of these genes in the candidate region may contribute to the variations in SL, and so we focused on them in the subsequent investigations.

Table 4 Prediction and annotation of candidate genes within the mapping region

To determine which gene is responsible for SL differentiation, the relative expression levels of both genes were evaluated during early silique development in BC3F3 L-NIL and S-NIL plants. The expression levels of BnaA09g39470D did not significantly differ between the L-NIL and S-NIL individuals at the L1, L2, and L4 stages, and the only difference was at the L3 stage (Fig. 4a). However, the transcript levels of BnaA09g55530D were significantly higher in L-NIL than S-NIL plants during early silique development, especially at the L3 and L4 stages (Fig. 4b). The high expression level of BnaA09g55530D produced long siliques and large seeds in L-NIL matched the phenotype of its orthology in Arabidopsis (Ito and Meyerowitz 2000; Sotelo-Silveira et al. 2013). Therefore, we concluded that BnaA09g55530D was the most likely candidate gene for BnSLA09 and named it ‘BnaA.CYP78A9.a’ based on the standard nomenclature of Østergaard and King (2008).

Fig. 4
figure 4

Relative expression of candidate genes BnaA09g39470D (a) and BnaA09g55530D (b) at different silique development stages between L-NIL and S-NIL plants. L1: 0–2 mm buds; L2–L4: 2–4, 4–6, and 6–8 mm pistils of buds. ***Significantly different at P < 0.001

Since B. napus is an allotetraploid species, homologs of BnaA.CYP78A9.a were identified by BLAST analysis in the reference genome of B. napus ‘Darmor-bzh.’ In brief, BnaA04g00510D on chromosome A04, BnaC08g31760D on chromosome C08, and BnaCnng73170D on chromosome Cnn were recognized as homoeologs (Glover et al. 2016) and their expression levels were measured in the early silique development stages. The expression levels of these three genes in both parents were extremely low compared with that of BnaA.CYP78A9.a in L-NIL plants (Fig. S3). Therefore, the long-silique phenotype in L-NIL plants was the result of the overexpression of BnaA.CYP78A9.a.

To characterize the relationship between the candidate gene and the variations in the silique-related traits, the promoter sequences, full-length genomic DNA, and coding sequences of BnaA.CYP78A9.a in ‘ZS11’ and ‘DH46’ were amplified. DNA sequencing revealed that the entire gene and coding sequences of BnaA.CYP78A9.a (GenBank accession number: MK224591 and MK224590) in ‘ZS11’ and ‘DH46’ were consistent with the reference sequences in the B. napus ‘ZS11’ genome. This gene consisted of two exons with an encoded protein composed of 532 amino acids, and four SNPs in the promoter interval (~ 3.0 kb before the start codon) between the two parents were found (Fig. 5). Moreover, a ~ 12.4 kb DNA sequence, ~ 3.9 kb upstream of the transcription initiation codon of BnaA.CYP78A9.a in ‘DH46,’ was replaced by a ~ 3.6 kb fragment in ‘ZS11’ (Fig. 5). In the corresponding regions, the 12.4 kb DNA sequence in ‘DH46’ resembled the reference genome sequence of B. napus ‘Darmor-bzh,’ whereas the 3.6 kb fragment in ‘ZS11’ was similar to the B. napus ‘ZS11’ genome sequence. The 3.6 kb fragment had a CACTA sequence in its reverse 3’ end which might have been a CACTA transposable element (TE). Therefore, we performed a sequence analysis in the public database CENSOR (www.girinst.org/censor/index.php). The replaced fragment was probably part of the BRENSPM1 element belonging to the CACTA transposon family, but its 5’ end was missing mainly because of the imprecise excision of an entire transposon (Fig. 5). The 12.4 kb DNA fragment in ‘DH46’ contained the putative gene BnaA09g55520D which is orthologous to AT3G61890 (ATHB12) in Arabidopsis and may positively control cell enlargement and expand leaves (Hur et al. 2015).

Fig. 5
figure 5

Gene structure, SNP variations, and fragment replacement between ‘ZS11’ and ‘DH46’; horizontal black arrows represent co-segregation analysis primers

Development of functional markers to identify different SL in B. napus

Based on the fragment replacements in ‘ZS11’ and ‘DH46,’ two pairs of functional markers were designed to identify long- and short-silique plants (Fig. 5). The CL1.1 and TE-R primer pair yielded a specific amplification product of 801 bp in ‘ZS11’ (Fig. 6a), while the CL02 and TE-R primer pair generated a unique 1024 bp product in ‘DH46’ (Fig. 6b). Amplification of the BC3F3 plants with these primers resulted in consistent silique phenotypes (Fig. 6). Therefore, these primers were highly effective at distinguishing the variations in SL between the two parents and their progenies.

Fig. 6
figure 6

Co-segregation analysis of long- and short-silique plants and silique traits in the identified accessions. a Primer CL1.1 and TE-R, the product is 801 bp. b Primer CL02 and TE-R, the product is 1024 bp. c, d Box plots for silique length (c) and seed weight (d) of identified accessions in the 521 natural population. The middle line, the median, the outer dots, the mean, the box, the range of the 25th to 75th percentiles of the total data, the cross, the max and min data, and the vertical lines, the interquartile range. Average SL (e) and SW (f) of three haplotypes of BnaA.ARF18.a in the accessions identified by primers CL1.1 and TE-R. ***Significantly different at P < 0.001

To verify the relationship of these co-segregation markers with SL and SW in B. napus, the primers were used to screen a diversity panel as described by Liu et al. (2016). Among 521 B. napus cultivars and inbred lines, 39 accessions were distinguished by the CL1.1 and TE-R primer pair and 147 accessions were identified by the CL02 and TE-R primer pair (Table S3). Analyses of the SL and SW measured in three environments of these accessions showed that SL and SW identified by the CL1.1 and TE-R primer pair were significantly greater than those distinguished by the CL02 and TE-R primer pair (Fig. 6c, d). Therefore, the functional markers designed in our study could be used to discriminate B. napus plants with different SL and SW.

The effect of BnaA.CYP78A9.a on SL and SW variations in B. napus

The candidate gene BnaA.CYP78A9.a and the reported gene BnaA.ARF18.a (Liu et al. 2015) are very close on chromosome A09 according to the reference genome of B. napus ‘ZS11.’ In order to figure out the effect of these two genes on SL and SW variations, the haplotypes (Hap A, Hap B, Hap C) of BnaA.ARF18.a in our natural populations were analyzed based on the results of Dong et al. (2018). A pair of specific primers (Hap-F, Hap-R) was designed to amplify the critical part of BnaA.ARF18.a to distinguish the haplotypes of them. The sequenced results revealed that only three accessions (accession name: 880101, 2012-9323 and Zhongshuang5) were Hap A, ten accessions were Hap B and other 25 accessions were belonged to Hap C in the 39 accessions identified by the CL1.1 and TE-R primer pair accessions. Moreover, more than 50 accessions classified into Hap A were recognized among the 147 accessions identified by the CL02 and TE-R primer pair. These results indicated that the truncated TE in front of BnaA.CYP78A9.a and the Hap C of BnaA.ARF18.a, as well as the disappeared TE in front of BnaA.CYP78A9.a and the Hap A of BnaA.ARF18.a, are closely linked to control SL and SW in most rapeseed varieties.

In addition, the average SL of the three Hap A accessions among the 39 accessions was 7.82 ± 0.59 cm and the average SW was 4.06 ± 0.83 g, which had no significant difference compared to the Hap B or the Hap C accessions among them (average SL was 7.12 ± 1.17 cm and 7.18 ± 0.90 cm; average SW was 4.35 ± 0.44 g and 4.25 ± 0.48 g, respectively) (Fig. 6e, f). Although only a small number of these special accessions were discovered, we could conclude from the existing results that the strong effect of BnaA.CYP78A9.a on SL and SW was independent of the haplotypes of BnaA.ARF18.a in the 39 accessions identified by the functional markers, which revealed that only the functional copy of BnaA.CYP78A9.a was enough to improve the SL and SW in Brassica napus.

Discussion

In oilseed rape, SL and SW are important agronomic traits related to crop yield. Only a few QTLs have been identified, and the functional genes controlling these traits are largely unknown. In this study, a DH line ‘DH46’ with few seeds and short siliques was investigated to map the gene associated with SL. A genetic analysis revealed that variation in the SL was controlled by a single Mendelian factor among the mapping populations, which was narrowed down to a 98.47 kb interval on chromosome A09 by microarray analysis and a map-based cloning strategy. Expression analyses and DNA sequencing showed that BnaA09g55530D, a ortholog of the Arabidopsis CYP78A9 gene, may control both SL and SW.

Previous studies have shown that several QTLs affecting SL and SW are linked or overlapped, and the major QTLs on chromosome A09 controlling both traits have been identified (Fu et al. 2015; Yang et al. 2012). The candidate gene recognized in the present study is located in this interval; thus, our mapping results are reliable. Moreover, the candidate gene BnaA.CYP78A9.a is close to the reported gene BnaA.ARF18.a (Liu et al. 2015) and can also affect both SL and SW development in rapeseed. So in the breeding process, we can select plants with this functional locus to achieve suitably long siliques and improve crop yield. CYP78A9 overexpressing lines in Arabidopsis have long siliques, large seeds, and few seeds (Ito and Meyerowitz 2000; Sotelo-Silveira et al. 2013), indicating that high CYP78A9 expression levels may affect silique-related traits. These findings are consistent with the expression levels of this candidate gene and phenotypes of SL and SW in L-NIL and S-NIL plants. However, the SS in L-NIL plants with overexpressed CYP78A9 levels is normal. This is mainly due to the expression of BnaA.CYP78A9.a homoeologs are at normal levels, and only the gene on chromosome A09 is overexpressed (Fig. 4b, Fig. S3). A recent transcriptome analysis of large and small seeds in rapeseed showed that BnaA09g55530D is upregulated in long-silique plants with large seeds (Li et al. 2018). Moreover, SL and SW are overlapped in the 186 rapeseed accessions identified by the co-segregation markers. This discovery confirms that BnaA09g55530D is the candidate gene responsible for the SL and SW variations observed in this study.

Regulatory elements in the upstream regions of functional genes play important roles in the regulation of gene expression (Kolovos et al. 2012; Porto et al. 2014). Several functional enhancers, silencers, and SNPs have been identified in plants such as a 12 bp enhancer of NOG1, an 18 bp silencer of FZP in rice (Bai et al. 2017; Huo et al. 2017), a SNP33 located in the promoter region of bsr-d1 in rice (Li et al. 2017), a Tourist-like MITE insertion in the upstream region of BnFLC.A10 in rapeseed (Hou et al. 2012), and a truncated TE in the upstream region of HaCYC2c in sunflower (Chapman et al. 2012). The genomic region of BnaA09g55530D is identical in both parents, and there are only four SNPs and a fragment replacement in the upstream region of the candidate gene (Fig. 5). Among the four CYP78A9 homoeologs in L-NIL and S-NIL plants, only the BnaA.CYP78A9.a in L-NIL plants is overexpressed. Moreover, the functional markers used in the natural population were designed based on the exist of the truncated TE fragment. The accessions identified by primer CL1.1 and TE-R had longer SL and heavier SW than the ones recognized by the primer CL02 and TE-R. Therefore, the truncated TE in the upstream region of BnaA.CYP78A9.a in L-NIL plants may upregulate BnaA.CYP78A9.a and produce longer siliques and heavier seeds than those of the S-NIL plants.

Silique-related traits have complex relationships, and SS and SW are positively controlled by SL (Chen et al. 2007; Fu et al. 2015; Zhang et al. 2011). Differences in SL between the parental ‘DH46’ and the S-NIL plants in the BC3F3 population are caused mainly by variations in the SS. Since SL and SS are significantly correlated, it is necessary to eliminate the influence of the latter when fine mapping the SL gene locus in the BC3F3 population.

With the rapid development of microarray and high-throughput sequencing technologies, there has been a continuous increase in the numbers of rapeseed SNP arrays and reference genomes being exploited (Chalhoub et al. 2014; Clarke et al. 2016; Sun et al. 2017). To locate the candidate region associated with variations in SL, we performed a microarray analysis between the two DNA pools of the BC3F1 generation. Compared to the traditional mapping methods using amplified fragment-length polymorphism analysis, this technique is more efficient and cost-effective. Since rapeseed is an allotetraploid species with numerous homoeologous exchanges, tandemly duplicated repeats, and transposable elements, genomic gaps remain in the reference genomes (Chalhoub et al. 2014; Sun et al. 2017). In the mapping region of BnSLA09, a 27.9 kb genomic gap exists in B. napus ‘Darmor-bzh’ reference genome, whereas this gap is absent in B. napus ‘ZS11’ reference genome. To fine map and locate the candidate interval rapidly and accurately by the map-based cloning method, it is necessary to consider all reference genome sequences.

Map-based cloning in this study identified a major gene controlling both SL and SW, and sequence and expression analyses indicated that BnaA09g55530D is the putative gene responsible for variations in SL. Fragment replacement and SNP variations found in the upstream region of the candidate gene may alter the expression of this gene and cause variations in SL. The candidate gene may positively regulate SW as well as in the 186 rapeseed accessions identified by the associated markers. Therefore, identification of the silique-related gene and the development of functional markers will enhance the molecular breeding of high-yield rapeseed varieties.

Author contribution statement

WS, PQ, and MY performed the experiments; BL carried out the SNP experiment, and ZW designed the SSR and InDel primers. WS analyzed the data and wrote the manuscript. JW, BY, CM, JS, and TF contributed the materials, experiment tools and supervised the study. JT conceived and supervised the writing. All the authors read and approved the final manuscript.