Introduction

Oilseed rape (Brassica napus, genome AACC, 2n = 38) is one of most important oilseed crops for edible oil (canola) worldwide. It is also suitable for biofuel and bioproduct applications, and is being used in European biodiesel production (Tahir et al. 2012). The amino acid composition of the meal is well suited for animal feed. However, the inclusion rate of rapeseed meal in animal feed is still limited due to the high fibre content, in particular for feeding monogastrics (pigs and poultry) (Rakow et al. 2007). The fibre fraction includes neutral detergent fibre (NDF), acid detergent fibre (ADF) and acid detergent lignin (ADL). Acid detergent lignin, consisting largely of lignin-related phenolic compounds, is the most nutritionally relevant fibre fraction (Wittkop et al. 2012). It has been reported that meal from yellow-seeded lines has much lower fibre content than meal from black-seeded lines due to their thinner seed (Rakow et al. 2007). Reducing the seed coat thickness also resulted in an increased proportion of oil and protein in the seed, leading to an improvement in overall seed value (Slominski et al. 1999).

To date, substantial efforts have been made to reduce fibre content in oilseed rape meal by using the yellow seed trait as a selection marker for reduced seed coat thickness (Wittkop et al. 2012). However, seed colour is known to be environmentally sensitive (Boesewinkel and Bouman 1995). Due to the complexity of inheritance and environmental instability of the trait, various major effect QTL in addition to minor QTL were detected in different genetic backgrounds of Brassica species tested in different environments (Liu et al. 2005; Badani et al. 2006; Liu et al. 2006; Fu et al. 2007; Snowdon et al. 2010; Liu et al. 2012; Xiao et al. 2012). Badani et al. (2006) observed a highly positive correlation between seed colour and ADF, and detected a major QTL on N18 with a large effect on both traits in multiple environments. A major QTL influencing seed colour, fibre content and phenolic compounds was mapped to the same position on B. napus chromosome A9 (N9) in populations derived from two different yellow- by black-seeded B. napus crosses (Snowdon et al. 2010). Furthermore, a strong positive correlation was observed between ADL content and seed colour (Liu et al. 2012).

Seed oil content is the most important agronomic trait in oilseed crops, with fatty acid composition being one of the most critical determinants of oil quality (Zhao et al. 2008). Seed oil content and fatty acid composition are typical quantitative traits controlled by multiple QTL and influenced by genotype–environment interactions. Several studies have detected QTL associated with oil content in different rapeseed populations (Ecke et al. 1995; Burns et al. 2003; Qiu et al. 2006; Delourme et al. 2006; Zhao et al. 2006; Chen et al. 2010). QTL controlling fatty acid composition and a possible relationship between those QTL and oil content QTL have been investigated in B. napus (Ecke et al. 1995; Qiu et al. 2006; Zhao et al. 2008; Cao et al. 2010; Smooker et al. 2011). Multiple QTL were detected, and QTL specificity was inferred for particular combinations of substrate and product of fatty acid synthesis and modification pathway in B. oleracea (Barker et al. 2007). Furthermore, allele-specific markers were developed for FAD2 and FAD3, major genes for the control of C18:1 and C18:3 content in B. napus (Yang et al. 2012).

Genetic improvement of seed yield has been the primary objective of canola breeders for many years. Seed yield is a complex quantitative trait, which is largely influenced by the environment. The seed yield per plant is directly determined by three yield-component traits, specifically, siliques per plant, seeds per silique and seed weight. Therefore, yield-associated traits (yield-components and yield-related traits such as plant height) are known as important factors for improving rapeseed yield. Chen et al. (2010) detected 18 QTL associated with seed yield, but only one of them was reproducible in at least two environments. A number of QTL detected were associated with silique traits (silique length and seeds per silique) (Chen et al. 2007; Zhang et al. 2011a, b, 2012).

The increasing demands for high-quality vegetable oil and biodiesel have resulted in a dramatic increase in production of rapeseed worldwide. Developing varieties with high seed yield and desired seed composition have been targets of rapeseed breeding, and a genetic interrelationship has been noted among the desired traits. A strong negative correlation between oil and protein content was reported (Zhao et al. 2006), and a co-localized QTL was identified for lower oil content and higher seed yield alleles (Chen et al. 2010). Although some studies detected QTL responsible for individual composition (fibre, protein, oil and fatty acids) and yield-associated traits, and a few studies have looked at a combination of a small number of these traits, no study has investigated and mapped QTL for all these traits in one population. The main objectives of this study are to unravel a more complete genetic architecture of seed colour and seed quality-related traits such as fibre content and fatty acid composition. In the current study, QTL were mapped for all these traits in a B. napus doubled haploid (DH) population (BnaYB) derived from a cross between a black- (DH12075) and a yellow-seeded (YN01-429) line, assessed across five environments. DH12075 and YN01-429 have distinct seed colour and fibre content, which might lead to the identification of novel loci/alleles for these traits. Furthermore, multi-trait and multi-environment QTL analysis were performed to dissect the genetic correlation between traits and QTL × environment interaction. This established an integrated map of QTL for seed-related traits in B. napus and refined the genomic regions conferring these seed features, which will accelerate breeding of cultivars carrying not only one desirable trait, but a combination of various improved traits.

Materials and methods

Plant material

A segregating doubled haploid (DH) population developed by Dr. G. Séguin-Schwartz (Agriculture and Agri-Food Canada, Saskatoon Research Centre) was derived from reciprocal crosses between two canola-quality spring-type B. napus lines. One parent (YN01-429) is yellow-seeded, has low fibre, and has high oil content. YN01-429 originates from complex crosses involving the black-seeded, canola-quality B. napus variety Regent and yellow-seeded B. rapa yellow sarson, B. carinata and B. juncea (Rashid et al. 1994). YN01-429 is an F8 inbred selection that displays normal meiotic behaviour and genetically stable yellow-seeded trait (Rakow and Relf-Eckstein, 2005). The second parent (DH12075) is black-seeded, has high fibre, and has relatively low oil content. It is a DH from the F1 progeny of a cross between the French variety Cresor and the Canadian variety Westar (Dr. G. Séguin-Schwartz and Dr. G. Rakow, Agriculture and Agri-Food Canada, Saskatoon Research Centre). The resulting population, called BnaYB, consists of >500 DH lines and has been evaluated for seed traits under field conditions. One hundred and ninety-two of these lines were genotyped using SSR markers to identify the most diverse individuals, from which the core set of 96 lines used in this study was selected, with the exception that additional selections were required when seed was limiting.

Field experiments

The DH lines together with parents and two yellow seed check lines YN97-262 and YN90-1016 were evaluated in the field over 4 years (2008, 2009, 2010, 2011) at two locations near Saskatoon, SK [AAFC farms in Saskatoon (S) and Llewellyn (L)]. A randomized complete block design was used for field trial with two replicates in 2008 and 2009, and three replicates in 2010 and 2011. The parental lines and two yellow seed check lines were present at least once in every range, as well as being replicated throughout the trial. The DH lines and checks were planted in ten-foot rows with 24-inch row spacing for all years. The granular pre-emergent herbicide Edge was applied to the site prior to seeding. Corn grits treated with Helix insecticide were side-banded with the seed for flea beetle control. Fertilizer was applied prior to seeding. Due to inclement weather, insufficient materials could be harvested from trials at Llewellyn in 2009–2011 to produce reliable data and were not analysed.

Once combined, seed samples were dried to bring the moisture content to approximately 10 %. Excess chaff was cleaned from the samples using a Clipper Seed Separator. Subsamples weighing approximately 65 grams were removed from the bulk sample and further cleaned using a ASC-3 Seed Cleaner (Agriculex, Guelph, ON) prior to analysis. The subsamples were dried at 40 °C for 48 h, before analysis.

Trait measurement

Oil content, iodine value, protein and fibre content were determined by near-infrared reflectance (NIR Systems Model 6500, FOSS, Eden Prairie, MN) according to AOCS standard procedure Am 1-92: determination of oil, moisture and volatile matter, and protein by near-infrared reflectance. For oil content, the system was calibrated with oilseed samples extracted with hexane as described by Raney et al. (1987). Results are reported as a percentage of whole seed dry matter (zero moisture). For iodine value (IV), a measurement of fatty acid unsaturation, the system was calibrated with oilseed samples whose iodine values were determined according to AOCS Recommended Practice Cd 1c-85 reapproved (1997): calculated iodine value.

Fatty acid composition was analysed using an Agilent 6890 GC-FID (Agilent Technologies, Santa Clara, CA) following preparation of fatty acid methyl esters by base-catalysed methanolysis (Thies 1971), and individual fatty acids were reported as a per cent of total fatty acids by mass. Very long chain fatty acid (VLCFA) is the sum of fatty acid molecules with carbon chains of 20 or longer. C16:018:0 is the sum of palmitic acid (C16:0) and stearic acid (C18:0).

For protein, the system was calibrated with oilseed samples whose protein contents were determined by the AOCS Official Method Ba 4e-93, revised 2003: generic combustion method for determination of crude protein using a LECO FP-528 protein analyser (LECO, St. Joseph, MI). Results were reported as a percentage on a whole seed dry matter (zero moisture) basis.

For fibre, the system was calibrated with oilseed samples whose total fibre contents were determined by a modified AOCS Approved Procedure Ba 6a-05, approved 2005: crude fibre analysis in feeds by filter bag technique.

For seed colour, whiteness index (WIE) was determined by Hunter Lab reflectance (Hunter Associates Laboratory, Reston, VA) (Rakow et al. 1999) where high negative values indicate light-coloured seed.

Thousand seed weight (TSW) was the weight of 1000 randomly chosen seeds from one line.

Molecular markers and genetic map construction

Three classes of molecular markers, 265 simple sequence repeat (SSR), 17 single-nucleotide polymorphism (SNP) (prefixed with “BSNP”) and 38 gene expression markers (GEM) (prefixed with “pp” and fibre), were used for genotyping the BnaYB DH population. The SSR primer sequences were developed at the Saskatoon Research Centre, Agriculture and Agri-Food Canada (http://aafc-aac.usask.ca/BrassicaMAST/). These were analysed using fluorescently labelled tail PCR amplifications in 384-well format using the 5-dye system on an Applied Biosystems 3730xl. Genographer (available at http://www.genographer.com/) was used to produce a gel-like image of the genotype data, and polymorphic loci were scored visually. Polymorphic loci were named according to the marker used and markers detecting multiple loci were affixed with alphabetic letters.

Since the two parents of the BnaYB DH population share a common ancestry, a limitation of polymorphic markers and consequent gaps in the genetic map was expected. To reduce gaps in the SSR genetic map, SNP markers were identified using a 5509 Illumina Infinium SNP array (http://aafc-aac.usask.ca/ASSYST/) with a subset of the BnaYB DH lines. SNPs were chosen if they appeared polymorphic in the BnaYB subpopulation and potentially mapped to regions containing large gaps. SNP genotyping was performed using KBiosciences (http://www.kbioscience.co.uk/) fluorescence-based allele-specific competitive PCR (KASPar).

In addition, GEMs characterized by large parental differences in transcript levels causing a distinctly bimodal distribution of expression phenotypes in the population (West et al. 2006) were added to the map. GEMs selection was performed as described in (Potokina et al. 2008) based on gene expression analysis performed with Agilent two-colour 4 × 44 K B. napus arrays (unpublished data; https://www.ebi.ac.uk/arrayexpress/arrays). Normalized expression values of each probe from all DH lines and parents were divided into two clusters using k-means clustering, and the probes with two non-overlapping clusters were identified.

Genetic maps were constructed using Mapmaker v3.0b (Lincoln et al. 1992) with a total of 320 markers using a minimum logarithm of odds ratio (LOD) linkage threshold of 3.0 and a maximum likelihood distance of 40 cM. SSR marker order was determined based on initial screening of a population of 192 BnaYB individuals, and these markers were used as a framework for ordering the remaining markers. Genetic distances in cM were estimated using Kosambi’s mapping function (Kosambi 1943). The markers were distributed over 21 linkage groups (LGs) that were named according to the Brassica chromosome nomenclature with N1–N10 representing the A genome (B. rapa) and N11–N19 the C genome (B. oleracea) of B. napus, respectively (Parkin et al. 1995). Each Brassica chromosome is represented by a single LG except for N12 and N14 which were divided into top and bottom (bot) fragments.

Although most lines were tested in all environments, a few were only represented in a subset of environments. Therefore, genetic maps were constructed for each year individually; these have slightly different marker distances, and MergeMap (Wu et al. 2011) was used to construct a consensus map that was used for all QTL mapping.

Statistical analysis

Estimates of components of variance and covariance were conducted using the R package lme4 (Bates et al. 2011) (http://lme4.r-forge.r-project.org/). The environments corresponded to compounded effects of location and year. Environments S8, S9, S10 and S11 corresponded to field trials conducted in Saskatoon [S] between 2008 and 2011, and L8 corresponded to the field trial conducted in Llewellyn [L] in 2008. Environment (Env), genotype (DH lines), genotype by environment (GE) interactions and replication (Rep) within each environment (block effect) were treated as random effects (http://articles.extension.org/pages/61006/estimating-heritability-and-blups-for-traits-using-tomato-phenotypic-data). The components of variance were estimated using a linear mixed model fit by REML in R package lme4 (http://cran.r-project.org/web/packages/lme4/index.html) (lmer(Trait ~ (1|DH) + (1|Env) + (1|DH:Env) + (1|Rep%in%Env))) and used for broad-sense heritability (H 2) estimate for each trait (Xu et al. 2013).

ANOVA was performed in R (R Development Core Team 2012, http://www.R-project.org/). Pairwise phenotypic correlations between traits were calculated using the Pearson correlation coefficient in the R package Hmisc (http://cran.r-project.org/web/packages/Hmisc/index.html). The correlations were grouped into three classes based on the magnitude of absolute value of correlation coefficient: (1) strong: correlation coefficient ≥0.5; (2) moderate: correlation coefficient between 0.3 and 0.5; and (3) weak: correlation coefficient <0.3 (Hill et al. 2015).

QTL mapping

Mean values for each trait from two or three replicates in each environment were used for the detection of QTL. Outliers of trait values were detected and removed using a Z-score transformation with a threshold of 3. QTL detection was performed using composite interval mapping (CIM) in WinQTL Cartographer v.2.5 software (http://statgen.ncsu.edu/qtlcart/WQTLCart.htm; Wang et al. 2012). A walking speed of 2 cM was used. The forward and backward stepwise regression was used for the selection of the markers to control the genetic background (control markers or cofactors) with up to five control markers. A window size of 10 cM was used to exclude closely linked control markers at the testing site. The logarithm of odds (LOD) thresholds at a significance level of 0.05 for identifying statistical significant QTL for each trait in each environment was calculated by 1000 permutations. The additive effect (a) and phenotypic variance explained by each QTL (R 2) were estimated by CIM for each individual trait–environment combination. QTL detected in different environments for each trait were considered to be the same if the confidence intervals overlapped and the additive effect was contributed by the same parent.

The digenic epistatic interactions among all pairwise combinations of QTL were analysed with multiple interval mapping (MIM) in WinQTL Cartographer v.2.5 software. The initial QTL model was set using the CIM results obtained for each trait in each environment. The QTL model was progressively refined by searching and testing QTLs or epistasis, and re-estimating. Both main additive effects of QTL and their epistatic interactions were tested for significance using the Bayesian information criterion (BIC). Not only main QTL (QTL with statistically significant main effect) and interactions among main QTL, but epistatic QTL (QTL that has no or small main effect but statistically significant interaction effect with another QTL position) interacting with main QTL were searched.

Multiple-trait composite interval mapping (Mt-CIM) implemented in WinQTLCart was used to test for the presence of QTL × environment (Q × E) interaction at the main chromosome regions affecting the target traits (Maccaferri et al. 2008; Wang et al. 2012). The value of the trait in each environment was treated as a separate trait for the common genotypes. The G × E (H4) hypothesis was tested. Mt-CIM was also used for joint analysis of various traits in each environment.

QTL nomenclature

QTL nomenclature followed the one used in McCouch et al. (1997) for rice, with modifications. Each QTL starts with a lowercase “q” followed by 2–5 letters in capital abbreviated from the trait name to designate the trait measured then an underscore followed by the chromosome where the QTL is found and an additional underscore and final numerical code make it possible to uniquely identify QTLs for the same trait on the same chromosome. For example, qNDF_N16_1 stands for QTL of trait NDF identified on chromosome N16 and is the first QTL mapped on this chromosome.

Results

Phenotypic variation and correlation between traits

The two parents, DH12075 and YN01-429, showed significant variation for most traits investigated in the five environments (Supplementary Table S1, S2), making them suitable for uncovering the underlying genetic basis for seed quality through a bi-parental cross. The BnaYB population displayed continuous variation at all environments for each trait, especially obvious for fibre traits (ADF, ADL and NDF), as well as wide transgressive segregation. Individual DH lines displayed bidirectional transgressive segregation for most traits, as shown by the maximum and minimum values in Supplementary Table S2. However, the mean BnaYB DH population values of most traits were closer to the parent DH12075 value than to the parent YN01-429. High broad-sense heritability was observed for most traits across five environments (Table 1), varying from 0.44 (Protein) to 0.91(ADL), indicating the phenotypic variation was attributable mainly to genetic variation.

Table 1 Pairwise Pearson correlations between traits across environments and broad-sense heritability of traits

Pairwise correlations between traits across environments were examined (Table 1). In general, both positive and negative correlations were observed. Three fibre traits (ADF, ADL and NDF) and seed colour whiteness index (WIE) had strong significant positive correlations. These four traits showed moderate to strong significant negative correlations with the fatty acids linoleic acid (C18:2) and linolenic acid (C18:3) and with iodine value (IV). There was a strong negative correlation between protein and oil, which is suggestive of competition among these sinks for assimilates. Oil content also displayed a significant negative correlation (weak to moderate) with VLCFA, C16:018:0 (sum of C16:0 and C18:0), C18:3 and fibre traits. Oil content had a significant but weak positive correlation with oleic acid (C18:1) and C18:2. Oleic acid, a precursor for desaturation, showed a highly negative correlation with C18:2 and C18:3 and a moderate positive correlation with C16:0C18:0, implying that there could be common genetic factors affecting these components. A moderate but significant positive correlation was observed between thousand seed weight (TSW) and protein.

QTL mapping of single traits in single environments

Due to significant genotype × environment interactions revealed by ANOVA (Supplementary Table S3), QTL analysis was first performed for individual trait in each environment. Variable numbers of significant QTL, from 2 to 8, were detected for each trait over the five environments (Table 2; Supplementary Fig. S1). Globally, the largest number of QTL (8) was detected for VLCFA and oil. The highest LOD scores were seen for QTL associated with fibre traits (ADF, ADL and NDF) and WIE in all environments. Clustering of QTL was observed on N9. A QTL mapping to the same interval was found on N9 for all traits investigated, except C183, protein and TSW. The additive effect in relation to parental contribution was also estimated. The relative contribution of the two parents for most of the traits followed expectations, with QTL having a positive contribution from the predicted donor parent. In the case of fibre traits, inheritance of the DH12075 alleles at the identified QTL increased fibre traits (ADF, ADL and NDF) and WIE in most instances, but decreased the oil content and IV, coinciding with the parental performance for these traits (Supplementary Table S1, S2).

Table 2 Overview of QTL identified for all traits across five environments

Seed colour is a prominent phenotype segregating in the BnaYB population. A major QTL for WIE was mapped on chromosome N9 with R 2 (phenotypic variance explained by QTL) varying from 56.3 to 74.8 % in each environment. This region on N9 is rich in genetic variation, with detected QTL for several other traits, as described in the previous paragraph. All fibre traits (ADF, ADL and NDF) had a major QTL mapped on N9 with intervals overlapping with qWIE_N9 across all five environments, coinciding with the strong positive correlations between these traits (Table 1). This major QTL had trait value increasing allele from DH12075. A minor QTL, qADL_N19, detected for ADL has an allele from YN01-429 contributing to higher ADL content. Three minor QTL for NDF identified in a single environment, qNDF_N13, qNDF_N16_2 and qNDF_N4, have the alleles from YN01-429 contributing to higher NDF content.

A total of eight QTL were identified for oil, with 8.4–16.1 % phenotypic variance explained by each QTL. For all QTL except qOIL_N8, the alleles from YN01-429 contributed higher oil content. Five QTL were detected for IV, with one QTL on N9 co-localized with fibre QTL and repeatedly mapped in five environments. A total of 30 QTL associated with fatty acid components were detected, with one QTL on N9 co-localized with the major fibre QTL identified for all fatty acid components except C18:3. YN01-429 was the source of alleles on N9 for increasing oil, IV and C18:2, while DH12075 contributed to a higher value of C16:018:0, C18:1 and VLCFA. Two QTL identified for oil on N2 (qOil_N2_1) and N16 (qOil_N16), respectively, coincided with QTL for C16:018:0 (qC160180_N2 and qC160180_N16), with alleles from YN01-429 increasing oil and alleles from DH12075 increasing C16:018:0. Another QTL for C18:1 on N2 (qC181_N2_2) was also associated with C18:2 (qC182_N2), with the trait-enhancing allele for C18:1 inherited from YN01-429 and a higher proportion of C18:2 conferred by the allele from DH12075. Furthermore, this QTL had an opposite effect for these traits relative to the QTL on N9. The QTL on N19 for C18:1 (qC181_N19) and C18:3 (qC183_N19) had overlapping intervals. The co-localization and opposite additive effect observed for QTL associated with these fatty acids could explain the strong negative correlation between C18:1 and C18:2/C18:3 (Table 1). These results suggest that enhancing alleles for various traits originated from the different parents. Iodine value (IV), used to determine the amount of unsaturation of fats and oils, had one QTL on N19 (qIV_N19_1) co-localized with qC181_N19 as well. For all QTL of VLCFA except two on N4 and N18, respectively, the alleles from DH12075 contributed to higher VLCFA levels. The QTL qVLCFA_N7_1 and qVLCFA_N8 was repeatedly detected in 3 and 4 environments, respectively.

Three QTL were identified for protein with increasing alleles from DH12075. However, no QTL was repeatedly detected in two or more environments. In terms of QTL for TSW, an equal number of trait-enhancing alleles were inherited from both parents. Co-localization of QTL on N16 underlying protein (qProtein_N16) and TSW (qTSW_N16_2) accorded with the significant positive correlation between these two traits (Table 1).

Identification of epistatic interaction of QTL

The epistatic interactions detected by MIM together with their average effects are reported in Table 3. A total of seven pairwise QTL–QTL interactions were detected for six traits at a significance level α = 0.05. Compared to the QTL effects, the epistatic interactions showed less consistency across environments. All of the interactions were detected in a single environment. Most of the epistatic effects detected with the MIM procedure and retained in the MIM models were due to the interaction between a major QTL on N9 and QTL on other chromosomes. For WIE, C16:018:0 and IV, both loci involved in epistatic interactions were detected as significant QTL in CIM scans. For ADL, C18:1 and C18:3, the additive effect of the second locus was too small to reach genome-wide significance in CIM scans. However, the QTL–QTL interactions between these loci were significant.

Table 3 Epistatic interaction between QTL

Multi-trait QTL analysis in single environments

Multi-trait QTL was performed on fibre traits and WIE in each environment (Fig. 1). A significant QTL on N9 was detected for all traits, and joint analysis was significant as well. For this QTL, all individual traits had a similar trend as well as the joint trait along the chromosome. In addition, in four environments L8, S8, S10 and S11, individual traits had a similar pattern as to the influence on the joint profile. The trait most influencing the joint profile was ADL, which had a profile very similar to the joint profile. It was observed that the additive effects of QTL located on other chromosomes for the individual traits were relatively high, such as qADF_N16, qADL_N19 and qNDF_N13, and the additive effects were too small to be detected as significant for the joint analysis. In all environments, trait ADL had the most influence on the joint trait. The least influencing trait was WIE while NDF in S9. Notably, in all environments except S9 (the peak position of NDF was slightly shifted from one of other traits), the QTL peak of all individual traits was located at the same position as the one for the joint trait.

Fig. 1
figure 1

Multi-trait QTL analysis for fibre traits and WIE across five environments. The horizontal lines indicate LOD significance threshold

Multi-trait analysis was also performed on all traits except protein and TSW to investigate the genetic correlation of seed colour with oil and fatty acid compositions (Supplementary Fig. S2). In general, fibre traits and WIE are more influencing for joint profile. In all environments except S10 where no fatty acid compositions were measured, no significant QTL was detected for joint trait on any chromosomes. A QTL on chromosome N9 was significant for fibre traits, WIE, IV, C18:1 and C18:2 in all environments tested, which was in agreement with the results of single-trait CIM analysis. Other linked or pleiotropic QTL were inconsistent across environments, such as the QTL on N2 detected for both IV and C18:1 (Supplementary Fig. S2d). Notably, a significant QTL on N10 (close to marker sR12317) was detected for oil in the joint analysis (Supplementary Fig. S2b) in environment L8. However, this QTL was not identified in environment L8 using single-trait CIM analysis, but detected in environment S8 and S10. A QTL on N7 was detected for C18:2, although not detected using single-trait CIM analysis in any environment (Supplementary Fig. S2c). On N2, a significant QTL was identified for IV and C18:1 which coincided with the results of single-trait CIM analysis (Supplementary Fig. S2d).

Surprisingly, qVLCFA_N9 detected on N9, using single-trait CIM analysis in environment S8 (Supplementary Fig. S2e), was not significant in the joint analysis. In environment S9, no QTL on N9 was detected for oil, C16:018:0 and VLCFA while identified using single-trait CIM analysis (Supplementary Fig. S2f). In environment S10, where no fatty acid compositions were measured, joint analysis was performed on fibre traits, WIE, oil and IV. The QTL on N9 was significant for all individual traits except oil and joint trait (Supplementary Fig. S2g). A significant QTL was detected for oil and joint trait on N10 which agreed with single-trait analysis (Supplementary Fig. S2h). A significant QTL was detected on N11 and N19 for joint trait, respectively, while not for individual trait (Supplementary Fig. S2i, j). In environment S11, the QTL identified for oil on N9 (close to marker pP078731) was not significant in joint analysis (Supplementary Fig. S2k).

Multi-environment QTL analysis

Multi-environment QTL analysis was performed to detect the significant QTL across environments and genetic (QTL) × environment effect (Fig. 2). In general, environment S9 influenced joint analysis the most with highest additive effect for traits ADF and WIE, while environment S10 influenced joint analysis the most for traits ADL and NDF (Fig. 2a–d). The environment L8 had the least influence with the lowest additive effect for all fibre traits and WIE. A significant QTL on N9 was detected for all fibre traits and WIE across all environments. This QTL was also statistically significant for multi-environment joint analysis. However, the Q × E effect was significant. The QTL mapped on other chromosomes in only one or two environments did not reach the significance threshold value in the multi-environment QTL analysis, such as qADF_N16, qADL_N19 and qWIE_N11.

Fig. 2
figure 2figure 2

Multi-environment QTL analysis. The horizontal lines indicate LOD significance threshold

For the oil QTL detected on N10, environment S8 was more influential on the joint profile. However, environment S9 showed the most influence on oil QTL on N5 which was not identified in single-trait QTL mapping (Fig. 2e, f). For the QTL on N9 identified for IV, environment S10 influenced joint analysis the most, while L8 was not significant (Fig. 2g). The QTL on N9 detected for C18:1 was significant for joint analysis and all environments except L8 (Fig. 2h). Surprisingly, no significant QTL was detected for C16:018:0, C18:2, C18:3 and VLCFA.

Discussion

In the present study, a major QTL qWIE_N9 located on N9 was stably detected over five environments, indicating it could harbour a causal gene/gene cluster for regulating seed pigment biosynthesis and/or storage. In other studies, a major QTL with a large effect on seed colour was mapped to the two homoeologous B. napus chromosomes A9 and C8 along with other contributing QTL (Badani et al. 2006; Fu et al. 2007; Snowdon et al. 2010; Zhang et al. 2011a, b; Liu et al. 2012). It has also been reported that a major QTL on N9 (A9) for seed colour explained 64 % of phenotypic variance in B. rapa (Kebede et al. 2012). The loss in function of a bHLH transcription factor BrTT8 on A9, which regulates the late biosynthetic genes of flavonoid pathway, has been reported to result in yellow seed coat in B. rapa (Li et al. 2012). In this study, parental line YN01-429 has the genetic background of yellow-seeded B. rapa, B. carinata and B. juncea. Further investigation is required to determine whether BrTT8 is causal gene underling the QTL qWIE_N9.

Due to the strong positive correlation between seed colour and fibre content observed in this study and previous research (Badani et al. 2006; Liu et al. 2012; Wittkop et al. 2012), it is not unexpected that the major QTL on N9 influencing both traits was mapped to the same position. Our study not only detected the most reported chromosome (N9/A9) associated with fibre content and seed colour, but also identified new significant QTL on other chromosomes. Snowdon et al. (2010) reported a major QTL influencing both seed colour and fibre content mapped on A9 (N9). A major QTL on A9 for ADL content was explained by a loss-of-function allele of the Bna.CCR1 locus on chromosome C8 caused by a homoeologous non-reciprocal translocation (Liu et al. 2012). Whether the fibre-related QTL on N9 (A9) reported in the present study is related to mutation of the CCR1 locus is still under investigation. Another major QTL with large effect on both seed colour and ADF was reported on chromosome N18 in three B. napus populations across multiple environments (Badani et al. 2006).

Depending on the population and environments studied, different QTL for oil content have been reported (Burns et al. 2003; Zhao et al. 2005; Delourme et al. 2006; Qiu et al. 2006; Zhao et al. 2006; Chen et al. 2010; Zhao et al. 2012). Delourme et al. (2006) consistently detected one common QTL for oil content on N3 in two B. napus populations and in addition, located QTL on N2, N4, N9, N10, N16 and N17 in the same two populations. However, in the present study, no oil content QTL was identified on N3, but QTL were identified on chromosomes N2, N4, N8, N9, N10, N16 and N19. These QTL were more environment specific than the fibre QTL. Only four QTL (on N2, N9, N10 and N16) were repeatedly identified in two out of five environments in the present study. It has been reported that additive × environment interactions was a major factor contributing to variation of oil content in B. napus (Zhao et al. 2005). Likewise, Chen et al. (2010) identified 23 environment-specific QTL associated with oil content and only four reproducible QTL in at least two environments. Notably, Zhao et al. (2008) detected QTL for oil content on N2, N9 and N19 in a B. napus DH population derived from a cross between parents both having a high erucic acid (C22:1) and a very high oil content. QTL were also identified on these chromosomes in the present study. Similarly, pleiotropic QTL for oil content and fatty acid composition were reported by Zhao et al. (2008).

The fatty acid composition is the major determinant of edible and processing quality of rapeseed oil (Yang et al. 2012). A number of independent and pleiotropic QTL were identified for different fatty acid species in this study, indicating that complex interactions exist between individual fatty acid biochemical pathways. Acting in parallel with desaturation of 18-carbon fatty acid, elongation of the fatty acid chain produces VLCFA (Millar and Kunst 1997). Since these two pathways compete for the same precursor, it is not unexpected to see coincident QTL for C18:1 and C18:2 on N9. Similarly, the presence of coincident QTL for C18:1 and C18:2 with the major QTL for C22:1 on A8 and C3 was reported by Smooker et al. (2011). Interestingly, one locus associated with C18:2 at the top of N11 in this study is most likely the same as the one identified for C18:1 and C18:2 on C1 by Smooker et al. (2011) which coincided with an orthologue of FAD2. Six copies of the FAD3 gene are distributed on the A and C genomes (A3, A4, A5, C3 and two copies on C4) in B. napus (Yang et al. 2012). Two major QTL for C18:3 identified on A4 and C4 corresponded with two FAD3 copies in a B. napus DH population of high oleic acid × low oleic acid (Hu et al. 2006). Surprisingly, no QTL was identified for C18:3 on the chromosomes harbouring FAD3 gene in this study. In this study, the co-localization of QTL with contrasting parental effects for C18:1 and C18:2 on chromosome N19 would be indicative of one copy of the FAD2 gene.

Of the traits analysed in the present study, seed protein content appears to be the most affected by QTL × E, since no QTL were reproducible in two or more environments. The co-localization of one QTL on N16 for both oil and protein with opposite additive effects agrees with the strong negative correlation between these two traits in the present and previous studies (Zhao et al. 2006 and references therein), as a result of the competition for the common basic substrates during the biosynthesis of oil and protein. The strong negative correlation between oil and protein contents and their linked QTL with opposite source of favourable alleles represents a potential challenge for improving both oil and protein simultaneously in B. napus. However, apart from the linked QTL for seed oil and protein, independent QTL (seven for oil and two for protein) identified in this study are suitable for simultaneous improvement of these two traits.

Epistatic interaction between QTL have been shown to be important for unravelling the genetic basis for complex traits in plants (Xing et al. 2002; Malmberg et al. 2005; Xu and Jia 2007; Yang et al. 2010). The digenic epistatic interactions observed in this study are environment specific. No significant epistatic interaction was observed for the QTL of oil content in the present study. However, a number of pairs of loci with additive × additive epistatic interactions associated with oil content were reported in a B. napus population derived from a cross of parents both with high erucic acid, glucosinolate and oil contents (Zhao et al. 2005, 2006). This discrepancy, at least in part, could result from the different genetic background of the populations studied. Another possible reason could be the use of different statistical models, as demonstrated by previous studies performed in B. napus (Delourme et al. 2006).

In several cases, the QTL intervals for multiple traits overlapped. The most noticeable one is the co-localization of individual QTL on N9 for various traits. Likewise, a seed-related QTL cluster on A9 was reported in B. rapa (Bagheri et al. 2013). One major QTL for seed colour was co-localized with other seed-related QTL for seed size, seed weight, seed oil content, number of siliques and number of seeds per silique. These common chromosome regions with QTL for different traits indicate either linkage or pleiotropic effects. There could be pleiotropic effects of single genes or certain tightly linked trait-specific genes affecting multiple traits. Some of the pleiotropy could be the consequence of correlation between traits, such as fatty acid components, which are subject to autocorrelation since fatty acid components are expressed as a percentage of the total identified fatty acids (Smooker et al. 2011). There are reports of QTL having pleiotropic effects in B. napus and other crops (Shi et al. 2009 and references therein).

To dissect the genetic correlation between traits studied, multi-trait joint analysis was performed. Multiple-trait analysis takes into account the structure of genetic correlations between traits. It can test the nature of genetic correlations between traits, such as pleiotropy versus close linkage. Furthermore, multiple-trait analysis allows the estimation of genetic variance–covariance matrix between traits and its decomposition in terms of individual QTL (Da Costa et al. 2012). The change in QTL effect size of the individual trait was observed using multi-trait mapping compared to single-trait analysis. Some QTL profiles from single-trait and multi-trait analyses were very similar, such as joint analysis of fibre traits and WIE. However, inconsistency of QTL profiles from single-trait and multi-trait analysis were also observed. For instance, a significant QTL on N10 (close to marker sR12317) was detected for oil in the multi-trait analysis in environment L8 while not identified using single-trait CIM analysis. The opposite behaviour of inconsistency of QTL profiles from single-trait and multi-trait analysis was evidenced by some QTL such as qVLCFA_N9, which was detected as significant on N9 using single-trait CIM analysis in environment S8, but not significant in the joint analysis. The similar inconsistency of QTL profiles was also observed in other multi-trait QTL analysis (Singh et al. 2012). The joint effect of multi-trait analysis on the additive effect of each trait reflects the correlation structure between traits. Multi-trait analysis should provide more useful information than individual QTL mapping analyses due to the systematic nature of physiological processes in plants (Singh et al. 2012). Multi-trait analysis showed that the QTL on N9 was significant for individual and joint trait when WIE and fibre traits were combined, which was indicative of pleiotropy of this QTL. Similarly, pleiotropy of this QTL was also observed for IV, C18:1 and C18:2. The strong correlations between these traits were mostly caused by this pleiotropic QTL. For other traits, the QTL on N9 might be closely linked rather than pleiotropic.

Multi-environment QTL analysis is more powerful in detecting significant QTL for complex traits controlled by a number of QTL of smaller effects that might not be detectable with single-environment analysis (Alimi et al. 2013), which accounts for G (QTL) × E and tests all detected QTL in all environments. This was demonstrated by the detection of additional QTL for oil in the present study. QTL × E is significant if the QTL effects are strongly expressed in one environment but weakly in another (MacMillan et al. 2006). Of the QTL examined here, QTL × E was found for all of them. QTL × E observed was due to the difference of magnitude and not the direction of QTL. This was also evidenced by the QTL analysis in single environments. No crossover interaction was observed as QTL effects were consistently positive or negative across environments. The major QTL on N9 pleiotropic for WIE and fibre traits was the most stable across environments but with different magnitude of effect. The marker linked to this QTL is valuable for selection in breeding. Environment-specific QTL for seed colour was also observed in the present study and other study (Fu et al. 2007).

As the two parents in this study have some common ancestry, it is not surprising that some regions of the genome lack polymorphic markers. Therefore, given our marker density and population size, this study has sufficient power to detect QTL of large effect that are common across environments and environment specific. A larger population size coupled with higher density of markers would empower us to identify more QTL of small effects. Nevertheless, this study has led to the identification of numerous genomic regions involved in the control of seed-related traits across multiple environments with a few novel QTL being identified. To the best of our knowledge, this was the first study of multi-trait QTL analysis by taking into account the correlated structure of seed colour and seed composition in B. napus. These results identified the critical genomic regions governing seed features and will facilitate the development of strategies for integration of favourable alleles to produce B. napus cultivars with desirable seed quality characteristics.