Introduction

Seed weight is one of the most important traits in crops in which seeds are the direct product to be used, like cereals or oilseeds. An understanding of genetics of seed weight is of particular importance regarding to not only the economic and nutritional aspects, but also physiological, biochemical and molecular aspects. Since the beginning of plant domestication, plant breeders have been selecting varieties with bold seeds and increased seed weight. Genetic improvement of Indian mustard (Brassica juncea) for high seed weight has led to the development of bold-seeded varieties using the natural variation in the crop. It has generated a preference among the Indian farmers for the bold-seeded varieties.

Brassica juncea, an allotetraploid (AABB) with diploid genomes of B. rapa (AA) and B. nigra (BB), is an important oilseed crop of the Indian subcontinent, having two distinct gene pools—Indian and east European (Pradhan et al. 1993; Srivastava et al. 2001). The major breeding objectives in the B. juncea target yield enhancement, oil quality and resistance against pathogens and tolerance for abiotic stresses (Pradhan and Pental 2011). It has been shown that F1 hybrids between the lines of these two gene pools are more productive than those involving genotypes within a gene pool (Pradhan et al. 1993). In B. juncea, the first commercial F1 hybrid DMH-1 released in India has been developed from a cross between an Indian gene pool line Pusa bold and an east European gene pool line EH-2 that shows about 30 % higher yield than the best check varieties (Sodhi et al. 2006). However, the farmers’ preference for the hybrid DMH-1 is relatively low because the seed weight of the hybrid is less than 4.0 g per thousand seeds due to the smaller seed size of the pollinator parent, EH-2 (thousand seed weight ~2.5 g).

The current challenge in the improvement of seed size, however, lies in the identification of genomic regions quantitative trait locus/quantitative trait loci (QTL) directly influencing seed size and yields. QTL mapping studies identifying seed weight QTL have been carried out in various plant species like Arabidopsis (Alonso-Blanco et al. 1999; Herridge et al. 2011), tomato (Doganlar et al. 2000), chickpea (Hossain et al. 2010), soybean (Sun et al. 2012) and various Brassica species (Ramchiary et al. 2007; Fan et al. 2010; Yang et al. 2012; Yadava et al. 2012; Li et al. 2014; Qi et al. 2014). Seed weight, a highly heritable trait, has been reported to be governed by a relatively large number of QTL with small effects and a few QTL with large effects controlling the trait.

Seed weight in dicots is determined by regulating the growth of the embryo, endosperm, integuments and cell number/cell size or both in the female gametophyte (Sundaresan 2005; Alonso-Blanco et al. 1999). Different studies on factors influencing seed size in Arabidopsis have implicated several important genes. HAIKU (IKU), a transcription factor, was the first to be shown to affect seed size by influencing endosperm development (Garcia et al. 2005). IKU1, IKU2 and MINI3 and SHB1 act in the same developmental pathway affecting endosperm size (Luo et al. 2005; Zhou et al. 2009). Genes involved in hormone signaling such as the Brassinosteroid hormone play a central role by regulating endosperm and embryo development (acting upstream to SHB1, IKU1, MINI3 and IKU2) and growth of integuments [acting upstream of AP2, MNT/ARF2 and cytochrome P450 KLUH (KLU)/CYP78A5] (Jiang et al. 2013; Jiang and Lin 2013). Cytokinins and ethylene-mediated signaling also play a major role in seed size regulation (Li et al. 2013; Bennett et al. 2014). Flavonoids communicate signals from seed coat to endosperm, thus affecting its proliferation (Doughty et al. 2014). The ubiquitin pathway controls the seed size by targeting proteins for degradation (Li and Li 2015). A polycomb (PcG) group of genes and other epigenetic regulators also affect seed size (Li and Li 2015).

In this article, we have made concerted attempts on genetic analysis of seed weight in the oilseed crop B. juncea. By encapsulating maximum genetic variability from four different bi-parental mapping populations and QTL analyses, we have defined major QTL for seed weight. Using meta-analysis, consensus QTL for seed weight have been identified highlighting the underlying allelic variations. The candidate genes for seed weight from various Brassica species and the model crucifer Arabidopsis overlapping the consensus QTL have also been identified and their relevance in breeding programs for increasing seed weight in B. juncea is described.

Materials and methods

Plant materials

Four bi-parental microspore-derived doubled-haploid (DH) mapping populations covering maximum variability for seed weight in B. juncea were used in the present study. These four populations have been developed from six parents including three Indian (Pusajaikisan, Varuna and TM-4) and three east European lines (Heera, Donskaja-IV and EH-2). Two of these populations, namely, VH (derived from F1 between Varuna and Heera) and TD (derived from F1 between TM-4 and Donskaja-IV) used in the study have been earlier reported by Ramchiary et al. (2007) and Yadava et al. (2012), respectively. Two new DH populations, DE (derived from F1 of Donskaja-IV and EH-2) and EPJ (derived from F1 of EH-2 and Pusajaikisan), were developed in the present study. Detailed phenotypic contrasts for seed weight among the parental lines of these four populations are shown in Table 1.

Table 1 Mean thousand seed weight (Tsw) of the parental and doubled-haploid lines of the four B. juncea mapping populations

The EPJ and DE map used for the QTL analysis of seed weight consisted of 182 and 184 DH lines, respectively. The EPJ map was constructed with a total of 860 markers consisting of 241 intron polymorphisms (IP), 462 genic SSRs and 157 BAC-derived SSRs with a total genetic length of 2073.6 cM (Dhaka 2015). The DE linkage map with 653 markers has 344 IPs, 278 SSRs (both genic and BAC-derived), 26 SNPs and 5 Indels that cover a total genetic length of 1598.5 cM (Rout 2014). Details of these two linkage maps will be reported elsewhere.

Field experimental design and trait measurements

The EPJ population along with the parents, EH-2 and Pusajaikisan, was planted in three environments of northwest India in Delhi, Bharatpur and Alwar during the winter growing season. The field trials of the DE population along with the parents Donskaja-IV and EH-2 was conducted at Delhi for 3 years. The experiments were conducted in a randomized block design with three replications. Each line was planted in a single row with a row-to-row distance of 40 cm and a plant-to-plant distance of 10 cm. Five representative plants from the middle of the row were used for phenotyping. Seed weight trait was phenotyped as thousand seed weight (Tsw) measured in grams. The mean weight of thousand seeds from the mixed seeds of 15 plants, with 5 plants from each of the three replications was used as the trait value for statistical and QTL analyses.

QTL analysis for seed weight

Marker genotyping data of the four DH populations were scored as ‘a’ (female parent allele) and ‘b’ (male parent allele) with an exception to EPJ population where male parent allele (Pusajaikisan) was scored as ‘a’ and female parent allele as ‘b’ (EH-2). All the statistical analyses were performed using SPAR 2.0 (2005). Composite interval mapping (CIM) using Windows QTL Cartographer 2.5 (Wang et al. 2006) was used for analyzing QTL for seed weight (Tsw). A window size of 10 cM and a walk speed of 1 cM with five control markers were used. For declaring the presence of a QTL, genome-wide threshold values (P = 0.05) were estimated from 1000 permutations of trait data across all genetic intervals (Churchill and Doerge 1994; Doerge and Churchill 1996). QTL detected with more than 10 % of the phenotypic variance (R 2) were considered as major QTL.

For identification of consensus QTL by meta-analysis, QTL maps of the four bi-parental populations were used. In addition to QTL mapping of seed weight trait of EPJ and DE populations in the present study, a QTL re-analysis of the trait was also undertaken for VH and TD populations, using the multi-site trait data of VH (Ramchiary et al. 2007) and TD (Yadava et al. 2012) (Table 1). The VH and the TD genetic maps used for the purpose consisted of 2205 and 970 markers, respectively. Prior to meta-analysis, a consensus map was developed using the genetic maps of TD, EPJ and DE projected on to the VH map using the ‘Map projection’ command of BioMercator 2.1 (Arcade et al. 2004; Goffinet and Gerber 2000). Arabidopsis genomic blocks (Schranz et al. 2006) were assigned to the consensus map based on the corresponding IDs of the intron polymorphisms and the genic-SSR markers based on synteny with B. rapa available in the Brassica database.

The Tsw QTL identified were then integrated into consensus QTL using the meta-analysis method followed by Yadava et al. (2012). QTL obtained from different populations which merged into a single QTL upon meta-analysis were regarded as ‘consensus’ QTL (C-QTL), while those which did not merge by meta-analysis were designated as ‘population-specific’ QTL. The population-specific QTL were further classified into either ‘stable’ QTL (S-QTL) when at least two QTL from different environments merged into a single QTL or ‘environment’ QTL (E-QTL) which did not merge. The allelic effects at these QTL merging into a consensus QTL were estimated in each of the component bi-parental populations by calculating the phenotypic difference between the two parental genotypes at the marker closest to the consensus QTL following the procedure of Chardon et al. (2014). Similar exercise of allelic effects estimation was also undertaken for ‘population-specific’ QTL using the respective bi-parental QTL maps.

The epistasis and QTL × environment interactions influencing seed weight (Tsw) were identified by QTL Network 2.0 (Yang et al. 2007). It simultaneously investigates the additive QTL, QTL × environment, QTL × QTL (epistasis) as well as epistasis × environment interactions. The analyses were based on a mixed linear model approach with 1 cM walk speed and a testing window of 10 cM. To detect epistatic effects among loci with or without individual additive effects, 2D genome scans were carried out. 1000 permutations were performed for generating thresholds for the presence of QTL.

Results

Phenotypic variation in seed weight

The phenotypic variation for the trait among the six parents of the four bi-parental populations ranged from 2.0 to 7.6 g in different environments (Table 1). The maximum parental contrast was recorded for EPJ population ranging from 2.4 g in EH-2 to 7.6 g in Pusajaikisan. The parents of the VH population also showed high contrast in seed weight ranging from 2.3 g in Heera to 6.3 g in Varuna. The parents of the other two bi-parental populations (DE and TD) were not contrasting for seed weight trait. The frequency distributions of the trait in all the four populations over different environments showed continuous variation indicating the quantitative mode of inheritance for Tsw (Fig. 1). Transgressive segregants were observed in all of the populations with Tsw beyond the parental values, wherein DE and TD populations showed a higher frequency of transgressive segregants as compared to the EPJ and VH populations (Fig. 1). The trait showed high broad-sense heritability (ranging from 73.0 to 94.0 %) in all the four populations (Online Resource 1).

Fig. 1
figure 1

Frequency distributions of thousand seed weight in four mapping populations of Brassica juncea. a DE, b EPJ, c TD and d VH in multiple environments

QTL mapping of seed weight (Tsw) in four bi-parental populations

QTL analysis of the trait in the EPJ population showing very high parental contrast for the phenotype identified a total of 25 QTL with LOD scores greater than 2.5 from the three environments (Table 2). These QTL were detected over 12 linkage groups, wherein 12 QTL mapped to A and 13 to the B subgenome. Pusajaikisan contributed the positive alleles for all the 25 QTL. The additive effects of these QTL ranged from 0.1 to 0.4 g and the phenotypic variance explained (R 2) by each QTL ranged from 3.3 to 16.5 %. The analysis identified four major QTL with R 2 greater than 10 % located on LGs A3, B3 and B8.

Table 2 Tsw QTL data of DE, EPJ, TD and VH populations used for meta-QTL analysis

The DE population derived from the parents Donskaja-IV and EH-2 lacking much variability in the phenotype identified a total of ten QTL from 3 years of trials. The QTL were distributed over six linkage groups, three each in the A and B subgenomes (Table 2). Unlike the EPJ population, where all the positive alleles were contributed by the high parent (Pusajaikisan), the DE population identified positive alleles from both the parents. While Donskaja-IV contributed positive alleles for seven QTL, EH-2 contributed positive alleles for three QTL. Except two major QTL in LG A3, all others were detected as minor QTL.

The pattern of QTL re-analysis of the TD population developed from TM-4 and Donskaja-IV with no phenotypic contrast for seed weight was also similar to that of the DE population with both the parents contributing the positive alleles. Of the 14 Tsw QTL detected in the population, 8 had trait enhancing alleles from TM-4 and 6 had trait enhancing alleles from Donskaja-IV. Of these, five QTL in the LGs A7, A8, A9 and B7 were detected as major QTL with R 2 greater than 10 % (Table 2).

The QTL re-analysis of the VH population identified a total of 16 QTL for seed weight. In all the 16 detected QTL, the parent Varuna with high trait phenotype contributed positive alleles to all the QTL showing a similar genetic architecture as that of the EPJ population. The QTL were distributed over eight LGs with five major QTL in LGs A3, A10 and B3 (Table 2).

The QTL analysis in the four bi-parental populations identified a total of 65 Tsw QTL, of which 39 and 26 QTL were detected in the A and B subgenomes, respectively. Of these 65 QTL, 25 % of the QTL (=16) were detected as major QTL explaining more than 10 % of the phenotypic variance.

Identification of consensus Tsw QTL

The 65 QTL for seed weight identified in the four mapping populations phenotyped in different environments were projected onto the consensus genetic map of B. juncea for the identification of ‘consensus’ QTL (C-QTL). A meta-analysis was performed with the hypothesis that the Tsw QTL identified in different environments and located in a given region of a chromosome could be different estimations of the same QTL. Meta-analysis of 65 QTL resulted in six consensus Tsw (Tsw-C) QTL distributed on LGs A3, A7, A10 and B3 (Fig. 2; Table 3). These six Tsw-C QTL were formed by integrating 33 component Tsw QTL, while the remaining 32 Tsw QTL remained as population-specific QTL. The number of component QTL that merged into each of these six Tsw-C QTL ranged from three to nine. Of the six Tsw-C QTL, LGs A3 and B3 included two C-QTL each and LGs A7 and A10 harbored one C-QTL each. One C-QTL each in LGs A3 (Tsw-C-A3-1) and B3 (Tsw-C-B3-1) were formed by merging the maximum number of major component QTL, while the second C-QTL in these LGs (Tsw-C-A3-2 and Tsw-C-B3-2) were formed by merging minor component QTL. The remaining two C-QTL in LGs A7 and A10 were formed by merging the maximum number of minor component QTL.

Fig. 2
figure 2

Positions of six consensus Tsw (C-Tsw) QTL distributed on LGs A3, A7, A10 and B3 in Brassica juncea consensus map. Marker names are on the right of the linkage group bar and the positions in centiMorgans (cM) are on the left. The component QTL identified in EPJ, DE, TD and VH populations integrating into the consensus QTL are shown on the left side of the linkage group bar

Table 3 Characteristics of consensus Tsw (C-Tsw) QTL identified by meta-analysis through merging four component QTL maps (VH, TD, DE and EPJ)

Of the 32 Tsw QTL from the four bi-parental populations that did not merge to a consensus QTL, 15 QTL merged into population-specific stable QTL (S-QTL) as they were detected in more than one environment. The remaining 17 QTL which did not merge were detected as environment QTL (E-QTL) (Online Resource 2). The maximum number of three S-QTL were detected in the TD population wherein one S-QTL was formed by merging the component QTL from three environments (Tsw-S-A9-1-TD), while the other two S-QTL were formed by merging the component QTL from two environments. In the other three populations, thec EPJ population identified two S-QTL (Tsw-S-A1-1-EPJ and Tsw-S-B8-1-EPJ) and the VH population identified one S-QTL (Tsw-S-A4-1-VH). All the population-specific QTL in the DE population were detected as E-QTL.

Tsw-C-A3-1 in LG A3 was formed by merging six component QTL and all included positive alleles from both Indian and east European gene pool lines in the ‘J’ Brassica genomic block. This C-QTL that merged the QTL from three populations (DE, EPJ and VH) showed the highest allelic effects from Pusajaikisan, showing an increase in seed weight by 0.6 g followed by Donskaja-IV by 0.3 g and Varuna by 0.2 g in the respective populations (Table 4). In the second C-QTL, Tsw-C-A3-2 in F block, formed by merging minor QTL from two populations (DE and EPJ), revealed the maximum allelic effects in the EPJ population, wherein the Pusajaikisan allele increased the trait phenotype by 0.4 g as compared to Donskaja-IV allele in the DE population showing an increase by 0.2 g (Table 4).

Table 4 Estimation of allelic effects (measured in grams) of six consensus QTL for seed weight trait involving four bi-parental populations of B. juncea and determination of allelic status of six parents

The Tsw-C-A7-1 that merged QTL from all the four populations and located at the junction of E and N blocks showed contribution of positive alleles from both Indian (Pusajaikisan, TM-4 and Varuna) and east European (Donskaja-IV) lines. Among the Indian parents, Varuna increased the phenotype by 0.5 g followed by Pusajaikisan (0.3 g) and TM-4 (0.2 g) in the respective populations. In the DE population, the parent Donskaja-IV increased the phenotype by 0.2 g (Table 4).

Tsw-C-A10-1 also merged QTL from all the four populations. The contribution of trait enhancing alleles to this C-QTL was observed to be from both female (TD and VH populations) and male parents (EPJ and DE populations). The allelic effect of Varuna allele was found to be maximum, as it increased the phenotype by 0.6 g while the other three parents, Pusajaikisan, EH-2 and TM-4, increased the trait phenotype by 0.2 g each in the respective mapping populations (Table 4).

The two C-QTL in LG B3, Tsw-C-B3-1 and Tsw-C-B3-2 were formed by merging QTL from three and two populations, respectively. In Tsw-C-B3-1, Varuna allele increased the phenotype by 0.6 g followed by Pusajaikisan (0.5 g) and EH-2 (0.1 g). In Tsw-C-B3-2, both Varuna and Pusajaikisan increased the phenotype by 0.4 g each in the respective populations (Table 4).

On the basis of the diversity in allelic effects of the six parents utilized in the development of the four bi-parental populations, a comparative allelic status has been deciphered for these parental lines. For the purpose, it has been hypothesized that Heera/EH-2 is the common parent, as EH-2 is the EMS-induced early mutant of Heera (Rout et al. 2015) and would therefore possess the same alleles since both these lines have the same phenotype for the trait (Table 1). The exercise revealed that Indian parents—Pusajaikisan or Varuna—possess the most positive allele in all the six C-QTL among the B. juncea lines studied (Table 4). In three C-QTL (Tsw-C-A3-1, Tsw-C-A3-2 and Tsw-C-B3-2), Puasjaikisan had the most positive alleles and in the other three C-QTL (Tsw-C-A7-1, Tsw-C-A10-1 and Tsw-C-B3-1) Varuna had the most positive alleles.

Epistatic and QTL × environment interactions

Genome-wide analysis for epistatic (additive × additive) and QTL × environment interactions for Tsw trait in the four bi-parental populations revealed no significant QTL × environment interactions in any of the bi-parental populations (data not shown). Analysis of digenic interactions (A × A) in these four populations revealed five interactions each in the EPJ and VH populations and one interaction each in the DE and TD populations (Table 5). It was also observed that in the EPJ and VH populations, where the populations were developed with high phenotypic contrasts between the parents, a greater number of Type 1 digenic interactions (four out of five interactions in EPJ and three out of five in VH) were detected compared to that in the populations with low phenotypic contrasts between the parents (DE and TD).

Table 5 Epistatic (additive × additive) effects for seed weight in four bi-parental populations (EPJ, DE, TD and VH)

In silico co-localization of candidate genes

An in silico search for candidate genes for seed weight trait from the heterologous systems identified a total of 77 genes from Arabidopsis and 11 from other agricultural crops (rice, maize and tomato) (Online Resource 3 and Online Resource 4). The orthologs of 11 crop genes in Arabidopsis were established using BLASTX tool of TAIR database (http://www.arabidopsis.org/Blast/index.jsp). The probable positions of these candidate genes were predicted in B. juncea on the basis of the syntenic blocks of C-QTL. B. rapa paralogs of these genes were identified by search for the ‘syntenic gene’ in BRAD (http://brassicadb.org/brad/searchSyntenytPCK.php).

In silico co-localization of candidate genes with C-QTL was carried out based on the arrangement of Brassica building blocks on the B. juncea consensus genetic map. There were ten candidate genes which showed in silico co-localization with four A subgenome C-QTL (Table 6). In the absence of B genome sequence of Brassica species, no candidate genes could be co-localized to the B subgenome C-QTL. However, the candidate gene GASA4 (At5g15230) mapped in the EPJ and VH populations due to a polymorphism in the gene and was found to co-map to the Tsw-C-B3-2 region on LG B3. Among the ten candidate genes co-localizing to four A subgenome C-QTL, four candidate genes (ATGRF3, CKX1, HAIKU1 and TTG2) could be located in Tsw-C-A3-1, two genes (ARR1 and SGT) in Tsw-C-A3-2, two (BZR1 and DA2) in Tsw-C-A7-1 and two (MYB56 and TT4) in Tsw-C-A10-1. While all the six candidate genes in Tsw-C-A3-1 and Tsw-C-A3-2 QTL correspond to the MF2 (most fractionated 2) region in the ‘J’ block, other candidate genes co-locating to the QTL regions of LG A7 (Tsw-C-A7-1) and A10 (Tsw-C-A10-1) correspond to the LF (least fractionated) regions in the ‘E’ and ‘R’ blocks of B. rapa, respectively. Comparisons of identified C-QTL in the present study with QTL data for seed weight in B. rapa (Li et al. 2013) revealed that C-QTL in LG A3 (Tsw-C-A3-1) (J block) and in A10 (Tsw-C-A10-1) (R block) could be common between the two species (Table 6).

Table 6 Candidate genes for seed weight showing in silico co-localization with C-Tsw QTL and the corresponding A subgenome seed weight QTL identified in B. rapa

Discussion

Seed weight is an important yield component trait in oilseed Brassica species and is quantitative in nature. Among the cultivated Brassicas, B. juncea holds the maximum genetic variation for the trait. While extensive breeding efforts in Indian B. juncea gene pool lines have resulted in the development of varieties with higher seed weight, the east European gene pool lines are small seeded due to lack of breeding efforts in this direction (Ramchiary et al. 2007). Although there are several studies on genetic analysis of seed weight in B. rapa (Li et al. 2013; Basnet et al. 2015), B. napus (Yang et al. 2012; Li et al. 2014; Fu et al. 2015; Geng et al. 2016) and in B. juncea (Ramchiary et al. 2007; Yadava et al. 2012), no systematic study on the trait harvesting maximum available genetic variation has been undertaken in any of these Brassica species. The present study, therefore, has been attempted at understanding the genetics of this important quantitatively inherited trait in B. juncea describing the maximum genetic variation in the species. As the B. juncea germplasm is highly structured (Srivastava et al. 2001) and might run the risks of association mapping, the genetic dissection of the trait was undertaken using four bi-parental populations involving six lines from both Indian and east European gene pools. The seeds of the six parents involved in the present study showed a wide range in weight varying from 2.0 to 7.6 g per 1000 seeds in different environments (Table 1).

We observed higher frequencies of transgressive segregants in the populations derived from parents with low contrast (DE and TD) than those derived from the parents with high contrasts (EPJ and VH) in seed weight (Fig. 1). It clearly indicates that the positive alleles for seed size are still dispersed among parental lines with low contrast and the transgressive segregants have inherited combinations of alleles acting in the same direction contributed by both the parents. This was substantiated by the QTL analysis of the DE and TD populations wherein both the parents contributed the positive alleles for increased seed weight (Table 2). This observation also indicates that breeding efforts are lacking for improving seed weight in these lines. On the contrary, the occurrence of very few transgressive segregants in populations derived from the parents with high phenotypic contrasts (EPJ and VH) could be due to accumulation of all positive alleles in the high parents. QTL analysis in these two populations confirmed this hypothesis, as no positive allele was identified from the phenotypically inferior parents in EPJ and VH populations (Table 2). It reaffirms the strong breeding efforts made in the Indian germplasm lines for higher seed size.

The present study identified 65 Tsw QTL from four bi-parental populations distributed over 13 of 18 LGs, of which 75 % were minor QTL (explaining <10 % of the phenotypic variance). It indicates that in B. juncea also a sizable portion of the genetic variation of seed weight is controlled by a large number of loci with minor effects as has been reported in B. napus (Li et al. 2014). The study also identified around 50 % of the population-specific QTL as environmental QTL (E-QTL). The occurrence of environment-specific QTL is also a characteristic of quantitative traits. It has been proposed that they may be exhibiting ‘conditional neutrality’ (they affect the trait in one environment but not in the other) or ‘pleiotropy’ (they affect Tsw in one environment, but affect another trait in a different environment because of pleiotropic nature of the underlying genes). The appearance of QTL which show conditional neutrality and antagonistic pleiotropic effects and their importance in adaptation to different environments have been shown in Boechera stricta (Anderson et al. 2013), Arabidopsis and other species (EI-Soda et al. 2014).

Quantitative trait variation is also influenced by epistatic and QTL × environment interactions, which contribute to the observable phenotypic variation (Mackay 2014). Our results did not indicate significant QTL × environment interactions. It could primarily be due to the fact that seed weight is a highly heritable trait in B. juncea and has a strong additive component. However, the observation of a higher number of epistatic interactions (additive × additive) that were primarily of Type 1 in populations derived from parents with high phenotypic contrasts (EPJ and VH) than the populations with lesser parental contrasts (DE and TD) (Table 5) is suggestive of the fact that the Type 1 interactions have been fixed during the selection and fixation of additive loci for higher seed weight in the Indian gene pool lines. The effects attributed to epistasis are deemed important while conceiving marker-assisted breeding programs for QTL introgression.

Meta-QTL analysis that merges overlapping QTL of a trait detected in multiple populations is a synthetic genetic model based on the hypothesis that different QTL that are common between the populations are controlled by the same gene(s) (Chardon et al. 2014). The analysis results in the identification of ‘consensus’ QTL which could have broader application in marker-assisted breeding as compared to a QTL detected in a single bi-parental population. Meta-analysis also precisely quantifies the allelic effects at the QTL participating in the formation of consensus QTL. The estimated allelic effects could then be further used to establish the diversity among the parental alleles contributing to a consensus QTL in a particular case where all the bi-parental populations have been derived using common parents.

The meta-QTL analysis undertaken in the study identified a total of six consensus QTL (Table 3) by merging 33 of the 65 component QTL. The estimated allelic effects on seed weight showed maximum effects from Indian gene pool lines, Pusajaikisan or Varuna, in all the six consensus QTL. However, in the DE population (derived from two east European lines), Donskaja-IV showed maximum allelic effects (Table 4). Further comparative analysis of allelic diversity using EH-2/Heera as the common parent revealed that either Pusajaikisan or Varuna hold the most positive allele among the six parents (Table 4). Hence, the allelic diversity identified in the present study from two gene pool lines reveals the scope for genetic improvement of seed size by allele replacement between and/or within the two gene pools of B. juncea. However, the decision on inter-gene pool introgression of QTL for a component trait related to yield should only be made based on the genetic analysis of other yield component traits, as the majority of these traits are significantly correlated due to pleiotropy and/or close linkage (Yadava et al. 2012; Yang et al. 2012; Li et al. 2014).

Due to allelic diversity, QTL identified in a population may not be useful in other populations without marker validation and/or fine mapping (Xu and Crouch 2008; Nicholas 2006). Candidate gene analysis in the QTL regions that are shared across populations would permit the development of polymorphic markers, speeding up marker-assisted selection. In the present study, the synteny of candidate genes with the Tsw QTL localized 11 genes mapping to the regions harboring C-Tsw QTL (Table 6). These candidate genes could be validated and the polymorphism detected in the candidate genes be used for the precise transfer of QTL.

The comparative QTL map alignment for seed weight between the A subgenome of B. juncea (present study) with B. rapa (Li et al. 2013) identified two conserved QTL blocks, one in the J block of LG A3 (Tsw-C-A3-1) and the other in the R block of LG A10 (Tsw-C-A10-1). It indicates that these QTL blocks are evolutionarily conserved in the A subgenome of B. juncea. The in silico co-mapping study also identified four candidate genes, ATGRF3, CKX1, HAIKU1 and TTG2, locating to the J block of QTL Tsw-C-A3-1 whose paralogs were detected in the J block in LG A3 of B. rapa (Table 6). The J block in LG A3 in B. rapa has been shown to be a most fractionated block (MF2) which is more ancient than the other two blocks LF and MF1 (LF: least fractionated and MF1: medium fractionated) (Wang et al. 2011). We, therefore, believe that one or more of these genes might be functionally conserved across the Brassica species and play an important role in determining seed size.

In conclusion, this study addresses a very important aspect of genetic improvement of B. juncea by identifying consensus seed weight QTL based on data from multiple segregating populations developed by utilizing the maximum genetic variation existing in the two gene pools of B. juncea. Meta-analysis simultaneously revealed six consensus QTL and uncovered the allelic diversity of the trait in B. juncea. The candidate genes which show in silico synteny to the genomic blocks containing consensus Tsw QTL in B. juncea would be physically mapped and functionally validated in future. In the context of heterosis breeding in B. juncea, the Indian parents having positive alleles for seed weight trait may find application in improving the east European gene pool lines.

Author contribution statement

The experiment was conceived and designed by ND, DP and AKP and was performed by ND, KR, SKY and YSS. The data were analyzed by ND, KR, SKY, VG and AKP, and the manuscript was written and edited by ND, SKY, DP and AKP.