Introduction

Pod shattering is an important way of seed dispersal. It is necessary for the reproductive competence of plants under natural conditions but, premature or uncontrolled pod shattering in cultivated crops is undesirable. It restricts the temporal window for crop harvesting and may even lead to crop losses. So, selection for pod indehiscence may have been among the targets of selection during crop domestication. Pod shattering is a complex and intricately controlled process. It involves specific morphological and anatomical features like pod wall valves, a centrally placed replum and valve margins. Pod wall is photosynthetically active fruiting body, which arguably represent a modified leaf [1]. The valve margins include cells which later differentiate to become dehiscence zone (DZ). At maturity, valves separate from replum along the DZ and cause pods to shatter. DZ in shattering resistant species is made up thick walled and lignified parenchyma cells in comparison to thin walled parenchymatous cells present in pod shattering prone species [2]. Brassicas crops differ for intensity of pod shattering. B. juncea, B. carinata and B. rapa are less prone to pod shattering as compared to easy to shatter B. napus. These variations are attributed to the differences in the pod length [3], size of vascular bundles [4], water content [5], cellulose and lignification in the pod walls [6,7,8]. Role other anatomical features is also considered important [9, 10]. A suite of regulatory genes, SHATTERPROOF1 (SHP1), SHATTERPROOF2 (SHP2), NAC secondary wall thickening promoting factors (NST1, NST2 and NST3), INDEHISCENT (IND) and ALCATRAZ (ALC) are known to regulate pod shattering [11,12,13,14,15,16,17,18,19]. These are known as valve margin identity genes. REPLUMLESS (RPL) and the FRUITFULL (FUL) control the expression of valve-margin identity genes [20]. This regulatory network also includes genes associated with leaf development, establishment of dorsoventral axes of the lateral organs (e.g., FILAMENTOUS FLOWER, YABBY3, ASYMMETRIC LEAVES1/2) and the meristematic potential maintenance (BREVIPEDICELLUS) [21, 22]. IND, PG (polygalacturonase) and FUL are important for regulating pod shattering resistance in B. oleracea, B. napus and B. juncea [15, 18, 23]. Multiple QTLs were found to regulate variation for pod shattering in B. napus [24] and B. carinata [25]. Resistance to pod shattering and the increase in the seed size have been the foci of human selection in B. juncea [26].

This article focuses on the variations and genetics of pod shatter resistance in Indian mustard (Brassica juncea L. Czern & Coss) and its diploid progenitor species B. rapa and B. nigra. Indian mustard is a major oilseed and condiment crop in different parts of the world. It is important for India, where the crop is cultivated on over 6.5 million hectares. Majority of the mustard varieties in India are resistant to pod shattering. Although the data for the actual crop losses because of seeds shattered at harvest are not available, yield losses have been reported for many of the newly developed varieties or hybrids in India. Increased propensity for premature shattering of pods in many of these may have resulted from the increased use of exotic germplasm as donors for the novel traits, especially for oil quality. Many exotic germplasm lines and most wild species have easy to shatter pods. Improved resistance to pod shattering is relevant as voluntary seed shattering during traditional hand harvesting leads to crop losses and volunteer pressure during later crop season. Natural variation for shatter resistance was never documented in any global germplasm collection of B. juncea or its progenitor species. Information regarding trait genetics is also sketch. In this communication, we report the phenotypic variations for pod shatter resistance in the global germplasm assemblages of B. juncea and its progenitor species. We also conducted genome wide and candidate gene-based association mapping to unravel genetic factors underlying the observed trait variation. SHP1 and NST2 may have important roles in regulating the variation for all pod traits in the studied crops. We also noted the footprints of selection in SHP1, SHP2 (B. rapa, B. nigra and B. juncea), RPL (B. rapa) and NAC (B. juncea).

Materials and methods

Plant materials

Large sets of germplasm lines of B. juncea (124) and its progenitor species B. rapa (90), B. nigra (21) were used for the present studies. These comprised collections from Australia, Canada, China, central Asia, east Europe, Pakistan and India, while B. rapa lines were sourced from Canada, Central Asia, Pakistan and India. Details about the germplasm collections are available elsewhere [27]. Experiments were conducted during 2013–2014 (Y1) and 2014–2015 (Y2) cropping seasons at Punjab Agricultural University, Ludhiana. Test genotypes of each species were raised as separate paired rows and replicated twice. Experiments were conducted under timely sown irrigated conditions. Standard agronomic practices were followed throughout the crop season. Germplasm lines under the study were assessed for three pod shattering associated traits. These included: pod strength measured as rupture energy (RE), pod length (PL) and seeds/pod (SPP).

Pod traits

Pod strength was tested with the help of a pendulum machine fitted with an optical encoder [28, 29]. The instrument provides an estimate of RE (mJ) by measuring the loss of movement of the pendulum upon striking and rupturing a pod. The pod is clamped in front of a measuring scale at the bottom dead centre of the pendulum swing. The data were recorded from 25 pods/genotype. The moisture content of all the detached pods was first equilibrated by storing pods at room temperature in plastic tubes containing self-indicating coarse silica gel granules. These were later oven dried at 70 °C for 24 h before measuring their rupture energy. The pod length (mm) was measured on 5 mature pods from 10 random plants and averaged. Seeds from 25 pods from each genotype were counted and averaged to get an assessment of seeds/pod.

Statistical analysis

Mean values of rupture energy, pod length and seeds/pod were calculated for each germplasm and used for further statistical analysis. Analysis of variance (ANOVA) was conducted to study variation due to genotypes, environments (years) and genotype × environment interactions using Minitab Statistical software. ANOVA was conducted independently for each of the three species, e.g. B. rapa, B. nigra and B. juncea.

Genomic DNA extraction and genotyping

Modified CTAB extraction procedure [30] was used for DNA extraction. The quantity and quality of DNA was assessed by agarose gel electrophoreses (0.8%) and nanodrop spectrophotometer. A random set comprising 102 germplasm lines of B. juncea were picked for SNP genotyping. B. juncea lines were genotyped by sequencing (GBS) [31], while sequencing based DArT was used to genotype B. rapa (78). We also identified six shattering related genes for the candidate gene-based association mapping of the entire germplasm collection (235), comprising B. nigra. Gene sequences from SHP1, SHP2, NAC, IND, FUL and RPL were used to develop 21 STS primers. For candidate genes SHP1 and SHP2, specific primers were identified using software PRIMER 3. For NAC and IND, we used primer sequences as reported by Raman et al. [24] were used. Sequences for FUL and RPL were retrieved from the Arabidopsis database and Blastn with B. rapa in the Brassica database (BRAD). Orthologus sequences were selected and used for designing primers. For polymorphism studies, test DNA (5 μl of 5 ng/μl) was added to 5.8 μl of master mix that contained 1.0 μl reaction buffer (10 ×), 2.0 μl of 2.0 mM dNTPs, 1.0 μl of 1 mM forward primer, 1.0 μl of 1 mM reverse primer and 0.8 μl Taq polymerase. In vitro amplifications were performed in 96 welled PCR plate in Eppendorf AG (Model 6325). Standard SSR protocol (1 cycle of 4 min at 94 °C; 35 cycles of 1 min at 94 °C, 30 s at TA, 30 s at 72 °C; 1 cycle of 7 m at 72 °C and a final hold at 4 °C) was followed for PCR analyses. 2.5% Agarose gel (2.5%) was used for electrophoretic separation of the PCR products.

Genome wide association studies

NGSEP-GBS pipeline [32] was used for SNP data calling, based on the reference genome of B. juncea v1.5. DArT analytical pipelines were used to handle sequencing data and identification of SNPs in B. rapa. Imputation of SNPs was performed with ‘Fcgene v1.0.7’ [33] and ‘Beagle’ [34]. Phenotypic data for 102 genotypes of B. juncea and 78 genotypes of B. rapa was used for GWAS. The data were first normalized by using Johnson transformation available in the software Minitab v16.0. MVP (A Memory-efficient Visualization-enhanced and Parallel-accelerated Tool) [35] and GAPIT [36] were used to measure significance of trait-SNP association, using various GWAS models. An ideal model was expected to show uniformity between the observed and expected p-values in the plot. We matched the observed and expected p values through quantile–quantile (QQ) plots to test the precision of applied GWAS models. PCA + K models were ultimately selected to run with the MLM algorithm as implemented in the MVP software. A threshold of p-value > 3.0 was used for initial detection of significant MTAs. Genomic regions around the identified SNPs were used for annotation. The predicted gene and its orthologous sequences were then annotated by BLAST analysis against the Arabidopsis thaliana database using Blast2GO v5.0 tool [37]. Functions of the predicted candidate genes were reviewed in the literature to establish their importance for the traits of interest.

Candidate gene-based association mapping

The full set of 235 germplasm lines {B. juncea (124), B. rapa (90), B. nigra (21)} was used for candidate gene-based association mapping. Population structure study was conducted as implemented in the software STRUCTURE [38]. Candidate gene-based association mapping was carried out with the software package TASSEL (Trait Analysis by Association, Evolution and Linkage) [39].

Population genetic analysis

These estimations were based on polymorphisms generated by the primers designed from sequence information of six pod shattering related candidate genes. Expected heterozygosity and deviations from the Hardy–Weinberg equilibrium were worked out with the help of software FSTAT 2.9.3.2 [40]. Other population genetic parameters were measured with the help of the software package ARLEQUIN [41,42,43].

Results

Analysis of variance (ANOVA) showed significant genotypic differences for three pod traits all three species. Variances due to environment were significant for RE and seeds/pod in B. rapa and B. juncea and pod length in B. juncea. Genotypes × environment interactions (G × E) were significant for RE in B. rapa and B. juncea; seeds/pod and pod length for the test species (Table 1). Environment effects were non-significant for rupture energy in B. nigra, but G × E interactions variances were significant for pod length and seeds/pod. Estimates for pod strength, pod length and seeds/pod are presented in Table 2.

Table 1 Analysis of variance (ANOVA) for different pod related traits in evaluated three Brassica species
Table 2 Mean values for rupture energy, pod length and seeds/pod in B. rapa, B. nigra and B. juncea

Pod shattering (rupture energy)

RE values in B. rapa ranged from 2.3 (Leega) to 9.0 mJ (Mitra) during Y1 and 2.7 (Leega) to 7.7 mJ (Texi) during Y2. Variation was high for rupture energy in the B. rapa collection from central Asia, it was low for brown sarson germplasm from India (Fig. S1). B. nigra appeared very prone to shattering with little genotypic variation. RE values ranged from 1.3 mJ (Assam) to 2.6 mJ (Gujrat) during Y1, average being 1.9 mJ. Y2 values ranged from 1.4 mJ (Assam, FRG1) to 2.4 mJ (R 4136) with an average of 1.9 mJ (Fig. S2). B. juncea genotypes showed the largest estimates for RE; these varied from 2.9 mJ (EC 564645) to 11.5 mJ (MCP 633), with an average of 6.5 mJ during Y1, and from 3.4 mJ (RM 51) to 10.7 mJ (MCP 633), with an average of 6.3 mJ, during Y2. Australian B. juncea germplasm lines were more sensitive to shattering and also had a narrow range of variation. Distribution of variation in Indian and east European genotypes was skewed towards high RE values (Fig. S3).

Pod length

Maximum variation for PL was recorded in B. rapa. These varied from 17.0 mm (PAK 85530) to 48.0 mm (Sanya) for Y1 and 24.0 mm (R 436) to 42.8 mm (PAK 85910) during Y2. PL average was 34.2 mm. Based on geographic groupings, maximum trait variation was recorded from Pakistan, but a narrow range of variation was observed for Indian brown sarson (Fig. S1). PL was much smaller in B. nigra as compared to B. rapa and B. juncea. These fell within a range of 9.0 mm (R 704) to 15.0 mm (CN 113784) during Y1 because these varied between 10.0 mm (R 4130) to 13.0 mm (R 4132) for Y2. The corresponding averages were 11.7 mm and 11.3 mm. Canadian germplasm lines showed a wider range for the pod length, while Indian B. nigra lines had a particularly narrow range (Fig. S2). Pod length ranged from 18.5 (AJ 1) mm to 44.0 mm (Rohini) in B. juncea with an average of 30.6 mm during Y1. This was compared to 21.0 mm (EC 564,649) to 47.8 mm (MCP 633) recorded for Y2, the average being 31.6 mm. At the population level, Australian genotypes varied the least (Fig. S3). Variation in the Indian germplasm lines was skewed towards longer pods.

Seeds/pod

Brassica rapa showed a wide range for the seeds/pod trait. During Y1, the seeds/pod ranged from 5.8 (R 436) to 28.5 (Qianxi Hei) with an average of 15.2 seeds/pod. The range for Y2 was narrower, 5.8 (R 436) to 23.6 (German 56), average being 13.9 seeds/pod. Maximum variation was recorded in the germplasm lines from Central Asia while a narrow range of variation range was recorded in Indian brown sarson (Fig. S1). B. nigra showed low variation, with values of 4.0 (UP) to 8.0 (R 4136) seeds/pod during Y1 and 4.0 (UP) to 7.5 (R 4134) for Y2 (Fig. S2). The number of seeds/pod in B. juncea ranged from 7.0 (AJ 2) to 20.1 (IC 597895) seeds/pod with an average of 13.8 seeds/pod, during Y1. For Y2, these values ranged between 8.6 (RE 8) to 18.5 (IC 597875) seeds/pod. Australian B. juncea had least variation (Fig. S3). Pod shatter energy was associated with pod length in both B. rapa (0.251) and B. juncea (0.489). It was negatively correlated with seed number (− 0.273) and pod length (− 0.176) in B. nigra.

Association mapping for pod shattering and related traits

GWAS was conducted to find marker trait associations (MTAs) to analyze genes linked to the rupture energy, pod length and seeds/pod (Table 3) (Figs. 1, 2). Twenty-three MTAs associated with RE were envisaged on chromosomes A02, A03, A05, A09 and B05 of B. juncea. Phenotypic variation explained ranged between 20.67 and 25.46%. Three SNPs A02_15096881, A02_15096938, A02_15097017 was present near NAC domain transcriptional regulator superfamily protein, AT1G76420-CUC3/NAC368 (CUP SHAPED COTYLEDON 3). We could also observe the AT3G61910-NST2 along the SNPs, A09_46982925 and A09_46982955. A03_13393564 was associated with AT2G45190-AFO (ABNORMAL FLORAL ORGANS)/FIL (FILAMENTOUS FLOWER). AT5G60910-FUL1/AGL8 (FRUITFULL 1/AGAMOUS-LIKE 8) was called close to SNP A03_27065868. Two copies each of ethylene responsive genes, ERF4 (ETHYLENE RESPONSIVE ELEMENT BINDING FACTOR 4) and RAP2.4 (ETHYLENE-RESPONSIVE TRANSCRIPTION FACTOR RAP2-4) were seen close to SNPs A05_12315809, A05_12315841 and B05_60868573, B05_60868600, B05_60868609, B05_60868738, B05_60868793, B05_60868821, B05_60868825, respectively. Another important gene AT5G66460-MAN7 (ENDO-BETA-MANNASE7) was along the A09_6549794. It encodes glycosyl hydrolase superfamily protein that serves in the development and dehiscence of Arabidopsis pods. We anticipated no gene for RE in B. rapa. We could predict the importance of AT2G46770-NST1 in describing variation for pod length in B.juncea. The gene was envisaged near seven closely placed SNPs in the genomic region (11,913,579–11,913,680) on the chromosome B06. We also envisaged two important genes, AT3G58780-SHP1 (SHATTERPROOF1) and AT3G61910-NST2 close to the SNP A09_45505210 in B. rapa. Both these genes were also showed for seed per pod, where these genes were predicted close to A09_45507062. For seeds/pod in B.juncea,AT2G20610-SUR1 (SUPERROOT1) was envisaged near a cluster of nine SNPs in the genomic region 4,867,097–4,867,166 on the chromosome B02. It controls auxin biosynthesis. Candidate gene based association mapping did not lead to identification of any significant marker-trait associations in B. rapa and B. nigra for rupture energy (Table 4). However, marker NAC_3 explained 20–26% of variation for pod length in B. rapa. NAC_1, SHP 1-2_1 and SHP 1-3_1 in B. rapa and SHP 1-7_1 in B. nigra showed association with the seeds/pod. In B. juncea, SHP 1-3_1 and SHP 2-2_1 were associated with rupture energy, whereas markers SHP 1-3_1 and SHP 2-9_1 were associated with pod length. SHP 1-9_1 and NAC_1 were associated with the seeds/pod.

Table 3 GWAS for traits associated with pod shatter resistance
Fig. 1
figure 1

Manhattan plots depicting the MTAs for rupture energy in B. juncea

Fig. 2
figure 2

Manhattan plots depicting the MTAs for seeds per pod in B. rapa

Table 4 Marker-trait association in B. rapa, B. nigra and B. juncea

Genetic diversity, population structure and linkage disequilibrium

Allelic variation for six pod shattering associated candidate genes was investigated. Observed heterozygosity (HO) was low in B. rapa (Table 5). In contrast, observed heterozygosity was higher than the expected heterozygosity (HE) in B. nigra. Number of alleles per locus (NA) ranged from 1.3 to 2.1 in B. rapa, maximum being in rapa PB. Lowest value was recorded for rapa IB. The mean effective estimate of alleles per locus (NE) ranged from 1.2 to 1.6, rapa CA had the maximum value. Canadian genotypes of B. nigra recorded maximum values for NA and NE. Differences in mean values of NA, NE, HO, HE and µHE (unbiased expected heterozygosity) were not significant (P ≤ 0.05) among studied populations. Structure analysis helped to group B. rapa germplasm into three subpopulations at ΔK = 3 (Fig. 3). Significant admixing was showed; exception was the group (portrayed in red colour) comprising germplasm from central Asia. Indian brown sarson, Indian toria and Canadian rapa were present as admixtures. Group with blue colour was made up primarily of brown sarson germplasm from Pakistan and few brown sarson and toria lines of Indian origin. Group with yellow colour had genotypes from Canada, Central Asia and few toria genotypes from Pakistan and India. The distribution for all the three traits was normal in structure defined groups (Fig. S4). The genotypes included in the blue group varied in a narrow range for rupture energy and pod length as compared to other groups. Population structure was absent in B. nigra with complete admixing (Fig. 4). B. juncea accessions formed five subpopulations at ΔK = 5 (Fig. 5). East European lines predominated the first group (brown colour). Blue group included eleven lines from east Europe, seven lines from India and two lines each from Canada and Australia. Group III (green colour) included 10, 13 and 1 accession(s) from east Europe, India and Canada. Group with red colour included three Canadian, four Australian, eleven east European and five Indian germplasm lines. Indian accessions (27 out of 51) predominated the yellow group. Despite few exceptions, the distribution spread for rupture energy in structure defined groups showed a normal and overlapping distribution (Fig. S4). For pod length, four out of five structure defined populations had a normal distribution. Bulk of Indian genotypes included in a yellow group showed a distribution pattern that was skewed towards higher values (Fig. S4). Only green group had distribution that was skewed towards a higher number of seeds/pod as compared to other groups, where the variation pattern was largely normal (Fig. S4). The average LD varied between species and subpopulations within the species (Fig. 6). LD blocks differed across B. rapa subpopulations. Two significant LD blocks were recorded in Indian brown sarson, spanning SHP1 and FUL. Weak LD blocks, involving SHP1 and NAC, were observed in Canadian rapa and Indian toria. LD block was also seen for SHP1 in Pakistani brown sarson. NAC figured in a small LD block (Fig. S5). LD blocks also differed in B. nigra. Canadian B. nigra had a weak LD blocks involving SHP2, whereas LD block in Indian types spanned both SHP1 and SHP2 (Fig. S6). Weak LD blocks were noticed for SHP1 in Australian and Chinese B. juncea. Indian B. juncea germplasm showed a big LD block for SHP1 and a smaller one for SHP2. East European B. juncea also showed significant LD blocks for SHP1 (Fig. S7).

Table 5 Genetic diversity estimates (mean + SE) at the population levels based on candidate gene loci
Fig. 3
figure 3

Population structure showing three populations of B. rapa based on software STRUCTURE

Fig. 4
figure 4

Population structure of B. nigra based on software STRUCTURE

Fig. 5
figure 5

Population structure showing five subpopulations of B. juncea based on software STRUCTURE

Fig. 6
figure 6

Decay of linkage disequilibrium (LD) in B. rapa, B. nigra and B. juncea

Detection of loci under selection

We compared values for fixation indices (FST) with observed heterozygosity to show loci under selection (Table 6, Fig. S8). Loci with low FST values were under selection for less than the expected heterozygosity. Different loci under selection were identified in three Brassica species. These were: SHP1, SHP2, RPL in B. rapa; SHP1, SHP2 in B. nigra and SHP1, SHP2 and NAC in B. juncea.

Table 6 Analysis of molecular variance

Discussion

QTL mapping and GWAS have been widely used to identify genes controlling pod dehiscence in many crop plants [1, 24, 44]. However, little is known about the variation and genetics of this trait in B. juncea. We used multiple approaches to investigate variation, population structure and effect of selection on genetic diversity for pod traits in B. juncea, relative to its progenitor species. We recorded a wide range and near normal distribution of variation for RE, pod length and seeds/pod in B. juncea and B. rapa. B. nigra varied the least. Distribution of variations was almost similar in B. juncea and B. rapa germplasms. This may indicate that these two species have adapted to similar selection pressures for pod traits. The estimates of observed heterozygosity were lower than expected heterozygosity in B. juncea and B. rapa. Reverse was true for B. nigra, where observed heterozygosity was higher than expected heterozygosity. This may be a consequence of admixing of isolated populations or lack of selection pressure for the trait in this species. Prevention or reduction of natural seed dispersal was possibly among the first traits targeted for selection by ancient domesticators. This coupled with strong selection pressure exerted by modern plant breeders may have caused partial or quantitative loss of seed dispersal and shifts in gene frequencies. We could identify many germplasm lines of B. rapa and B. juncea with very hard to shatter pods. Low shattering B. rapa germplasm may be useful as trait donor for improving pod shatter resistance in B. napus. Indian and east European accessions of B. juncea were less prone to pod shattering as compared to Australian genotypes. Pod rupture energy was correlated with pod length in B. rapa and B. juncea, also reported earlier in B. napus [45].

Candidate genes associated with pod traits

GWAS led to the identification of 23 MTAs for pod shatter resistance in B. juncea, with high phenotypic variation explained. MTAs were identified on different chromosomes of both A (A02, A03, A05, A09) and B (B02, B05 and B06) genomes. Annotations of the nearby genomic regions allowed us to identify important pod development and related genes. NAC domain transcriptional regulator superfamily protein, AT1G76420-CUC3/NAC3 was identified on chromosome A02. Plant-specific transcription factors of NAC family includes, secondary wall thickening promoting factors 1, 2 and 3 (NST1, NST2 and NST 3) [46]. These regulate development of the pods in Arabidopsis. AT3G61910-NST2 was recognized on chromosome A09 and NST1 was annotated close to the SNPs identified as significant for pod length on chromosome B06. NST1 regulates the development of pods in Arabidopsis. AT2G45190-AFO/FIL, plant-specific transcription factor of YABBY family protein, was predicted on A03. Two YABBY family transcription factors FILAMENTOUS FLOWER (FIL) and YABBY3 (YAB3) are critical for the valve margin development and they act in tandem to promote expression of FUL in the valves, and SHP1,2 in the valve margins [47]. FIL also controls the spatial activity of APETALA2-like transcription factor which impacts the ease of threshing [48]. We envisaged AT5G60910-FUL1/AGL8 on chromosome A03. This gene encodes a MADS-box transcription factor [49] that mediates valve development by inhibiting expression of the valve margin identity genes (SHP1 and SHP2) [19, 20]. FUL is required for pod morphogenesis, and its expression is integral to SHP genes [49, 50]. Studies have shown that the ectopic expression of the FUL may lead to shatter proof fruit in B. juncea [13]. Ethylene responsive genes, ERF4 and RAP2.4 were envisaged on chromosomes A05 and B05. Both these involve one APETALA2 domain which mediates light and ethylene signaling [51]. Up-regulation of ABA signaling and down-regulation of ethylene and jasmonate signaling also reduced pod shattering [6, 52]. APETALA2 has been reported to specify abscission zone (AZ) development and seed shattering in rice [53]. AT5G66460-MAN7 was envisaged on the chromosome A09. It is a glycosyl hydrolase superfamily protein that functions in the development and dehiscence of Arabidopsis pods. Its expression in the pod is depended on the IND and ALC transcription factors. MAN7 is also perceived to promote cell degeneration in the separation layer in mature pods [54]. We also envisaged two important genes, AT3G58780-SHP1 and AT3G61910-NST2 on chromosome A09 of B. rapa. NST2 regulates the pod formation. AT3G58780-SHP1 along with SHP2 controls DZ differentiation and also promotes the lignification of adjacent cells [14]. These are fundamental for the coordination of cell divisions in ovule, seed coat development and endosperm formation in Arabidopsis [55]. AT2G20610-SUR1, controlling the auxin biosynthesis, was predicted close to a cluster of SNPs on B02. Low auxin level is essential for pod dehiscence as it may trigger cell wall degrading enzymes [17]. Our candidate gene-based association studies validated some of the gene predictions from GWAS. Both SHP1 and SHP2 were associated with rupture energy in B. juncea. SHP1 (all three species) and NAC1 (B. rapa and B. juncea) were associated with seeds/pod. We also predicted more than one copy of shattering related gene(s), either on the same chromosome or on different chromosomes. Multiple copies of shatter related genes are expected as Brassicas are ancient polyploids. Further, an expansion in gene families was also possible because of the direct or indirect selection pressure.

Linkage disequilibrium and footprints of selection

Analysis of allelic diversity in the candidate genes emphasized existence of population structure in B. rapa and B. juncea. Gene exchange/admixing, observed in B. rapa, was expected in a cross-pollinated crop. We also identified LD blocks in B. rapa. However, these differed across subpopulations of the species. These spanned SHP1 and FUL in brown sarson, while Canadian rapa and Indian toria showed weak LD blocks involving SHP1 and NAC. In contrast, B. nigra showed weak LD blocks involving SHP2 or both SHP1&2. Indian and east European B. juncea formed large LD blocks for SHP1. In contrast, Australian and Chinese B. juncea revealed weak LD blocks for the gene. Australian and Chinese genotypes were more susceptible to pod shattering as compared to Indian and east European germplasm. It is likely that diverse sets of genes were selected in different selection events and ecogeographic groups. LD is also influenced by extent of genetic diversity present in a germplasm resource. High genetic variation is associated with a rapid LD decay; a direct consequence of the wide-ranging historical recombination’s. Random genetic drift in finite populations can also generate regions of high LD [56]. Intensive selection reduces natural variation and heterozygosity inherent in the species. SHP1, SHP2 (B. rapa, B. nigra and B. juncea), RPL (B. rapa) and NAC (B. juncea) carried footprints of selection. These genes were also predicted from our GWAS studies for their role in regulating rupture energy, pod length and seeds/pod.

Conclusions

SHP1 and NAC appears useful for developing breeder friendly KASP (Kompetitive Allele-Specific PCR) assays for marker aided improvement of pod traits in mustard. Mustard genotypes MCP 633, NR 3350 and SKM518 can be used as trait donors for pod length, seed number and resistance to pod shattering.