Introduction

Rapeseed (Brassica napus L.) is an important oil crop worldwide, which provide both edible oil for human consumption and industrial materials such as livestock meal, lubricants, and biodiesel (Rondanini et al. 2012). Utilization of heterosis has become a major strategy to increase crop productivity, and hybrid rapeseed accounts for about 70% of the total planted area in China (Fu 2019). So far, genic male sterility (GMS), cytoplasmic male sterility (CMS), ecological male sterility (EMS), self-incompatibility (SI), and chemical hybridization agent (CHA) are the main pollination control systems of heterosis utilization in rapeseed. Of them, EMS, or referred as photoperiod and/or temperature (thermo-) sensitive genic male sterility (P/TGMS), is regarded as an efficient system that can produce two-line hybrids. This system has the following advantages: the firstly, almost all conventional inbred lines can restore the fertility and can serve as male parents; secondly, no negative effects associated with sterility-inducing cytoplasm have been observed, and the thirdly, the genes of this system can be easily transferred to other genetic backgrounds (Yu et al. 2015). P/TGMS germplasms have been explored and characterized in several crops, including wheat (Tang et al. 2011, 2012; Ru et al. 2015; Yuan et al. 2018; Wang et al. 2019; Yang et al. 2020), maize (Tang et al. 2006), cotton (Zhang et al. 2020b), and soybean (Frasch et al. 2011). In rice, the genes underlying the P/TGMS trait have been successfully cloned and characterized (Ding et al. 2012; Zhou et al. 2014; Fan et al. 2016; Sun et al. 2022; Wu et al. 2022). More recently, using the Arabidopsis system, slowing of development is proposed as a general mechanism applicable to the sterility-fertility conversion of P/TGMS lines (Zhang et al. 2020a; Zhu et al. 2020). Rapeseed P/TGMS germplasms has been exploited and applied to hybrid breeding. Until now, several two-line hybrid varieties of rapeseed, including Xiangzayou No. 5, Xiangzayou No. 7, Liangyou 586, Ganliangyou No. 2, Ganliangyou No. 3 and Ganliangyou No. 5 (Liu et al. 2000, 2007, 2013; Gao et al. 2010), have been successfully developed and approved in China, indicating a promising future of two-line hybrid in rapeseed hybrid breeding.

Male sterility exists widely in plant kingdom because pollen development can be hindered at various developmental stages, which starts at the anther cell division and differentiation, and ends at the dehiscence of anthers when the mature pollen grains are released (Sanders et al. 1999; Quilichini et al. 2014). Since the first male sterile gene MS2 was mapped-based cloned in plants (Aarts et al. 1993), several dozens of genes have been identified to play vital roles in anther and pollen development in Arabidopsis (Ma 2005; Ariizumi and Toriyama 2011; Zhu et al. 2011; Jiang et al. 2013; Liu and Fan 2013; Shi et al. 2015; Ma et al. 2021). These genes could be classified into two types, regulatory genes and structural genes. The first type included various protein kinases and transcription factors (TFs), such as SPL/NZZ, EMS1, TPD1, DYT1, TDF1 , AMS / TDR , AtMYB103/MS188/MYB80, and MS1. The second type included genes involved in different pathways related to tapetum and pollen development, and the formation of primexine. The genes and their functions in anther and pollen development are summarized in Table 1. These TFs and their regulated downstream genes form genetic regulatory networks controlling anther and pollen development (Jiang et al. 2013; Liu and Fan 2013; Shi et al. 2015; Lu et al. 2020; Ma et al. 2021).

Table 1 The identified genes which play vital roles in anther and pollen development in plants

RNA-sequencing (RNA-seq) is a powerful and cost efficient high-throughput technology for transcriptomic profiling (Wang et al. 2009; McGettigan 2013; Wan and Li 2019). This technology has been used to identify differential expressed genes and important pathways related to male sterility in different kinds of male sterile lines in rapeseed, which include recessive GMS (Qu et al. 2015; Jiang et al. 2022), dominant GMS (Jiang et al. 2022), dominant TGMS (Yan et al. 2016), the recessive TGMS (Liu et al. 2017), pol TCMS (Xiao et al. 2021), and Nsa CMS (Xing et al. 2022) in B. napus. The findings of these studies have laid foundation for understanding of the complex molecular mechanism of male sterility in Brassica crops.

Previously, we have developed a TGMS line named 373S from a spontaneous male sterile mutant in B. napus (Yu et al. 2007). Recently, studies of different temperature and photoperiod treatments in natural conditions and controlled environments have shown that the fertility alteration of 373S are mainly attributed to temperature changes, and the critical temperature range leading to fertility alteration is from 10°C (15°C/5°C) to 12°C (17°C /7°C), and the temperature-responding stage coincides with anther development from pollen mother cell (PMC) formation to meiosis stages (Sun et al. 2020). Genetic analysis indicated that the TGMS trait in 373S was controlled by one pair of genes, with male sterility as the recessive (Sun et al. 2020). However, the genome-wide transcriptional analysis of TGMS-related genes and pathways of the TGMS line 373S need to be investigated. In the present study, we compared anther ultrastructure between the line 373S and the fertile control line Zhongshuang No. 9 (ZS9) and found various abnormalities in meiotic behavior and organelle degradation. To gain a comprehensive understanding of molecular mechanism of its male sterility, we further performed RNA-Seq to explore DEGs in fertile and sterile flower buds and anthers between fertile and sterile parents, and between fertile and sterile offsprings in BC1 population derived from the TGMS line 373S. Based on bioinformatics analysis, we finally identified candidate genes and TFs involved in pollen development, and further analyzed by quantitative real-time polymerase chain reaction (qRT-PCR). Our results will contribute to an understanding of TGMS molecular mechanisms and provide useful information for heterosis breeding in B. napus.

Materials and methods

Plant materials

The plant materials used in this study included 5A214 (the male sterile parent, 373S), 5C95 (the fertile parent), 5A219 (the BC1, 5A214// 5A214/5C95), and ZS9. We previously revealed that the TGMS trait in 373S was controlled by one recessive allele Bnmst1 (Sun et al. 2020). So, in the BC1 population of 5A219, the fertile and sterile individuals only differ in the locus BnMst1 /Bnmst1. All the materials were provided by Rapeseed Research Center, College of Agronomy, Northwest A&F University, China. These plants were cultivated in crop season from September 2015 to June 2016 in the experimental field of Northwest A&F University in Yangling (34°16′N, 108°4′E, altitude 530 m), Shaanxi, China. Cultural practices including soil preparation, fertilizer, and irrigations were applied equally to all the entries/experiments.

Cytological study

In April 2016, when the male fertility of the first opened flowers of 373S plants was visually detectable for male sterility, flower buds of 373S and ZS9 plants were collected and divided into groups according to their size on the same day (Sun et al. 2020). Acetocarmine staining was performed to examine the correlation of the microspore developmental stages with the bud lengths. Subsequently, DAPI (4′,6-diamidino-2-phenylindole) and aniline blue staining were performed. For transmission electron microscopy, anthers at different microspore developmental stages were fixed and embedded in LR-White resin. The ultrathin sections were obtained using a diamond knife on the Leica EM UC7 ultramicrotome (Leica, Nussloch, Germany), and picked up on the formvar-coated grids and floated on 3% aqueous solution of UA for 10–15 min, and then re-floated by 4% Pb solution for 8–10 min. All sections were examined under JEM-1230 transmission electron microscope (JEOL, Tokyo, Japan) at 80 kV, and all the images were recorded with side-inserted BioScan Camera Model 792 (Gatan, Pleasanton, Calif., USA).

RNA extraction and Illumina sequencing

For transcriptome sequencing, young flower buds (length ≤ 1mm) and anthers (isolated from > 1mm buds) of 5A214, 5C95, sterile, and fertile plants from 5A219 were pooled separately, and immediately frozen in liquid nitrogen, and stored at −80°C for RNA extraction. As a result, 16 samples (eight specimens with two biological replications, each replication prepared from five plants) were used for RNA-seq analysis (Table S1). For example, 5A219F-A1, means the first replication of anthers from fertile plants of 5A219.

Total RNA was extracted from the 16 samples with plant RNA extraction kit (E.Z.N.A. ®Plant RNA Kit, R6827-01, OMEGA), according to the manufacturer’s instructions. The quality and purity of RNA samples were monitored on 1% agarose gels, and the RNA integrity number (RIN) of samples was assessed using Agilent 2100 Bioanalyzer. Following purification, the mRNA samples were fragmented into small pieces using divalent cations under an elevated temperature in Reaction Buffer (5X) and the cleaved RNA fragments were used for first-strand cDNA synthesis using reverse transcriptase and random primers. Second strand cDNA synthesis was then performed using DNA Polymerase I and RNase H. Short fragments were purified and resolved with Ethidium bromide buffer for end reparation and single nucleotide A (adenine) addition. After that, the short fragments were connected with adapters and the suitable fragments were selected for the PCR amplification. Agilent 2100 Bioanaylzer and ABI StepOnePlus Real-Time PCR System were used in quantification and qualification of the cDNA libraries. Finally, the prepared libraries were sequenced using a paired-end read protocol with 100 bp of data collected per run on the Illumina Hiseq 4000.

Sequencing data analysis and identification of DEGs

Clean reads were obtained from raw data by filtering the adaptor sequences, the low-quality sequences and unknown nucleotides N, and stored in FASTQ format (Cock et al. 2010). Then, the clean reads were mapped to the reference genome sequence of B. napus (http://www.brassica.info/resource/genome.php) (Chalhoub et al. 2014) using Bowtie2 v2.2.6 (http://bowtie-bio.sourceforge.net/bowtie2/index.shtml), with only a 1-bp mismatch allowed. After alignment, the count of mapped reads from each sample was derived and normalized to FPKM (Fragments Per Kilobase of transcript per million mapped reads). Based on the expression level, statistical significance in gene expression between sterile and fertile samples was assessed using the NOIseq program (Tarazona et al. 2011). The DEGs were identified using log2 (Fold change ratio of sterile/fertile) ≥ 1 and probability ≥ 0.8.

Bioinformatics analysis of DEGs

Functional annotation of DEGs was performed using BLASTX searches against the NR and Swiss-Prot databases (E-value < 1e−5). Venn diagram, Hierarchical Clustering, Gene ontology (GO), and Kyoto encyclopedia of genes and genomes (KEGG) pathway analysis were performed using the OmicShare tools, a free online platform for data analysis (www.omicshare.com/tools), to identify the putative biological functions and biochemical pathways for DEGs and find statistically overrepresented GO terms or KEGG pathways. BiNGO plugin (Maere et al. 2005) of Cytoscape software (http://www.cytoscape.org/) was used to draw networks of downregulated and upregulated genes (using Arabidopsis data). The known and predicted interactions among the DEGs were proved by STRING (https://string-db.org/) based on Arabidopsis data. Analysis of Cis-regulatory element of the 430 DEGs was performed using the online software Plant CARE (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/). Differentially expressed TFs were identified by online software Plant TFDB 4.0 (http://planttfdb.cbi.pku.edu.cn/) from the 430 DEGs.

qRT-PCR verification

To validate the Illumina sequencing results and characterize genes that are differentially expressed between the sterile and fertile samples, small buds (length ≤ 1mm), and anthers (from buds length > 1mm), the 19 genes identified to be related to the pollen development were selected for analysis using qRT-PCR. RNAs were extracted as mentioned above, and cDNAs were synthesized according to the manufacturer’s protocol of GoScript TM Reverse Transcription System (A5001, Promega). A housekeeping gene Ubiquitin-conjugating enzyme 21 (UBC21) was used as the reference gene, and all primer sequences used in this study are shown in Table S2. The qRT-PCR assays were performed in triplicate using GoTaq®qPCR master mix (A6001, Promega) on a QuantStudio™ 7 Flex Real-Time PCR System (Applied Biosystems). Statistical analysis was performed using the 2-ΔΔCT method (Livak and Schmittgen 2001). All data are expressed as mean ± standard deviation.

Verification of DEGs in 373S under restrictive condition (high temperature) and permissive condition (low temperature)

To verify the 430 DEGs identified above, the following experiment was further conducted. The seeds of 373S line were sown in the experimental field of Northwest A&F University in Yangling, Shaanxi, China in middle of September 2021. The seedlings were transplanted into pots to greenhouse after vernalization in the middle of January 2022. These plants were cultivated in a greenhouse for 2 weeks at 28 °C/14 °C, 14-h/10-h day/night cycle, light intensity: 4000–6000 Lux. According to our previous experimental results (Sun et al. 2020), the plants were then transferred to growth cabinets for two different temperature regime treatments (13 °C/3 °C (mean = 8 °C), permissive condition, and 17 °C/7 °C (12 °C), restrictive condition) under 14-h day/10-h night, light intensity: 4000 Lux, for 10 days and then returned to the greenhouse. The two treatments had three biological replications with three pots (two seedlings per pot) each. When the male fertility of the first opened flowers of the treated 373S plants was visually detectable for male fertility under permissive condition, or male sterility under restrictive condition, young flower buds (length ≤ 3 mm) of 373S plants were collected to perform RNA-seq analysis to verify the DEGs identified above. Totally, six samples (two treatments × three biological replications) were used for RNA-seq analysis following the same procedure as described above. Each replication was prepared from five plants.

Results

Phenotypic performance and cytological analysis of the TGMS line 373S

During the flowering period, the flowers of control line ZS9 plants had long filaments, full yellow anthers with sufficient pollen grains, whereas the flowers of 373S plants opened wide and flat, the filaments were shortened, and the stamens withered without pollens production under higher temperature (Sun et al. 2020).

DAPI staining results indicated that, at PMC stage, the PMCs both in the control plants (Fig. 1A) and 373S plants (Fig. 1F) exhibited their chromosomes clearly. At tetrad stage, four bright DAPI signals were detected in control tetrads (Fig. 1B). By contrast, the tetrads of 373S plants showed very weak DAPI signals (Fig. 1G), indicating that their nuclei degenerated at this stage. At mature pollen stage, DAPI signals disappeared in 373S pollens (Fig. 1H), which indicated that their nuclei degraded completely compared to the control (Fig. 1C). In order to reveal the callose deposition pattern in 373S plants, we examined intact PMCs and tetrads isolated from anthers in PMC and tetrad stages. At PMC stage, a large percentage of PMCs were enclosed within thick callose wall in fertile anthers of control plants (Fig. 1D); however, PMCs of 373S plants were deformed or burst (Fig. 1I), and their callose walls were significantly thinner than their fertile counterparts (Table S3). In tetrad stage, tetrads from anthers of 373S plants were smaller than those in fertile counterparts, and callose walls of tetrads of 373S plants were also thinner than those in fertile counterparts (Figs. 1E, J; Table S3).

Fig. 1
figure 1

Abnormal chromosome and callose layer in 373S male sterile plants. A, F 4′,6-diamidino-2-phenylindole (DAPI) staining of pollen mother cells from male fertile and male sterile anthers; B, G DAPI staining of tetrads from male fertile and male sterile anthers; C, H DAPI staining of pollen grains from male fertile and male sterile anthers; D, I aniline blue staining of pollen mother cells from male fertile and male sterile anthers; E, J aniline blue staining of tetrads from male fertile and male sterile anthers

To gain insights into the cellular events during pollen development in 373S line, the anthers of 373S and fertile control plants at different developmental stages were examined by transmission electron microscopy. In the PMC stage, nuclear membranes, mitochondria, and vacuoles could be clearly observed in PMCs of fertile plants (Fig. 2A); however, these organelles could not be observed in PMCs of 373S plants (Fig. 2F). In the tetrad stage, PMCs of fertile control plants underwent meiosis to form tetrads surrounded by the characteristic callose wall, and numerous organelles were observed, including mitochondria and plastids in the tetrad cytoplasm (Fig. 2B). However, plasmolysis occurred in tetrads of 373S plants, and only a few tetrad cells had intact nuclei (Fig. 2G). In the early uninucleate stage, after the callose wall was dissolved, the microspores were released from the tetrads and covered with the exine wall in fertile control plants (Fig. 2C), whereas the microspores of 373S plants changed their shape, their pollen exine (tectum and baculum) did not form normally, and their cytoplasm was clearly concentrated (Fig. 2H). Thereafter, the microspores of 373S plants completely lost their cytoplasm and nuclei, leaving only irregular exine (Fig. 2I, J) compared with those of fertile control plants (Fig. 2D, E). The tryphine was not fully deposited in the pollen wall of sterile pollens of 373S plants during the middle uninucleate (Fig. 2I, V) and bicellular pollen stages (Fig. 2J, X) compared with fertile controls (Fig. 2D, U, E, W). In addition, the pollen intine of 373S plants could not be observed (Fig. 2V, X) compared with controls (Fig. 2U, W).

Fig. 2
figure 2

Transmission electron microscope micrographs of the anthers from control and 373S male sterile plants. AE, U, W pollens from control plants; FJ, V, X pollens from sterile plants; KO tapetal cells from control plants; PT tapetal cells from sterile plants. A, F, K, P pollen mother cell stage; B, G, L, Q tetrad stage; C, H, M, R early microspore stage; D, I, N, S, U, V vacuolate microspore stage; E, J, O, T, W, and X bicellular pollen stage; UX magnifications of the section indicated by the rectangle in D, I, E, and J, respectively. Ca, callose; Nu, nucleus; P, plastid; M, mitochondrion; G, Golgi body; Msp, microspore; Ex, exine; V, vacuole; L, lipid droplet; Epl, elaioplast; Ts, tapetosome; ER, endoplasmic reticulum; Ba, baculum; Te, tectum; In, intine; Ne, nexine; Try, tryphine

The observation of tapetum development indicated that there was no obvious difference between tapetal cells of 373S plants (Fig. 2P, Q) and the fertile control plants (Fig. 2K, L) before tetrad stage. In the tapetal cells, numerous organelles, including endoplasmic reticulum (ER), plastids, Golgi bodies, and vacuoles, could be observed. In the early uninucleate stage, abundant elaioplasts and tapetosomes were observed in fertile control plants (Fig. 2M). However, the tapetum of anthers from 373S plants showed abnormal developmental process, with only a large number of vacuoles formed (Fig. 2R). In the middle uninucleate stage, when the tapetal cells in the control plants just started to degrade (Fig. 2N), the tapetal cells in 373S plants were greatly degraded (Fig. 2S). In bicellular pollen stage, the tapetal cells of 373S plants were completely degraded (Fig. 2T) compared with those of the control plants (Fig. 2O). The results indicated that the tapetal cells in control underwent the normal programmed cell death (PCD) process, while the tapetal cells in 373S plants did not. Finally, the fertile control plants produced mature pollens, while 373S plants did not.

The transcriptome of the buds and anthers from fertile and sterile parents and their offspring

We sampled small buds (length ≤ 1mm) and anthers (dissected from > 1mm buds) from two parents (male sterile parent 5A214 and fertile parent 5C95) as well as male sterile and fertile plants from 5A219 (BC1 population, 5A214//5A214/5C95) for RNA-seq analysis (Table S1). Except for one sample (5C95-B2, Table S1) failed to be sequenced, all other samples were sequenced and contained 6.56-Gb raw reads. After filtering out reads containing unknown base N, low-quality data, and only adaptor reads, a total of 306,684,766 and 305,848,672 clean reads were obtained from fertile (the fertile parent and fertile plants of BC1 population) and sterile (the sterile parent and sterile plants of BC1 population) samples, respectively (Table S4). We mapped these clean reads to the B. napus reference sequences (http://www.brassica.info/resource/genome.php) and obtained average 70.06% and 69.70% reads matched (including perfect match and 1-bp or 2-bp mismatch) for fertile and sterile samples, respectively. Out of the reads mapped to the reference sequences, average 65.49% and 65.44% reads were uniquely matched. By sequence alignment, we found that a total of 427,988 and 427,713 genes were hit by the unique reads from fertile and sterile samples, respectively, which accounted for >94% of the known gene models. An overview of the Illumina sequencing results and the distribution of distinct clean reads is shown in Table S4.

The FPKM value was calculated to test the expression levels of the unigenes. The heatmap of sample correlations, which based on FPKM of all genes in the 15 samples tested (Additional file 1), showed that the two biological replicates of the same sample have higher correlation coefficience, which indicating good reproducibility between (Fig. S1A). Principal component analysis (PCA) showed that, the first four principal components (PC) cumulatively explained 88.99% of the total variance (Fig. S1C), with PC1 accounting for 34.411% and PC2 accounting for 30.077% of total variance, respectively. PCA analysis classified the tested samples into two groups (anther and bud samples) corresponding to their developmental stages, suggesting similar expression patterns in similar developmental stages (Fig. S1B).

Identification and validation of DEGs between fertile and sterile samples

We employed a two-step comparison strategy to identify the genes associated with male sterility in the TGMS line 373S. In the first step (Fig. S2), four pairwise comparisons were conducted between male fertile (5C95) and sterile (5A214) parents, and between male fertile (5A219F) and sterile (5A219S) plants of BC1 population at two stages (bud and anther). In the second step, we compared the DEGs from the comparisons of both parents with those from the comparison of male sterile and fertile plants of BC1, their intersect was DEGs of interest to us for further analysis (Fig. 3).

Fig. 3
figure 3

Strategy for identification of differentially expressed genes (DEGs) between fertile and sterile samples. Venn diagrams showing the resultant DEGs between fertile and sterile samples. The DEGs resulted from the comparisons between both parents as control, to remove the DEGs that showed differentially expressed between male sterile and fertile plants of BC1, but did not between both parents. Upregulated genes involved in four pairs of comparisons were shown in the left, and downregulated genes in the right

As a result, the pairwise comparisons in the first step revealed 4536 DEGs (2069 upregulated, 2467 downregulated) between 5A214-B and 5C95-B (5A214-B/5C95-B), 6005 DEGs (3140 upregulated, 2,865 downregulated) between 5A214-A and 5C95-A, 505 DEGs (228 upregulated, 277 downregulated) between 5A219S-B and 5A219F-B, and 648 DEGs (546 upregulated, 102 downregulated) between 5A219S-A and 5A219F-A (Fig. S2, Additional file 1). In total, 10,092 DEGs were identified. The second step comparison resulted in 430 DEGs (298 upregulated, 132 downregulated), which were common in pairwise comparisons between both parents and fertile and sterile plants of BC1 population (Fig. 3). These 430 DEGs included 242 in anthers, 90 in buds, and 98 common in both anthers and buds (Fig. S3, Additional file 2). Therefore, they were annotated to the TAIR database (http://www.arabidopsis.org/) (Additional file 3) and selected for further analysis.

Hierarchical clustering and functional enrichment analysis of the 430 DEGs

The selected 430 DEGs were subjected to a hierarchical clustering analysis. The results showed that 5A219F-A and 5C95-A, 5A219S-B and 5A214-B, 5A219F-B and 5C95-B, 5A219S-A and 5A214-A were first clustered together, and the group (5A219S-A and 5A214-A) was relatively far away from the other three groups, which indicated that samples were clustered according to their developmental stages and their fertility. Based on their expression patterns, the 430 DEGs were classified into six groups (Fig. 4). GO analysis of the overrepresented biological processes in each group indicated that each group encompassed several significant GO terms (Fig. 4, Additional file 4). The group 1 included 50 genes which were preferentially upregulated in 5A219F-A and 5C95-A, while downregulated in other samples, the most enriched GO term in this group was associated with regulation of multicellular organism growth. The group 2 included 72 genes which were preferentially upregulated in 5A219F-B and 5C95-B, the most enriched GO term in this group was related to anatomical structure formation involved in morphogenesis. The group 3 contained 92 genes which were upregulated in 5A214-A, the most enriched GO term was associated with response to organic substance. The group 4 included 128 genes which were preferentially upregulated in 5A219S-A, and the most enrichment GO term was related with regulation of lateral root development. The group 5 included 25 genes which were preferentially upregulated in 5A219S-B, the most enriched GO term was associated with protein targeting to chloroplast. The group 6 contained 63 genes which were preferentially upregulated in 5A219S-B, 5A214-B, 5A219F-B, and 5C95-B, and the most enriched GO term in this group was associated with pollen exine formation.

Fig. 4
figure 4

Cluster analysis of the 430 differentially expressed genes between the sterile and fertile samples. The color key represents log10 (FPKM+1). Color intensities (from green to red shading) increased with elevated expression level, as indicated at upper left. The red dotted line is used as reference line for grouping. The most significant GO (Gene ontology) term is indicated in each group

According to their GO annotation, these 430 DEGs were divided into three categories of biological process, cellular component, and molecular function, and further divided into 45 different sub-categories (Fig. S4, Additional file 5). For the biological process category, genes involved in cellular process were the predominant (208/430, 48.4%), followed by those involved in metabolic process (191/430, 44.4%), single-organism process (187/430, 43.5%), response to stimulus (138/430, 32.1%) and biological regulation (108/430, 25.1%). In the cellular component category, 65.6% (282/430) of DEGs targeted cell and cell part, 45.3% (195/430) were directed toward organelle, 29.3% (126/430) were directed toward membrane, 17.2% (74/430) were directed toward extracellular region, and 16.7% (72/430) to membrane part. In the molecular function category, 36.5% (157/430) of DEGs were assigned to binding, 34.4% (148/430) were assigned to catalytic activity, 6.5% (28/430) were assigned to nucleic acid binding transcription factor activity, 6.0% (26/430) were assigned to transporter activity, and 2.6% (11/430) to molecular function regulator.

In order to further analyze the important biological functions of the 430 DEGs, a detailed GO network of these DEGs was drawn by using the BiNGO plugin of Cytoscape software (Fig. S5). Based on the results of BiNGO analysis (Fig. S5) and GO enrichment analysis (Fig. S6), the most highly represented biological processes included sporopollenin biosynthetic process, pollen exine formation, extracellular matrix assembly, post-embryonic plant organ development, cellular component assembly and anatomical structure formation involved in morphogenesis, response to organic substance, asparagine biosynthetic process and glycine biosynthetic process. It is worth mentioned that, DEGs were mostly enriched in pollen exine formation in both GO enrichment analysis and BiNGO analysis.

We further investigated the known and predicted interactions among the DEGs by STRING. In the derived interaction network, these 430 DEGs were divided into 13 pathways (Additional file 6, Fig. S7). The genes in the pathway involved in response to stress were the predominant activity (47/430, 10.9%), followed by those involved in response to abiotic stimulus (32/430, 7.4%), multi-organism process (31/430, 7.2%), response to other organism (29/430, 6.7%), and response to oxygen-containing compound (27/430, 6.3%). The most interesting pathways included pollen wall assembly (including genes CYP703A2, FAR2, SHT, SWEET8, TKPR1, and TKPR2), carbohydrate transmembrane transport (NIP3;1, NIP7;1, STP1, STP14, STP2, SWEET1, SWEET8), pollen exine formation (CYP703A2, FAR2, SHT, TKPR1, TKPR2), and sporopollenin biosynthetic process (CYP703A2, TKPR1, TKPR2). The genes involved in these pathways are very important to microspore and pollen development.

Genes usually play a role in special biological functions by interacting with each other. Pathway-based analysis helps to further elucidate the biological functions of the DEGs. In the present study, KEGG analysis indicated that amino acid metabolism, carbohydrate metabolism, lipid metabolism, translation, signal transduction, transport, and catabolism were overrepresented metabolic activity, while pathways involved in global and overview, folding, sorting and degradation, metabolism of terpenoids and polyketides were also present (Fig. S8A). The KEGG enrichment analysis revealed that the top 20 of significantly overrepresented pathways related to fertility included photosynthesis-antenna proteins, amino acid metabolism, cutin, suberine and wax biosynthesis, flavoniod biosynthesis, terpenoid backbone biosynthesis, fatty acid elongation, pentose, and glucoronate interconversions (Fig. S8B). Further studies of these genes involved in certain pathways would help to elucidate the underlying mechanisms of male sterility in 373S.

TFs involved in anther and pollen development

In the anther and pollen development processes, TFs are generally thought to be important regulators. In this study, 26 TFs (23 upregulated, 3 downregulated) were identified from the 430 DEGs with online software Plant TFDB 4.0 (http://planttfdb.cbi.pku.edu.cn/) (Table 2, Additional file 3). Among these TFs, 20 genes were specifically upregulated in sterile anthers, and three genes upregulated in both sterile anthers and buds; two genes were specifically downregulated in sterile buds, and one gene was downregulated in both sterile anthers and buds. The TFs included HD-ZIP family (seven genes), NAC family (four genes), bZIP family (three genes), RAV family (two genes), C3H family (two genes), WRKY family (one gene), LBD family (one gene), bHLH family (one gene), B3 family (one gene), ERF family (one gene), TALE family (one gene), C2H2 family (one gene), and MYB family (one gene). Among these TFs, AMS is the key regulator in pollen development (Liu and Fan 2013; Shi et al. 2015; Ma et al. 2021). The other TFs identified may provide good candidate for investigation the pollen and anther functions.

Table 2 The transcript factors identified from the 430 differentially expressed genes

In addition, the 430 DEGs were analyzed for the cis-regulatory elements in their promoter sequences. The identified cis-regulatory elements included temperature responsive elements (CAAT-box, MYC, TATA-box, MYB, G-box, DRE, and ABRE), light responsive elements (GT1-motif, G-box, GATA-box, and I-box), drought responsive elements (CAAT-box, GT1-motif, MYC, ABRE, MYB, G-box, DRE, and W-box), hormone responsive elements (ABRE, DRE, TATA-box, CAAT-box, W-box, G-box, and GT1-motif), and so on (Additional file 7). The presence of above-mentioned regulatory elements indicated that the expression of the 430 DEGs may be influenced by temperature, drought stress, hormone and light. Furthermore, in the promoter sequences of the 19 pollen development-related DEGs identified below (Table 3), the most cis-regulatory element was TATA-box (617/1272, 48.5%), followed by AT~ TATA-box (126/1272, 9.9%), MYB (104/1272, 8.2%), CAAT-box (74/1272, 5.8%), and MYC (67/1272, 5.3%), indicating that the expression of the 19 genes may be induced mostly by temperature, drought stress and hormone. Furthermore, the TFs of bHLHs, NACs, WRKYs, ERFs, and bZIPs may be bound to G-box, MYC-like, W-box, DRE, and ABRE, respectively, and they were responded to temperature, drought stress and light factors.

Table 3 Nineteen genes associated with pollen development identified from the 430 differentially expressed genes

Genes associated with pollen development in 373S

In combination of TAIR database annotation, clustering analysis, GO, and KEGG analysis, we finally identified 19 genes which are involved in pollen development and pollen wall formation (Table 3, Additional file 3). The 19 DEGs were clustered into six groups according to their expression patterns in the four pair samples tested (Fig. S9). The group 1 comprised two genes, BnaA03g23380D and BnaC03g27700D, both were annotated to CEP1. The group 2 comprised four genes, BnaA10g04790D (STP2), BnaCnng08630D (CYP703A2), BnaC01g03140D (TKPR1), and BnaA10g19830D (TKPR2). The group 3 comprised two genes, BnaA03g41380D and BnaC07g32300D, both were annotated to Lipid Transfer Protein 12 (LTP12). The group 4 comprised five genes, BnaC05g00670D (CYP703A2), BnaA02g17470D (SKS18), BnaC02g23450D (SKS18), BnaA03g41660D (KCS15), and BnaC07g32730D (KCS15). The group 5 comprised two genes, BnaC07g41240D and BnaA03g49210D, both were annotated to ANTHER 7 (A7). The group 6 comprised four genes, BnaAnng22360D (SHT), BnaC09g09810D (SHT), BnaA02g17600D (ATA27), and BnaC03g46740D (AMS). Generally, genes annotated to the same orthologous gene in Arabidopsis were clustered together, except two genes annotated to CYP703A2, one (BnaCnng08630D) in group 2, another (BnaC05g00670D) in group 4.

qRT-PCR verification

The 19 genes related to pollen development were chosen to verify the accuracy of the RNA-Seq result by qRT-PCR. The results indicated that 78.95% (15/19) of the chosen genes showed significantly positive correlations (r=0.7392–0.9997) between the two results. This showed that the majority of these 19 genes had similar expression pattern by two approaches, indicating that the RNA-Seq results in this study are reliable (Fig. 5), which enhanced the reliability of our data in a relatively wide range.

Fig. 5
figure 5

qRT-PCR verification of the 19 selected differentially expressed genes involved in pollen development. Values were estimated from 2-△△Ct values in qRT-PCR and the standard deviation for each sample with three replicates was indicated by error bars. r, Pearson correlation coefficient between the results of qRT-PCR and RNA-Seq. * and ** mean significant at 0.05 and 0.01 level, respectively

Verification of DEGs in 373S under restrictive condition (high temperature) and permissive condition (low temperature)

To verify the 430 DEGs identified above, the field grown 373S seedlings were transplanted to greenhouse after vernalization and treated with two different temperature regimes (13 °C/3 °C (mean = 8 °C, permissive condition) and 17 °C/7 °C (12 °C, restrictive condition) for 10 days, according to our previous experimental results (Sun et al. 2020). When the male fertility of the first opened flowers of the treated 373S plants was visually detectable for male fertility under permissive condition, or male sterility under restrictive condition, young flower buds (length ≤ 3 mm) of 373S plants under two treatments were collected to perform RNA-seq analysis. As a results, 127 of the 430 genes (29.53%) were detected as differentially expressed when cutoff value as log2 (fold change) ≥ 1 and p =0.05 (Additional file 8, Table S5). Among the 127 differentially expressed genes in common, only 36 genes (28.35%) showed the same change trend (Table S5). These results indicated that DEGs identified between sterile and fertile samples, and those identified in 373S under permissive and restrictive conditions were largely different (Table S5, Additional file 3).

Discussion

Photoperiod and/or temperature sensitive genic male sterility (P/TGMS) is one of the most important pollination control systems for rapeseed hybrid production. Some valuable rapeseed P/TGMS germplasms have been identified and used for development of several two-line hybrid varieties of rapeseed (Liu et al. 2000, 2007, 2013; Gao et al. 2010). However, the molecular mechanism underlying the male sterility in rapeseed P/TGMS is unclear, which therefore limits the utilization of such system in hybrid breeding. In the present study, a combined study of cytological observation and RNA-seq based transcriptome profiling analysis was conducted to uncover the key cellular events and molecular regulation mechanism in the TGMS line 373S (B. napus). Many novel abnormalities were observed during 373S anther abortion, including microspore nucleus degradation, abnormal tapetum degradation, less elaioplast and tapetosome formation in tapetal cells, and irregular pollen wall formation (Fig. 1 and Fig. 2). Comparative transcriptome analysis identified 430 DEGs between the fertile and sterile samples, with 298 being upregulated and 132 downregulated (Fig. 3). GO analysis identified the most highly represented biological processes including sporopollenin biosynthetic process, pollen exine formation, extracellular matrix assembly, post-embryonic plant organ development, cellular component assembly, anatomical structure formation involved in morphogenesis, response to organic substance, asparagine biosynthetic process and glycine biosynthetic process (Fig. S5). KEGG analysis indicated that the most enriched pathways included amino acid metabolism, carbohydrate metabolism, lipid metabolism, translation, signal transduction, transport and catabolism (Fig. S8B). The 26 differentially expressed TFs were identified, which may be involved in abnormal formation of pollen wall, abnormal degeneration of tapetum, and ultimately lead to pollen abortion (Table 2, Additional file 3). These results suggested that the degradation of microspore nuclei due to abnormal development of chromosomes and callose wall, and abnormal pollen wall formation due to the abnormal development of tapetum are the main causes of pollen abortion, and the aberrant gene expression regulation revealed by RNA-seq analysis corresponded well with the observed cytological abnormalities.

Cytological characteristics of 373S anther abortion

In flowering plants, successful male reproductive development requires a sequence of developmental events such as cell differentiation, cell-to-cell communication, meiosis, and mitosis (McCormick 2004; Scott et al. 2004). Several studies have been conducted to examine ultrastructure changes in anthers of rapeseed EMS lines. The abortion of microspores in the EMS line H50S in B. napus started after the microspores were released from the tetrads. No obvious vacuoles were observed in the aborted microspores, and nuclei of the aborted microspores collapsed (Sun et al. 2009). For the EMS line Huiyou 50S, the microspore abortion occurred at the uninucleate stage. The microspore cytoplasm was disintegrated, and microspores became an empty cell at later time. Meanwhile, the tapetum degraded rapidly at late uninucleate stage (Xu et al. 2014b). Cytological observation of the TGMS line SP2S revealed many abnormal cytological phenomena, such as excessive vacuolation of tapetal cells at the microsporocyte stage, delayed tetrad wall degradation, vacuolization of uninucleate microspores, later deposition of exine, and absence of nexine (Yu et al. 2015; Liu et al. 2017). Ultrastructure analysis of the thermo-sensitive DGMS line TE5A in B. napus showed that its male gamete development was stopped at meiosis prophase I. Homologous chromosomes could not pair, synapse, condense, and form bivalents in TE5A (Yan et al. 2016). Compared with the results of anther development in these EMS lines, our cytological observations revealed some novel and noteworthy features during the anther development of the line 373S (Fig. 1 and Fig. 2). Our results suggested that abnormal anther development of 373S first appeared at the PMC stage. The PMCs of 373S plants had less cytoplasm, early degradation of nuclei, and deformation of the callose wall. However, some anthers in 373S could develop to tetrad and uninucleate microspore stages. Finally, the microspores in 373S deformed and were only surrounded by an irregular exine, and their nuclei, cytoplasm and intine were completely degraded. These findings enriched our previous results of semi-thin section and scanning electron microscope observation (Sun et al. 2020). Pollen wall consists of three layers including an inner pectocellulosic intine, an outer sporopollenin based exine and a lipid and protein-rich pollen coat in the crevices of exine, and its formation is an essential metabolic process throughout pollen development (Liu and Fan 2013; Shi et al. 2015; Ma et al. 2021). For tapetum development, previous studies have reported that elaioplasts are specialized plastids derived from proplastids which contain few internal membranes that were packed with globules of steryl esters enclosed by structural proteins, and they are abundant in tapetal cells during the active stage of pollen maturation in Brassicaceae species, including Arabidopsis (Platt et al. 1998; Ting et al. 1998; Wu et al. 1999; Kim et al. 2001). After degradation of tapetal cells, elaioplasts were deposited onto the maturing pollens to form the pollen coat, which is the outermost layer of the pollen wall (Ting et al. 1998). Tapetosomes, another specific organelle in cruciferous plants, act as transporters for the tryphine components in Arabidopsis and Brassicaceae plants, and they also accumulate ER-derived flavonoids, alkanes, oleosins for the formation of the pollen wall, as well as regulate tapetum programmed cell death (Xu et al. 2014a; Shi et al. 2015). Our study revealed that in the tapetal cells of 373S line, elaioplasts and tapetosomes disappeared, the rest of subcellular structures were invisible, and only a large number of vacuoles remained. All these phenomena we observed in 373S were not reported previously in other rapeseed EMS lines (Dong et al. 2004; Sun et al. 2009; Xu et al. 2014b; Yu et al. 2015; Yan et al. 2016; Liu et al. 2017). The novel results may be due to the different genetic backgrounds of the plant materials and the different environments in which the plant materials were grown.

Pathways and genes involved in anther abortion of 373S

Comparative transcriptomic analysis facilitates the systematic identification of DEGs between fertile and sterile samples, and enhances exploration of molecular mechanism involved in anther and pollen development (Wan and Li 2019). Although some valuable rapeseed P/TGMS germplasms have been discovered and utilized in rapeseed hybrid breeding (Sun et al. 2020), a few studies combining cytological and transcriptomic analysis were carried out to reveal the underlying molecular mechanism of P/TGMS in rapeseed (Yan et al. 2016; Liu et al. 2017). To reveal the molecular mechanism of the TGMS TE5A in B. napus, Yan et al. (2016) performed RNA-seq analysis to identify DEGs participating in the control of fertility. As a result, they obtained a total of 3841 DEGs between TE5A and its fertile NILs. Some of the DEGs were associated with homologous chromosome behavior and cycle control, which corresponded well with the abnormal meiosis prophase I observed in TE5A. Liu et al. (2017) conducted transcriptomic comparisons between young flower buds of the TGMS line SP2S and its NIL SP2F plants, and identified 465 differentially expressed transcripts. They suggested that the TGMS genes in SP2S may affect key proteins that are required for microsporocyte meiosis and tapetum development. The aberrant genetic regulation corresponded well with the observed cytological abnormalities. To reveal the underlying fertility conversion mechanism that controls fertility restoration of the pol TCMS in rapeseed, Xiao et al. (2021) identified 1637 DEGs in the fertile flowers of the TGMS 596 line under low temperature (16°C) when compared to the sterile flower of the temperature-stable pol CMS 1318 line and the TGMS 596 line under high temperature (25°C) by RNA-seq. These DEGs were involved in temperature response, ROS accumulation, anther development, and mitochondrial function. They also found that alternative oxidase, type II NAD(P)H dehydrogenases, and transcription factor Hsfs might play an important role in male fertility under the low-temperature condition. In the present study, we identified 430 DEGs (298 upregulated and 132 downregulated) between male sterile parent (373S line) and fertile parent, and between male sterile and fertile plants of the BC1 population derived from 373S line (Fig. 3, Additional files 2 and 3). These DEGs are mainly involved in sporopollenin biosynthetic process, pollen exine formation, extracellular matrix assembly, post-embryonic plant organ development, cellular component assembly and anatomical structure formation involved in morphogenesis, response to organic substance, asparagine biosynthetic process and glycine biosynthetic process. Among them, the process of pollen exine formation is the only pathway shared to another rapeseed TGMS line SP2S (Liu et al. 2017), indicating that this pathway is important for anther and pollen formation.

Anther and pollen development is a complex process governed by genetic regulatory networks consisting of many TFs and their downstream genes (Jiang et al. 2013; Liu and Fan 2013; Shi et al. 2015; Ma et al. 2021). Previous studies indicated that the genetic regulatory network DYT1-TDF1-AMS-MS188/MS103-MS1, and their downstream genes, including CYP703A2, ACOS5, TKPR1/2, controls the formation of the sexine of pollen wall (Jiang et al. 2013; Liu and Fan 2013; Shi et al. 2015; Lu et al. 2020; Ma et al. 2021). In addition, another pathway DYT1-TDF1-AMS-TEK-APGs is responsible for synthesizing the nexine of the pollen wall. So, AMS plays a central role in tapetum development and pollen wall formation in Arabidopsis (Sorensen et al. 2003; Xu et al. 2010; Xu et al. 2014a; Shi et al. 2015; Ma et al. 2021). Lipid synthesis-related genes CYP703A2 (Morant et al. 2007), CYP704B1 (Dobritsa et al. 2009), and MS2 (MALE STERILE2) (Chen et al. 2011) in Arabidopsis and the corresponding orthologs CYP704B2 (Li et al. 2010), CYP703A3 (Yang et al. 2014), and DPW (DEFECTIVE POLLEN WALL) (Xu et al. 2017) in rice are essential for the pollen wall formation. MS2, ACOS5, CYP703A, and CYP704B work in cascade to synthesize the main components of the sporopollenin precursor-fatty acid monomers, and their expressions are regulated by the TF AMS (Xu et al. 2010; Xu et al. 2014a). In the present study, we identified 26 TFs from the 430 DEGs (Table 2), including the members of HD-ZIP, NAC, bZIP, RAV, C3H, WRKY, LBD, bHLH, B3, ERF, TALE, C2H2, and MYB family. Through clustering, GO, and KEGG analysis of the 430 DEGs, we excavated 19 genes related to pollen development in the 373S line (Table 3). These genes included the TF AMS and some of its target genes reported previously, including CYP703A2KCS15, TKPR1/DRL1, and LTP12 (Xu et al. 2010;Xu et al. 2014a). Furthermore, we have also identified some other TFs and DEGs not involved in the above AMS pathways in the present study, which are worthy of further study.

In the present study, in addition to genes that regulate the exine formation of the pollen wall synthesized in the tapetum, we have found some other genes, including ATA27 and STP2, which influence the exine formation of the pollen wall by regulating callose. Callose is the main component of the tetrad wall and forms at the early stage of meiosis. The synthesis and degradation of callose are important for the formation of pollen exine. ATA27 encodes a pollen coat protein similar to the BGL4 -glucosidase in B. napus. It only expressed in both tapetum and pollen during microspore and pollen development (Rubinelli et al. 1998; Lu et al. 2020). The STP2 gene of A. thaliana encodes a high affinity, low specificity monosaccharide carrier that can transport a number of hexoses and pentoses at similar rates. Its expression is confined to the early stages of gametophyte development from the beginning of callose degradation to microspore release from the tetrads, suggesting its role in the uptake of glucose units resulting from callose degradation during pollen maturation (Truernit et al. 1999). The aberrant expression of these genes may be related to deformation of the callose wall and intine observed in the present study (Fig. 1 and Fig. 2). Together, these abnormal gene expressions and metabolism caused cytoplasmic condensation, abnormal callose, irregular exine, and abnormal degradation of microspore nuclei and tapetum, ultimately leading to pollen abortion in the TGMS 373S. Of course, many genes, especially those mentioned above, are involved in this process. In the future, the cloning and functional characterization of the TGMS gene (s) underlying the TGMS trait in 373S line will help to reveal the genetic mechanism of thermo-sensitive male sterility in rapeseed.

Conclusions

In this study, cytological observation and comparative transcriptome analysis were conducted to reveal the mechanism underlying temperature inducing male sterility in the rapeseed line 373S. Cytological studies indicated that the condensed cytoplasm, nucleus degradation of microspore, irregular pollen wall formation, and abnormal degradation of tapetum were related to microspore abortion in 373S. The comparative transcriptome analysis identified 430 DEGs in buds and anthers between male sterile and fertile samples. Of them, 298 were upregulated and 132 downregulated. The most highly represented biological processes included sporopollenin biosynthetic process and pollen exine formation. Nineteen pollen development-related genes, 26 TFs, and lots of cis-regulatory elements were identified. Furthermore, the TFs of bHLHs, NACs, WRKYs, ERFs, and bZIPs probably bound to the cis-regulatory elements of G-box, MYC-like, W-box, DRE, and ABRE of the 430 DEGs. The findings will provide insights into the complex gene regulation network during pollen development and contribute to understanding of the molecular mechanism of male sterility in TGMS rapeseed.