Introduction

Cotton (Gossypium hirsutum L.) is not only the world’s leading textile fiber and oilseed crop, but also a crop that is of significance for foil energy and bioengergy production, and model system for studies of many biological processes. However, the growth and productivity of cotton are adversely affected by salt stress, especially at germination and young seedling stage. The salt-tolerant mechanism of cotton has been investigated physiologically, including ion homeostasis, accumulation of osmolytes and antioxidants (Ashraf 2001). Consistently, independent studies revealed some molecular adaptation of cotton to salt stress. For instance, the ethylene-responsive element-binding factors, GhERF2, GhERF3, GhERF6 (Jin et al. 2010), GhERF4 (Jin and Liu 2008) and GhERF5 (Champion et al. 2009) were induced by ethylene, abscisic acid (ABA), salt, cold and drought stresses. A cotton DRE-binding transcription factor gene (GhDREB) conferred enhanced tolerance to drought, high salt and freezing stresses in transgenic wheat (Gao et al. 2009). We characterized the salt, drought, cold and fungi-induced genes of cotton, including a cotton tonoplast Na+/H+ antiporter (GhNHX1) (Wu et al. 2004), a novel CCCH-type zinc finger (GhZFP1) (Guo et al. 2009) and a cotton metallothionein GhMT3a (Xue et al. 2009). Large-scale approaches including microarrays (Walia et al. 2006; Buchanan et al. 2005; Ueda et al. 2004; Rabbani et al. 2003) and whole-genome tiling arrays (Zeller et al. 2009) have been employed to identify genes responding to salinity in plants. One motivation for expression profiling in a reference plant such as Arabidopsis is to exploit the understanding of conserved stress-signaling networks to improve the stress tolerance of crops (Denby and Gehring 2005; Valliyodan and Nguyen 2006). Therefore, it is interesting to determine whether the cotton salt response shares the same regulatory mechanisms with that in other plants. To obtain comprehensive and unbiased transcript profiles during salt stress, we performed deep sequencing analysis using Solexa digital gene expression system. This system is an improved tag-based method that can sequence in parallel millions of total cDNA for the derivation of an accurate measure of gene expression and the discovery of novel transcriptional regions, dramatically changing the way that the functional complexity of transcriptome can be studied (Ford et al. 2008; Li et al. 2011).

In the present study, an overall impression of gene profiles during the salt stress of cotton seedlings was acquired by deep sequencing. We have comprehensively characterized the molecular basis of the physiological processes during cotton salt stress and provided useful information for further research.

Materials and methods

Plant materials and RNA extraction

A salt-tolerant upland cotton cultivar ZM3 was mainly used in the present study, except for an experiment to verify the expression of 12 selected genes in a salt-sensitive cultivar LM6. Seeds of ZM3 and LM6 were pre-germinated in distilled deionized water in a plant growth chamber at 60% humidity with 300 μmol m−2 s−1 light intensity and day/night temperature of 28/20°C for 48 h. Then germinated seeds were planted into soil and cultured in the same chamber with the same conditions for 20 days. The 20 uniformly developed seedlings were transferred into mediums containing 250 mM NaCl for indicated times. The true leaves and roots were harvested directly into liquid nitrogen and stored at −80°C. Then, total RNA was prepared by RNeasy Plant Mini Kit (QIAGEN, USA) in accordance with the manufacturer’s instruction.

Determination of total antioxidant capacity and MDA content

The total antioxidant capacity (T-AOC) of the protective system and MDA content was measured using commercial assay kits (supplied by Nanjing Jian Cheng Bioengineering Institute, China) according to the manufacturer’s instructions. Briefly, for analysis of T-AOC, 0.5 g of samples was ground in 4.5 ml of phosphate buffer (pH 7.0) and centrifuged at 4°C and 12,000g for 15 min. The supernatant was used for T-AOC measurement according to the manufacturer’s instruction. The method was based on the reduction of Fe3+ to Fe2+ by antioxidant, and then Fe2+ reacted with the phenanthroline to form a complex. The mixture was measured at 520 nm. Increase of 0.01 unit at A520 in 1 min/mg protein was considered as one unit of antioxidant capacity, and T-AOC was expressed as unit/mg protein. For MDA content measurement, 0.5 g of samples was ground in 4.5 ml of 10% thiobarbituric acid and centrifuged at 22°C and 12,000g for 15 min. The supernatant was used for MDA measurement according to the manufacturer’s instruction. The absorbance of the mixture was measured at 532 nm. MDA was expressed as nmol/mg protein.

Tag library construction and sequencing

Sequence tag preparation and sequencing were done with the Digital Gene Expression Tag Profiling Kit (Illumina Inc., San Diego, CA, USA) according to the manufacturer’s protocol (version 2.1B). Twenty micrograms of total RNA were used for mRNA purification using biotin-Oligo (dT) magnetic bead adsorption. Then first- and second-strand cDNA was synthesized. While on the beads, double-strand cDNA was digested with NlaIII endonuclease to produce a bead-bound cDNA fragment containing sequence from the 3′-most CATG to the poly (A)-tail. These 3′ cDNA fragments were purified using magnetic bead precipitation, and the Solexa adapter 1 was added to the new 5′ end. The junction of Solexa adapter 1 and CATG site was recognized by MmeI. The enzyme cuts and produces 17 bp cDNA sequence tags with adapter 1. Then, the Solexa adapter 2 was ligated to the 3′ end of the cDNA tag. The adapter-ligated cDNA tags were enriched using PCR primers that anneal to the adaptor ends. The resulting 85 base PCR fragments were purified from a 6% acrylamide gel and subjected to the Solexa sequencing system (Illumina Inc., San Diego, CA, USA).

Tag-to-gene assignment and functional categorization

Sequencing quality evaluation and data summarization were performed using Solexa pipeline software after sequencing. “Clean Tags” were obtained by filtering off adaptor-only tags and low-quality tags (containing ambiguous bases). Comparison of the sequences by BLASTN against the nucleotide sequences downloaded from the Institute for Genome Research (TIGR) (http://plantta.jcvi.org/) was carried out. All clean tags were annotated based on the reference genes. For conservative and precise annotation, only sequences with perfect homology or 1 nt mismatch were considered further. The number of annotated clean tags for each gene was calculated and then normalized to TPM (number of transcripts per million clean tags) (Audic and Claverie 1997; Poole et al. 2008). Sequences were manually assigned to functional categories based on the analysis of scientific literature.

Reverse transcriptase PCR (RT-PCR) analysis

Total RNA isolated above was treated with DNase I to remove genomic DNA contamination. The first-strand cDNA synthesis and the qRT-PCR were carried out using the PrimeScript™ RT reagent Kit (perfect Real Time) and SYBR PrimeScript™ RT-PCR Kit (perfect Real Time), respectively. The qRT-PCR was performed on an Opticon II system (MJ Research/Bio-Rad) with the following cycling parameters: 94°C for 30 s, indicated cycles of 94°C for 12 s, 58°C for 30 s, 72°C for 30 s, and 1 s at 80°C for plate reading. The ubiquitin (GenBank: EU604080) gene was used for the normalization of reactions. The expression levels of the genes were calculated from the threshold cycle using the delta–delta C t method (Livak and Schmittgen 2001). Semiquantitative RT-PCR was also used for ten unknown function genes with the cycling parameters: 94°C for 30 s, indicated cycles of 94°C for 30 s, 55°C for 30 s, 72°C for 30 s, and 1 min at 72°C for plate reading. All reactions were performed with at least three replicates.

Results

Effects of salinity on T-AOC and MDA content of ZM3

To select a suitable time point of 250 mM NaCl treatment, we measured the total antioxidant capacity (T-AOC) of the protective system and the malondialdehyde (MDA) content, which is the decomposition product of polyunsaturated fatty acids of biomembrane. The changes in T-AOC and MDA are shown in Fig. 1. T-AOC increased to a higher level at 24 h of NaCl treatment and then decreased at 48 and 72 h of salt treatments (Fig. 1a). In contrast, the content of MDA increased slowly before 48 h of NaCl treatment, but increased rapidly at 72 h (Fig. 1b). These data indicated that the imbalance of ROS production and scavenging occurred at 12 h of NaCl treatment. However, a stronger ROS scavenging was built up at 24 h of NaCl treatment. Therefore, we carried out the next experiment by using 250-mM NaCl treatment within 24 h.

Fig. 1
figure 1

Effects of salt stress on the total activities of antioxidant enzymes (T-AOC) a and MDA contents b in the true leaves of 20-day cotton seedlings at different time points

Characterization of the sequenced Solexa libraries

To identify genes involved in salinity, two cotton Solexa libraries were constructed from 20-day-old seedlings treated without or with 250 mM NaCl for 24 h. Sequencing depths of 3,583,646 and 3,694,330 tags were achieved in the two libraries, including 395,874 and 389,356 distinct tags, respectively. To make the libraries meaningful, tags recorded only once were wiped off, leaving 145,794 and 138,518 clean tags in each library that were detected multiple times. The frequency of these tags are shown in Table 1, in which the majority of clean tags (about 86% from each) were present at low copy numbers (<10 copies), and approximately 9% tags from each library were counted between 11 and 100 times. Only approximately 1.4% tags were detected more than 100 times.

Table 1 Distribution of the tags sequenced from the two Solexa libraries and summary of distinct tag-to-gene mapping data

To identify the genes corresponding to the 145,794 and 138,518 meaningful tags in each library, an essential dataset containing 70,667 reference genes expressed in the cotton from the http://plantta.jcvi.org/plant TA search was used. Altogether, 62,029 genes (87.8%) have the CATG sites, resulting in a total number of 121,721 (79.5%) unambiguous reference tags. By assigning the eperimental Solexa tags to the virtual reference ones (Table1), we observed that 54,937 (37.7%) and 50,493 36.5%) tags were perfectly matched to the reference genes n control and salinity libraries, respectively. Out of the tags, approximately 20 and 34% tags in control and salinity libraries were mapped to multiple locations. Further sequence analysis revealed that some of them were mapped to highly conserved domains shared by different genes. For instance, CATGTGATCACCGTAGCAGAG could be matched to CO499433, CD486297, TA21047_3635, DN801579, CD486180, TA21046_3635, DT050874, TA21044_3635, TA21045_3635, DT054118, DT050861, DV850220, TA21048_3635 and CD485873 with 95–100% identities to late embryogenesis abundant protein LEA5-A of cotton. In addition, approximately 18% tags in the two libraries were mapped to the antisense strands, demonstrating that those regions might be bidirectionally transcribed. However, for the discrepancy between the reference tags and experimental tags [25], approximately 14% of 1-bp mismatched tags were present in the two libraries. Overall, 75,500 (51.8%) and 72,077 (52.0%) tags in the Solexa libraries treated without or with NaCl were matched to the reference genes, respectively. As a result of the significant sequencing depth of Solexa technology and incomplete genomic data of the cotton, however, approximately 14% unmatched tags in each library were observed.

Identification of differentially expressed transcripts

By comparing two Solexa libraries, a great number of differentially expressed transcripts were identified. The distribution of fold changes in tag number between the two libraries is shown in Appendix S1. The great majority of transcripts were expressed at similar levels in the two libraries: 98.9% tags showed a <5-fold difference in expression regulated by salinity, while tags with expressional changes in the range 5- to 200-fold only accounted for 1.1%. Scatter plot analysis also presented a broader scope of differentially expressed tags than annotated genes, demonstrating that a great number of unknown transcripts were revealed (Appendix S2). To study a subset of genes that were associated with salinity, we analyzed the most differentially regulated tags with a log2ratio >2 or <−2 using a greater statistically significant value (P < 0.001) as well as false discovery rates (FDR < 0.01), representing 223 up- and 317 down-regulated transcripts. Apart from the unknown transcripts (45%), 125 up- and 171 down-regulated genes were classified into ten categories according to their putative or known functions (Fig. 2, Appendix S3 and S4). Of these, the most enriched functional categories are those of metabolism (35.5%), signaling pathway (15.2%), environmental response (13.9%) and transcription (10.8%). Gene categories showed an obvious increase in transcript abundance for genes involved in environmental response. In contrast, transcriptional abundance for genes participating in protein synthesis, metabolism and cytoskeleton were reduced.

Fig. 2
figure 2

Functional classification of genes differentially expressed during salt stress of the cotton plants

In detail, we characterized some significant regulated pathways (Appendix S3 and S4). For example, out of the 15 differentially expressed genes involved in cell growth and cytoskeleton, 9 transcripts encoding for expansins and actins were down-regulated significantly. Many auxin and gibberellin-associated genes were also reduced in the present study, such as SAUR protein and GASA2-like protein, indicating their negative regulation in salt stress. Moreover, a total of 25 phosphorylation/de phosphorylation cascades and 32 transcription factors (TFs) were detected with different changes. Of these, 6 protein phosphatase genes were up-regulated, while 12 protein kinase genes were repressed, indicating lower phosphorylation regulation during salt stress. Among the TFs, NAC (petunia NAM and Arabidopsis ATAF1, ATAF2 and CUC2) (4 genes), zinc finger (4 genes), homodomain leucine zipper (HD zip) (4 genes) and AREB (ABA responsive element-binding protein) (2 genes) TFs were mostly up-regulated, while MYB (3 genes) and ERF (ethylene-responsive element-binding) (3 genes) TFs were mainly down-regulated. This implied the importance of signal transduction during the salt adaptation process. In addition, many transcripts encoding for heat shock proteins (HSPs), late embryogenesis abundant proteins (LEAs) and compatible solutes were greatly induced in the salt library. Similarly, genes encoding for organic molecule transporters were also significantly induced. On the contrary, genes encoding for cation/metal transporters were significantly repressed. The over representation of these categories strongly suggested their regulating roles in the salinity adaptation in plants.

Furthermore, with the benefit of Solexa sequencing, numerous novel transcripts with unclear functions were also detected (Appendix S5).

Quantitative real-time PCR (qRT-PCR) confirmation

To evaluate the validity of Solexa analysis and to further asses the patterns of differential gene expression, seven known and five unknown transcripts with high abundances were selected and detected by real-time RT-PCR (qRT-PCR) (Table 2) with gene-specific primers (Appendix S6), including four LEAs and two trehalose-6-phosphate synthases (TPSs). The log2ratios for the four LEA genes of M19406 (D-113), M19379 (D-11), X15086 (D-7) and TA24166_3635 (LEA14-A) were 12.1, 9.7, 4.9 and 2.7, and those for the two TPS genes of TA26597_3635 and TA30732_3635 were 2.5 and 2.9, respectively. Different from the Solexa data, the highest up-regulation of M19379 was observed with almost 2,000-fold in salinity, while the transcript abundance of M19406 was induced by approximately 520-fold, lower than that of M19379 (Fig. 3). The induction of the transcript abundance of the two TPS genes was much lower, while the three significantly down-regulated genes of CD486436, DW231112 and DT567567 were also detected with lower changes by qRT-PCR than by Solexa sequencing. Furthermore, another ten unknown transcripts were selected and detected by relative RT-PCR (Table 2) with gene-specific primers (Appendix S6). As shown in Fig. 4, nine unknown transcripts were induced in root or leaves or both at 6 and 12 h of salinity treatment, while one transcript was repressed during salt stress. Importantly, the transcription levels of 12 of the genes mentioned above in a salt-sensitive cotton cultivar LM6 showed almost no differences or slightly inducible patterns after salt treatment than the control (Fig. 5). The mRNA levels of TA22164_3635 and TA32043_3635 were repressed in the salt-sensitive cotton cultivar LM6 (Fig. 5). These data further indicated the important role of these genes in cotton salt tolerance. Taken together, the expression patterns of these genes in ZM3 showed agreement with the Solexa data. The apparent discrepancies should be attributed to the essentially different algorithms determined by the two techniques (Ekman et al. 2003). The deep sequencing method generates absolute rather than relative expression measurements. However, these results basically confirmed the reliability of our transcriptome analysis.

Table 2 List of the expression profiles selected for confirmation by RT -PCR
Fig. 3
figure 3

Relative expression of the 12 genes detected by qRT-PCR in different conditions of the cotton seedlings. The transcript levels were normalized to that of ubiquitin, and the level of each gene in the control samples (CK) was set at 1.0. Error bars represent the SE for three independent experiments. The gene expression in Solexa experiment and the descriptions are listed in Table 2. The primers used for each gene are listed in Appendix S6

Fig. 4
figure 4

The expression profiles of another ten selected unknown function genes detected by relative RT-PCR. The plants used in this experiment were 20-day-old cotton seedlings that were treated with 250 mM NaCl for 0, 6 and 12 h. The roots and leaves of cotton seedlings were harvested, respectively, for this experiment. A total of three independent experiments were carried out and the results were similar in expression changes. The gene expression in the Solexa experiment, their descriptions and primers are also listed in Table 2 and in Appendix S6, respectively

Fig. 5
figure 5

The expression profiles of 12 selected genes in salt-sensitive cultivar LM6 detected by relative RT-PCR. The plants used in this experiment were 20-day-old cotton seedlings that were treated with 250 mM NaCl for 24 h. The whole seedlings were harvested for this experiment. The results were illustrated by one of three independent experiments because of the similar expression changes. The gene descriptions and their primers are also listed in Table 2 and in Appendix S6, respectively

Discussion

The major goal of the present study is to preliminarily explore transcripts involved in salinity response of cotton seedlings, as well as to provide groundwork for investigating their regulating mechanisms. Several batches of cotton microarrays have been fabricated. The most number of gene elements are derived from fibers (Arpat et al. 2004) or ovales (Shi et al. 2006; Wu et al. 2006). To our knowledge, the results represent the first large-scale investigation for the transcriptional changes during salt stress in cotton seedlings by the Solexa deep sequencing technology. Recently, Wang et al. (2009) revealed the epigenetic modifications in maize shoots and roots by Solexa sequencing. Our work revealed maize brace root development by Solexa sequencing (Li et al. 2011). This demonstrated that Solexa sequencing analysis had emerged as an efficient and economical method for sampling transcript profiles under specific experimental conditions. In total, we could map approximately 52% of tags to unique or non-unique positions. We consider this as the limited amount of genomic sequences available for cotton and related species in GenBank. In the present study, only 87.8% of reference genes from the cotton EST library have the CATG sites, leaving 8,538 genes unanalyzed, which means that a number of the related genes involved in cotton salt responses would be neglected.

Solexa sequencing could provide a comprehensive and unbiased dataset in the global analysis of gene expression. In the present study, 2,239 differently expressed annotated and novel transcripts (P < 0.01) were explored. Except for unknown transcripts, the most differentially expressed genes with a log2ratio >2 or <−2 (P < 0.001) also participate in various biological pathways, such as environmental response, macromolecule metabolism, signal transduction and transcriptional regulation. Interestingly, many transcripts detected at very low copy numbers in the control library were significantly up-regulated in the salt library (Appendix S3), implying that these genes might be specifically expressed in salt stress. For example, the presence of LEA protein D-113, secretory laccase, putative dehydration-induced myb-related protein and class IV chitinase genes only in the salinity stress indicates that they begin to take on functions to survive under salt stress. Osmoprotection, detoxication and turgor maintenance processes also showed different changes, explaining the different cell responses to salt stress, such as galactinol and trehalose accumulation due to the induced galactinol synthase and TPSs. These changes are common in abiotic stresses. It has been reported that an A. thaliana mutant overexpression of a galactinol synthase gene was more tolerant to desiccation, cold and salinity stress (Taji et al. 2002; Ottow et al. 2005). Overexpression of TPS and d-pyrroline-5-carboxylase synthetase genes conferred tolerance to desiccation and other abiotic stresses in A. thaliana plants and rice (Ashraf and Foolad 2007). In addition, a number of LEA genes responsive to low temperature, drought, high salinity and ABA have been reported (Xiong et al. 2002; Rorat 2006; Hundertmark and Hincha 2008).

Previous studies have reported that the transcripts involved in the regulation of these stress responsive genes were regulated by phosphorylation/dephosphorylation cascades and specific transcription factors (Chinnusamy et al. 2004). In the present study, 18 protein phosphatase/protein kinase genes were regulated by salt stress out of 25 phosphorylation/dephosphorylation cascades (Appendix S3 and S4), indicating the lower phosphorylation regulation during salt stress. Moreover, large-scale transcriptome analysis using the areb1 areb2 abf3 triple mutant revealed downstream genes in response to water stress, including many LEA class and group-Ab PP2C genes (Yoshid et al. 2010). This indicated that ABA might participate in the phosphorylation/dephosphorylation pathway in salt stress. In concordance with this, we observed the differential expression of many transcriptional factors conserved in salt and/or ABA responses, such as NAC [4 genes (Fujita et al. 2004; Tran et al. 2007; Ohnishi et al. 2005; Nakashim et al. 2007], zinc finger (4 genes) (Tran et al. 2007; Guo et al. 2009), AREB (2 genes) (Fujita et al. 2005; Yoshid et al. 2010), ERFs (Qiao et al. 2008; Jin et al. 2010; Champion et al. 2009). Hence, the crosstalk between salt stress and ethylene and ABA responses could possibly be mediated by ERFs in cotton. Moreover, auxin-related genes were also regulated in the salt library (Appendix S3 and S4), including down-regulated auxin-related genes, implying that: the auxin signaling was negatively involved in salinity responses; and genes encoding for jasmonate synthesis and responses, such as allene oxide synthase, lipoxygenases, lipase, alcohol dehydrogenase and chitinases, were regulated under salinity. The increased levels of JA in barley plants under osmotic stress (Lehmann et al. 1995), and rice (Moons et al. 1997) and Iris hexagona (Wang et al. 2001) under salt stress have been reported. Recently, Walia et al. (2007) reported that the JA mediated adaptation of barley to salinity stress and that 38% of salt-induced genes were induced by JA. We found that 36 and 39% of salt-regulated genes were regulated by ABA and MeJA, respectively (unpublished data). These indicated the crosstalk of salt stress with auxin and the JA signaling pathway. Taken together, salt stress crosstalks with multiple phytohormones, such as ABA, ethylene, JA and auxin in cotton, in the present study, and the regulatory mechanisms appear to be commonly shared with respect to cotton salinity-regulated genes and other plant abiotic responsive genes.