Introduction

Expression of most eukaryotic genes rely on specific transcription factors that bind to or modulate DNA structure in the regulatory region of genes, which in turn affects the activity of RNA polymerases for initiation of transcription. In plants, more than 50 families of different transcription factors have been identified based on sequence analysis in model species such as Arabidopsis and rice (Riechmann et al. 2000; Xiong et al. 2005; Riano-Pachon et al. 2007), and numerous reports suggest transcription factors are involved in almost all aspects of cellular activities with their related roles in gene expression.

NAM, ATAF, and CUC (NAC) transcription factors comprise a large protein family. Proteins of this family contain a highly conserved N-terminal DNA-binding domain and a variable C-terminal domain (Xie et al. 2000; Duval et al. 2002; Ernst et al. 2004; Olsen et al. 2005). NAC was originally derived from the names of three proteins, no apical meristem (NAM), ATAF1-2, and CUC2 (cup-shaped cotyledon), that contain a similar DNA-binding domain (Souer et al. 1996; Aida et al. 1997). NAC proteins appear to be widespread in land plants but no homolog has been identified thus far in other eukaryotes (Riechmann et al. 2000). The early reported NAC transcription factors are implicated in various aspects of plant development. A few examples are NAM from Petunia (Souer et al. 1996) and CUC1-2 (Aida et al. 1997) from Arabidopsis which have roles in controlling the formation of boundary cells of the meristem; NAP (Sablowski and Meyerowitz 1998) from Arabidopsis which acts as a target gene of AP3/PI and functions in the transition between cell division and cell expansion in stamens and petals; and AtNAC1 which mediates auxin signaling to promote lateral root development (Xie et al. 2000). Recently, a few NAC transcription factors were reported to play an essential role in regulating senescence, cell division, and wood formation (Ishida et al. 2000; Takada et al. 2001; Vroemen et al. 2003; Weir et al. 2004; Kubo et al. 2005; Kim et al. 2006; Zhong et al. 2006; Demura and Fukuda 2007; Ko et al. 2007; Mitsuda et al. 2007; Zhong et al. 2007).

NAM, ATAF, and CUC proteins were also found to participate in plant responses to pathogens, viral infections, and environmental stimuli (Xie et al. 1999; Ren et al. 2000; Collinge and Boller 2001; Kim et al. 2007). In Arabidopsis, three NAC genes, ANAC019, ANAC055, and ANAC072, were induced by drought, salinity, and/or low temperature (Tran et al. 2004), and the transgenic Arabidopsis plants overexpressing these genes showed improved stress tolerance compared to the wild type (Tran et al. 2004). Furthermore, proteins of these genes can bind to a cis-element containing CATGTG motif (Tran et al. 2004). Several NAC genes appeared to be hormone-inducible (Hoth et al. 2002; Xie et al. 2002; Greve et al. 2003). An Arabidopsis NAC gene AtNAC2 that can be induced by high salinity, abscisic acid, aminocyclopropane carboxylic acid, and naphthalene acetic acid has been predicted to be a downstream gene in the ethylene and auxin signal pathways (He et al. 2005). Overexpression of AtNAC2 resulted in altered lateral root development and enhanced salt tolerance (He et al. 2005). Recently, a stress-responsive NAC gene, SNAC1, was also characterized in rice (Hu et al. 2006). Overexpression of this gene in rice resulted in significantly increased stomata closure and drought resistance in drought-stressed field conditions while the photosynthesis rate and yield of transgenic plants was not affected under normal growth conditions (Hu et al. 2006). Another stress-responsive NAC gene, OsNAC6, which is a member of ATAF subfamily (Kikuchi et al. 2000; Ooka et al. 2003), has been reported to be induced by abiotic stresses and jasmonic acid treatment (Ohnishi et al. 2005), and overexpression of this gene in rice resulted in increased stress resistance (Nakashima et al. 2007). The tomato gene StNAC, which belongs to the ATAF subfamily, was induced by pathogen attack and wounding (Collinge and Boller 2001). In Brassica napus, nine members of the NAC family (BnNAC) were identified for their differential expression after flea beetle feeding and cold temperature treatment (Hegedus et al. 2003). A NAC gene, NAM-B1, conferring nutrient remobilization from leaves to developing grains was reported in ancestral wild wheat (Uauy et al. 2006), which further exemplified the functional diversity of NAC gene family.

Although quite a few NAC transcription factors have been characterized for their diverse functions, the functions of the majority of members in this family remain unknown. In the first classification of the NAC family, only eight cDNA genes (OsNAC1OsNAC8) in rice were included (Kikuchi et al. 2000). The second classification of this family was based on 75 cDNAs representing 55 non-redundant genes in rice and 105 putative genes in Arabidopsis (Ooka et al. 2003). With the completion of genome sequencing in rice, more than 100 members of the NAC family have been proposed for the rice genome (Xiong et al. 2005), but the basic information of this family in rice remains to be discovered. In this study, the number of members in the NAC family of rice was reanalyzed and 140 putative NAC or NAC-like genes were identified. In addition, 12 tissue-specific and 20 stress-responsive genes were identified in this family.

Experimental procedures

Mining and bioinformatic analysis of NAC family

Three methods were used to identify all putative NAC genes in rice. First, the keyword “NAC”, “no apical meristem”, or “NAM” was used as a query to search against the annotation database of rice genome (http://www.tigr.org/, release 5). Second, the conserved NAM DNA-binding domains of known NAC proteins were used to search (BLASTP program) against the predicted protein database of rice genome (http://www.tigr.org/, release 5) with a threshold E value less than 1E-5. Third, the HMM profile of the NAM domain in the Pfam database (http://pfam.sanger.ac.uk/) was used to search the annotated rice protein database. All hits with expected values less than 1.0 were collected. The nonredundant sequences resulted from these three methods were then compared with the NAC family in the rice transcription factor database (RiceTFDB, http://ricetfdb.bio.uni-potsdam.de/v2.1/) (Riano-Pachon et al. 2007) and the previously reported annotation of this family (Ooka et al. 2003). All non-redundant putative NAC protein sequences were manually checked for the NAM domain.

Multiple alignments of the full length protein sequences were performed with CLUSTALX (Thompson et al. 1997) and the conserved NAC domains were extracted for constructing phylogenetic tree with the program MrBayes version 3.0 (Ronquist and Huelsenbeck 2003). The HMM profile of the NAM domain was reconstructed with the HMMbuild program in the HMMER package version 2.1 (Eddy 1998). The program multiple EM for motif elicitation (MEME); (Bailey and Elkan 1994) was used to predict the potential motifs. All motifs discovered by MEME with expected values lower than 1E-30 were searched in the InterPro database with InterProScan (Mulder et al. 2005).

To investigate putative cis-acting regulatory DNA elements (cis-elements) in the promoter regions of the tissue-specific or stress-responsive genes, promoter sequences (500 bp regions upstream the 5′ end of the full-length cDNA or predicted CDS) of these genes were extracted from rice genomic sequences and searched against the promoter database PLACE 26.0 (http://www.dna.affrc.go.jp/PLACE/index.html) (Higo et al. 1999). The promoter analysis and calculation of significance were performed as described previously (Nemhauser et al. 2004).

Biochemical assay in yeast

Yeast one-hybrid assay was performed using the Matchmaker one-hybrid system (Clontech, Palo Alto, CA, USA). Two pHIS2-cis reporter vectors, containing the NAC binding sequences from promoters of OsERD1 (Tran et al. 2004) and a putative ornithine aminotransferase gene (LOC_Os03g44150, OsOAT), respectively, were constructed. Activation vectors were constructed by fusing the open reading frames of selected NAC genes with different motif compositions to the GAL4 activation domain in the vector pGADT7-Rec2. Reporter and activation vectors were co-transformed into yeast strain Y187 for verification of the DNA–protein interactions. Primers used for yeast one-hybrid constructs were listed in the Supplemental material online.

Plant growth and stress treatment

To identify tissue-specifically expressed genes, seeds of Minghui 63 were grown under normal conditions. Thirteen tissues representing major tissues or organs of rice in an entire life cycle were collected for real-time PCR analysis.

To verify the expression profiles of stress-responsive rice NAC genes identified from DNA chip expression profiling, seedlings of Minghui 63 were grown on sandy-soil (one-third paddy soil mixed with two-thirds river sand). At the four-leaf stage, the seedlings were treated with salt and cold stresses. For high salinity treatment, sodium chloride (NaCl) was added for a final concentration of about 200 mM. For cold stress, the seedlings were transferred to a growth chamber at 4°C with 12 h light/12 h dark for 5 days and then back to normal growth conditions for recovery. Drought stress was induced by stopping watering at about 2 weeks before flowering and leaves were sampled according to the degree of leaf-rolling.

Real-time PCR analysis

Gene-specific primers (Electronic supplementary material Tables 1 and 2) were designed for NAC genes showing tissue-specific or stress-responsive expression in DNA chip analysis. For real-time PCR analysis, first-strand cDNAs were synthesized from DNaseI-treated total RNA using SuperscripIII reverse transcriptase (Invitrogen, Carlsbad, CA, USA) according to the manufacture instructions. Real-time PCR was performed in an optical 96-well plate with an ABI PRISM 7500 real-time PCR system (Applied Biosystems, Foster City, CA, USA). Each reaction contained 12.5 μL 2× SYBR Premix® Ex Taq TM (TAKARA), 0.5 μL 50× ROX reference dye II, 1.0 μL cDNA samples, and 200 nM of gene-specific primer in a final volume of 25 μL. The thermal cycle used is as follows: 95°C for 10 s; 45 cycles of 95°C for 5 s, 60°C for 34 s. Rice Actin1 gene (Accession number X16280) was used as internal control with primers 5′-TGGCATCTCTCAGCACATTCC-3′ and 5′-TGCACAATGGATGGGTCAGA-3′. The relative expression levels were determined as described previously (Livak and Schmittgen 2001).

Results and discussion

Annotation update of NAC family in rice

By keyword (“NAC”, “NAM”) search in the rice annotation database (TIGR, release 5, http://www.tigr.org), a total of 118 gene loci with annotations containing one of the three words were browsed. BLASTP search of the predicted rice protein database with the conserved DNA-binding domain of known NAC proteins resulted in 156 gene loci that included those from keyword search. The predicted protein sequences that resulted from keyword and BLASTP searches were further checked with profile Hidden Molkov Model (pHMM) of the NAM domain to remove false sequences (Eddy 1998). Taken together, a total of 138 gene loci were predicted to encode putative NAC or NAC-like proteins (Table 1). Compared to the annotation of NAC family in RiceTFDB (http://ricetfdb.bio.uni-potsdam.de/v2.1/) in which 123 loci were claimed (Riano-Pachon et al. 2007), the 138 loci identified in this study include all the loci in the RiceTFDB database. Among the 75 cDNA clones of the rice NAC family (ONAC001-075 representing 56 loci) annotated by Ooka et al. (2003), all except two clones are included in the 138 loci. The cDNA sequences of the two exceptional clones, AK068153 and AK105493, cannot be mapped to the genomic sequence of rice, although the predicted protein sequences of the two cDNAs have high similarity (>60% in the NAM DNA-binding domain) to other NAC sequences. To keep the nomenclature of this family consistent, the names of reported genes and those of the previous nomenclature system (i.e., ONAC001-075) are kept in this updated nomenclature of NAC family in rice (named ONAC001-140, Table 1). For the reported genes of this family, the published names are referenced by priority and followed by a systematic name. The systematic names of the redundant cDNA clones in the previous annotation were replaced with newly identified members. All the newly identified members were named according to their sequential locations on the chromosomes.

Table 1 Annotation update and expression pattern of NAC family genes in rice

Generally, the 138 loci of NAC family are evenly distributed among the 12 chromosomes of rice. However, there are eight gene clusters each with two to four genes tandem or closely located in the genome including a cluster containing four genes on chromosome 11 (LOC_Os11g31330, LOC_Os11g31340, LOC_Os11g31360, and LOC_Os11g31380). The sequences of the clustered members are highly similar to each other, suggesting a contribution of gene duplication in the expansion of this family.

Phylogenetic analysis of NAC family

To reveal the evolutionary relationship of NAC genes, all the putative NAC protein sequences of rice were collected for phylogenetic analysis. In addition, seven published NAC proteins from other plants were included as reference sequences. NAC protein sequences. The result suggested that the rice NAC family can be classified into five groups (I–V) and each subfamily has been largely diversified (Fig. 1). Group I has 54 rice NAC members, and can be further classified into five subgroups: I-1 (OsNAC7), I-2 (NAC1), I-3 (NAM/CUC), I-4 (GRAB2), and I-5 (NAC2). All the reported NAC transcription factors related to development were classified into subgroups I-2, I-3, and I-4. Group II also contains a large number of members (54 rice NAC sequences) including four sequences (LOC_Os11g31330, LOC_Os11g31340, LOC_Os11g31360, and LOC_Os11g31380) that are tandem located in chromosome 11 as a gene cluster and can be classified into a few subgroups. However, the tree structure of this group is more complex than that of group I, and group II contains none of the published NAC sequences. Group III, containing 14 ONAC genes, is named stress-related NAC (SNAC) because all the published stress-related genes of NAC family fall in this group. These published genes include SNAC1 (Hu et al. 2006) and OsNAC6 (Nakashima et al. 2007) from rice; ANAC019, ANAC055, and ANAC072 from Arabidopsis (Tran et al. 2004); and StNAC from tomato (Collinge and Boller 2001). TERN (GenBank accession number: AB021178) identified from tobacco (Benson et al. 1997) is also located in this group. Group IV contains 14 NAC members from rice. Group V has two NAC members of rice as well as the reported NAC gene SENU5 identified from tobacco (John et al. 1997). The high sequence diversification of the NAC family suggest that the function of this family has also been diversified, which can be supported by the reported NAC members involved in diverse aspects of plant growth, development, and stress responses as reviewed in the Introduction.

Fig. 1
figure 1

Phylogenetic analysis of NAC protein sequences. Unrooted phylogenetic tree was derived from the program MrBayes version 3.0 (Ronquist and Huelsenbeck 2003) based on the conserved NAC domain of rice and the published NAC genes. The numbers at the branching sites indicated the posterior probability values for nodal support. The names of the reported NAC genes continued to be used in order to keep a consistency with the previous references, while the unpublished NAC genes in rice were designated according to their locus ID from The Institute for Genomic Research (TIGR http://www.tigr.org/)

To further reveal the diversification of NAC family in rice, putative motifs were predicted by the program MEME, and 36 putative motifs were predicted in the NAC family of rice (Table 2). Based on the composition of motifs, the NAC family of rice can be classified into 15 types (A–O, Fig. 2). Most NAC proteins (including 97 rice NAC members in types A–E) contain a complete NAC DNA-binding domain and a diversified C-terminal activation domain. In the previous annotation of this family (Ooka et al. 2003), five motifs were identified in the DNA-binding domain of a typical NAC protein. Our results showed that each of five motifs in the previous report has at least two variants for different subgroups of NAC members (Fig. 2). We noticed that the sequences of type J (three genes), K (6), L (8), M (3), and N (5) do not have all five motifs in the DNA-binding domain, while the sequences of type F (10), G (6), H (6), and I (3) have three to four motifs with locations matching that of the five motifs of the typical NAC DNA-binding domain but the motif sequences are completely different. Therefore, these members may be more properly called NAC-like proteins. In the transcriptional regulation region (TRR), at least ten motifs were identified (Fig. 2). Some motifs are present only in one or two sequences, and for simplicity of classification, these sequences were classified into type E (18 genes), which has a complete DNA-binding domain but a “Variable” TRR, or into type O (11 genes), which contains unclassifiable sequences based on motif composition.

Table 2 Motifs identified in NAC family genes in rice
Fig. 2
figure 2

Classification of NAC family in rice based on motif composition. Putative motifs shared among a subset of NAC protein sequences were resulted from the analysis with MEME program (see Materials and methods). Numbered boxes represent different putative motifs (annotations are listed in Table 2). The number of NAC genes belonging to each type was indicated in the bracket after the type. The majority of NAC family proteins have a typical NAC DNA-binding domain containing five kinds of motifs. In the type E, diverse motifs were identified in the activation domain and each was present only in one to two members, thus designated as “Variable” for simplicity. The members in the type O share little structural similarity with other NAC proteins. M motif; TRR transcriptional regulation region

In general, most of the closely related members in the phylogenetic tree have the same or very similar motif composition (Table 1). However, the classification based on motif composition does not completely match the phylogenetic classification. This is not very surprising since the phylogenetic tree is based on the alignments of DNA-binding domain sequences, whereas the classification of motif composition is based on the combination of different motifs. Although the functions of most of the motifs remain to be determined (most of them do not have homologous sequences in the InterPro database, http://www.ebi.ac.uk/interpro/, release 6.1), the motif composition of these unknown NAC (or NAC-like) sequences may provide clues for further function analysis of these genes.

Yeast one-hybrid assay of ONAC proteins

Our previous work showed that SNAC1 and SNAC2 can bind the NAC recognition site (NACRS)-like sequence in the promoter of OsERD1 (Hu et al. 2006; Hu et al. 2008). To determine whether other ONAC proteins with different motif compositions can bind the NAC DNA binding sequence, activation constructs (pGADT7-ONAC) containing the opening reading frames of 12 ONAC genes (representing almost all the motif composition types) the fused to the GAL4 activation domain were co-transformed with pHIS-cis construct (containing the NAC DNA binding sequences in the promoters of OsERD1 gene or OsOAT gene) into the yeast strain Y187. The result showed that all the co-transformants of pGADT7-ONAC and pHIS-cis, but not the negative control, could grow very well on the SD/Leu-/Trp-/His-medium with 30 mM 3-AT (Fig. 3), suggesting that these ONAC proteins with different motif compositions could bind the NAC DNA binding sequence in yeast.

Fig. 3
figure 3

Yeast one-hybrid assay of ONAC proteins with different motif compositions. a The pGAD-ONAC and the pHIS-cis reporter constructs used for co-transformation of yeast strain Y187. b The positive transformants were examined by growth performance on the SD/Leu/Trp plate and the SD/Leu/Trp/His plate containing 30 mM 3-AT. P positive control (p53HIS2 plus pGAD-Rec2-53); N negative control (p53HIS2 plus pGAD-ONAC); 1–4 four different colonies of the co-transformants

Identification of tissue-specifically expressed NAC genes in rice

Increasing evidence suggest tissue-specifically expressed transcription factors play critical roles in plant growth and development (Mitsuda et al. 2007; Yoo et al. 2007). Nevertheless, tissue-specific NAC transcription factor has not been reported in rice. A preliminary check of our whole-genome expression profile data of rice for different tissues suggested that 17 genes showed tissue-specific expression (i.e. expression signal was detected only in 1–3 tissues of more than 30 tissues being investigated) (Wang et al., unpublished data). Real-time PCR was then performed to confirm the tissue-specific expression patterns of 12 genes in 13 representative tissues from an entire life cycle of rice. The results suggested that all the 12 genes examined showed tissue-specific expression in only 1–2 tissues (Fig. 4). Among these tissue-specific genes, seven genes (LOC_01g29840, LOC_02g12310, LOC_05g34310, LOC_11g31340, LOC_11g31330, LOC_11g31360 and LOC_11g31380) are specifically expressed in immature endosperm, three genes (LOC_02g15340, LOC_06g33940 and LOC_11g03370) are specifically expressed in callus (including one that is callus and root specific), two genes (LOC_07g37920 and LOC_12g22630) are specifically expressed in stamen. These tissue-specific expressed genes may deserve special notice for further investigation of their function since quite a few members of the NAC family in plants have been proven to play important roles in regulating growth and development.

Fig. 4
figure 4

Real-time PCR of genes showing tissue-specific expression. x-axes are representative tissues and y-axes are scales of relative expression level. Thirteen representative tissues are as follows: Cal callus at 15 days after subculture; Sd1 young shoots (3 days after germination); Sd2 three-leaf seedlings; Lf1 leaf from plants with two tillers; Rt root from plants with two tillers; Lf2 leaf from plants with young panicle 4–5 cm in length; Sh sheath from plants with young panicle 4–5 cm in length; Pan young panicle 4–5 cm in length; Stm stem from plants at 5 days before flowering; Sta stamen at 1 day before flowering; Esp1 endosperm at 7 days after pollination (DAP); Esp2 endosperm at 14 DAP; Esp3 endosperm at 21 DAP. The bars are standard deviations of technical repeats

Expression profiles of the NAC family under different stresses

To obtain an overview of the expression level changes of the rice NAC family under stress conditions, stress/nonstress signal ratios of the ONAC genes were extracted from our microarray data of ‘Minghui 63’ for drought and salt stresses at tillering stage (Zhou et al. 2007). In the leaves of drought-stressed plants at tillering stage, the stress/nonstress signal ratios of 20 genes were higher than 2.0 [with the range of 2.0–21.40 (Electronic supplementary material Figure 1)]. In the leaves of salt-stressed plants at tillering stage, 21 genes showed induction ratios higher than 2.0 (with the range of 2.1–7.5), and 14 of them also showed induction ratios higher than 2.0 under drought stress (Electronic supplementary material Figure 1). Compared to the number of induced genes, only a few genes showed suppressed expression (three and two genes with stress/nonstress signal ratio less than 0.4 under drought and salinity stress, respectively). At whole-genomic level, about 7% genes in the rice genome were induced by drought or salt stresses (Zhou et al. 2007), whereas in the ONAC family, at least 15% (21/140) genes were induced by one of the two stresses. These results suggest a significant portion of the NAC family genes in rice are responsive to abiotic stresses. Interestingly, we noticed that most of the genes in the group III of the phylogenetic tree (12 of 14 rice genes) are responsive to at least one of the stresses, while other stress-responsive genes are random distributed in other phylogenetic groups.

It should be noted that the number of stress-responsive genes might be underestimated with a twofold expression level change because some genes are also significantly induced or suppressed by the stresses based on statistical analysis, even if their expression level changes are less than twofold. Here we still take twofold as a threshold to estimate the stress-responsive genes because more than 95% of the genes with twofold or higher expression changes can be confirmed by real-time PCR analysis. For example, we checked 20 genes with expression level changes higher than twofold in the microarray data by real-time PCR analysis, and the results showed that almost all of them can be confirmed for their stress-responsive expressions (Fig. 5). Among the 20 genes, 5 genes were induced by drought (LOC_Os05g34830, LOC_Os11g03300, LOC_Os01g48130, LOC_Os06g04090, and LOC_Os02g34970), 19 genes (all except LOC_Os06g04090) were induced by salt, and 16 genes (all except four genes LOC_Os12g29330, LOC_Os06g46270, LOC_Os05g34830, and LOC_Os12g41680) were induced by cold. A few genes showed a slight difference in drought induction patterns between real-time PCR and microarray analysis, which may be resulted mainly from different batches of stressed samples used for real-time PCR and microarray analysis (the degree of drought stress in field conditions is difficult to control to the same degree between replications at different times).

Fig. 5
figure 5

Real-time PCR analysis of stress-responsive NAC genes. Stress-inducible expression patterns of the stress-responsive NAC genes are represented. x-axes are time courses of abiotic stress treatments and y-axes are scales of relative expression level. D drought; S salt; C cold. For drought stress, rice leaves were sampled before stress (D0) and after stress at slight leaf-rolling (D1), moderate leaf-rolling (D2), and severe leaf-rolling (D3) stages. For salt and cold stress, seedlings were sampled at 0 (S0), 1 (S1), 6 (S2), and 12 h (S3) and 0 (C0), 1 (C1), 3 (C2), 10 h (C3) after treatment, respectively. The bars are standard deviations of technical repeats

To date, a number of transcription factor genes from different families such as DREB (Liu et al. 1998; Kasuga et al. 1999; Yamaguchi-Shinozaki and Shinozaki 2001; Haake et al. 2002), MYB (Abe et al. 1997), bZIP (Uno et al. 2000), and zinc finger (Mukhopadhyay et al. 2004) have been reported as having an effect on improving stress tolerance. Increasing evidence suggest that some members of the NAC family also contribute to abiotic stress tolerance in Arabidopsis (Fujita et al. 2004; Tran et al. 2004). So far, only two genes of the NAC family in rice have been functionally characterized for their roles in abiotic stress tolerance. One is a stress-responsive gene, SNAC1 (ONAC002), reported in our previous study (Hu et al. 2006). This gene is induced specifically in guard cells by drought stress, and overexpression of this gene in rice can promote stomata closure and significantly improve drought resistance under the field conditions. Another is OsNAC6, which is also induced by various stresses, and transgenic rice plants overexpressing this gene showed significantly improved tolerance of dehydration stresses (Nakashima et al. 2007). In this study, more than 20 genes of NAC family in rice were identified as responsive to drought, salinity, or cold stress. These stress-responsive genes have large diversity both in stress-induced expression patterns and protein structures (based on sequence/motif composition), suggesting that they may participate in the regulation of a wide spectrum of responses to different abiotic stresses. Since rice is one of the most important crops and the model monocot species, functional characterization of these genes has important significance in genetic improvement of rice for abiotic stress tolerance.

In silico cis-element analysis of OsNAC genes

To identify putative cis-acting regulatory DNA elements enriched in OsNAC genes, promoter sequences upstream the 5′ end of the full-length cDNA of OsNAC genes were extracted and subjected to search against the PLACE database. The statistical analysis showed that 17 cis-elements are enriched in rice NAC family genes (Table 3). Among these cis-elements, four elements (ARFAT, PYRIMIDINEBOXHVEPB1, PYRIMIDINEBOXOSRAMY1A, and WBBOXPCWRKY1) are known to be responsive to plant hormones (auxin, GA and SA). Four elements (BOXIINTPATPB, GT1CORE, IBOXCORE, and LTRE1HVBLT49) are responsive to environmental stimulus such as light and cold. Three elements (RYREPEATLEGUMINBOX, RYREPEATVFLEB4, and RYREPEATBNNAPA) are related to seed-specific expression.

Table 3 Putative cis-elements enriched in promoters of rice NAC genes

Numerous cis-elements have been reported for their essential roles in determining the tissue-specific or stress-induced expression patterns of the genes. In this study, 12 ONAC genes were identified to be expressed in specific tissues or organs (Fig. 4), and 20 genes were confirmed to be induced by different stresses including drought, salt, and cold treatments (Fig. 5). These results promoted us to inspect the relationship of the expression pattern and existence of putative tissue-specific or stress-responsive elements of these genes. Almost all the genes with tissue-specific expression patterns contain at least one of the cis-elements related to the corresponding expression pattern (ESM Table 1). All of the 20 stress-inducible genes contain at least one of the stress responsive cis-elements such as ABRE (ABA-responsive element), DRE (dehydration-responsive element), and LIRE (low-temperature-responsive element) (Yamaguchi-Shinozaki and Shinozaki 1994; Shinwari et al. 1998; Narusaka et al. 2003) (ESM Table 2). In general, the genes containing predicted stress-responsive cis-elements are actually induced by the corresponding stresses. However, this is not always the case. For example, six ONAC genes (LOC_Os05g34830, LOC_Os02g36880, LOC_Os01g48130, LOC_Os02g34970, LOC_Os07g37920, and LOC_Os12g41680) are not induced by cold based on the result of real-time PCR, but they actually contain at least one of the cold-responsive cis-elements in their promoter regions. Such discrepancy might be due to the sequence diversity in the promoter regions between different rice varieties since indica rice Minghui 63 was used for real-time PCR analysis while the promoter sequences analyzed were from japonica rice Nipponbare. This is true at least for three of the six genes (LOC_Os05g34830, LOC_Os02g36880 and LOC_Os01g48130) that are responsive to cold (>twofold) in japonica rice Zhonghua 11 in our previous microarray analysis (Hu et al. 2008). It is remarkable that the expressions of two ONAC genes (Os11g03370 and Os07g37920) are both tissue-specific and stress-responsive. The gene Os11g03370 is expressed specifically in callus and root and induced by salt and cold, while the gene Os07g37920 is stamen-specific and induced by drought and salt. Interestingly, both tissue-specific and stress-responsive cis-elements are presented in their promoter regions (ESM Tables 1 and 2).

In conclusion, this study has provided not only an updated annotation of the NAC family in rice, but also the identification of many tissue-specifically expressed or stress-responsive genes. These data provide a very useful reference as well as a starting point for revealing the function of NAC family genes in rice, especially for those genes involved in the regulation of growth and development at specific stages and stress tolerance.