Abstract
Recently, a new Arabidopsis thaliana master regulator of plant cell wall biosynthesis was characterized. It was named SHINE transcription factor (SHINE TF). This work searched for homologous genes in Eucalyptus grandis genome draft. RNAseq data, phylogeny analysis and qRT-PCR experiments were performed to complement SHINE gene analysis. By similarity searches using A. thaliana SHINE genes, four sequences were identified in Eucalyptus. Two of them contain all conserved motifs and characteristic features of this family, being assumed as true SHINE TFs and named EgrSHN1 and EgrSHN2. The other two sequences contain an incomplete ‘mm’ motif and were not considered true SHINE TFs, being further referred as Egr33m and Egr40m. Expression analysis revealed that EgrSHN1 is more expressed in flowers than in leaves and immature xylem, and both EgrSHN1 and EgrSHN2 are absent from adult xylem RNAseq libraries. This expression profile is similar to A. thaliana orthologues. On the other hand, Egr33m and Egr40m expression was detected in adult xylems. The phylogenetic studies indicate that both EgrSHNs were originated by gene duplication events which, together with gene loss, are hypothesized as common events in SHINE evolution. In conclusion, it is possible that the overexpression of SHINE genes in Eucalyptus xylem can generate information about wood formation processes, allowing an effective increase in forest plantation productivity.
Avoid common mistakes on your manuscript.
Introduction
Eucalyptus wood is a competitive source for paper manufacture and bioenergy production (Grattapaglia and Kirst 2008). It is composed by lignin and hemicelluloses in similar amounts, about 25 % each (dry weight), while cellulose composes the remaining 50 % (Ramírez et al. 2009). Only a few Eucalyptus species attends the wood characteristics desirable in the biomass industry. In this context, adjusting the parameters of cell wall composition to the industrial demands is crucial to increase the process efficiency. As a major obstacle to accomplish it is the fact that the genetic control mechanisms behind secondary cell wall formation are still unclear (Zhou et al. 2009).
In the last years, some transcription factors (TFs) related to this process were discovered and characterized in Arabidopsis thaliana (Zhong et al. 2006; Demura and Fukuda 2007; Mitsuda et al. 2007) and in Eucalyptus (Goicoechea et al. 2005; Legay et al. 2007). Recently, a new TF assumed the position of the master regulator of plant cell wall biosynthesis. Ambavaram et al. (2010) demonstrated that SHINE/WAX INDUCER (SHN/WIN) clade of TFs coordinate cell wall components deposition by directly regulating a vast range of TFs (including MYB and NAC TFs). By this way, SHINE TFs regulate the accumulation of cellulose, lignin and cutin (the top three plant biomass polymers) in plant protective layers, such as those formed during tissue strengthening, abscission, dehiscence and wounding (Aharoni et al. 2004; Ambavaram et al. 2010; Shi et al. 2011).
There are three SHINE genes in A. thaliana, named AtSHN1, AtSHN2 and AtSHN3. They belong to the ERF-B6 (Ethylene Responsive Factor-B6) clade, which is a sub-group from the AP2/EREBP (APETALA 2/ethylene response element binding protein) TFs family (Dietz et al. 2010). All AP2/EREBP genes contain at least one conserved motif AP2 DNA Binding Domain. Additionally, SHINE genes contain more other two exclusive motifs: “mm” (middle domain, with approximately 61 amino acids) and “cm” (C-terminal domain, containing approximately 10 aminoacids). Another specific SHINE characteristic is the presence of just one intron positioned about 80 bp from the start codon (Aharoni et al. 2004).
In A. thaliana, there is another gene that contains the SHINE specific domains (At5g25190). However, overexpression of this gene did not result in typical morphological SHINE phenotype, most likely due to the presence of an incomplete “mm” motif in that gene (Aharoni et al. 2004).
Expression analysis using fusions of SHINE genes with promoter-β-glucuronidase provided evidences for their functional role. The AtSHN1 is strongly expressed in flowering organs that go through abscission and it is low expressed in leaves, stem and silique (Broun et al. 2004). In the case of AtSHN2, the expression is highly precise and temporary coordinated in anther and silique dehiscence zones. On the other hand, AtSHN3 is constitutively expressed in all plant tissues (Aharoni et al. 2004).
In a recent work, Ambavaram et al. (2010) demonstrated that the heterologous expression of AtSHN2 in rice (Oryza sativa) caused a 34 % increase in cellulose and 45 % decrease in lignin content without prejudice plant strength and development. Moreover, lignin composition was also altered in SHINE transgenic plants, leading to improved digestibility.
In this context, due to the economic potential of SHINE TFs in promoting an increase of wood productivity, this work searched for Eucalyptus homologous genes by similarity searches using E. grandis EST and genome data. Results were complemented by the expression analysis of the candidate genes, in order to investigate their putative role in cell wall biosynthesis.
Materials and methods
Bioinformatics analysis
Putative transcription factors of SHINE family were searched in Eucalyptus ssp. transcriptome database “Eucspresso” (Mizrachi et al. 2010) and E. grandis ESTs database generate by Genolyptus Project (http://www.lge.ibi.unicamp.br/eucalyptus/). Databases were queried using the three A. thaliana SHINE protein sequences public available (AtSHN1, gi: 28950720; AtSHN2, gi: 48479321; AtSHN3, gi: 28973112) and tBlastn algorithm (Altschul et al. 1997). Putative Eucalyptus SHINE EST sequences were anchored in the Eucalyptus grandis genome to fill gaps and define the entire coding sequences based on gene prediction produced during the Genome Project (Eucalyptus grandis Genome Project 2010, unpublished—http:://www.phytozome.net/eucalyptus). The complete sequences can be accessed on http://www.phytozome.net/cgi-bin/gbrowse/eucalyptus/ using the queries Eucgr.C04221.1, Eucgr.C01178.1, Eucgr.C02719.1 or Eucgr.F03947.1 for EgrSHN1, EgrSHN2, Egr33m or Egr40m, respectively.
Phylogenetic analysis
Multiple sequence alignments were carried on using ClustalW2.0 with default settings (Larkin et al. 2007). SHINE homologous gene sequences from A. thaliana, Populus trichocarpa, Vitis vinifera and Oryza sativa were obtained in NCBI databases using tBlastn and AtSHNs sequences as query. The amino acid sequences of all species were aligned to Eucalyptus sequences using Muscle tool (Edgar 2004) from MEGA5 software (Tamura et al. 2011). The phylogenetic trees were constructed using PhyML 3.0 program, available online at http://www.atgc-montpellier.fr/phyml/ (Guindon et al. 2010). It was used the Maximum Likelihood method and Jones-Taylor-Thornton (JTT) model (Jones et al. 1992) and a gamma distribution with 4 discrete categories. This model was chosen by Akaike information criterion available at MEGA5 software. Bootstrap values were calculated with 1,000 replicates.
Gene expression analysis
Gene expression was investigated by the exploration of the RNAseq database generated from xylem tissue from three Eucalyptus species: E. grandis, E. globulus and E. urophylla, as described in Salazar et al. (submitted).
Gene expression was also investigated by qReal Time-PCR using the mRNA obtained from immature xylem, flower and leaves from 5-years old Eucalyptus urograndis trees (n = 3). Total RNA was extracted as described by Zeng and Yang (2002). After cDNA synthesis (SuperScript™ II, Invitrogen), RT-qPCR was performed using the following primers: EgrSHN1 (F, 5′-GCAATCAAAGAAGTTCAGAGGAG-3, R, 5′-AAGGTGCCAAGCCAGACTCG-3′); EgrSHN2 (F, 5′-CCGAAATTCGCCATCCTCTG-3′, R, 5′-ATCAAGATGGCGGCCTGGTC-3′); housekeeping gene Histone H2B (F, 5′-GAGCGTGGAGACGTACAAGA-3′, R, 5′-GGCGAGTTTCTCGAAGATGT-3′). Data generated were analyzed accordingly to Pfaffl (2001).
Results and discussion
Gene discovery and bioinformatic analysis
The analysis using A. thaliana SHINE proteins to identify homologous genes in Eucalyptus genome indicated the existence of four putative Eucalyptus SHINE ESTs. These sequences were anchored in E. grandis genome to obtain the complete genes, using the database generated by Zander Myburg and colleagues (Eucalyptus grandis Genome Project 2010, http://www.phytozome.net/eucalyptus).
From the four genes, only two presented all the features required to classify them as true SHINE genes: a single intron located 80 bp (EgrSHN1) or 86 bp (EgrSHN2) from the start codon, an AP2 DNA binding domain and “mm” (middle domain) and “cm” (C-terminal domain). They were further given the nomenclature EgrSHN1 and EgrSHN2, following the respective similarity to AtSHN genes (Fig. 1).
The amino acid sequence similarity over the “mm domain” from both EgrSHNs and AtSHNs was high enough to corroborate on their identification as SHINE genes. While the minimum “mm domain” identity level between SHINE members is approximately 60 % (Aharoni et al. 2009), the similarity between EgrSHNs and AtSHNs is above 72 %. The similarity between EgrSHNs and AtSHNs is also elevated (56 %) when analyzing the full length protein sequence.
The other two Eucalyptus genes identified also presents the typical SHINE characteristics but with an exception: “mm” domain is not complete (Fig. 1). Phylogenetic analysis of these transcripts (Fig. 2a) reveals that both sequences are positioned out of ‘SHINE’ clade; instead, they are probable orthologs to At5g25190, which also presents all SHINE specific domains but lacks part of the “mm” motif (Dietz et al. 2010).
Phylogenetic analysis of EgrSHN1 and EgrSHN2 genes indicated that, when comparing both sequences to those identified as SHINE genes in other species (A. thaliana, Populus trichocarpa, Vitis vinifera and Oryza sativa), all of them grouped in the same clade (Fig. 2a, branch identified as SHINE clade and detailed in Fig. 2b). Interestingly, since the phylogeny was constructed using the sequences from each species after tBlastn alignments using the well-annotated AtSHN genes as query, and not by the use of keyword searches in the public database, thus at least in one case we retrieved results indicating wrong gene annotation.
In P. trichocarpa, the most similar sequences to SHINE proteins were those identified as DREB proteins (PtDREB27, PtDREB28, PtDREB29, PtDREB30 and PtDREB31), which is another subgroup from the AP2/EREBP family (Tuskan et al. 2006; Zhuang et al. 2008). According the phylogeny results, however, these sequences group in the SHINE clade, indicating a possible error in gene annotation (Fig. 2b). Some further analysis on these sequences demonstrates that P. trichocarpa genes are evolutionarily closer to AtERF-B6 genes than to AtDREB-A2 (Supplementary Fig. 1) and contain all typical SHINE specific motifs (Supplementary Fig. 2). Based on these evidences, our work suggests that PtDREB27, PtDREB28, PtDREB29, PtDREB30 and PtDREB31 sequences can be re-classified as PtSHNs.
Other result from phylogenetic analysis provides strong evidences that Eucalyptus EgrSHNs evolved from the same ancestor sequence by a duplication event (Fig. 2b). According to our analysis, this kind of gene evolution seems to be common in SHINE family, as for example AtSHN2 and AtSHN3, PtSHN30 and PtSHN31, OsSHN1 and OsSHN2. It is also possible to infer that some gene losses events probably occurred during SHINE evolution. For example, V. vinifera has an orthologous gene to PtDREB27 and PtDREB28, which indicates it was present in plant evolution before E. grandis speciation. Thus, since E. grandis does not have its correspondent orthologous gene, it can be concluded that it was probably lost. This ‘gene loss’ tendency is not surprising since SHINE paralogous develops redundant functions (Shi et al. 2011).
Gene expression results
Expression analysis was performed evaluating the RNAseq database generated from xylem of E. grandis, E. globulus and E. urophylla (Salazar et al. manuscript in submission). Results reveal the absence of EgrSHN1 or EgrSHN2 transcripts in the xylem of these three species, which is similar to the expression observed for AtSHN1 and AtSHN2 genes (Broun et al. 2004). Accordingly, qRT-PCR results indicated that EgrSHN1 expression is similar to AtSHN1 transcription in A. thaliana, i.e. higher expression in flower than in leaves and immature xylem (Fig. 3a) (Broun et al. 2004). In the case of EgrSHN2 gene, qRT-PCR results were inconclusive (data not shown).
The expression of Egr33m and Egr40m genes was also investigated in RNAseq database. At the contrary of EgrSHNs, both transcripts were detected in xylem of the three Eucalyptus species (Salazar et al. manuscript in submission). Besides, in an RNAseq database generated by other work in our group (Lepikson-Neto et al. manuscript in preparation), it was observed that Egr33m and Egr40m are inhibited in presence of flavonoids narigenin-chalcone and narigenin (Fig. 3b). Since flavonoid supplementation is proved as an efficient way to decrease wood lignification (Besseau et al. 2007), we hypothesize that both genes might be involved in lignin deposition.
In our work we did not perform qRT-PCR of Egr33m and Egr40m genes, since we focused in the characterization of the genes considered true SHN TF Eucalyptus genes.
As conclusion of the present work, two Eucalyptus SHINE genes were identified (EgrSHN1 and EgrSHN2), as well as two close sequences (Egr33m and Egr40m). Phylogenetic analysis indicated that the two Eucalyptus EgrSHNs evolved from the same ancestor sequence by a duplication event. Expression similarities between AtSHNs and EgrSHNs allow concluding that Eucalyptus SHINEs might actually develop SHINE functions. Additionally, Egr33m and Egr40m also might participate in cell wall biosynthesis once there expression is altered under a lignification inhibitory treatment, i.e. Eucalyptus seedlings flavonoid supplementation. The identification of SHINE transcription factors in Eucalyptus can generate information about wood formation processes allowing the increase in forest plantation productivity.
References
Aharoni A, Dixit S, Jetter R, Thoenes E, Arkel GV, Pereira A (2004) The SHINE clade of AP2 domain transcription factors activates wax biosynthesis, alters cuticle properties, and confers drought tolerance when overexpressed in Arabidopsis. Plant cell 16:2463–2480. doi:10.1105/tpc.104.022897
Aharoni A, Dixit C, Pereira A (2009) Shine clade of transcription factors and their use. United States Patent Application Publication No US20090300790. http://www.faqs.org/patents/app/20090300790#b. Accessed 3 Mar 2012
Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucl Acids Res 25:3389–3402
Ambavaram MMR, Krishnan A, Trijatmiko KR, Pereira A (2010) Coordinated activation of cellulose and repression of lignin biosynthesis pathways in rice. Plant Physiol 155:916–931. doi:10.1104/pp.110.168641
Besseau S, Hoffmann L, Geoffroy P, Lapierre C, Pollet B, Legrand M (2007) Flavonoid accumulation in Arabidopsis repressed in lignin synthesis affects auxin transport and plant growth. Plant Cell 19:148–162. doi:10.1105/tpc.106.044495
Broun P, Poindexter P, Osborne E, Jiang CZ, Riechmann JL (2004) WIN1, a transcriptional activator of epidermal wax accumulation in Arabidopsis. Proc Natl Acad Sci 101:4706–4710. doi:10.1073/pnas.0305574101
Demura T, Fukuda H (2007) Transcriptional regulation in wood formation. Trends Plant Sci 12(2):64–70. doi:10.1016/j.tplants.2006.12.006
Dietz K-J, Vogel MO, Viehhauser A (2010) AP2/EREBP transcription factors are part of gene regulatory networks and integrate metabolic, hormonal and environmental signals in stress acclimation and retrograde signalling. Protoplasma 245:3–14. doi:10.1007/s00709-010-0142-8
Edgar RC (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucl Acids Res 32(5):1792–1797. doi:10.1093/nar/gkh340
Goicoechea M, Lacombe E, Legay S, Mihaljevic S, Rech P, Jauneau A, Lapierre C, Pollet B, Verhaegen D, Chaubet-Gigot N, Grima-Pettenati J (2005) EgMYB2, a new transcriptional activator from Eucalyptus xylem, regulates secondary cell wall formation and lignin biosynthesis. Plant J 43:553–567. doi:10.1111/j.1365-313X.2005.02480.x
Grattapaglia D, Kirst M (2008) Eucalyptus applied genomics: from gene sequences to breeding tools. New Phytol 179:911–929. doi:10.1111/j.1469-8137.2008.02503.x
Guindon S, Dufayard JF, Lefort V, Anisimova M, Hordijk W, Gascuel O (2010) New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of phyml 3.0. Syst Biol 59(3):307–321. doi:10.1093/sysbio/syq010
Jones DT, Taylor WR, Thornton JM (1992) The rapid generation of mutation data matrices from protein sequences. Comput Appl Biosci 8:275–282. doi:10.1093/bioinformatics/8.3.275
Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, Valentin F, Wallace IM, Wilm A, Lopez R, Thompson JD, Gibson TJ, Higgins DG (2007) Clustal W and Clustal X version 2.0. Bioinformatics 23:2947–2948. doi:10.1093/bioinformatics/btm404
Legay S, Lacombe E, Goicoechea M, Briere C, Seguin A, Mackay J, Grimapettenati J (2007) EgMYB1, an R2R3 MYB transcription factor from eucalyptus negatively regulates secondary cell wall formation in Arabidopsis and poplar. Plant Sci 173:542–549. doi:10.1111/j.1469-8137.2010.03432.x
Lepikson-Neto J, Camargo ELO, Salazar MM, Nascimento LC, Carazzolle, MF, Teixeira PJ, Marques WL, Deckmann AC, Pereira GAG. Influence of flavonoid supplementation on Eucalyptus urograndis gene expression (in preparation)
Mitsuda N, Iwase A, Yamamoto H, Yoshida M, Seki M, Shinozaki K, Ohme-Takagi M (2007) NAC transcription factors, NST1 and NST3, are key regulators of the formation of secondary walls in woody tissues of Arabidopsis. Plant Cell 19:270–280. doi:10.1105/tpc.106.047043
Mizrachi E, Hefer CA, Ranik M, Joubert F, Myburg AA (2010) De novo assembled expressed gene catalog of a fast-growing Eucalyptus tree produced by Illumina mRNA-Seq. BMC Genomics 11:681. doi:10.1186/1471-2164-11-681
Pfaffl MW (2001) A new mathematical model for relative quantification in real-time RT-PCR. Nucleic Acids Res 29:2002–2007
Ramírez M, Rodríguez J, Balocchi J, Peredo M, Elissetche JP, Mendonça R, Valenzuela S (2009) Chemical composition and wood anatomy of Eucalyptus globulus clones: variations and relationships with pulpability and hands heet properties. J Wood Chem Tech 29:43–58. doi:10.1080/02773810802607559
Salazar MM, Nascimento LC, Camargo ELO, Gonçalves DC, Lepikson Neto J, Marques WL, Teixeira PJPL, Mondego JMC, Carazzole MF, Deckmann AC, Pereira GAG. Xylem transcription profiles indicate potential metabolic responses for economically relevant characteristics of Eucalyptus species (in submission)
Shi JX, Malitsky S, De Oliveira S, Branigan C, Franke RB, Schreiber L, Aharoni A (2011) SHINE transcription factors act redundantly to pattern the archetypal surface of arabidopsis flower organs. PLoS Genet 7:e1001388. doi:10.1371/journal.pgen.1001388
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S (2011) MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. doi:10.1093/molbev/msr121
Tuskan G, Difazio S, Jansson S et al (2006) The genome of black cottonwood, Populus trichocarpa (Torr. & Gray). Science 313:1596–1604. doi:10.1126/science.1128691
Zeng Y, Yang T (2002) RNA isolation from highly viscous samples rich in polyphenols and polysaccharides. Plant Mol Biol Rep 20:417a–417e. doi:10.1007/BF02772130
Zhong R, Demura T, Ye ZH (2006) SND1, a NAC domain transcription factor, is a key regulator of secondary wall synthesis in fibers of Arabidopsis. Plant Cell 18:3158–3170. doi:10.1105/tpc.106.047399
Zhou J, Lee C, Zhong R, Ye ZH (2009) MYB58 and MYB63 are transcriptional activators of the lignin biosynthetic pathway during secondary cell wall formation in Arabidopsis. Plant Cell 21:248–266. doi:10.1105/tpc.108.063321
Zhuang J, Cai B, Peng R-H, Zhu B, Jin X-F, Xue Y, Gao F, Fu X-Y, Tian Y-S, Zhao W, Qiao Y-S, Zhang Z, Xiong A-S, Yao Q-H (2008) Genome-wide analysis of the AP2/ERF gene family in Populus trichocarpa. Biochem Biophys Res Commun 371:468–474. doi:10.1016/j.bbrc.2008.04.087
Acknowledgments
This work was supported by International Paper do Brazil Ltda., which offered financial contribution and supplied this research with plant materials. The authors wish to thank Professor Andy Pereira, Ph.D (University of Arkansas; Virginia Bioinformatics Institute, Virginia Polytechnic Institute and State University) due to his contribution in experimental design; Dr. Jorge M. Mondego (Centro de Pesquisa e Desenvolvimento em Recursos Genéticos Vegetais, Instituto Agronômico de Campinas) and Dr. Ana C. Deckmann (Faculdade de Engenharia Química, Universidade Estadual de Campinas) for their valuable collaboration in the revision of this work. The four EgrSHNs sequence data were verified and completed using the unpublished Eucalyptus grandis genomic data available in “Eucalyptus grandis Genome Project 2010” (http://www.phytozome.net/eucalyptus), with the permission of Dr. Zander Myburg and his colleagues following the instructions detailed at http://www.phytozome.net/eucalyptus.php.
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Marques, W.L., Salazar, M.M., Camargo, E.L.O. et al. Identification of four Eucalyptus genes potentially involved in cell wall biosynthesis and evolutionarily related to SHINE transcription factors. Plant Growth Regul 69, 203–208 (2013). https://doi.org/10.1007/s10725-012-9754-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10725-012-9754-7