Abstract
A computational analysis of pine transcripts was conducted to contribute to the functional annotation of conifer sequences. A statistical analysis of expressed sequential tags(ESTs) belonging the 7732 contigs in the TIGR Pinus Gene Index (PGI1.0) identified 260 differentially represented gene sequences across six cDNA libraries from loblolly pine secondary xylem. Cluster analysis of this subset of contigs resulted in five groups representing genes preferentially represented in one of the xylem samples (compression wood, plannings, root xylem, latewood) and one group containing mostly genes simultaneously present in compression and side wood libraries. To complement the sequence annotation, 27 cDNA clones representing selected transcripts were completely sequenced. Several genes were identified that could represent putative markers for xylem from different organs, at different stages of development. Several sequences encoding regulatory proteins were over-represented in root xylem as opposed to the other xylem samples. Some of them belonged to known families of plant transcription factors, but two genes were previously uncharacterized in plants. One transcript was homologous to the gene encoding the Smad4 interacting factor, a key co-activator in TGFβ (transforming growth factor) signalling in animals. Thus, the digital analysis of pine ESTs highlighted a putative gene function of potentially broad interest but that has yet to be investigated in plants. More generally, this study showed that the application of numerical approaches to EST databases should be helpful in establishing priorities among genes to consider for targeted functional studies. Thus, we illustrated the potential of extracting information from conifer sequences already accessible through well-structured public databases.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
S.M. Ahern-Djamali A.R. Comer C. Bachmann A.S. Kastenmeier S.K. Reddy M.C. Beckerle U. Walter F.M. Hoffmann (1998) ArticleTitleMutations in Drosophila enabled and rescue by human vasodilator-stimulated phosphoprotein (VASP) indicate important functional roles for Ena/VASP homology domain 1 (EVH1) and EVH2 domains Mol. Biol. Cell 9 2157–2171
I. Allona M. Quinn E. Shoop K. Swope S. St Cyr J. Carlis J. Riedl E. Retzel M. Campbell R. Sederoff R.W. Whetten (1998) ArticleTitleAnalysis of xylem formation in pine by cDNA sequencing Proc. Natl. Acad. Sci. USA 95 9693–9698
S.F. Altschul T.L. Maden A.A. Schaffer J. Zhang Z. Zhang W. Miller D.J. Lipman (1997) ArticleTitleGapped BLAST and PSI-BLAST: a new generation of protein database search programs Nucleic Acids Res. 25 3389–3402 Occurrence Handle10.1093/nar/25.17.3389 Occurrence Handle1:CAS:528:DyaK2sXlvFyhu7w%3D Occurrence Handle9254694
S. Audic J.M. Claverie (1997) ArticleTitleThe significance of digital gene expression profiles Genome Res. 7 986–995
R.Y. Bai C. Koester T. Ouyang S.A. Hahn M. Hammerschmidt C. Peschel J. Duyster (2002) ArticleTitleSMIF, a Smad4-interacting protein that functions as a co-activator in TGFβ signalling Nature Cell Biol. 4 181–190
L.J. Ball T. Jarchau H. Oschkinat U. Walter (2002) ArticleTitleEVH1 domains: structure, function and interactions FEBS Lett. 19 45–52
F. Bedon S. Blais V. Roy H. Bérubé M.J. Morency J. Mackay (2004) Characterization, expression and phylogeny of xylem expressed R2R3-MYB genes of spruce and pine trees Plant & Animal Genomes XII Conference San Diego, CA
S. Ben-Yaacov R. Le Borgne I. Abramson F. Schweisguth E.D. Schejter (2001) ArticleTitleWasp, the Drosophila Wiskott–Aldrich syndrome gene homologue, is required for cell fate decisions mediated by Notch signalling J. Cell Biol 152 1–13
M. Bonke S. Thitamadee A.P. Mahonen M.T. Hauser Y. Helariutta (2003) ArticleTitleAPL regulates vascular tissue identity in Arabidopsis Nature 426 181–186
S. Bortoluzzi F. d’Alessi C. Romualdi G.A. Danieli (2001) ArticleTitleDifferential expression of genes coding for ribosomal proteins in different human tissues Bioinformatics 17 1152–1157
S. Bortoluzzi F. d’Alessi G.A. Danieli (2000) ArticleTitleA computational reconstruction of the adult human heart transcriptional profile J. Mol. Cell Cardiol 32 1931–1938
S. Bortoluzzi G.A. Danieli (1999) ArticleTitleTowards an in silico analysis of transcription patterns Trends Genet 15 IssueID3 118–119
E.D. Brenner D.W. Stevenson R.W. McCombie M.S. Katari S.A. Rudd K.F. Mayer P.M. Palenchar S.J. Runko R.W. Twigg G. Dai et al. (2003) ArticleTitleExpressed sequence tag analysis in Cycas, the most primitive living seed plant Genome Biol. 4 R78
I. Callebaut (2002) ArticleTitleAn EVH1/WH1 domain as a key actor in TGFbeta signalling FEBS Lett 22 178–180
S. Chang J. Pureyear J. Cairney (1993) ArticleTitleA simple and efficient method for isolating RNA from pine trees Plant Mol. Biol. Rep. 11 113–116
J.M. Claverie (1999) ArticleTitleComputational methods for the identification of differential and coordinated gene expression Hum. Mol. Genet. 8 1821–1832
C. Dubos C. Plomion (2003) ArticleTitleIdentification of water-deficit responsive genes in maritime pine (Pinus pinaster Ait.) roots Plant Mol. Biol. 51 249–262
M.B. Eisen P.T. Spellman P.O. Brown D. Botstein (1998) ArticleTitleCluster analysis and display of genome-wide expression patterns Proc. Natl. Acad. Sci. USA 95 14863–14868
R.M. Ewing A.B. Kahla O. Poirot F. Lopez S. Audic J.M. Claverie (1999) ArticleTitleLarge-scale statistical analyses of rice ESTs reveal correlated patterns of gene expression Genome Res. 9 950–959
Felsenstein J. (1993). PHYLIP (Phylogeny Inference Package) version 3.6a2. Distributed by the author. Department of Genetics, University of Washington, Seattle.
R. Florin (1963) ArticleTitleThe distribution of conifer and taxad genera in time and space Acta Horti. Bergiani 20 121–312
C. Geourjon G. Deleage (1994) ArticleTitleSOPM: a self-optimized method for protein secondary structure prediction Protein Eng. 7 157–164
L.D. Greller F.L. Tobin (1999) ArticleTitleDetecting selective expression of genes and proteins Genome Res 9 282–296
M. Hertzberg H. Aspeborg J. Schrader A. Andersson R. Erlandsson K. Blomqvist R. Bhalerao M. Uhlen T.T. Teeri J. Lundeberg et al. (2001) ArticleTitleA transcriptional roadmap to wood formation Proc. Natl. Acad. Sci. USA 98 14732–14737
W.M. Karlowski A.M. Hirsch (2003) ArticleTitleThe over-expression of an alfalfa RING-H2 gene induces pleiotropic effects on plant growth and development Plant Mol. Biol. 52 121–133
M. Kirst A.F. Johnson C. Baucom E. Ulrich K. Hubbard R. Staggs C. Paule E. Retzel R. Whetten R. Sederoff (2003) ArticleTitleApparent homology of expressed genes from wood-forming tissues of loblolly pine (Pinus taeda L.) with Arabidopsis thaliana Proc. Natl. Acad. Sci. USA 100 7383–7388
Kosarev, P., Mayer, K.F. and Hardtke, C.S. 2002. Evaluation and classification of RING-finger domains encoded by the Arabidopsis genome. Genome Biol. 3, research0016.1-0016.12.
H.D. Kranz M. Denekamp R. Greco H. Jin A. Levya R.C. Meissner K. Petroni A. Urzainqui M. Bevan C. Martin et al. (1998) ArticleTitleTowards functional characterisation of the members of the R2R3-MYB gene family from Arabidopsis thaliana Plant J. 16 263–276
J. Mata J. Bahler (2003) ArticleTitleCorrelations between gene expression and gene conservation in fission yeast Genome Res. 13 2686–2690
G.J. McDougall (2000) ArticleTitleA comparison of proteins from the developing xylem of compression and non-compression wood of branches of Sitka spruce (Picea sitchensis) reveals a differentially expressed laccase J. Exp. Bot. 51 1395–1401
Megy, K., Audic, S. and Claverie, J.M. 2002. Heart-specific genes revealed by expressed sequence tag (EST) sampling. Genome Biol. 16: 3(9).
E.J. Mellerowicz M. Baucher B. Sundberg W. Boerjan (2001) ArticleTitleUnravelling cell wall formation in the woody dicot stem Plant Mol. Biol. 47 239–274
I.S. Mercy R.B. Meeley S.E. Nichols O.A. Olsen (2003) ArticleTitleZea mays ZmMybst1 cDNA, encodes a single Myb-repeat protein with the VASHAQKYF motif J. Exp. Bot. 54 1117–1119
Y. Nagano H. Furuhashi T. Inaba Y. Sasaki (2001) ArticleTitleA novel class of plant-specific zinc-dependent DNA-binding protein that binds to A/T-rich DNA sequences Nucleic Acids Res. 29 4097–4105
C. Notredame D.G. Higgins J. Heringa (2000) ArticleTitleT-Coffee: A novel method for fast and accurate multiple sequence alignment J. Mol. Biol. 302 205–217
Y. Ogihara K. Mochida Y. Nemoto K. Murai Y. Yamazaki I.T. Shin Y. Kohara (2003) ArticleTitleCorrelated clustering and virtual display of gene expression patterns in the wheat life cycle by large-scale statistical analyses of expressed sequence tags Plant J. 33 1001–1011
S.G. Olivier (1996) ArticleTitleFrom DNA sequence to biological function Nature 379 597–600
A. Patzlaff S. McInnis A. Courtenay C. Surman L.J. Newman C. Smith M.W. Bevan S. Mansfield R.W. Whetten R.R. Sederoff M.M. Campbell (2003) ArticleTitleCharacterization of a pine MYB that regulates lignification Plant J. 36 743–754
J. Quackenbush F. Liang I. Holt G. Pertea J. Upton (2000) ArticleTitleThe TIGR Gene Indices: reconstruction and representation of expressed gene sequences Nucleic Acids Res. 28 141–145
C. Romualdi S. Bortoluzzi G.A. Danieli (2001) ArticleTitleDetecting differentially expressed genes in multiple tag sampling experiments: comparative evaluation of statistical tests Hum. Mol. Genet. 10 2133–2141
C.M. Ronning S.S. Stegalkina R.A. Ascenzi O. Bougri A.L. Hart T.R. Utterbach S.E. Vanaken S.B. Riedmuller J.A. White J. Cho et al. (2003) ArticleTitleComparative analyses of potato expressed sequence tag libraries Plant Physiol. 131 419–429
B. Rost C. Sander (1993) ArticleTitlePrediction of protein secondary structure at better than 70 accuracy J. Mol. Biol 232 584–599
S. Rozen H.J. Skaletsky (2000) Primer3 on the WWW for general users and for biologist programmers S. Krawetz S. Misener (Eds) Bioinformatics Methods and Protocols: Methods in Molecular Biology. Humana Press Totowa, NJ 365–386
L. Savard P. Li S.H. Strauss M.W. Chase M. Michaud J. Bousquet (1994) ArticleTitleChloroplast and nuclear gene sequences indicate late Pennsylvanian time for the last common ancestor of extant seed plants Proc. Natl. Acad. Sci. USA 91 5163–5167
G. Stekel Y. Git F. Falciani (2000) ArticleTitleThe comparison of gene expression from multiple cDNA libraries Genome Res. 10 2055–2061
F. Sterky S. Regan J. Karlsson M. Hertzberg A. Rohde A. Holmberg B. Amini R. Bhalerao M. Larsson R. Villarroel et al. (1998) ArticleTitleGene discovery in the wood-forming tissues of poplar: analysis of 5,692 expressed sequence tags Proc. Natl. Acad. Sci. USA 95 13330–13335
L. Tamagnone A. Merida A. Parr S. Mackay F.A. Culianez-Macia K. Roberts C. Martin (1998) ArticleTitleThe AmMYB308 and AmMYB330 transcription factors from Antirrhinum regulate phenylpropanoid and lignin biosynthesis in transgenic tobacco Plant Cell. 10 135–154
R. Whetten Y.H. Sun Y. Zhang R. Sederoff (2001) ArticleTitleFunctional genomics and cell wall biosynthesis in loblolly pine Plant Mol. Biol. 47 275–291
X.M. Wu S.H. Lim W.C. Yang (2003) ArticleTitleCharacterization, expression and phylogenetic study of R2R3-MYB genes in orchid Plant Mol. Biol. 51 959–972
Y. Zhang R.R. Sederoff I. Allona (2000) ArticleTitleDifferential expression of genes encoding cell wall proteins in vascular tissues from vertical and bent loblolly pine trees Tree Physiol. 20 457–466
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Pavy, N., Laroche, J., Bousquet, J. et al. Large-scale statistical analysis of secondary xylem ESTs in pine. Plant Mol Biol 57, 203–224 (2005). https://doi.org/10.1007/s11103-004-6969-7
Received:
Accepted:
Issue Date:
DOI: https://doi.org/10.1007/s11103-004-6969-7