Abstract
Expressed sequence tags (ESTs) provide researchers with a quick and inexpensive route for discovering new genes, data on gene expression and regulation, and also provide genic markers that help in constructing genome maps. Cacao is an important perennial crop of humid tropics. Cacao EST sequences, as available in the public domain, were downloaded and made into contigs. Microsatellites were located in these ESTs and contigs using five softwares (MISA, TRA, TROLL, SSRIT and SSR primer). MISA gave maximum coverage of SSRs in cacao ESTs and contigs, although TRA was able to detect higher order (>5-mer) repeats. The frequency of SSRs was one per 26.9 kb in the known set of ESTs. One-third of the repeats in EST-contigs were found to be trimeric. A few rare repeats like 21-mer repeat were also located. A/T repeats were most abundant among the mononucleotide repeats and the AG/GA/TC/CT type was the most frequent among dimerics. Flanking primers were designed using Primer3 program and verified experimentally for PCR amplification. The results of the study are made available freely online database (http://riju.byethost31.com/cocoa/). Seven primer pairs amplified genomic DNA isolated from leaves were used to screen a representative set of 12 accessions of cacao.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
Aggarwal R. K., Prasad S. H., Varshney R. K., Prasanna R. B., Krishnakumar V., Lalji S. et al. 2007 Identification, characterization and utilization of EST-derived genic microsatellite markers for genome analyses of coffee and related species. Theor. Appl. Genet. 114, 359–372.
Araujo J. S., Intorne A. C., Pereire M. G., Lopes U. V. and deSouza Filko G. A. 2007 Development and characterization of novel tetra, tri and dinucletotide repeat microsatellite markers in cacao (Theobrama cacao L.) Mol. Breed. 20, 73–81.
Bilgen M., Karaca M., Onus A. and Ince A. 2004 A software program combining sequence motif searches with keywords for finding repeats containing DNA sequences. Bioinformatics 20, 3379–3386.
Borrone J. W., Brown J. S., Kuhn D. N., Motamayor J. C. and Schnell R. J. 2007 Microsatellite markers developed from Theobroma cacao L. expressed sequence tags. Mol. Ecol. Notes 7, 236–239.
Cardle L., Ramsay L., Milbourne D., Macaulay M., Marshall D. and Waugh R. 2000 Computational and experimental characterization of physically clustered simple sequence repeats in plants. Genetics 156, 847–854.
Couch J. A., Zintel H. A. and Fritz P. J. 1993 The genome of the tropical tree Theobroma cacao L. Mol. Gen. Genet. 237, 123–128.
Dice L. R. 1945 Measures of the amount of ecologic association between species. Ecology 26, 297–302.
Ewing B. and Green P. 1998 Basecalling of automated sequencer traces using phred II. error probabilities. Genome. Res. 8, 186–194.
Gao L. F., Tang J. F., Li H. W. and Jia J. Z. 2003 Analysis of microsatellites in major crops assessed by computational and experimental approaches. Mol. Breed. 12, 245–261.
Jurka J. and Pethiygoda C. 1995 Simple repetitive DNA sequences from primates: compilation and analysis. J. Mol. Evol. 40, 120–126.
Kantety R. V., La Rota M., Matthews D. E. and Sorrells M. E. 2002 Data mining for simple sequence repeats in expressed sequence tags from barley, maize, rice, sorghum, and wheat. Plant Mol. Biol. 48, 501–510.
Katti M. V., Ranjekar P. K. and Gupta V. S. 2001 Differential distribution of simple sequence repeats in eukaryotic genome sequences. Mol. Biol. Evol. 18, 1161–1167.
Kumpatla S. P. and Mukhopadhyay S. 2005 Mining and survey of simple sequence repeats in expressed sequence tags of dicotyledonous species. Genome 48, 985–998.
Lawson M. J. and Zhang L. 2006 Distinct patterns of SSR distribution in the Arabidopsis thaliana and rice genomes. Genome Biol. 7, R14.
Lima L. S., Gramacho K. P., Gesteira A. S., Lopes U. V., Gaiotto F. A., Zaidan H. A. et al. 2008 Characterization of microsatellites from cacao-Moniliophthora perniciosa interaction expressed sequence tags. Mol. Breed. 22, 315–318.
Martins W., De Sousa D., Proite K., Guimaraes P., Moretzsohn M. and Bertioli D. 2006 New softwares for automated microsatellite marker development. Nucleic Acids Res. 34, e31.
Metzgar D., Bytof J. and Wills C. 2000 Selection against frameshift mutations limits microsatellite expansion in coding DNA. Genome Res. 10, 72–80.
Newcomb R. D., Crowhurst R. N., Gleave A. P., Rikkerink E. H. A., Allan A. C., Beuning L. L. et al. 2006 Analysis of expressed sequence tags from apple. Plant Physiol. 141, 147–166.
Pearson C., Sinden E. and Richard R. 1998 Trinucleotide repeat DNA structure: Dynamic mutations from dynamic DNA. Curr. Opin. Structl. Biol. 36, 884–889.
Powell W. W., Machery G. C. and Provan J. 1996 Polymorphism revealed by simple sequence repeats. Trends Genet. 1, 76–83.
Pugh T., Fouet O., Tisterucci A. M., Brottier P., Abouladze M., Deletrez C. et al. 2004 A new cacao linakage map based on codominant markers: development and integration of 201 new microsatellite markers. Theor. Appl. Genet. 108, 1151–1161.
Quackenbush J., Cho D., Lee F. L., Hott I., Karamychera S. and Parizi B. 2001 The TIGR gene indices: analysis of gene transient sequences in highly sample eukaryotic species. Nucleic Acids Res. 29, 159–164.
Rabello E., Nunes de Souza A., Saito D. and Tsai S.M. 2005 In silico characterization of microsatellites in Eucalyptus spp.: abundance, length variation and transposon associations. Genet. Mol. Biol. 28, 582–588.
Robinson A. J., Love C. G., Batley J., Barker G. and Edwards D. 2004 Simple sequence repeat marker loci discovery using SSR primer. Bioinformatics 20, 1475–1476.
Rohlf F. J. 1998 On applications of geometric morphometrics to studies of ontogeny and phylogeny. Sys. Biol. 47, 147–158.
Smith J. S. C., Chin E. C. L., Shu H., Smith O. S., Wall S. J., Senior M. L. et al. 1997 An evaluation of the utility of SSR loci as molecular markers in maize (Zea mays L.): comparisons with data from RFLPs and pedigree. Theor. Appl. Genet. 95, 163–173.
Sneath P. H. A. and Sokal R. R. 1973 Numerical taxonomy, the principles and practice of numerical classification. W. H. Freeman, San Francisco.
Sreenu V. B., Kumar P., Nagaraju J. and Nagarajaram H. A. 2007 Simple sequence repeats in mycobacterial genomes. J. Biosci. 32, 3–15.
Tang J. F., Gao L., Cao Y. and Jia J. 2006 Homologous analysis of EST-SSRs and transferability of wheat SSR-EST markers across barley, rice and maize. Euphytica 151, 87–93.
Temnykh S., DeClerck G., Lukashova A., Lipovich L., Cartinhour S. and McCouch S. 2001 Computational and experimental analysis of microsatellites in rice (Oryza sativa L.): frequency, length variation, transposon associations, and genetic marker potential. Genome Res. 11, 1441–1452.
Thiel T., Michalek V. and Graner A. 2003 Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.). Theor. Appl. Genet. 106, 411–422.
Varshney R. K., Graner A. and Sorrells M. E. 2005 Genic microsatellite markers in plants: features and applications. Trends Biotech. 23, 48–55.
Varshney R. K., Thiel T., Stein N., Langridge P. and Graner A. 2002 In silico analysis on frequency and distribution of microsatellites in ESTs of some cereal species. Cell Mol. Biol. Lett. 7, 537–546.
Verica J., Maximova S., Strem M., Carlson J., Bailey B. and Guiltinan M. 2004 Isolation of ESTs from cacao (Theobroma cacao L.) leaves treated with inducers of the defense response Plant Cell Rep. 23, 404–413.
von Stackelberg M., Rensin S. A. and Reskhi R. 2006 Identification of genic moss SSR markers and a comparative analysis of 24 algal and plant gene indices reveal specific rather group specific characteristics of microsatellites. BMC Plant Biol. 6, 9.
Yasodha R., Sumathi R., Chezhian P., Kavitha S. and Ghosh M. 2008 Eucalyptus microsatellites mined in silico: survey and evaluation. J. Genet. 87, 21–25.
Yu J. K., La Rota M., Kantety R. V. and Sorrells M. E. 2004 EST derived SSR markers for comparative mapping in wheat and rice. Mol. Gen. Genomics 271, 742–751.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Riju, A., Rajesh, M.K., Sherin, P.T.P.F. et al. Mining of expressed sequence tag libraries of cacao for microsatellite markers using five computational tools. J Genet 88, 217–225 (2009). https://doi.org/10.1007/s12041-009-0030-1
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12041-009-0030-1