Abstract
The increasing pace of biotechnological advances produced an unprecedented amount of both experimental data and biological information mostly diffused on the web. However, the heterogeneity of the data organization and the different knowledge representations open the ways to new challenges in the integration and the extraction of biological information fundamental for correctly interpreter experimental results.
In the present work we introduce a new methodology for quantitatively scoring the degree of biological correlation among biological terms occurring in biomedical abstracts. The proposed flow is based on the latent semantic analysis of biomedical literature coupled with the UMLS Metathesarurs and PubMed literature information. The results demonstrate that the structured and consolidated knowledge in the UMLS and pathway database efficiently improves the accuracy of the latent semantic analysis of biomedical literature.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Doms, A., Schroeder, M.: GoPubMed: exploring PubMed with the Gene Ontology. Nucleic Acids Research (2005)
Plake, C., Royer, L., Winnenburg, R., Hakenberg, J., Schroeder, M.: GoGene: gene annotation in the fast lane. Nucleic Acids Research (2009)
PubMed., http://www.ncbi.nlm.nih.gov/pubmed/
MeSH, Medical Subject Headings (MeSH) Fact sheet. National Library of Medicine (2005)
The Gene Ontology Consortium: Gene ontology: tool for the unification of biology. Nature Genetics (2000)
Wang, J.Z., Du, Z., Payattakool, R., Yu, P.S., Chen, C.F.: A New Method to Measure the Semantic Similarity of GO Terms. Bioinformatics (2007)
Abate, F., Ficarra, E., Acquaviva, A., Macii, E.: An Automated Tool for Scoring Biomedical Terms Correlation Based on Semantic Analysis. In: International Conference on Complex, Intelligent and Software Intensive Systems (2010)
Gliozzo, A.M., Strapparava, C.: Domain Kernels for Text Categorization. In: Ninth Conference on Computational Natural Language Learning (2005)
Aronson, A.R.: Effective Mapping of Biomedical Text to the UMLS. Metathesaurus: The MetaMap Program. In: AMIA Fall Symposium (2001)
Bodenreider, O.: The Unified Medical Language System (UMLS): integrating biomedical terminology. Nucleic Acids Research (2004)
Hill, D.P., Smith, B., McAndrews-Hill, M.S., Blake, J.A.: Gene Ontology annotations: what they mean and where they come from. Bioinformatics (2008)
Kanehisa, M., Goto, S.: Kegg: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Research (1999)
Stark, C., Breitkreutz, B.J., Reguly, T., Boucher, L., Breitkreutz, A., Tyers, M.: BioGRID: a general repository for interaction datasets. Nucleic Acids Research (2006)
Romero, P., Wagg, J., Green, M.L., Kaiser, D., Krummenacker, M., Karp, P.D.: Computational prediction of human metabolic pathways from the complete human genome. Genome Biology (2004)
Pathway Commons (2007), http://www.pathwaycommons.org
Cerami, E.G., Bader, G.D., Gross, B.E., Sander, C.: cPath: open source software for collecting, storing, and querying biological pathways. Bioinformatics (2006)
Hermjakob, H., et al.: The HUPO PSI’s molecular interaction format community standard for the representation of protein interaction data. Natural Biotechnology (2004)
BioPAX: Biological Pathways Exchange (2007), http://www.biopax.org
Chakraborti, S., Mukras, R., Lothian, R., Wiratunga, N., Watt, S., Harper, D.: Sprinkling: Supervised Latent Semantic Indexing. In: Lalmas, M., MacFarlane, A., Rüger, S.M., Tombros, A., Tsikrika, T., Yavlinsky, A. (eds.) ECIR 2006. LNCS, vol. 3936, pp. 510–514. Springer, Heidelberg (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Abate, F., Ficarra, E., Acquaviva, A., Macii, E. (2013). Improving Latent Semantic Analysis of Biomedical Literature Integrating UMLS Metathesaurus and Biomedical Pathways Databases. In: Fred, A., Filipe, J., Gamboa, H. (eds) Biomedical Engineering Systems and Technologies. BIOSTEC 2011. Communications in Computer and Information Science, vol 273. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29752-6_13
Download citation
DOI: https://doi.org/10.1007/978-3-642-29752-6_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-29751-9
Online ISBN: 978-3-642-29752-6
eBook Packages: Computer ScienceComputer Science (R0)