Chapter Overview
Natural language processing is increasingly used to support biomedical applications that manipulate information rather than documents. Examples include automatic summarization, question answering, and literature-based scientific discovery. Semantic processing is a method of automatic language analysis that identifies concepts and relationships to represent document content. The identification of this information depends on structured knowledge, and in the biomedical domain, one such resource is the Unified Medical Language System. After providing some linguistic background, we discuss several semantic interpretation systems being developed in biomedicine. Finally, we briefly investigate two applications that exploit semantic information in MEDLINE citations; one focuses on automatic summarization and the other is directed at information extraction for molecular biology research.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Keywords
References
Aronson, A. R. (2001). “Effective Mapping of Biomedical Text to the UMLS Metathesaurus: The MetaMap Program,” in Proceedings of the AMIA Symposium, 17–21.
Cimino, J. J. (1996). “Linking Patient Information Systems to Bibliographic Resources,” Methods of Information in Medicine, 35, 122–6.
Cimino, J. J. and Barnett, G. O. (1993). “Automatic Knowledge Acquisition from MEDLINE,” Methods of Information in Medicine, 32, 120–30.
Christensen, L., Haug, P. J., and Fiszman, M. (2002). “MPLUS: A Probabilistic Medical Language Understanding System,” ACL Workshop on Natural Language Processing in the Biomedical Domain, 29–36.
Fiszman, M., Rindflesch, T. C., and Kilicoglu, H. (2003). “Integrating a Hypernymic Proposition Interpreter into a Semantic Processor for Biomedical Text,” in Proceedings of the AMIA Symposium, 239–43.
Fiszman, M., Rindflesch, T. C., and Kilicoglu, H. (2004). “Abstraction Summarization for Managing the Biomedical Research Literature,” in Proceedings of the Workshop on Computational Lexical Semantics, 76–83. HLT-NAACL.
Fuller, S., Revere, D., Bugni P., and Martin, G.M. “Telemakus: A Schema-based Information System to Promote Scientific Discovery,” Journal of the American Society for Information Science and Technology, in press.
Friedman, C., Alderson, P. O., Austin, J. H., Cimino, J. J., and Johnson, S. B. (1994). “A General Natural-language Text Processor for Clinical Radiology,” Journal of the American Medical Informatics Association, 1(2), 161–74.
Grishman, R., Huttunen, S., and Yangarger, R. (2002). “Information Extraction for Enhanced Access to Disease Outbreak Reports,” Journal of Biomedical Informatics, 35(4), 236–46.
Hahn, U. and Reimer, U. (1999). “Knowledge-based Text Summarization: Salience and Generalization Operators for Knowledge Base Abstraction,” in I. Mani (Ed.), Advances in Automatic Summarization, Cambridge, MA: MIT Press. 215–32.
Hahn, U., Romacker, M., and Schulz, S. (2002). “MEDSYNDIKATE—Design Considerations for an Ontology-based Medical Text Understanding System,” in Proceedings of the AMIA Symposium, 330–4.
Humphreys, B. L., Lindberg, D.A., Schoolman, H.M., and Barnett, G.O. (1998). “The Unified Medical Language System: An Informatics Research Collaboration,” Journal of the American Medical Informatics Association, 5(1), 1–11.
Jacquelinet, C., Burgun, A., Delamarre, D., Strang, N., Djabbour, S., Boutin, B., and Le Beux, P. (2003). “Developing the Ontological Foundations of a Terminological System for End-stage Diseases, Organ Failure, Dialysis and Transplantation,” International Journal of Medical Informatics, 70(2–3), 317–28.
Jacquemart, P. and Zweigenbaum P. (2003). “Towards a Medical Question-answering System: A Feasibility Study,” Stud Health Technol Inform, 95, 463–8.
Johnson, S. B., Aguirre, A., Peng, P., and Cimino, J. J. (1993). “Interpreting Natural Language Queries Using the UMLS,” in Proceedings of the AMIA Symposium, 294–8.
Johnson, S. B. and Gottfried, M. (1989). “Sublanguage Analysis as a Basis for Controlled Medical Vocabulary,” SCAMC, 519–23.
Leroy, G., Chen, H., and Martinez, J.D. (2003). “A Shallow Parser Based on Closed-class Words to Capture Relations in Biomedical Text,” Journal of Biomedical Informatics, 36(3), 145:58.
Libbus, B., Kilicoglu, H., Rindflesch, T. C., Mork, J. G., and Aronson, A. R. (2004). “Using Natural Language Processing, Locus Link, and the Gene Ontology to Compare OMIM to MEDLINE,” in Proceedings of the Workshop on Linking the Biological Literature, Ontologies and Databases: Tools for Users, 69–76. HLT-NAACL.
McCray, A. T. (2003). “An Upper-level Ontology for the Biomedical Domain,” Comp Funct Genom, 4, 80–4.
McCray, A. T., Burgun, A., and Bodenreider, O. (2001). “Aggregating UMLS Semantic Types for Reducing Conceptual Complexity,” in Medinfo, 10(Pt 1), 216–20.
McCray, A. T., Srinivasan, S., and Browne, A. C. (1994). “Lexical Methods for Managing Variation in Biomedical Terminologies,” SCAMC, 235–9.
Mendonca, E. A. and Cimino, J. J. (2000). “Automated Knowledge Extraction from MEDLINE Citations,” in Proceedings of the AMIA Symposium, 575–9.
Mendonca, E. A., Johnson, S. B., Seol, Y., and Cimino, J. J. (2002). “Analyzing the Semantics of Patient Data to Rank Records of Literature Retrieval,” ACL Workshop on Natural Language Processing in the Biomedical Domain, 69–76.
Pakhomov, S. V., Ruggieri, A., and Chute, C. G. (2002). “Maximum Entropy Modeling for Mining Patient Medication Status from Free Text,” in Proceedings of the AMIA Symposium, 587–91.
Rindflesch, T. C., and Fiszman, M. (2003). “The Interaction of Domain Knowledge and Linguistic Structure in Natural Language Processing: Interpreting Hypernymic Propositions in Biomedical Text,” Journal of Biomedical Informatics, 36(6), 462–77.
Rindflesch, T. C., Libbus, B., Hristovski, D., Aronson, A. R., and Kilicoglu, H. (2003). “Semantic Relations Asserting the Etiology of Genetic Diseases,” in Proceedings of the AMIA Symposium, 554–8.
Smith, L., Rindflesch, T., and Wilbur, W. J. (2004). “MedPost: A Part of Speech Tagger for Biomedical Text,” Bioinformatics, in press.
Sowa, J. F. (2000). Knowledge Representation, Pacific Grove: Brooks/Cole.
Srinivasan P. and Libbus, B. (2004). “Mining MEDLINE for Implicit Links Between Dietary Substances and Diseases,” Bioinformatics, 20, i290–i296.
Taira, R. K., and Soderland, S. G. (1999). “A Statistical Natural Language Processor for Medical Reports,” in Proceedings of the AMIA Symposium, 970–4.
Tanabe, L. and Wilbur, W. J. (2002). “Tagging Gene and Protein Names in Biomedical Text,” Bioinformatics, 18(8), 1124–32.
Suggested Readings
Friedman, C. and Hripcsak, G. (1999). “Natural Language Processing and its Future in Medicine,” Acad Med, 74(8), 890–5.
Jurafsky, D. and Martin J.H. (2000). Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, 1sted., Upper Saddle River: Prentice Hall.
Rindflesch, T. C. and Aronson, A.R. (2002). “Semantic Processing for Enhanced Access to Biomedical Knowledge,” in Real World Semantic Web Applications, V. Kashyap and L. Shklar (Eds.), IOS Press, 157–72.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer Science+Business Media, Inc.
About this chapter
Cite this chapter
Rindflesch, T.C., Fiszman, M., Libbus, B. (2005). Semantic Interpretation for the Biomedical Research Literature. In: Chen, H., Fuller, S.S., Friedman, C., Hersh, W. (eds) Medical Informatics. Integrated Series in Information Systems, vol 8. Springer, Boston, MA. https://doi.org/10.1007/0-387-25739-X_14
Download citation
DOI: https://doi.org/10.1007/0-387-25739-X_14
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-24381-8
Online ISBN: 978-0-387-25739-6
eBook Packages: MedicineMedicine (R0)