Abstract
Since Swanson’s introduction of literature-based discovery in 1986, new hypotheses have been generated by connecting disconnected scientific literatures. In this paper, we present the general discovery model and show how it can be used for drug discovery research. We have developed a discovery support tool that employs Natural Language Processing techniques to extract biomedical concepts from Medline titles and abstracts. Using semantic knowledge, the user, typically a biomedical scientist, can efficiently filter out irrelevant information. This chapter provides an algorithmic description of the system and presents a potential drug discovery. We conclude by discussing the current and future status of literature-based discovery in the biomedical research domain.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
- Drug Discovery
- Text String
- Natural Language Processing Technique
- Biomedical Concept
- American Medical Informatics Association
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Aronson, A.R.: The effect of textual variation on concept based information retrieval. In: Proceedings of the Annual Symposium of American Medical Informatics Association AMIA-96, Nashville, TN, pp. 373–377 (1996)
Aronson, A.R.: Effective mapping of biomedical text to the UMLS metathesaurus: The MetaMap program. In: Proceedings of the Annual Symposium of American Medical Informatics Association AMIA-01, Washington, DC, pp. 17–21 (2001)
Cory, K.A.: Discovering hidden analogies in an online humanities database. Computers and the Humanities 31, 1–12 (1997)
Gordon, M.D., Lindsay, R.K.: Toward discovery support systems: A replication, re-examination, and extension of Swanson’s work on literature-based discovery of a connection between Raynaud’s and fish oil. Journal of the American Society for Information Science 47, 116–128 (1996)
Hristovski, D., Stare, J., Peterlin, B., Dzeroski, S.: Supporting discovery in medicine by association rule mining in medline and umls. In: Proceedings of the Tenth World Congress on Medical Informatics, London, UK, pp. 1344–1348 (2001)
Langley, P.: The computational support of scientific discovery. International Journal of Human-Computer Studies 53, 393–410 (2000)
Lindberg, D.A.B., Humphreys, B.L., McCray, A.T.: The unified medical language system. Methods of Information in Medicine 32, 281–291 (1993)
Lindsay, R.K., Gordon, M.D.: Literature-based discovery by lexical statistics. Journal of the American Society for Information Science 50, 574–587 (1999)
Rikken, F.: Adverse drug reactions in a different context. Doctoral dissertation, University of Groningen, The Netherlands (1998)
Rikken, F., Vos, R.: How adverse drug reactions can play a role in innovative drug research. Pharmacy World & Science 17, 195–200 (1995)
Rindflesch, T.C., Aronson, A.R.: Ambiguity resolution while mapping free text to the UMLS Metathesaurus. In: Proceedings of the Annual Symposium of American Medical Informatics Association AMIA-94, San Francisco, CA, pp. 240–244 (1994)
Simon, H.A., Valdés-Pérez, R.E., Sleeman, D.H.: Scientific discovery and simplicity of method. Artificial Intelligence 91, 177–181 (1997)
Smalheiser, N.R.: Informatics and hypothesis-driven research. EMBO Reports 3, 702 (2002)
Smalheiser, N.R., Swanson, D.R.: Using ARROWSMITH: A computer-assisted approach to formulating and assessing scientific hypotheses. Computer Methods and Programs in Biomedicine 57, 149–153 (1998)
Spasser, M.A.: The enacted fate of undiscovered public knowledge. Journal of the American Society for Information Science 48, 707–717 (1997)
Swanson, D.R.: Fish oil, Raynaud’s syndrome, and undiscovered public knowledge. Perspectives in Biology and Medicine 30, 7–18 (1986)
Swanson, D.R.: Migraine and magnesium: Eleven neglected connections. Perspectives in Biology and Medicine 31, 526–557 (1988)
Swanson, D.R., Smalheiser, N.R.: An interactive system for finding complementary literatures: A stimulus to scientific discovery. Artificial Intelligence 91, 183–203 (1997)
Valdés-Pérez, R.E.: Principles of human computer collaboration for knowledge discovery in science. Artificial Intelligence 107, 335–346 (1999)
Vos, R.: Drugs looking for diseases. Kluwer Academic Publishers, The Netherlands, Dordrecht (1991)
Weeber, M.: Literature-based discovery in biomedicine. Doctoral dissertation, University of Groningen, The Netherlands (2001)
Weeber, M., Klein, H., Aronson, A.R., Mork, J.G., de Jong-van den Berg, L.T.W., Vos, R.: Text-based discovery in biomedicine: The architecture of the DAD-system. In: Proceedings of the Annual Symposium of American Medical Informatics Association AMIA-00, Los Angeles, CA, pp. 903–907 (2000a)
Weeber, M., Vos, R., Baayen, R.H.: Extracting the lowest-frequency words: Pitfalls and possibilities. Computational Linguistics 26, 301–317 (2000b)
Weeber, M., Vos, R., Klein, H., de Jong-van den Berg, L.T.W.: Using concepts in literature-based discovery: Simulating Swanson’s Raynaud – fish oil and migraine – magnesium discoveries. Journal of the American Society for Information Science and Technology 52, 548–557 (2001)
Weeber, M., Vos, R., Klein, H., de Jong-van den Berg, L.T.W., Molema, G.: Generating hypotheses by discovering implicit associations in the literature. a case report of a search for new potential therapeutic uses for thalidomide. Journal of the American Medical Informatics Association 10, 254–262 (2003)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Weeber, M. (2007). Drug Discovery as an Example of Literature-Based Discovery. In: Džeroski, S., Todorovski, L. (eds) Computational Discovery of Scientific Knowledge. Lecture Notes in Computer Science(), vol 4660. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73920-3_14
Download citation
DOI: https://doi.org/10.1007/978-3-540-73920-3_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73919-7
Online ISBN: 978-3-540-73920-3
eBook Packages: Computer ScienceComputer Science (R0)