Abstract
Literature-based knowledge discovery generates potential discoveries from associations between specific concepts that have been previously reported in the literature. However, because the associations are generally between individual concepts, the knowledge of specific relationships between those concepts is lost. A description logic (DL) ontology adds a set of logically defined relationship types, called properties, to a classification of concepts for a particular knowledge domain. Properties can represent specific relationships between instances of concepts used to describe the things studied by a particular researcher. These relationships form a “triple” consisting of a domain instance, a range instance, and the property specifying the way those instances are related. A “relationship association” is a pair of relationship triples where one of the instances from each relationship can be determined to be semantically equivalent. In this paper, we report our work to structure a subset of more than 1300 terms from the Medical Subject Headings (MeSH) controlled vocabulary into a DL ontology, and to use that DL ontology to create a corpus of A-Boxes, which we call “semantic statements”, each of which describes one of 392 research articles that we selected from MEDLINE. Relationship associations were extracted from the corpus of semantic statements using a previously reported technique. Then, by making the assumption of the transitivity of association used in literature-based knowledge discovery, we generate hypothetical relationship associations by combining pairs of relationship associations. We then evaluate the “interestingness” of those candidate knowledge discoveries from a life science perspective.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Swanson, D.R.: Fish oil, Raynaud’s syndrome, and undiscovered public knowledge. Perspectives in Biology and Medicine 30, 7–18 (1986)
Swanson, D.R.: Somatomedin C. and Arginine: Implicit connections between mutually isolated literatures. Perspectives in Biology and Medicine 33(2), 157–179 (1990)
Weeber, M., Kors, J.A., Mons, B.: Online tools to support literature-based discovery in the life sciences. Briefings in Bioinformatics 6(3), 277–286 (2005)
Racunas, S.A., Shah, N.H., Albert, I., Fedoroff, N.V.: HyBrow: a prototype system for computer-aided hypothesis evaluation. Biofinformatics 20(suppl. 1), i257–i264 (2004)
Natarajan, J., Berrar, D., Dubitzky, W., Hack, C., Zhang, Y., DeSesa, C., Van Brocklyn, J.R., Bremer, E.G.: Text mining of full-text journal articles combined with gene expression analysis reveals a relationship between sphingosine-1-phosphate and invasiveness of a glioblastoma cell line. BMC Bioinformatics 7, 373 (2006)
Srinivasan, P.: Text Mining: Generating Hypotheses From MEDLINE. JASIST 55(5), 396–413 (2004)
van der Eijk, C.C., van Mulligen, E.M., Kors, J.A., Mons, B., van den Berg, J.: Constructing an associative concept space for literature-based discovery. JASIST 55(5), 436–444 (2004)
Yamamoto, Y., Takagi, T.: Biomedical knowledge navigation by literature clustering. Journal of Biomedical Informatics 40(2), 114–130 (2007)
Baader, F., Calvanese, D., McGuinness, D.L., Nardi, D., Patel-Schneider, P.F.: The Description Logic Handbook: Theory, Implementation, and Applications. Cambridge University Press, New York (2003)
Erhardt, R.A.-A., Schneider, R., Blaschke, C.: Status of text-mining techniques applied to biomedical text. Drug Discovery Today 11(7-8), 315–325 (2006)
Rinaldi, F., Schneider, G., Kaljurand, K., Hess, M., Romacker, M.: An environment for relation mining over richly annotated corpora: the case of GENIA. BMC Bioinformatics 7(suppl. 3), S3 (2006)
Ceol, A., Chatr-Aryamontri, A., Licata, L., Cesareni, G.: Linking Entries in Protein Interaction Database to Structured Text: the FEBS Letters Experiment. FEBS Letters 582(8), 1171–1177 (2008)
Rebholz-Schuhmann, D., Kirsch, H., Couto, F.: Facts from text–is text mining ready to deliver? PLoS Biol. 3(2), e65 (2005)
Gerstein, M., Seringhaus, M., Fields, S.: Structured digital abstract makes text mining easy. Nature 447, 142 (2007)
Seringhaus, M., Gerstein, M.: Manually structured digital abstracts: a scaffold for automatic text mining. FEBS Lett. 582, 1170 (2008)
Mons, B., et al.: Calling on a million minds for community annotation in WikiProteins. Genome Biol. 9(5), R89 (2008)
Pico, A.R., Kelder, T., van Iersel, M.P., Hanspers, K., Conklin, B.R., Evelo, C.: WikiPathways: Pathway Editing for the People. PLoS Biol. 6(6), e184+ (2008)
Hartley, J., Betts, L.: The effects of spacing and titles on judgments of the effectiveness of structured abstracts. JASIST 58(14), 2335–2340 (2007)
Cafarella, M.J., Re, C., Suciu, D., Etzioni, O.: Structured Querying of Web Text Data: A Technical Challenge. In: Proceedings of CIDR 2007 (2007)
O’donnell, M., Mellish, C., Oberlander, J., Knott, A.: ILEX: an architecture for a dynamic hypertext generation system. Nat. Lang. Eng. 7(3), 225–250 (2001)
Hunter, L., Cohen, K.B.: Biomedical language processing: what’s beyond PubMed? Mol. Cell. 21, 589–594 (2006)
Natarajan, J., Berrar, D., Hack, C.J., Dublitzky, W.: Knowledge discovery in biology and biotechnology texts: A review of techniques, evaluation strategies, and applications. Critical Rev. in Biotech. 25, 31–52 (2005)
Kraines, S.B., Guo, W., Kemper, B., Nakamura, Y.: EKOSS: A Knowledge-User Centered Approach to Knowledge Sharing, Discovery, and Integration on the Semantic Web. In: Cruz, I., Decker, S., Allemang, D., Preist, C., Schwabe, D., Mika, P., Uschold, M., Aroyo, L.M. (eds.) ISWC 2006. LNCS, vol. 4273, pp. 833–846. Springer, Heidelberg (2006)
Kraines, S.B., Makino, T., Guo, W., Mizutani, H., Takagi, T.: Bridging the Knowledge Gap between Research and Education through Textbooks. In: Proc. 9th Intl Conference on Web Learning, Shanghai, China (2010)
Soualmia, L.F., Golbreich, C., Darmoni Soualmia, S.J.: Representing the MeSH in OWL: Towards a Semi-Automatic Migration. In: Proceedings of the KR 2004 Workshop on Formal Biomedical Knowledge Representation, Whistler, BC, Canada (2004)
OWL Web Ontology Language Overview, http://www.w3.org/TR/2004/REC-owl-features-20040210
Life Science Dictionary Project, http://lsd.pharm.kyoto-u.ac.jp/en/service/weblsd/index.html
U.S. National Library of Medicine, http://www.nlm.nih.gov/pubs/factsheets/mesh.html
McCray, A.T.: An upper-level ontology for the biomedical domain. Comparative and Functional Genomics 4, 80–84 (2003)
Batres, R., West, M., Leal, D., Price, D., Masaki, K., Shimada, Y., Fuchino, T., Naka, Y.: An upper ontology based on ISO 15926. Computers & Chemical Eng. 31, 519–534 (2007)
Niles, I., Pease, A.: Towards a Standard Upper Ontology. In: Welty, C., Smith, B. (eds.) Proc. 2nd Intl Conf. on Formal Ontology in Information Systems, Ogunquit, Maine (2001)
Rector, A., Bechhofer, S., Goble, C., Horrocks, I., Nowlan, W., Solomon, W.: The GRAIL concept modelling language for medical terminology. Artificial Intelligence in Medicine 9, 139–171 (1997)
Kraines, S.B., Iwasaki, W., Usuki, H., Yamamoto, Y.: A description logics ontology for biomolecular processes (poster). In: Bio-Ontologies SIG Workshop, Vienna, Austria (2007)
Yee, K.P., Swearingen, K., Li, K., Hearst, M.: Faceted metadata for image search and browsing. In: CHI 2003: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, pp. 401–408 (2003)
Kashyap, V., Borgida, A.: Representing the uMLSformula_image semantic network using OWL. In: Fensel, D., Sycara, K., Mylopoulos, J. (eds.) ISWC 2003. LNCS, vol. 2870, pp. 1–16. Springer, Heidelberg (2003)
Allemang, D., Hender, J.: Semantic Web for the Working Ontologist. Morgan Kaufmann, Burlington (2008)
Rector, A., Drummond, N., Horridge, M., Rogers, J., Knublauch, H., Stevens, R., Wang, H., Wroe, C.: OWL Pizzas: Practical Experience of Teaching OWL-DL: Common Errors & Common Patterns. In: Motta, E., Shadbolt, N.R., Stutt, A., Gibbins, N. (eds.) EKAW 2004. LNCS (LNAI), vol. 3257, pp. 63–81. Springer, Heidelberg (2004)
Smith, B., Ceusters, W., Klagges, B., Kohler, J., Kumar, A., Lomax, J., Mungall, C., Neuhaus, F., Rector, A.L., Rosse, C.: Relations in biomedical ontologies. Genome Biol. 6(5), R46 (2005)
Kanehira, M., Katagiri, T., Shimo, A., Takata, R., Shuin, T., Miki, T., Fujioka, T., Nakamura, Y.: Oncogenic role of MPHOSPH1, a cancer-testis antigen specific to human bladder cancer. Cancer Research 67, 3276–3285 (2007)
Guo, W., Kraines, S.B.: Discovering Relationship Associations in Life Sciences Using Ontology and Inference. In: Proceedings of 1st International Conference on Knowledge Discovery and Information Retrieval 2009, Madeira, Portugal, pp. 10–17 (2009)
Guo, W., Kraines, S.B.: Extracting Relationship Associations from Semantic Graphs in Life Sciences. In: Fred, A., Dietz, J.L.G., Liu, K., Filipe, J., et al. (eds.) IC3K 2009. CCIS, vol. 128, pp. 53–67. Springer, Heidelberg (2011)
Kraines, S.B., Guo, W., Hoshiyama, D., Mizutani, H., Takagi, T.: Generating Literature-Based Knowledge Discoveries in Life Sciences Using Relationship Associations. In: Proc. 2nd Intl. Conf. on Knowledge Discovery and Information Retrieval, Valencia, Spain (2010)
Hristovski, D., Friedman, C., Rindflesch, T.C., Peterlin, B.: Exploiting Semantic Relations for Literature-Based Discovery. In: AMIA Annu. Symp. Proc. 2006, pp. 349–353 (2006)
Taweel, A., Rector, A., Rogers, J.: A collaborative biomedical research system. Journal of Universal Computer Science 12, 80–98 (2006)
Bontcheva, K., Wilks, Y.: Automatic Report Generation from Ontologies: The MIAKT Approach. In: Meziane, F., Métais, E. (eds.) NLDB 2004. LNCS, vol. 3136, pp. 324–335. Springer, Heidelberg (2004)
Wroe, C.J., Stevens, R., Goble, C.A., Ashburner, M.: A methodology to migrate the Gene ontology to a description logic environment using DAML+OIL. In: Pacific Symposium on Biocomputing, vol. 8, pp. 624–635 (2003)
Berners-Lee, T., Hendler, J.: Publishing on the Semantic Web. Nature 410, 1023–1024 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kraines, S.B. et al. (2013). Literature-Based Knowledge Discovery from Relationship Associations Based on a DL Ontology Created from MeSH. In: Fred, A., Dietz, J.L.G., Liu, K., Filipe, J. (eds) Knowledge Discovery, Knowledge Engineering and Knowledge Management. IC3K 2010. Communications in Computer and Information Science, vol 272. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-29764-9_6
Download citation
DOI: https://doi.org/10.1007/978-3-642-29764-9_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-29763-2
Online ISBN: 978-3-642-29764-9
eBook Packages: Computer ScienceComputer Science (R0)