Abstract
Semantic similarity and relatedness measures between ontology concepts are useful in many research areas. While similarity only considers subsumption relations to assess how two objects are alike, relatedness takes into account a broader range of relations (e.g., part-of). In this paper, we present a framework, which maps the feature-based model of similarity into the information theoretic domain. A new way of computing IC values directly from an ontology structure is also introduced. This new model, called Extended Information Content (eIC) takes into account the whole set of semantic relations defined in an ontology. The proposed framework enables to rewrite existing similarity measures that can be augmented to compute semantic relatedness. Upon this framework, a new measure called FaITH (Feature and Information THeoretic) has been devised. Extensive experimental evaluations confirmed the suitability of the framework.
Chapter PDF
Similar content being viewed by others
References
Agirre, E., Alfonseca, E., Hall, K., Kravalova, J., Pasca, M., Soroa, A.: A Study on Similarity and Relatedness Using Distributional and WordNet-based Approaches. In: Proc. of NAACL-HLT (2009)
Borgida, A., Walsh, T., Hirsh, T.: Towards Measuring Similarity in Description Logics. In: Proc. of Description Logics (2005)
Danushka, B., Yutaka, M., Mitsuru, I.: Measuring Semantic Similarity Between Words using Web Search Engines. In: Proc. of WWW 2007, pp. 757–766 (2007)
D ’ Amato, C.: Similarity-based Learning Methods for the Semantic Web. PhD Thesis, University of Bari (2007)
Son, J.Y., Goldstone, R.L.: The Transfer of Scientific Principles using Concrete and Idealized Simulation. The Journal of the Learning Sciences (14), 69–110 (2005)
Hirst, G., St-Onge, D.: Lexical Chains as Representations of Context for the Detection and Correction of Malapropisms. In: Fellbaum, C. (ed.) WordNet. An Electronic Lexical Database, ch. 13, pp. 305–332
Hliaoutakis, A.: Semantic Similarity Measures in MeSH Ontology and their Application to Information Retrieval on Medline, Technical report, Technical Univ. of Crete, Dept. of Electronic and Computer Engineering (2005)
Hliaoutakis, A., Varelas, G., Voutsakis, E., Petrakis, E.G.M., Milios, E.E.: Information Retrieval by Semantic Similarity. Int. J. SWIS 2(3), 55–73 (2006)
Jiang, J.J., Conrath, D.W.: Semantic Similarity based on Corpus Statistics and Lexical Taxonomy. In: Proc. of ROCLING X (1997)
Leacock, C., Chodorow, M.: Combining Local Context and WordNet Similarity for Word Sense Identification. In: Fellbaum, C. (ed.) WordNet. An Electronic Lexical Database, ch. 11, pp. 265–283
Li, Y., Bandar, A., McLean, D.: An Approach for Measuring Semantic Similarity between Words Using Multiple Information Sources. IEEE TKDE 15(4), 871–882
Lin, D.: An Information-theoretic Definition of Similarity. In: Proc. of Conf. on Machine Learning, pp. 296–304 (1998)
Miller, G.A.: WordNet an on-line Lexical Database. International Journal of Lexicography 3(4), 235–312 (1990)
Miller, G.A., Charles, W.G.: Contextual Correlates of Semantic Similarity. Language and Cognitive Processes (6), 1–28 (1991)
Banerjee, S., Pedersen, T.: Extended Gloss Overlaps as a Measure of Semantic Relatedness. In: Proc. of IJCAI, pp. 805–810 (2003)
Pirró, G., Ruffolo, M., Talia, D.: SECCO: On Building Semantic Links in Peer to Peer Networks. Journal on Data Semantics XII, 1–36 (2009)
Pirró, G.: A Semantic Similarity Metric Combining Features and Intrinsic Information Content. Data Knowl. Eng. 68(11), 1289–1308 (2009)
Rada, R., Mili, H., Bicknell, M., Blettner, E.: Development and Application of a measure on Semantic Nets. IEEE TSMC (19), 17–30 (1989)
Resnik, P.: Information Content to Evaluate Semantic Similarity in a Taxonomy. In: Proc. of IJCAI, pp. 448–453 (1995)
Rodriguez, M.A., Egenhofer, M.J.: Determining Semantic Similarity among Entity Classes from Different Ontologies. IEEE TKDE 15(2), 442–456 (2003)
Rubenstein, H., Goodenough, J.B.: Contextual Correlates of Synonymy. CACM 8(10), 627–633 (1965)
Schickel-Zuber, V., Faltings, B.: OSS: A Semantic Similarity Function based on Hierarchical Ontologies. In: IJCAI, pp. 551–556 (2007)
Seco, N., Veale, T., Hayes, J.: An Intrinsic Information Content measure for Semantic Similarity in WordNet. In: Proc. of ECAI 2004, pp. 1089–1090 (2004)
Tversky, A.: Features of Similarity. Psychological Review 84(2), 327–352 (1977)
Wang, J., Du, Z., Payattakool, R., Yu, P., Chen, C.: A New Method to Measure the Semantic Similarity of GO Terms. Bioinformatics 23(10), 1274–1281 (2007)
Watanable, S.: Knowing and Guessing: A Quantitative Study of Inference and Information. Wiley, Chichester (1969)
Wu, Z., Palmer, M.: Verb semantics and Lexical Selection. In: Proc. of FQAS ACL 1994, pp. 133–138 (1994)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Pirró, G., Euzenat, J. (2010). A Feature and Information Theoretic Framework for Semantic Similarity and Relatedness. In: Patel-Schneider, P.F., et al. The Semantic Web – ISWC 2010. ISWC 2010. Lecture Notes in Computer Science, vol 6496. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-17746-0_39
Download citation
DOI: https://doi.org/10.1007/978-3-642-17746-0_39
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-17745-3
Online ISBN: 978-3-642-17746-0
eBook Packages: Computer ScienceComputer Science (R0)