Abstract
This paper presents an approach that merges case relations into the well-known Vector Space Model (VSM), leading to a new model named C-VSM (Case relation-based VSM). A Chinese case system with 23 case relations is established, and a Chinese Olympic news corpus of 7,662 sentences, denoted COCS, is constructed by manual annotation with these 23 case relations. We use 50 queries on COCS as a test set. Experimental results on the test set show that C-VSM outperforms W-VSM (Word-based VSM) by 3.4% on the average 11-point precision. It is worth pointing out that almost all the previous studies on semantic IR obtained no better, even worse, results than W-VSM, our work thus validates the usefulness of case relations in IR through the validation is still preliminary. The proposed model is believed to be language-independent.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Khoo, S.G.: Using Cause-effect Relations in Text to Improve Information Retrieval Precision. Information Processing and Management 37, 119–145 (2001)
Liu, G.Z.: Semantic Vector Space Model: Implementation and Evaluation. Journal of the American Society for Information Science 48(5), 395–417 (1997)
Lin, X.G.: Lexical Semantics and Computational Linguistics. YuWen Press, Beijing (1999)
Lu, X.: An Application of Case Relations to Document Retrieval, Doctoral dissertation, University of Western Ontario (1990)
Fillmore, C.J.: The Case for Case. In: Universals in Linguistic Theory. Holt, Rinehart and Winston, Inc., New York (1968)
Somers, H.L.: Valency and Case in Computational Linguistics. Edinburgh University Press, Edinburgh (1987)
Lewis, D.A.: Case Grammar and Functional Relations. Doctoral dissertation, University of Western Ontario (1984)
Young, C.: Development of Language Analysis Procedures with Application to Automatic Indexing. Doctoral dissertation, The Ohio State University (1973)
Croft, W.B., Turtle, H.R., Lewis, D.D.: The Use of Phrases and Structured Queries in Information Retrieval. In: Proc. of the Fourteenth Annual International ACM/SIGIR Conference on Research and Development in Information Retrieval (1991)
Hyoudo, Y., Niimi, K., Ikeda, T.: Comparison between Proximity Operation and Dependency Operation in Japanese Full-text Retrieval. In: Proc. of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (1998)
Smeaton, A.F., O’Donnell, R., Kelledy, F.: Indexing Structures Derived from Syntax in TREC-3: System Description. In: Overview of the Third Text REtrieval Conference (TREC-3), National Institute of Standards and Technology Special Publication 500-225, pp. 55–67 (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hongtao, W., Maosong, S., Shaoming, L. (2005). Merging Case Relations into VSM to Improve Information Retrieval Precision. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2005. Lecture Notes in Computer Science, vol 3406. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30586-6_62
Download citation
DOI: https://doi.org/10.1007/978-3-540-30586-6_62
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-24523-0
Online ISBN: 978-3-540-30586-6
eBook Packages: Computer ScienceComputer Science (R0)