Abstract
We developed a new indexing formalism that considers not only the terms in a document, but also the concepts to represent the semantic content of a document. In this approach, concept clusters are defined and a concept vector space model is proposed to represent the semantic importance of words and concepts within a document. Through experiments on the TREC-2 collection, we show that the proposed method outperforms an indexing method based on term frequency.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Salton, G., Buckley, C.: Term-weighting approaches in automatic text retrieval. IP&M 24(5), 513–523 (1988)
Morris, J., Hirst, G.: Lexical cohesion computed by thesaural relations as an indicator of the structure of text. Computational Linguistics 17(1), 21–43 (1991)
Kazmand, R., Al-Halimi, R., Hunt, W., Mantei, M.: Four paradigms for indexing video conferences. IEEE Multimedia 3(1), 63–73 (1996)
The full text is available from, http://dragon.kaist.ac.kr/cicling2004.pdf
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kang, BY., Kim, HJ., Lee, SJ. (2004). Performance Analysis of Semantic Indexing in Text Retrieval. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2004. Lecture Notes in Computer Science, vol 2945. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24630-5_52
Download citation
DOI: https://doi.org/10.1007/978-3-540-24630-5_52
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21006-1
Online ISBN: 978-3-540-24630-5
eBook Packages: Springer Book Archive