Abstract
This paper describes the current state of our system for structured retrieval. The system itself is based on an extension of the vector space model initially proposed by Fox [5]. The basic functions are performed using the Smart experimental retrieval system [11]. The major advance achieved this year is the inclusion of a flexible capability, which allows the system to retrieve at a desired level of granularity (i.e., at the element level). The quality of the resultant statistics is largely dependent on issues (in particular, ranking) which have yet to be resolved.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Bellamkonda, A.: Automation of Content-and-Structure query processing. Master’s Thesis, Dept. of Computer Science, University of Minnesota Duluth (2004), http://www.d.umn.edu/cs/thesis/bellamkonda.pdf
Crouch, C., Apte, S., Bapat, H.: Using the extended vector model for XML retrieval. In: Proc. of the First Workshop of the Initiative for the Evaluation of XML Retrieval (INEX), Schloss Dagstuhl, pp. 99–104 (2002)
Crouch, C., Apte, S., Bapat, H.: An approach to structured retrieval based on the extended vector model. In: Proc. of the Second Workshop of the Initiative for the Evaluation of XML Retrieval (INEX), Schloss Dagstuhl, pp. 87–93 (2003)
Crouch, C., Crouch, D., Nareddy, K.: The automatic generation of extended queries. In: Proc. of the 13th Annual International ACM SIGIR Conference, Brussels, pp. 369–383 (1990)
Fox, E.A.: Extending the Boolean and vector space models of information retrieval with p-norm queries and multiple concept types. Ph.D. Dissertation, Department of Computer Science, Cornell University (1983)
Fox, E., Nunn, G., Lee, W.: Coefficients for combining concept classes in a collection. In: Proc. of the 11th Annual International ACM SIGIR Conference, Grenoble, pp. 291–307 (1988)
Fuhr, N., GrossJohann, K.: XIRQL: A query language for information retrieval in XML documents. In: Proc. of the 24th Annual International ACM SIGIR Conference, New Orleans, pp. 172–180 (2001)
Kamps, J., de Rijke, M., Sigurbjornsson, B.: Length normalization in XML retrieval. In: Proc. of the 27th Annual International ACM SIGIR Conference, Sheffield, England, pp. 80–87 (2004)
Liu, S., Zou, Q., Chu, W.: Configurable indexing and ranking for XML information retrieval. In: Proc. of the 27th Annual International ACM SIGIR Conference, Sheffield, England, pp. 88–95 (2004)
Mahajan, A.: Flexible retrieval in a structured environment. Master’s Thesis, Dept. of Computer Science, University of Minnesota Duluth cs/thesis/mahajan.pdf (2004), http://www.d.umn.edu/
Salton, G.: Automatic information organization and retrieval. Addison-Wesley, Reading (1968)
Salton, G., Wong, A., Yang, C.S.: A vector space model for automatic indexing. Comm. ACM 11, 613–620 (1975)
Salton, G., Buckley, C.: Term weighting approaches in automatic text retrieval. IP&M 5, 513–523 (1988)
Singhal, A.: AT&T at TREC-6. In: The Sixth Text REtrieval Conf. (TREC-6), NIST SP 500-240, pp. 215–225 (1998)
Singhal, A., Buckley, C., Mitra, M.: Pivoted document length normalization. In: Proc. of the 19th Annual International ACM SIGIR Conference, Zurich, pp. 21–19 (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Crouch, C.J., Mahajan, A., Bellamkonda, A. (2005). Flexible Retrieval Based on the Vector Space Model. In: Fuhr, N., Lalmas, M., Malik, S., Szlávik, Z. (eds) Advances in XML Information Retrieval. INEX 2004. Lecture Notes in Computer Science, vol 3493. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11424550_23
Download citation
DOI: https://doi.org/10.1007/11424550_23
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26166-7
Online ISBN: 978-3-540-32053-1
eBook Packages: Computer ScienceComputer Science (R0)