Abstract
As many metadata are encoded in XML, and many digital libraries need to manage XML documents, efficient techniques for searching in such formatted data are required. In order to efficiently process path expressions with wildcards on XML data, a new path index is proposed. Extensive evaluation confirms better performance with respect to other techniques proposed in the literature. An extension of the proposed technique to deal with the content of XML documents in addition to their structure is also discussed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Amato, G., Gennaro, C., Savino, P.: Indexing and retrieving documentary films: managing metadata in the ECHO system. In: 4th Intl. Workshop on Multimedia Information Retrieval, Juan-les-Pins, France, in conjunction with ACM Multimedia, December 6 (2002)
Gonnet, G., Baeza-Yates, R.: Handbook of data structure and algorithms, 2nd edn. Addison-Wesley, Reading (1991)
Brately, P., Choueka, Y.: Processing truncated terms in document retrieval systems. Information Processing & Management 18(5), 257–266 (1982)
Bruno, N., Koudas, N., Srivastava, D.: Holistic twig joins: Optimal XML pattern matching. In: Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, Madison Wisconsin, USA, pp. 310–321. ACM, New York (2002)
Castelli, D., Pagano, P.: Opendlib: A digital library service system. In: Agosti, M., Thanos, C. (eds.) ECDL 2002. LNCS, vol. 2458, p. 292. Springer, Heidelberg (2002)
World Wide Web Consortium. XML path language (XPath), version 1.0, W3C. Recommendation (November 1999)
World Wide Web Consortium. XQuery 1.0: An XML query language. W3C Working Draft (November 2002), http://www.w3.org/TR/xquery
Day, N., Martnez, J.M.: Introduction to MPEG- 7 (v4.0). working document N4675 (2002) , Available at http://mpeg.telecomitalialab.com/workingdocuments.htm
Salton, G., McGill, M.J.: Introduction to Modern Information Retrieval. McGraw-Hill Book Company, New York (1983)
Yao, B.B., Tamer Özsu, M., Keenleyside, J.: XBench – a family of benchmarks for XML DBMSs. Technical Report TR-CS-2002-39, University of Waterloo (December 2002), http://db.uwaterloo.ca/~ddbms/projects/xbench/index.html
Zhang, C., Naughton, J.F., DeWitt, D.J., Luo, Q., Lohman, G.M.: On supporting containment queries in relational database management systems. In: Aref, W.G. (ed.) ACM SIGMOD Conference 2001, Proceedings, Santa Barbara, CA, USA, ACM, New York (2001)
Zobel, J., Moffat, A., Sacks-Davis, R.: Searching large lexicons for partially specified terms using compressed inverted files. In: Agrawal, R., Baker, S., Bell, D.A. (eds.) 19th International Conference on Very Large Data Bases, Proceedings, Dublin, Ireland, August 24-27, pp. 290–301. Morgan Kaufmann, San Francisco (1993)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Amato, G., Debole, F., Zezula, P., Rabitti, F. (2003). YAPI: Yet Another Path Index for XML Searching. In: Koch, T., Sølvberg, I.T. (eds) Research and Advanced Technology for Digital Libraries. ECDL 2003. Lecture Notes in Computer Science, vol 2769. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45175-4_17
Download citation
DOI: https://doi.org/10.1007/978-3-540-45175-4_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40726-3
Online ISBN: 978-3-540-45175-4
eBook Packages: Springer Book Archive