Abstract
XML is becoming the standard representation format for metadata. Metadata for multimedia documents, as for instance MPEG-7, require approximate match search functionalities to be supported in addition to exact match search. As an example, consider image search performed by using MPEG-7 visual descriptors. It does not make sense to search for images that are exactly equal to a query image. Rather, images similar to a query image are more likely to be searched. We present the architecture of an XML search engine where special techniques are used to integrate approximate and exact match search functionalities.
This work was partially supported by DELOS NoE [1], funded by the European Commission under FP6 (Sixth Framework Programme) and by the ECD project (Enhanced Content Delivery) [2], funded by the Italian government. We would like to thank Paolo Bolettieri for its contribution to the implementation of XMLSe.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Delos, http://www.delos.info/
ECD, Enhanced, Content, Delivery (2002), http://ecd.isti.cnr.it
XPath1.0 (1999), http://www.w3.org/tr/xpath
XQuery1.0 (2005), http://www.w3.org/tr/xquery
Amato, G., Debole, F., Rabitti, F., Savino, P., Zezula, P.: Signature-based approach for efficient relationship search on xml data collections. In: Bellahsène, Z., Milo, T., Rys, M., Suciu, D., Unland, R. (eds.) XSym 2004. LNCS, vol. 3186, pp. 82–96. Springer, Heidelberg (2004)
MPEG (2004), http://www.chiariglione.org/mpeg/
Florescu, D., Kossmann, D.: Storing and querying xml data using an rdbms. IEEE Data Engineering Bulletin 22(3), 27–34 (1999)
Shanmugasundaram, J., Tufte, K., He, G., Zhang, C., DeWitt, D., Naughton, J.: Relational databases for querying xml documents:limitations and opportunities. In: Proceedings of the 25th VLDB Conference, Edinburgh, Scotland (1999)
Shimura, T., Yoshikawa, M., Uemura, S.: Storage and retrieval of xml documents using object-relational databases. In: Bench-Capon, T.J.M., Soda, G., Tjoa, A.M. (eds.) DEXA 1999. LNCS, vol. 1677, pp. 206–217. Springer, Heidelberg (1999)
ECHO (2000), http://pc-erato2.iei.pi.cnr.it/echo/
Salton, G., McGill, M.: Introduction to Modern Information Retrieval. McGraw-Hill Book Company, New York (1984)
Tamino (2001), http://www1.softwareag.com/corporate/products/tamino/default.asp
Meier, W.: exist: An open source native xml database. In: Chaudhri, A.B., Jeckle, M., Rahm, E., Unland, R. (eds.) NODe-WS 2002. LNCS, vol. 2593, pp. 169–183. Springer, Heidelberg (2003)
Xindice, A.: (2001), http://xml.apache.org/xindice/
Carey, M.J., DeWitt, D.J., Franklin, M.J., Hall, N.E., McAuliffe, M.L., Naughton, J.F., Schuh, D.T., Solomon, M.H., Tan, C.K., Tsatalos, O.G., White, S.J., Zwilling, M.J.: Shoring up persistent applications, pp. 383–394 (1994)
Zhang, C., Naughton, J., DeWitt, D., Luo, Q., Lohman, G.: On supporting containment queries in relational database management systems. In: SIGMOD 2001: Proceedings of the 2001 ACM SIGMOD international conference on Management of data, pp. 425–436. ACM Press, New York (2001)
Goldman, R., Widom, J.: Dataguides: Enabling query formulation and optimization in semistructured databases. In: Jarke, M., Carey, M.J., Dittrich, K.R., Lochovsky, F.H., Loucopoulos, P., Jeusfeld, M.A. (eds.) VLDB 1997, Proceedings of 23rd International Conference on Very Large Data Bases, pp. 436–445. Morgan Kaufmann, San Francisco (1997)
Chung, C.W., Min, J.K., Shim, K.: Apex: An adaptive path index for xml data. In: Proceedings of the 2002 ACM SIGMOD International Conference on Management of Data, June 3-6. ACM Press, New York (2002)
Cooper, B., Sample, N., Franklin, M.J., Hjaltason, G.R., Shadmon, M.: A fast index for semistructured data. In: Apers, P.M.G., Atzeni, P., Ceri, S., Paraboschi, S., Ramamohanarao, K., Snodgrass, R.T. (eds.) VLDB 2001, Proceedings of 27th International Conference on Very Large Data Bases, Roma, Italy, September 11-14, pp. 341–350. Morgan Kaufmann, San Francisco (2001)
Amato, G., Debole, F., Zezula, P., Rabitti, F.: Yapi: Yet another path index for xml searching. In: Koch, T., Sølvberg, I.T. (eds.) ECDL 2003. LNCS, vol. 2769, pp. 176–187. Springer, Heidelberg (2003)
Amato, G., Debole, F., Zezula, P., Rabitti, F.: Tree signatures for xml querying and navigation. In: Bellahsène, Z., Chaudhri, A.B., Rahm, E., Rys, M., Unland, R. (eds.) XSym 2003. LNCS, vol. 2824, pp. 149–163. Springer, Heidelberg (2003)
Fuhr, N., Großjohann, K.: XIRQL: An extension of XQL for information retrieval. In: ACM SIGIR Workshop On XML and Information Retrieval, Athens, Greece (2000)
Guha, S., Jagadish, H.V., Koudas, N., Srivastava, D., Yu, T.: Approximate xml joins. In: SIGMOD ’02: Proceedings of the 2002 ACM SIGMOD international conference on Management of data, pp. 287–298. ACM Press, New York (2002)
Amato, G., Rabitti, F., Savino, P., Zezula, P.: Region proximity in metric spaces and its use for approximate similarity search. ACM Trans. Inf. Syst. 21, 192–227 (2003)
Lucene (2000), http://lucene.apache.org/java/docs/index.html
Zezula, P., Amato, G., Debole, F., Rabitti, F.: Tree Signatures for XML Querying and Navigation, pp. 149–163. Springer, Heidelberg (2003)
Milos (2002), http://milos.isti.cnr.it
Amato, G., Gennaro, C., Rabitti, F., Savino, P.: Milos: A multimedia content management system for digital library applications. In: Heery, R., Lyon, L. (eds.) ECDL 2004. LNCS, vol. 3232, pp. 14–25. Springer, Heidelberg (2004)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Amato, G., Debole, F. (2005). A Native XML Database Supporting Approximate Match Search. In: Rauber, A., Christodoulakis, S., Tjoa, A.M. (eds) Research and Advanced Technology for Digital Libraries. ECDL 2005. Lecture Notes in Computer Science, vol 3652. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11551362_7
Download citation
DOI: https://doi.org/10.1007/11551362_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28767-4
Online ISBN: 978-3-540-31931-3
eBook Packages: Computer ScienceComputer Science (R0)