Abstract
With the increasing ubiquity of XML, an eXtensible Markup Language, the industry is racing to provide XML infrastructure for e–commerce, information interchange, effective query of diverse sources and yet more integration of diverse data. It is anticipated that large volumes of XML data will be created manually from HTML documents or generated using some WWW tools and electronic data interchange (EDI). In this paper, we examine how large amounts of XML data can be stored in a relational database. Our scheme considers the unique irregular features of XML, including missing elements or multiple occurrences of the same element, and elements which may have atomic values in some data items and structured values in others. A detailed experimental study demonstrates good query performance, effective space utilization and scalability.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
References
S. Abiteboul, D. Quass, J. Widom, and J. Wiener, “The lorel query language for semistructured data,” Internat. J. Digital Libraries 1(1), 1997.
T. Bray, J. Paoli, and C. Sperberg–McQueen, “Extensible markup language (XML) 1.0. W3C,” Recommendation, available at http://www.w3.org/TR/1998, 1998.
A. Deutsch, M. Fernandez, and D. Suciu, “Storing semistructured data with STORED,” in Proc. of ACM SIGMOD, 1999, pp. 431–442.
Document Object Model (DOM) level 1 specification, http://www.w3.org/TR/REC–DOM–Level–1.
D. Florescu and D. Kossman, “Storing and querying XML data using an RDBMS,” Bulletin of IEEE Computer Society Technical Committee on Data Engineering, 1999.
J. McHugh, S. Abiteboul, R. Goldman, and J. Widom, “Lore: A database management system for semisturctured data,” SIGMOD Record 26(3), 1997.
J. Shanmugashundaram et al. “Relational databased for querying XML documents: Limitations and opportunities,” in Proc. of VLDB, 1999.
K. Wang and H. Liu, “Discovering typical structures of documents: a road map approach,” in ACM SIGIR Conf. on Research and Development in Information Retrieval, 1998.
R. Zowl, P. Apers, and A. Wilschut, “Modeling and querying semistructured data with MOA,” in Workshop on Query Processing for Semistructured Data and Non–Standard Data Formats, 1999.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Wang, W.Q., Lee, M.L., Ooi, B.C. et al. XStorM: A Scalable Storage Mapping Scheme for XML Data. World Wide Web 4, 101–119 (2001). https://doi.org/10.1023/A:1012412529611
Issue Date:
DOI: https://doi.org/10.1023/A:1012412529611