Abstract
Recently, a large number of XML documents are available on the Internet. This trend motivated many researchers to analyze them multi-dimensionally in the same way as relational data. In this paper, we propose a new framework for multidimensional analysis of XML documents, which we call XML-OLAP. We base XML-OLAP on XML warehouses where every fact data as well as dimension data are stored as XML documents. We build XML cubes from XML warehouses. We propose a new multidimensional expression language for XML cubes, which we call XML-MDX. XML-MDX statements target XML cubes and use XQuery expressions to designate the measure data. They specify text mining operators for aggregating text constituting the measure data. We evaluate XML-OLAP by applying it to a U.S. patent XML warehouse. We use XML-MDX queries, which demonstrate that XML-OLAP is effective for multi-dimensionally analyzing the U.S. patents.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Abello, A., Samos, J., Saltor, F.: Understanding Facts in a Multidimensional Object-Oriented Model. In: The 4th ACM Intl. Workshop on Data Warehousing and OLAP (DOLAP 2001), Atlanta, pp. 32–39 (2001)
Gofarelli, M., Rizzi, S., Vrdoljak, B.: Data Warehouse Design from XML Sources. In: Proc. The 4th ACM Intl. Workshop on Data Warehousing and OLAP (DOLAP 2001), Atlanta, pp. 40–47 (2001)
Hümmer, W., Bauer, A., Harde, G.: XCube – XML For Data Warehouses. In: Proc. The 6th ACM Intl. Workshop on Data Warehousing and OLAP (DOLAP 2003), New Orleans, Louisiana, pp. 33–40 (2003)
Jensen, M.R., Mφller, T.H., Pedersen, T.B.: Specifying OLAP Cubes on XML Data. Journal of Intelligent Information Systems 17(2/3), 255–280 (2001)
Jensen, M.R., Mφller, T.H., Pedersen, T.B.: Converting XML Data To UML Diagrams For Conceptual Data Integration. In: Proc. The 1st Intl. Workshop on Data Integration Over The Web, pp. 17–31 (2001)
Lujan-Mora, S., Trujillo, J., Vassiliadis, P.: Advantages of UML for Multidimensional Modeling. In: Proc. the 6th Intl. Conf. on Enterprise Information Systems (ICEIS 2004), pp. 298–305. ICEIS Press, Porto (2004)
Nassis, V., Rajugan, R., Dillon, T.S., Rahayu, W.: Conceptual Design of XML Document Warehouses. In: Kambayashi, Y., Mohania, M., Wöß, W. (eds.) DaWaK 2004. LNCS, vol. 3181, pp. 1–14. Springer, Heidelberg (2004)
Niemi, T., Niinimaki, M., Nummenmaa, J., Thanisch, P.: Constructing an OLAP Cube from Distributed XML Data. In: Proc. the 5th ACM Intl. Workshop on Data Warehousing and OLAP (DOLAP 2002), McLean, pp. 22–27 (2002)
Pokorny, J.: Modelling Stars Using XML. In: Proc. The 4th ACM Intl. Workshop on Dara Warehousing and OLAP (DOLAP 2001), Atlanta, pp. 24–31 (2001)
Rusu, L.I., Rahayu, W., Taniar, D.: On Building XML Data Warehouses. In: Yang, Z.R., Yin, H., Everson, R.M. (eds.) IDEAL 2004. LNCS, vol. 3177, pp. 293–299. Springer, Heidelberg (2004)
Spofford, G.: MDX Solutions with Microsoft SQL Server Analysis Services. John Wiley & Sons, Chichester (2001)
United States Patent and Trademark Office, http://www.uspto.gov/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Park, BK., Han, H., Song, IY. (2005). XML-OLAP: A Multidimensional Analysis Framework for XML Warehouses. In: Tjoa, A.M., Trujillo, J. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2005. Lecture Notes in Computer Science, vol 3589. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11546849_4
Download citation
DOI: https://doi.org/10.1007/11546849_4
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28558-8
Online ISBN: 978-3-540-31732-6
eBook Packages: Computer ScienceComputer Science (R0)