Abstract
The ability to store information contained in XML documents for future reference becomes a very important issue these days, as the number of applications which use and exchange data in XML format is growing continuously. Moreover, the contents of XML documents are dynamic and they change across time, so researchers are looking to efficient solutions to store the documents’ versions and eventually extract interesting information out of them. This paper proposes a novel approach for mining association rules from changes between versions of dynamic XML documents, in a simple manner, by using the information contained in the consolidated delta. We argue that by applying our proposed algorithm, important information about the behaviour of the changed XML document in time could be extracted and then used to make predictions about its future performance.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Rusu, L.I., Rahayu, W., Taniar, D.: Maintaining Versions of Dynamic XML Documents. In: Ngu, A.H.H., Kitsuregawa, M., Neuhold, E.J., Chung, J.-Y., Sheng, Q.Z. (eds.) WISE 2005. LNCS, vol. 3806, pp. 536–543. Springer, Heidelberg (2005)
Zhao, Q., Bhowmick, S.S., Mohania, M., Kambayashi, Y.: FCS Mining: Discovering Frequently Changing Structures from Historical Structural Deltas of Unordered XML. In: Proceedings of the 13th Conference on Information and Knowledge Management (CIKM 2004), pp. 188–197 (2004)
Wang, Y., DeWitt, D.J., Cai, J.Y.: X-Diff: An Effective Change Detection Algorithms for XML Documents. In: Proceedings of ICDE 2003, pp. 519–530. IEEE Computer Society Press, Los Alamitos (2003)
Zhao, Q., Bhowmick, S.S., Mohania, M., Kambayashi, Y.: Discovering Frequently Changing Structures from Historical Structural Deltas of Unordered XML. In: Proceedings of ACM CIKM 2004, Washington, US, November 8-13, 2004, pp. 188–197 (2004)
Zhao, Q., Bhowmick, S.S., Mandria, S.: Discovering Pattern-Based Dynamic Structures from Versions of Unordered XML Documents. In: Kambayashi, Y., Mohania, M., Wöß, W. (eds.) DaWaK 2004. LNCS, vol. 3181, pp. 77–86. Springer, Heidelberg (2004)
SIGMOD XML dataset, http://www.cs.washington.edu/datasets
Yin, M., Goh, D.H.-L., Lim, E.-P., Sun, A.: Discovery of Content Entities from Web Sites Using Web Unit Mining. International Journal of Web Information Systems 1(3), 123–136 (2005)
Zhou, B., Hui, S.C., Fong, A.C.M.: A Web Usage Lattice Based Mining Approach for Intelligent Web Perzonalization. International Journal of Web Information Systems 1(3), 137–146 (2005)
Quang, N.H., Rahayu, W.: XML Schema Design Approach. International Journal of Web Information Systems 1(3), 161–178 (2005)
Rusu, L.I., Rahayu, W., Taniar, D.: A methodology for building XML data warehouses. International Journal of Data Warehousing and Mining 1(2), 67–92 (2005)
Feng., L., Dillon, T.: An XML-enabled data mining query language: XML-DMQL. International Journal of Business Intelligence and Data Mining 1(1), 22–41 (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Rusu, L.I., Rahayu, W., Taniar, D. (2006). Mining Changes from Versions of Dynamic XML Documents. In: Nayak, R., Zaki, M.J. (eds) Knowledge Discovery from XML Documents. KDXD 2006. Lecture Notes in Computer Science, vol 3915. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11730262_3
Download citation
DOI: https://doi.org/10.1007/11730262_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33180-3
Online ISBN: 978-3-540-33181-0
eBook Packages: Computer ScienceComputer Science (R0)