Abstract
The maintenance of data warehouses(DWs) is becoming an increasingly important topic due to the growing use, derivation and integration of digital information. Most previous work has dealt with one centralized data warehouse only. In this paper, we now focus on environments with multiple DWs that are possibly derived from other DWs. In such a large-scale environment, data updates from base sources may arrive in individual data warehouses in different orders, thus resulting in inconsistent data warehouse extents. We propose to address this problem by employing a registry agent responsible for establishing one unique order for the propagation of updates from the base sources to the DWs. With this solution, individual DW managers can still maintain their respective extents autonomously and independently from each other, thus allowing them to apply any existing incremental maintenance algorithm from the literature. We demonstrate that this registry-based coordination approach (RyCo) indeed achieves consistency across all DWs.
This work was supported in part by several grants from NSF, namely, the NSF NYI grant #IRI 97-96264, the NSF CISE Instrumentation grant #IRIS 97-29878, and the NSF grant #IIS 97-32897. Dr. Rundensteiner would like to thank our industrial sponsors, in particular, IBM for the IBM partnership award, and GTE for partial support of Xin Zhang.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
D. Agrawal, A. E. Abbadi, A. Singh, and T. Yurek. Efficient View Maintenance at Data Warehouses. In Proceedings of SIGMOD, pages 417–427, 1997.
S. Chaudhuri and U. Dayal. An Overview of Data Warehousing and OLAP Technology. SIGMOD Record, 26(1):65–74, 1997.
L. Colby, A.Kawaguchi, D. Lieuwen, I. Mumick, and K. Ross. Supporting Multiple View Maintenance Policies. AT&T Technical Memo, 1996.
L. Ding, X. Zhang, and E. A. Rundensteiner. The MRE Wrapper Approach: Enabling Incremental View Maintenance of Data Warehouses Defined On Multi-Relation Information Sources. In Proceedings of the ACM First International Workshop on Data Warehousing and OLAP (DOLAP’99), pages 30–35, November 1999.
L. Ding, X. Zhang, and E. A. Rundensteiner. Scalable Maintenance of Multiple Interrelated Data Warehousing Systems. Technical Report WPI-CS-TR-00-16, Worcester Polytechnic Institute, Dept. of Computer Science, May 2000.
H. García-Molina, W. Labio, J. L. Wiener, and Y. Zhuge. Distributed and Parallel Computing Issues in Data Warehousing. In Symposium on Principles of Distributed Computing, page 7, 1998. Abstract.
A. Gupta and I. S. Mumick. What is the data warehousing problem? (Are materialized views the answer?). In International Conference on Very Large Data Bases, page 602, 1996. Panel.
A. Kawaguchi, D. F. Lieuwen, I. S. Mumick, and K. A. Ross. Implementing Incremental View Maintenance in Nested Data Models. In Workshop on Database Programming Languages, pages 202–221, 1997.
M. K. Mohania, S. Konomi, and Y. Kambayashi. Incremental Maintenance of Materialized Views. In Database and Expert Systems Applications (DEXA), pages 551–560, 1997.
E. A. Rundensteiner, A. Koeller, and X. Zhang. Maintaining data warehouses over changing information sources. Communications of the ACM, June 2000.
E. A. Rundensteiner, A. Koeller, X. Zhang, A. Lee, A. Nica, A. Van Wyk, and Y. Li. Evolvable View Environment. In Proceedings of SIGMOD’99 Demo Session, pages 553–555, May/June 1999.
I. Stanoi, D. Agrawal, and A. E. Abbadi. Weak Consistency in Distributed Data Warehouses. In Proceedings of the International Conference of Foundations of Data Organization, November 1998.
I. Stanoi, D. Agrawal, and A. E. Abbadi. Modeling and Maintaining Multi-View Data Warehouses. In Proceedings of the 18th International Conference on Conceptual Modeling (ER’99), pages 161–175, 1999.
M. Wu and A. P. Buchman. Research Issues in Data Warehousing. In Datenbanksysteme in Büro, Technik und Wissenschaft, pages 61–82, 1997.
X. Zhang and E. A. Rundensteiner. The SDCC Framework for Integrating Existing Algorithms for Diverse Data Warehouse Maintenance Tasks. In International Database Engineering and Application Symposium, pages 206–214, Montreal, Canada, August, 1999.
Y. Zhuge, H. García-Molina, J. Hammer, and J. Widom. View Maintenance in a Warehousing Environment. In Proceedings of SIGMOD, pages 316–327, May 1995.
Y. Zhuge, H. García-Molina, and J. L. Wiener. The Strobe Algorithms for Multi-Source Warehouse Consistency. In International Conference on Parallel and Distributed Information Systems, pages 146–157, December 1996.
Y. Zhuge, J. L. Wiener, and H. García-Molina. Multiple View Consistency for Data Warehousing. In Proceedings of IEEE International Conference on Data Engineering, pages 289–300, 1997.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ding, L., Zhang, X., Rundensteiner, E.A. (2000). Scalable Maintenance of Multiple Interrelated Data Warehousing Systems. In: Kambayashi, Y., Mohania, M., Tjoa, A.M. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2000. Lecture Notes in Computer Science, vol 1874. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44466-1_11
Download citation
DOI: https://doi.org/10.1007/3-540-44466-1_11
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67980-6
Online ISBN: 978-3-540-44466-4
eBook Packages: Springer Book Archive