Abstract
In this paper we describe the design of a tool supporting the integration of independently developed data warehouses, a problem that arises in several common scenarios. The basic facility of the tool is a test of the validity of a matching between heterogeneous dimensions, according to a number of desirable properties. Two strategies are then provided to perform the actual integration. The first approach refers to a scenario of loosely coupled integration, in which we just need to identify the common information between sources and perform drill-across queries over them. The goal of the second approach is the derivation of a materialized view built by merging the sources, and refers to a scenario of tightly coupled integration in which queries are performed against the view. We illustrate architecture and functionality of the tool and the underlying techniques that implement the two integration strategies.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Abelló, A., Samos, J., Saltor, F.: On relationships Offering New Drill-across Possibilities. In: ACM Fifth Int. Workshop on Data Warehousing and OLAP (DOLAP 2002), pp. 7–13 (2002)
Abiteboul, S., Hull, R., Vianu, V.: Foundations of Databases. Addison-Wesley, Reading (1995)
Cabibbo, L., Torlone, R.: A logical Approach to Multidimensional Databases. In: Schek, H.-J., Saltor, F., Ramos, I., Alonso, G. (eds.) EDBT 1998. LNCS, vol. 1377, pp. 183–197. Springer, Heidelberg (1998)
Cabibbo, L., Torlone, R.: Integrating Heterogeneous Multidimensional Databases. In: 17th Int. Conference on Scientific and Statistical Database Management, SSDBM 2005 (2005)
Elmagarmid, A., Rusinkiewicz, M., Sheth, A.: Management of Heterogeneous and Autonomous Database Systems. Morgan Kaufmann, San Francisco (1999)
Jensen, M.R., Møller, T.H., Pedersen, T.B.: Specifying OLAP Cubes on XML Data. J. Intell. Inf. Syst. 17(2-3), 255–280 (2001)
Kimball, R., Ross, M.: The Data Warehouse Toolkit: The Complete Guide to Dimensional Modeling, 2nd edn. John Wiley & Sons, Chichester (2002)
Lenzerini, M.: Data Integration: A Theoretical Perspective. In: 21st ACM SIGACT SIGMOD SIGART Symp. on Principles of Database Systems, pp. 233–246 (2002)
Miller, R.J., Hernández, M.A., Haas, L.M., Yan, L., Ho, C.T.H., Fagin, R., Popa, L.: The Clio Project: Managing Heterogeneity. SIGMOD Record 30(1), 78–83 (2001)
Yin, X., Pedersen, T.B.: Evaluating XML-extended OLAP queries based on a physical algebra. In: ACM Int. Workshop on Data Warehousing and OLAP (DOLAP 2004), pp. 73–82 (2004)
Pedersen, T.B., Shoshani, A., Gu, J., Jensen, C.S.: Extending OLAP Querying to External Object Databases. In: Int. Conference on Information and Knowledge Management, pp. 405–413 (2000)
Rahm, E., Bernstein, P.A.: A Survey of Approaches to Automatic Schema Matching. VLDB Journal 10(4), 334–350 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Torlone, R., Panella, I. (2005). Design and Development of a Tool for Integrating Heterogeneous Data Warehouses. In: Tjoa, A.M., Trujillo, J. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2005. Lecture Notes in Computer Science, vol 3589. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11546849_11
Download citation
DOI: https://doi.org/10.1007/11546849_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28558-8
Online ISBN: 978-3-540-31732-6
eBook Packages: Computer ScienceComputer Science (R0)