Abstract
Extraction-Transformation-Loading (ETL) tools are pieces of software responsible for the extraction of data from several sources, their cleansing, customization and insertion into a data warehouse. In this paper, we delve into the logical design of ETL scenarios. We describe a framework for the declarative specification of ETL scenarios with two main characteristics: genericity and customization. Moreover, we present a palette of several templates, representing frequently used ETL activities along with their semantics and their interconnection. Finally, we discuss implementation issues and we present a graphical tool, ARKTOS II that facilitates the design of ETL scenarios, based on our model.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Gartner. ETL Magic Quadrant Update: Market Pressure Increases. Available at http://www.gartner.com/reprints/informatica/112769.html
H. Galhardas, D. Florescu, D. Shasha and E. Simon. Ajax: An Extensible Data Cleaning Tool. In Proc. ACM SIGMOD Intl. Conf. On the Management of Data, pp. 590, Dallas, Texas, (2000).
IBM. IBM Data Warehouse Manager. Available at http://www-3.ibm.com/software/data/db2/datawarehouse/
Informatica. PowerCenter. Available at http://www.informatica.com/products/data+integration/powercenter/default.htm
R. Kimbal, L. Reeves, M. Ross, W. Thornthwaite. The Data Warehouse Lifecycle Toolkit: Expert Methods for Designing, Developing, and Deploying Data Warehouses. John Wiley & Sons, February 1998.
Microsoft. Data Transformation Services. Available at http://www.microsoft.com
S. Naqvi, S. Tsur. A Logical Language for Data and Knowledge Bases. Computer Science Press 1989.
Oracle. Oracle Warehouse Builder Product Page. Available at http://otn.oracle.com/products/warehouse/content.html
V. Raman, J. Hellerstein. Potter’s Wheel: An Interactive Data Cleaning System. In Proceedings of 27 th International Conference on Very Large Data Bases (VLDB), pp. 381–390, Roma, Italy, (2001).
P. Vassiliadis, A. Simitsis, S. Skiadopoulos. Modeling ETL Activities as Graphs. In Proc. 4th Intl. Workshop on Design and Management of Data Warehouses (DMDW), pp. 52–61, Toronto, Canada, (2002).
P. Vassiliadis, A. Simitsis, S. Skiadopoulos. Conceptual Modeling for ETL Processes. In Proc. 5th ACM Intl. Workshop on Data Warehousing and OLAP (DOLAP), pp. 14–21, McLean, Virginia, USA (2002).
P. Vassiliadis, A. Simitsis, P. Georgantas, M. Terrovitis. A Framework for the design of ETL scenarios (long version). Available at http://cs.uoi.gr/~pvassil/publications/caise03_long.pdf
C. Zaniolo. LDL++ Tutorial. UCLA. http://pike.cs.ucla.edu/ldl/, Dec. 1998.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Vassiliadis, P., Simitsis, A., Georgantas, P., Terrovitis, M. (2003). A Framework for the Design of ETL Scenarios. In: Eder, J., Missikoff, M. (eds) Advanced Information Systems Engineering. CAiSE 2003. Lecture Notes in Computer Science, vol 2681. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45017-3_35
Download citation
DOI: https://doi.org/10.1007/3-540-45017-3_35
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-40442-2
Online ISBN: 978-3-540-45017-7
eBook Packages: Springer Book Archive