Skip to main content

Nuts and Bolts of ETL in Data Warehouse

  • Conference paper
  • First Online:
Emerging Trends in Expert Applications and Security

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 841))

Abstract

Data transformation from text files to database files, relational database management systems, and distributed database management systems in recent past has emerged a vast field of data warehouse. Currently data analytics is the most appealing field for the data scientists and challenges are very big as data volume is very huge. Not only data volume is high but the speed at which data is growing annually is exponentially. Data analytics has become a tool to grow the business by forecasting, business intelligence and decision support systems. In a simplified way, data is organized in the form of database, collective databases makes the data warehouse and the technologies like business intelligence, decision support system, and data analytics make use of data warehouse for their purpose. Big data is the enhanced form of the data warehouse which consists of the cloud storage and MapReduce-based architecture which consists of the clustering of data. This survey paper will give a high-level understanding of the existing data warehouse processing mechanisms including conventional processing and the distributed processing. Existing Extraction Transformation and Loading process will be analyzed for better understanding of the sub processes of the data warehouse building process.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Abdelaziz E (2015) Optimisation of the queries execution plan in cloud data warehouses, pp 129–133

    Google Scholar 

  2. Anand N, Kumar M (2013) anand2013. In: Modeling and optimization of extraction- transformation-loading (ETL) processes in data warehouse: an overview

    Google Scholar 

  3. Bondarev A, Zakirov D (2016) Data warehouse on Hadoop platform for decision support systems in education

    Google Scholar 

  4. Chen Z, Zhao T (2012) A new tool for ETL process. In: Proceedings of the 2012 international conference on image analysis signal processing IASP 2012, pp 269–273

    Google Scholar 

  5. Hamad MM, Jihad AA (2011) An enhanced technique to clean data in the data warehouse. In: Proceedings of the 4th international conference on developments in eSystems engineering DeSE 2011, pp 306–311

    Google Scholar 

  6. Ishizuka Y, Chen W, Paik I (2016) Workflow transformation for real-time big data processing

    Google Scholar 

  7. Kabiri A, Chiadmi D (2012) A method for modelling and organazing ETL processes. In: 2nd international conference on innovative computing technology INTECH 2012, pp 138–143

    Google Scholar 

  8. Maji G, Sen S (2015) A data warehouse based analysis on CDR to depict market share of different mobile brands. In: 2015 annual IEEE India conference, pp 1–6

    Google Scholar 

  9. Mukkamala R (2016) Privacy-aware big data warehouse architecture

    Google Scholar 

  10. Parul, Nawab S, Teggihalli S (2015) Performance optimization for extraction, transformation, loading and reporting of data. In: Global conference on communication technologies GCCT 2015, no. Gcct, pp 516–519

    Google Scholar 

  11. Prema A, Pethalakshmi A (2013) Novel approach in ETL. In: Proceedings of the 2013 international conference on pattern recognition, informatics and mobile engineering PRIME 2013, pp 429–434

    Google Scholar 

  12. Sharma S, Kumar K (2016) ETLR—Effective DWH design paradigm. In: Proceedings of the international conference on data engineering and communication technology. Springer, pp 149–157

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sharma Sachin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sachin, S., Goyal, S.K., Avinash, S., Kamal, K. (2019). Nuts and Bolts of ETL in Data Warehouse. In: Rathore, V., Worring, M., Mishra, D., Joshi, A., Maheshwari, S. (eds) Emerging Trends in Expert Applications and Security. Advances in Intelligent Systems and Computing, vol 841. Springer, Singapore. https://doi.org/10.1007/978-981-13-2285-3_1

Download citation

Publish with us

Policies and ethics