Abstract
Nowadays, producing streams of data is not helpful if you cannot store them somewhere. Applications, software, and objects generate huge masses of data, which need to be collected, stored, and made available for analysis. Moreover, these data are very valuable and need to be preserved. That is why Big Data has attracted global interest from all leaders of information technology and new ways of storing information have emerged and flourished. Accordingly, while proceeding our analysis on this subject, we note that in terms of Big Data architecture, the storage layer is very useful and is essential for the proper functioning of any Big Data system. In fact, there are two types of storage at this layer: Hadoop distributed file system (HDFS) and NoSQL databases. We relied on previous works in which we identified key storage concepts through comparative studies of main big data distributions. The storage layer is located directly above Data Sources and Data ingestion layers for which we already proposed a meta-model. Thus, in this paper, we applied techniques related to Model Driven Engineering ‘MDE’ to provide a universal Meta-modeling for the storage layer at the level of a Big Data system.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Richards, Ken: Machine Learning: For Beginners—Your Starter Guide For Data Management, Model Training, Neural Networks. CreateSpace Independent Publishing Platform, Machine Learning Algorithms (2018)
Erraissi, A., Belangour, A., Tragha, A.: A big data hadoop building blocks comparative study. Int. J. Comput. Trends Technol. Accessed 18 June 2017
Erraissi, A., Belangour, A., Tragha, A.: A comparative study of hadoop-based big data architectures. Int. J. Web Appl. IJWA. 9(4) (2017)
Erraissi, A., Belangour, A., Tragha, A.: Digging into hadoop-based big data architectures. Int. J. Comput. Sci. Issues IJCSI. 14(6), 52–59 (2017)
Erraissi, A., Belangour, A, Tragha, A.: Meta-Modeling of Data Sources and Ingestion Big Data Layers. SSRN Scholarly Paper. Rochester, Social Science Research Network, NY 26 May 2018. https://papers.ssrn.com/abstract=3185342
White, T.: Hadoop—The Definitive Guide 4e-. 4th ed. O'Reilly, Beijing (2015)
Alapati, S.R.: Expert Hadoop Administration: Managing, Tuning, and Securing Spark, YARN, and HDFS. Addison Wesley, Boston, MA (2016)
Raj, P., Deka, G.C.: A Deep Dive into NoSQL Databases: The Use Cases and Applications. S.l.: Academic Press (2018)
Dunning, T., Friedman, E.: Real-World Hadoop (2015)
Blokdyk, G.: MapReduce Complete Self-Assessment Guide. CreateSpace Independent Publishing Platform (2017)
N. Sawant, Shah, H.: Big data application architecture Q & A a problem-solution approach. Apress (2013)
Balasubramanian, S.: Big Data Hadoop The Premier Interview Guide (2017)
Borthakur, : HDFS architecture guide. Hadoop Apache Proj. http//hadoop apache …, pp. 1–13 (2008)
Banane, M., Belangour, A., El Houssine, L.: Storing RDF data into big data NoSQL databases. In: Mizera-Pietraszko J., Pichappan P., Mohamed L. (eds) Lecture Notes in Real-Time Intelligent Systems. RTIS 2017. Advances in Intelligent Systems and Computing. vol. 756. Springer, Cham
Banane, M., Belangour, A., Labriji, E.H.: RDF data management systems based on NoSQL Databases : a comparative study. Int. J. Comput. Trends Technol. (IJCTT). V58(2), 98–102 (2018)
Sadalage, P.: NoSQL Distilled: A Brief Guide to the Emerging World of Polyglot Persistence, 1st edn. Addison Wesley, Upper Saddle River, NJ (2009)
Nayak, A., Poriya, A., Poojary, D.: Type of NoSQL Databases and its Comparison with Relational Databases. Int. J. Appl. Inf. Syst. 5(4), 16–19 (2013)
Seeger, M., Ultra-Large-Sites, S.: Key-value stores: a practical overview. … Sci. Media, pp. 1–21 (2009)
Carlson, J.L.: Redis in Action. Pap/Psc. Shelter Island. Manning Publications, NY (2013)
Meyer, M.: Riak Handbook (2011)
Akboka, B., Filipchuk, N., Zimanyi, E.:Advance database: Voldemort (2015)
Abadi, D.: The Design and Implementation of Modern Column-Oriented Database Systems. Found. Trends® Databases, 5(3), 197–280 (2012)
VLDB 2009 Tutorial Column-Oriented Database Systems Column-Oriented Database Systems
George, L.: Hbase: The Definitive Guide: Random Access to Your Planet-size Data. 2nd Revised edition. O’Reilly Media, Inc, USA (2018)
Chang, F. et al.: Bigtable: A distributed storage system for structured data. 7th Symp. Oper. Syst. Des. Implement. (OSDI ’06). pp. 205–218, Novemb. 6–8, Seattle, WA, USA (2006)
Carpenter, J., Eben Hewitt.: Cassandra—The Definitive Guide 2e. 2nd ed. Sebastopol, O'Reilly, CA (2015)
Amazon Web Services: Amazon DynamoDB Developer Guide API Version 2012-08-10 (2012)
Issa, A., Schiltz, F.: Document oriented Databases (2015)
Team, C.: CouchDB 2.0 Reference Manual. Samurai Media Limited (2015)
Syn-Hershko, I.: RavenDB in Action. Manning Publications (2016)
Bradshaw, Shannon, Chodorow, Kristina: Mongodb: The Definitive Guide: Powerful and Scalable Data Storage, 3rd edn. Place of publication not identified, O’Reilly Media Inc, USA (2018)
Robinson, I., Webber, J., Elfrem, E.: Graph Databases 2e. 2nd ed. O'Reilly, Beijing (2015)
Baton, J., Van Bruggen, R.: Learning Neo4j 3.x—Second Edition: Effective data modeling, performance tuning and data visualization techniques in Neo4j. 2nd Revised edition. Packt Publishing Limited (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Erraissi, A., Belangour, A. (2019). Capturing Hadoop Storage Big Data Layer Meta-Concepts. In: Ezziyyani, M. (eds) Advanced Intelligent Systems for Sustainable Development (AI2SD’2018). AI2SD 2018. Advances in Intelligent Systems and Computing, vol 915. Springer, Cham. https://doi.org/10.1007/978-3-030-11928-7_37
Download citation
DOI: https://doi.org/10.1007/978-3-030-11928-7_37
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-11927-0
Online ISBN: 978-3-030-11928-7
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)