Abstract
Multipurpose Computation is reached with the platform as a service with container orchestration (PaaSCO) which integrates containerized technologies in a package-based style. These platforms include Kubernetes, OpenShift, Distributed Cloud Operating System DC/OS, Cloud Foundry, and Docker Swarm. Using a PaaSCO facilitates and accelerates the construction of platforms for developing applications by unifying components and software frameworks to build applications for different use cases. Containers enable ubiquity, portability, and distributed computing capabilities, as well as the programming and development of different applications and services to engineering and sciences areas like the internet of things, software development, data analytics, microservices, and artificial intelligence. This means that the same solution can be deployed in different PaaSCO environments including public, private, and hybrid clouds to obtain the same service. The National Laboratory of Information Technologies of the Autonomous University of Ciudad Juárez (LaNTI) offers computing infrastructure services to researchers inside and outside the institution. Due to their lack of expertise in infrastructure and computing tools, most researchers have difficulty installing and configuring the software tools necessary for their research. We present a solution using the PaaSCO Distributed Cloud Operating System (DCOS) through the integration of a stack of components required by a Big Data architecture. By implementing the solution proposed in this work, we are contributing to the academic and research environment by accelerating the generation, and implementation of components required by the Big Data Analytics system. Using containerized tools makes it easy for researchers to focus on the substantive part of their research. In addition to allowing data analytics tasks with a focus on anomaly detection, two experiments demonstrating resource elasticity and isolation are presented to demonstrate the platform additional benefits.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
DC/OS-D2iQ docs. https://archive-docs.d2iq.com/mesosphere/dcos/. Accessed 29 Oct 2022
Getting started with Kubernetes-DZone refcardz. https://dzone.com/refcardz/getting-started-kubernetes. Accessed 05 Oct 2023
Akoka, J., Comyn-Wattiau, I., Laoufi, N.: Research on big data-a systematic mapping study. Comput. Stand. Interfaces 54, 105–115 (2017). https://doi.org/10.1016/j.csi.2017.01.004.
Beimborn, D., Miletzki, T., Wenzel, S.: Platform as a service (PaaS). WIRTSCHAFTSINFORMATIK 53(6), 371–375 (2011)
Cisneros, L., Rivera, G., Florencia, R., Sánchez-Solís, J.P.: Fuzzy optimisation for business analytics: a bibliometric analysis. J. Intell. Fuzzy Syst. 44(2), 2615–2630 (2023). https://doi.org/10.3233/JIFS-221573
D2iq, I.: Architecture. https://archive-docs.d2iq.com/mesosphere/dcos/2.2/overview/architecture/. Accessed 28 Oct 2023
Dean, J.: Big Data, Data Mining, and Machine Learning: Value Creation for Business Leaders and Practitioners. Wiley (2014)
Docker, I.: What is a container? (2021). https://www.docker.com/resources/what-container/. Accessed 23 Jan 2021
Dongarra, J., Lastovetsky, A.L.: High Performance Heterogeneous Computing. Wiley (2009)
GmbH, C.: Platform as a service (PaaS): definition, examples, and advantages (2021). https://www.linkedin.com/pulse/platform-service-paas-definition-examples-advantages-cubeware-gmbh/. Accessed 22 Oct 2022
Hindman, B., Konwinski, A., Zaharia, M.: Mesos: a platform for fine-grained resource sharing in the data center. In: Proceedings of NSDI 2011: 8th USENIX Symposium on Networked Systems Design and Implementation, pp. 295–308 (2011)
Kakadia, D.: Apache Mesos Essentials, 2015 edn. Packt Publishing Ltd., Birmingham (2015). www.packtpub.com
Kolanovic, M., Krishnamachari, R.: Big data and AI strategies: machine learning and alternative data approach to investing. Technical Report May, J.P. Morgan (2017)
Li, Z., Zhang, Y., Liu, Y.: Towards a full-stack devops environment (platform-as-a-service) for cloud-hosted applications (2017). https://doi.org/10.1109/TST.2017.7830891
Linthicum, D.S.: Moving to autonomous and self-migrating containers for cloud applications. IEEE Cloud Comput. 3(6), 6–9 (2016)
Mar-Cupido, R., García, V., Rivera, G., Sánchez, J.S.: Deep transfer learning for the recognition of types of face masks as a core measure to prevent the transmission of covid-19. Appl. Soft Comput. 125, 109207 (2022). https://doi.org/10.1016/j.asoc.2022.109207
Mohamed, M., Warke, A., Hildebrand, D., Engel, R., Ludwig, H., Mandagere, N.: Ubiquity: extensible persistence as a service for heterogeneous container-based frameworks. In: On the Move to Meaningful Internet Systems. OTM 2017 Conferences: Confederated International Conferences: CoopIS, C &TC, and ODBASE 2017, Rhodes, Greece, October 23–27, 2017, Proceedings, Part I, pp. 716–731. Springer (2017)
Mohanty, H., Bhuyan, P., Chenthati, D.: Big Data: A Primer, vol. 11. Springer (2015)
O’Brien, S.: What is PaaS?-platform as a service definition, benefits, platforms & providers (2021). https://www.ringcentral.com/gb/en/blog/definitions/platform-as-a-service-paas/. Accessed 23 Jan 2023
Pahl, C.: Containerization and the paas cloud. IEEE Cloud Comput. 2(3), 24–31 (2015)
Pratt, H., Coenen, F., Broadbent, D.M., Harding, S.P., Zheng, Y.: Convolutional Neural Networks for Diabetic Retinopathy (2016)
Red Hat: Red Hat OpenShift (2022). https://www.redhat.com/en/technologies/cloud-computing/openshift. Accessed 05 Oct 2022
Rivera, G., Cruz-Reyes, L., Fernandez, E., Gomez-Santillan, C., Rangel-Valdez, N., Coello Coello, C.A.: An ACO-based hyper-heuristic for sequencing many-objective evolutionary algorithms that consider different ways to incorporate the DM’s preferences. Swarm Evol. Comput. 76, 101211 (2023). https://doi.org/10.1016/j.swevo.2022.101211
Rivera, G., Porras, R., Florencia, R., Sánchez-Solís, J.P.: Lidar applications in precision agriculture for cultivating crops: a review of recent advances. Comput. Electron. Agric. 207, 107737 (2023). https://doi.org/10.1016/j.compag.2023.107737
Romero-Aroca, P., Sagarra Álamo, R.: La retinopatía diabética e hipertensiva (2018)
SaaS Scout Research Group: Big Data Statistics, Growth & Facts 2021 | SaaS Scout (formerly SoftwareFindr) (2020). https://saasscout.com/statistics/big-data-statistics/
Statista: Data Volume in The World 2010–2024 (2020). https://www.statista.com/statistics /871513/worldwide-data-created/. Accessed 29 Jun 2021
Vozábal, M.: Department of Computer Science and Engineering Master Thesis Tools and Methods for Big Data Analysis. Ph.D. thesis, University of West Bohemia (2016)
Wiggins, A.: The twelve-factor app (2017). https://12factor.net/. Accessed 5 Jan 2023
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this chapter
Cite this chapter
Hernández-Rivas, A., Morales-Rocha, V., Ruiz-Hernández, O. (2023). Big Data Platform as a Service for Anomaly Detection. In: Rivera, G., Cruz-Reyes, L., Dorronsoro, B., Rosete, A. (eds) Data Analytics and Computational Intelligence: Novel Models, Algorithms and Applications. Studies in Big Data, vol 132. Springer, Cham. https://doi.org/10.1007/978-3-031-38325-0_7
Download citation
DOI: https://doi.org/10.1007/978-3-031-38325-0_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-38324-3
Online ISBN: 978-3-031-38325-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)