Abstract
The term “Big Data” is used to designate very large data sets, more varied and with more complex structures. Its characteristics are usually related to other challenges such as data storage, processing, analysis, or security. A Health Information System (HIS) refers to a system designed to manage healthcare data. This includes systems that collect, store, manage, and transmit data that is related to patients or the operational management of a hospital, clinic, testing laboratory, or any health care facility. Today, the health sector produces a significant amount of data. This information is voluminous and is stored in many different forms and types. It has become difficult to secure this data with traditional techniques and methods. The objective of this work is to propose a health data security process in a Big Data environment. The idea is to take advantage of the performance of Apache Hadoop and its components to create a secure environment for health data. The proposed solution will be based on four layers of security, the first component to be implemented in Apache Hadoop is Kerberos, the second element will be Apache Ranger, then Knox and finally we will use the encryption of Hadoop Distributed File System (HDFS).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Chen, M., Mao, S., Liu, Y.: Big data: A survey. Mob. Netw. Appl. 19(2), 171–209 (2014)
White, T.: Hadoop: the definitive guide. O’Reilly Media, Inc. (2012)
Tayefi, M., et al.: Challenges and opportunities beyond structured data in analysis of electronic health records, Wiley Interdisciplinary Rev. Comput. Stat. 13(6) (2021)
Bhathal, G.S., Singh, A.: Big Data: Hadoop framework vulnerabilities, security issues and attacks. Array 1–2 (2019)
Bahga, A., Vijay, M.: Big Data Science & Analytics (2019)
What is Big Data?-Redsen. https://www.redsen-consulting.com/data-analyse/big-data/. Last accessed 15 Oct 2022
Kataria, M., Mittal, P.: International journal of computer science and mobile computing BIG DATA: a review. Int. J. Comput. Sci. Mobile Comput. 3(7), 106–110 (2014)
Hadoop Developer Training. https://www.formation-bigdata.com/developpeur-hadoop/. Last accessed 15 Oct 2022
Hadoop–Apache Hadoop 3.3.4. https://hadoop.apache.org/docs/stable/. Last accessed 15 Oct 2022
Kandrouch, I., Hmina, N., Hmina, H.C.N.: A novel security architecture based on haystack system for HDFS storage system: extended work. Int. J. Innov. Technol. Explor. Eng. 9(4), 709–719 (2020)
What is Data Security?| Oracle France. https://www.oracle.com/fr/security/database-security/what-is-data-security/. Last accessed 15 Oct 2022
Abouelmehdi, K., Beni-Hssane, A., Khaloufi, H., Saadi, M.: Big data security and privacy in healthcare: a review. Procedia Comput. Sci. 113, 73–80 (2017)
Overview of Data Security Technology-Intel. https://www.intel.fr/content/www/fr/fr/analytics/data-security.html. Last accessed 15 Oct 2022
Kandrouch, I., Saadi, C., Hmina, N., Chaoui, H.: Security measures assessment for big data management systems. In: International Conference on Optimization and Applications, ICOA 2019, pp. 1–5 (2019)
Healthcare Big Data Projects, Applications and Examples. https://www.projectpro.io/article/5-healthcare-applications-of-hadoop-and-big-data/85#toc-2. Last accessed 15 Oct 2022
Apache Ranger|Cloudera. https://fr.cloudera.com/products/open-source/apache-hadoop/apache-ranger.html. Last accessed 15 Oct 2022
Apache Knox, l’API gateway d’Hadoop. https://blog.ippon.fr/2020/02/17/apache-knox-api-gateway-hadoop/. Last accessed 15 Oct 2022.
Hortonworks Data Platform : Apache Knox Gateway Administrator Guide
Apache Knox, c’est facile ! | Adaltas.” https://www.adaltas.com/fr/2019/02/04/apache-knox-2/. Last accessed 15 Oct 2022
Raja, M.C.: Comprehensive and Coordinated Security of Knox Gateway in Big Data (2015)
Introduction to Hadoop Security|Key Terminologies|Edureka. https://www.edureka.co/blog/hadoop-security/. Last accessed 15 Oct 2022
Suganya, S., Selvamuthukumaran, S.: Hadoop distributed file system security -a review. In: Proceedings of the 2018 International Conference on Current Trends towards Converging Technologies, ICCTCT 2018, pp. 1–5 (2018)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Ameskour, I., Benbrahim, H., Amine, A. (2023). Health Data Security in a Big Data Environment. In: Idrissi, N., Hair, A., Lazaar, M., Saadi, Y., Erritali, M., El Kafhali, S. (eds) Artificial Intelligence and Green Computing. ICAIGC 2023. Lecture Notes in Networks and Systems, vol 806. Springer, Cham. https://doi.org/10.1007/978-3-031-46584-0_17
Download citation
DOI: https://doi.org/10.1007/978-3-031-46584-0_17
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-46583-3
Online ISBN: 978-3-031-46584-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)