Skip to main content

Health Data Security in a Big Data Environment

  • Conference paper
  • First Online:
Artificial Intelligence and Green Computing (ICAIGC 2023)

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 806))

  • 124 Accesses

Abstract

The term “Big Data” is used to designate very large data sets, more varied and with more complex structures. Its characteristics are usually related to other challenges such as data storage, processing, analysis, or security. A Health Information System (HIS) refers to a system designed to manage healthcare data. This includes systems that collect, store, manage, and transmit data that is related to patients or the operational management of a hospital, clinic, testing laboratory, or any health care facility. Today, the health sector produces a significant amount of data. This information is voluminous and is stored in many different forms and types. It has become difficult to secure this data with traditional techniques and methods. The objective of this work is to propose a health data security process in a Big Data environment. The idea is to take advantage of the performance of Apache Hadoop and its components to create a secure environment for health data. The proposed solution will be based on four layers of security, the first component to be implemented in Apache Hadoop is Kerberos, the second element will be Apache Ranger, then Knox and finally we will use the encryption of Hadoop Distributed File System (HDFS).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 139.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 179.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Chen, M., Mao, S., Liu, Y.: Big data: A survey. Mob. Netw. Appl. 19(2), 171–209 (2014)

    Article  Google Scholar 

  2. White, T.: Hadoop: the definitive guide. O’Reilly Media, Inc. (2012)

    Google Scholar 

  3. Tayefi, M., et al.: Challenges and opportunities beyond structured data in analysis of electronic health records, Wiley Interdisciplinary Rev. Comput. Stat. 13(6) (2021)

    Google Scholar 

  4. Bhathal, G.S., Singh, A.: Big Data: Hadoop framework vulnerabilities, security issues and attacks. Array 1–2 (2019)

    Google Scholar 

  5. Bahga, A., Vijay, M.: Big Data Science & Analytics (2019)

    Google Scholar 

  6. What is Big Data?-Redsen. https://www.redsen-consulting.com/data-analyse/big-data/. Last accessed 15 Oct 2022

  7. Kataria, M., Mittal, P.: International journal of computer science and mobile computing BIG DATA: a review. Int. J. Comput. Sci. Mobile Comput. 3(7), 106–110 (2014)

    Google Scholar 

  8. Hadoop Developer Training. https://www.formation-bigdata.com/developpeur-hadoop/. Last accessed 15 Oct 2022

  9. Hadoop–Apache Hadoop 3.3.4. https://hadoop.apache.org/docs/stable/. Last accessed 15 Oct 2022

  10. Kandrouch, I., Hmina, N., Hmina, H.C.N.: A novel security architecture based on haystack system for HDFS storage system: extended work. Int. J. Innov. Technol. Explor. Eng. 9(4), 709–719 (2020)

    Google Scholar 

  11. What is Data Security?| Oracle France. https://www.oracle.com/fr/security/database-security/what-is-data-security/. Last accessed 15 Oct 2022

  12. Abouelmehdi, K., Beni-Hssane, A., Khaloufi, H., Saadi, M.: Big data security and privacy in healthcare: a review. Procedia Comput. Sci. 113, 73–80 (2017)

    Article  Google Scholar 

  13. Overview of Data Security Technology-Intel. https://www.intel.fr/content/www/fr/fr/analytics/data-security.html. Last accessed 15 Oct 2022

  14. Kandrouch, I., Saadi, C., Hmina, N., Chaoui, H.: Security measures assessment for big data management systems. In: International Conference on Optimization and Applications, ICOA 2019, pp. 1–5 (2019)

    Google Scholar 

  15. Healthcare Big Data Projects, Applications and Examples. https://www.projectpro.io/article/5-healthcare-applications-of-hadoop-and-big-data/85#toc-2. Last accessed 15 Oct 2022

  16. Apache Ranger|Cloudera. https://fr.cloudera.com/products/open-source/apache-hadoop/apache-ranger.html. Last accessed 15 Oct 2022

  17. Apache Knox, l’API gateway d’Hadoop. https://blog.ippon.fr/2020/02/17/apache-knox-api-gateway-hadoop/. Last accessed 15 Oct 2022.

  18. Hortonworks Data Platform : Apache Knox Gateway Administrator Guide

    Google Scholar 

  19. Apache Knox, c’est facile ! | Adaltas.” https://www.adaltas.com/fr/2019/02/04/apache-knox-2/. Last accessed 15 Oct 2022

  20. Raja, M.C.: Comprehensive and Coordinated Security of Knox Gateway in Big Data (2015)

    Google Scholar 

  21. Introduction to Hadoop Security|Key Terminologies|Edureka. https://www.edureka.co/blog/hadoop-security/. Last accessed 15 Oct 2022

  22. Suganya, S., Selvamuthukumaran, S.: Hadoop distributed file system security -a review. In: Proceedings of the 2018 International Conference on Current Trends towards Converging Technologies, ICCTCT 2018, pp. 1–5 (2018)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Houssam Benbrahim .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ameskour, I., Benbrahim, H., Amine, A. (2023). Health Data Security in a Big Data Environment. In: Idrissi, N., Hair, A., Lazaar, M., Saadi, Y., Erritali, M., El Kafhali, S. (eds) Artificial Intelligence and Green Computing. ICAIGC 2023. Lecture Notes in Networks and Systems, vol 806. Springer, Cham. https://doi.org/10.1007/978-3-031-46584-0_17

Download citation

Publish with us

Policies and ethics