Skip to main content

Federated Learning: An Alternative Approach to Improving Medical Data Privacy and Security

  • Chapter
  • First Online:
Current and Future Trends in Health and Medical Informatics

Part of the book series: Studies in Computational Intelligence ((SCI,volume 1112))

Abstract

While medical data is integral to building robust predictive machine learning models for medical research, obtaining access to medical data is increasingly difficult. The challenges primarily arise from obtaining consent, concerns around the privacy and security of medical data and the technical challenge of migrating what can be huge datasets to a centralised location. As a result, this chapter analyses the question, “How can we make medical data more accessible for medical research whilst addressing the ethical and technical issues around data privacy and data-sharing?” Moreover, this work expands on federated learning that represents a paradigm shift in machine learning from both a technical and sociological perspective. From a technical perspective, federated learning enables machine learning models to be trained in a decentralised manner. It thus allows researchers to utilise data stored in separate locations. From a sociological perspective, federated learning represents a shift in the power dynamic between those providing and those using medical data for research. Under a federated learning framework, raw data never leaves the client’s device. Instead, the centralised server only receives encrypted parameter updates after a shared model is sent and trained locally on each client device. This ensures that the entities that provide medical data have more control over where their data is stored and what information is shared with other parties. Even though federated algorithms have a slightly lower accuracy when compared to non-federated algorithms, it comes with data privacy benefits that non-federated algorithms cannot provide.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 249.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. K. Abouelmehdi, A. Beni-Hessane, H. Khaloufi, Big healthcare data: preserving security and privacy. J. Big Data 5(1), 1–18 (2018)

    Google Scholar 

  2. A. Ballantyne, How should we think about clinical data ownership? J. Med. Ethics 46(5), 289–294 (2020)

    Google Scholar 

  3. R. Bey et al., Fold-stratified cross-validation for unbiased and privacy-preserving federated learning. J. Am. Med. Inf. Assoc. 27(8), 1244–1251 (2020)

    Google Scholar 

  4. S. Boughorbel et al., Federated uncertainty-aware learning for distributed hospital ehr data (2019). arXiv preprint arXiv:1910.12191

  5. A. Bourke, G. Bourke, Who owns patient data? the answer is not that simple (2020). https://blogs.bmj.com/bmj/2020/08/06/who-owns-patient-data-the-answer-is-not-that-simple/

  6. Y. Chen et al., Fedhealth: A federated transfer learning framework for wearable healthcare. IEEE Intell. Syst. 35(4), 83–93 (2020)

    Google Scholar 

  7. O. Choudhury, A. Gkoulalas-Divanis et al, Anonymizing data for privacy-preserving federated learning (2020). arXiv:2002.09096

  8. O. Choudhury, Y. Park et al., Predicting adverse drug reactions on distributed health data using federated learning, in AMIA Annual Symposium Proceedings, vol. 2019. (American Medical Informatics Association, 2019), p. 313

    Google Scholar 

  9. J. Cui et al., FeARH: Federated machine learning with anonymous random hybridization on electronic medical records. J. Biomed. Inf. 117, 103735 (2021)

    Google Scholar 

  10. S. Dash et al., Big data in healthcare: management, analysis and future prospects. J. Big Data 6(1), 1–25 (2019)

    Google Scholar 

  11. S. Day, M. Zweig, Beyond wellness for the healthy: digital health consumer adoption (2018). https://rockhealth.com/reports/beyond-wellness-for-the-healthy-digital-health-consumer-adoption-2018/

  12. D.-L. Donnelly, Privacy by design’ in the Eu general data protection regulation: a new privacy standard or the emperor’s new clothes? S. Afr. Law J. 139(3), 541–576 (2022)

    Google Scholar 

  13. E.S. Dove, M. Phillips, Privacy law, data sharing policies, and medical data: a comparative perspective, in Medical Data Privacy Handbook, pp. 639–678

    Google Scholar 

  14. Fallah, A., Mokhtari, A., Ozdaglar, A., Personalized federated learning with theoretical guarantees: a model agnostic meta-learning approach. Adv. Neural Inf. Process. Syst. 33, 3557–3568 (2020)

    Google Scholar 

  15. D. Gao et al., Hhhfl: Hierarchical heterogeneous horizontal federated learning for electroencephalography (2019). arXiv preprint arXiv:1909.05784

  16. A. Holst, Amount of data created, consumed, and stored 2010–2025. Technol. Telecommun. Retrieved, 06–29 (2021)

    Google Scholar 

  17. L. Huang, A. L. Shea et al., Patient clustering improves efficiency of federated machine learning to predict mortality and hospital stay time using distributed electronic medical records. J Biomed. Inf. 99, 103291 (2019)

    Google Scholar 

  18. L. Huang, Y. Yifeng et al., LoAdaBoost: loss-based AdaBoost federated machine learning with reduced computational complexity on IID and non-IID intensive care data. Plos One 15(4), e0230706 (2020)

    Google Scholar 

  19. C. Ju, D. Gao et al., Federated transfer learning for EEG signal classification, in 2020 42nd Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC) (IEEE, 2020), pp. 3040–3045

    Google Scholar 

  20. C. Ju, R. Zhao et al., Privacy-preserving technology to help millions of people: federated prediction model for stroke prevention (2020). arXiv preprint arXiv:2006.10517

  21. M. Lawler, T. Maughan, From Rosalind Franklin to Barack Obama: data sharing challenges and solutions in genomics and personalised medicine. New Bioeth. 23(1), 64–73 (2017)

    Google Scholar 

  22. T. Li, A.K. Sahu et al., Federated optimization in heterogeneous networks, in Proceedings of Machine Learning and Systems, vol. 2, ed. by I. Dhillon, D. Papailiopoulos, V. Sze, pp. 429–450 (2020a). https://proceedings.mlsys.org/paper/2020/file/38af86134b65d0f10fe33d30dd76442e-Paper.pdf

  23. T. Li, A.K. Sahu et al., Federated optimization in heterogeneous networks. Proceed. Mach. Learn. Syst. 2, 429–450 (2020b)

    Google Scholar 

  24. T. Li, M. Sanjabi et al., Fair resource allocation in federated learning (2019). arXiv preprint arXiv:1905.10497

  25. X. Li et al., Multi-site fMRI analysis using privacy-preserving federated learning and domain adaptation: ABIDE results. Med. Image Anal. 65, 101765 (2020)

    Google Scholar 

  26. D. Liu et al., FADL: Federated-autonomous deep learning for distributed electronic health record (2018). arXiv preprint arXiv:1811.11400

  27. Y. Lu et al., Differentially private asynchronous federated learning for mobile edge computing in urban informatics. IEEE Trans. Ind. Inf. 16(3), 2134–2143 (2019)

    Google Scholar 

  28. B. McMahan et al., Communication-efficient learning of deep networks from decentralized data. Artif. Intell. Stati. PMLR 1273–1282 (2017)

    Google Scholar 

  29. M. Mirchev, I. Mircheva, A. Kerekovska, The academic viewpoint on patient data ownership in the context of big data: scoping review. J. Med. Internet Res. 22(8), e22214 (2020)

    Google Scholar 

  30. M. Mostert et al., Big Data in medical research and EU data protection law: challenges to the consent or anonymise approach. Eur. J. Hum. Genet. 24(7), 956–960 (2016)

    Google Scholar 

  31. J. Passerat-Palmbach et al., A blockchain-orchestrated federated learning architecture for healthcare consortia (2019). arXiv preprint arXiv:1910.12603

  32. S.R. Pfohl, A.M. Dai, K. Heller, Federated and differentially private learning for electronic health records (2019). arXiv preprint arXiv:1911.05861

  33. N. Rieke et al., The future of digital health with federated learning. NPJ Digit. Med. 3(1), 119 (2022)

    Google Scholar 

  34. A. Sadilek et al., Privacy-first health research with federated learning. NPJ Digit. Med. 4(1), 132 (2021)

    Google Scholar 

  35. Series, Cisco Cybersecurity (2020). Consumer Privacy Survey (Cisco, 2019)

    Google Scholar 

  36. N. Sethi, G.T. Laurie, Delivering proportionate governance in the era of eHealth: making linkage and privacy work together. Med. Law Int. 13(2–3), 168–204 (2013)

    Google Scholar 

  37. R. Shao et al., Stochastic channel-based federated learning for medical data privacy preserving (2019). arXiv preprint. arXiv:1910.11160

  38. P. Sharma, F.E. Shamout, D.A. Clifton, Preserving patient privacy while training a predictive model of in-hospital mortality (2019). arXiv preprint arXiv:1912.00354

  39. M. Sheller J et al., Multi-institutional deep learning modeling without sharing patient data: a feasibility study on brain tumor segmentation, in Brainlesion: Glioma, Multiple Sclerosis, Stroke and Traumatic Brain Injuries: 4th International Workshop, BrainLes 2018, Held in Conjunction with MICCAI 2018 (Springer, Granada, Spain, 2019). Revised Selected Papers, Part I 4., pp. 92–104

    Google Scholar 

  40. L. Siemons et al., Big Data for personalized healthcare. Int. J. Adv. Syst. Meas. 9(3/4), 220–229 (2016)

    Google Scholar 

  41. S.S. Silva Rincon, Complex anatomical patterns in mild cognitive impairment to Alzheimer’s disease/conversion (n.d.)

    Google Scholar 

  42. V. Smith et al., Federated multi-task learning. Adv. Neural Inf. Process. Syst. 30 (2019)

    Google Scholar 

  43. B. Van Asbroeck, J. Debussche, J. C’sar, Big data & issues & opportunities: data ownership. Bird & Bird (2019)

    Google Scholar 

  44. W.G. Van Panhuis et al., A systematic review of barriers to data sharing in public health. BMC Public Health 14(1), 1–9 (2014)

    Google Scholar 

  45. C. Wu et al., Communication-efficient federated learning via knowledge distillation. Nat. Commun. 13(1), 2032 (2022)

    Google Scholar 

  46. B. Yuan, S. Ge, W. Xing, A federated learning framework for healthcare IoT devices (2020). arXiv preprint arXiv:2005.05083

  47. H. Zhang et al., FedPCC: parallelism of communication and computation for federated learning in wireless networks, in IEEE Transactions on Emerging Topics in Computational Intelligence 6.6, pp. 1368–1377. https://doi.org/10.1109/TETCI.2022.3170471.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Farnaz Farid .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Chen, J., Farid, F., Polash, M. (2023). Federated Learning: An Alternative Approach to Improving Medical Data Privacy and Security. In: Daimi, K., Alsadoon, A., Seabra Dos Reis, S. (eds) Current and Future Trends in Health and Medical Informatics. Studies in Computational Intelligence, vol 1112. Springer, Cham. https://doi.org/10.1007/978-3-031-42112-9_13

Download citation

Publish with us

Policies and ethics