Abstract
Natural language processing (NLP) is the subfield of artificial intelligence that has the potential to make human language analyzable by computers. NLP is increasingly proving its importance in the medical field where a huge amount of data remains unstructured (free text) stored as electronic medical records (EMR); discharge summaries, lab reports, clinical notes, pa-thology reports, etc. Traditional Machine learning (ML) based approaches have been widely used for medical NLP tasks, but these methods require a set of manual work and still suffer in terms of accuracy. However, deep learning (DL) based methods have made significant improvement. The main goal of this study is to present the state-of-the-art DL based NLP tech-niques in healthcare. We started by presenting word embedding techniques and popular deep learning models used in this area, and then reviewed ap-plications of NLP tasks in medical domain such as classification, predic-tion, and information extraction. We concluded our study with analyzing cited architectures and showing the promising results of CNN and BiLSTM and BERT fine-tuning.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Liu, Z., Lin, Y., Sun, M.: Representation learning and NLP. In: Representation Learning for Natural Language Processing, pp. 1–11. Springer, Singapore (2020). https://doi.org/10.1007/978-981-15-5573-2_1
Liu, Z., Lin, Y., Sun, M.: Word representation. In: Representation Learning for Natural Language Processing, pp. 13–41. Springer, Singapore (2020). https://doi.org/10.1007/978-981-15-5573-2_2
Plate, T.: Distributed Representations. Cognitive Science, pp. 1–15 (2003)
Mikolov, T., Chen, K., Corrado, G., Dean, J.: Efficient estimation of word representations in vector space. arXiv, arXiv:1301.3781 (2013)
Banerjee, I., Chen, M.C., Lungren, M.P., Rubin, D.L.: Radiology report annotation using intelligent word embeddings: applied to multi-institutional chest CT cohort. J. Biomed. Inform. 77, 11–20 (2018). https://doi.org/10.1016/j.jbi.2017.11.012
Soriano, I.M., Castro, J.L., Fernandez-Breis, J.T., Román, I.S., Barriuso, A.A., Baraza, D.G.: Snomed2Vec: Representation of SNOMED CT Terms with Word2Vec, pp. 678–83. IEEE Computer Society (2019). https://doi.org/10.1109/CBMS.2019.00138
Pennington, J., Socher, R., Manning, C.D.: Glove: Global Vectors for Word Representation, vol. 14, 1532–1543 (2014). https://doi.org/10.3115/v1/D14-1162
Kalyan, K.S., Sangeetha, S.: SECNLP: A Survey of Embeddings in Clinical Natural Language Processing. J. Biomed. Inform. 101, 103323 (2020). https://doi.org/10.1016/j.jbi.2019.103323
Khattak, F.K., Jeblee, S., Pou-Prom, C., Abdalla, M., Meaney, C., Rudzicz, F.: A survey of word embeddings for clinical text. J. Biomed. Inf. 100(1-4) 100057 (2019). https://doi.org/10.1016/j.yjbinx.2019.100057
Habib, M., Faris, M., Alomari, A., Faris, H.: AltibbiVec: a word embedding model for medical and health applications in the Arabic language. IEEE Access 9, 133875–88 (2021). https://doi.org/10.1109/ACCESS.2021.3115617
Vaswani, A., et al.: Attention Is All You Need. arXiv (2017). https://doi.org/10.48550/arXiv.1706.03762
Liu, Y., et al.: RoBERTa: a robustly optimized bert pretraining approach. arXiv (2019). https://doi.org/10.48550/arXiv.1907.11692
Lee, J., et al.: BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinform. 36, btz682 (2019). https://doi.org/10.1093/bioinformatics/btz682
Rasmy, L., Xiang, Y., Xie, Z., Tao, C., Zhi, D.: Med-BERT: PRetrained contextualized embeddings on large-scale structured electronic health records for disease prediction. npj Digit. Med. 4(1), 1–13 (2021). https://doi.org/10.1038/s41746-021-00455-y
Huang, K., Altosaar, J., Ranganath, R.: ClinicalBERT: modeling clinical notes and predicting hospital readmission. ArXiv (2019)
Jia, Y., Kaul, C., Lawton, T., Murray-Smith, R., Habli, I.: Prediction of weaning from mechanical ventilation using convolutional neural networks. Artif. Intell. Med. 117, 102087 (2021). https://doi.org/10.1016/j.artmed.2021.102087
Borjali, A., Magnéli, M., Shin, D., Malchau, H., Mu-ratoglu, O.K., Varadarajan, K.M.: Natural language processing with deep learning for medical adverse event detection from free-text medical narratives: a case study of detecting total hip replacement dislocation. Comput. Biol. Med. 129, 104140 (2021). https://doi.org/10.1016/j.compbiomed.2020.104140
Guan, M., Cho, S., Petro, R., Zhang, W., Pasche, B., Topaloglu, U.: Natural language processing and recurrent network models for identifying genomic mutation-associated cancer treatment change from patient progress notes. JAMIA Open 2(1), 139–149 (2019). https://doi.org/10.1093/jamiaopen/ooy061
Bayrak, S., Yucel, E., Takci, H.: Epilepsy radiology reports classification using deep learning networks (2022). https://doi.org/10.32604/cmc.2022.018742
Chaib, R., Azizi, N., Schwab, D., Gasmi, I., Chaib, A.: GL-LSTM Model for multi label text classification of cardiovascular disease reports (2022). https://easychair.org/publications/preprint/BMRx
Mao, C., Yao, L., Luo, Y.: AKI-BERT: a pre-trained clinical language model for early prediction of acute kidney injury. arXiv (2022). https://doi.org/10.48550/arXiv.2205.03695
Liu, X., Chen, Y., Bae, J., Li, H., Johnston, J., Sanger, T.: Predicting heart failure readmission from clinical notes using deep learning. In: 2019 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 2642–48 (2019). https://doi.org/10.1109/BIBM47256.2019.8983095
Schneider, E.T.R., et al.: BioBERTpt - a portuguese neural language model for clinical named entity recognition. In: Proceedings of the 3rd Clinical Natural Language Processing Workshop. Association for Computational Linguistics (2020). https://doi.org/10.18653/v1/2020.clinicalnlp-1.7
Khalafi, S., Ghadiri, N., Moradi, M.: Hybrid deep learning methods for phenotype prediction from clinical notes. arXiv (2022). https://doi.org/10.48550/arXiv.2108.10682
Mahajan, D., Liang, J.J., Tsou, C.-H.: Extracting daily dosage from medication instructions in EHRs: an automated approach and lessons learned. arXiv (2021). https://doi.org/10.48550/arXiv.2005.10899
Jouffroy, J., Feldman, S., Lerner, I., Rance, B., Burgun, A., Neuraz, A.: MedExt: combining expert knowledge and deep learning for medication extraction from french clinical texts (Preprint) (2020). https://doi.org/10.2196/preprints.17934
Shen, Y., Zhang, Q., Zhang, J., Huang, J., Lu, Y., Lei, K.: Improving medical short text classification with semantic expansion using word-cluster embedding (2018). https://doi.org/10.48550/arXiv.1812.01885
Hsu, E., Malagaris, I., Kuo, Y.-F., Sultana, R., Roberts, K.: Deep learning-based NLP data pipeline for EHR scanned document information extraction. arXiv (2021). https://doi.org/10.48550/arXiv.2110.11864
Olthof, A.W., et al.: Machine learning based natural language processing of radiology reports in Orthopaedic trauma. Comput. Methods Programs Biomed. 208, 106304 (2021). https://doi.org/10.1016/j.cmpb.2021.106304
Ren, G.R., et al.: Differentiation of lumbar disc herniation and lumbar spinal stenosis using natural language processing-based machine learning based on positive symptoms. Neurosurg. Focus 52(4), E7 (2022). https://doi.org/10.3171/2022.1.FOCUS21561
Kumar, A.: Different types of CNN architectures explained: examples. Data Analytics (blog) (2022). https://vitalflux.com/different-types-of-cnn-architectures-explained-examples
Kumar, E.S., Jayadev, P.S.: Deep learning for clinical decision support systems: a review from the panorama of smart healthcare. In: Deep Learning Techniques for Biomedical and Health Informatics (2020) https://doi.org/10.1007/978-3-030-33966-1_5
Sandeep Kumar, E., Satya Jayadev, P.: Deep learning for clinical decision support systems: a review from the panorama of smart healthcare. In: Dash, S., Acharya, B.R., Mittal, M., Abraham, A., Kelemen, A. (eds.) Deep Learning Techniques for Biomedical and Health Informatics. SBD, vol. 68, pp. 79–99. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-33966-1_5
Adnan, K., Akbar, R., Khor, S.W., Ali, A.B.A.: Role and challenges of unstructured big data in healthcare. In: Sharma, N., Chakrabarti, A., Balas, V.E. (eds.) Data Management, Analytics and Innovation. AISC, vol. 1042, pp. 301–323. Springer, Singapore (2020). https://doi.org/10.1007/978-981-32-9949-8_22
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
El M’hamdi, Z., Lazaar, M., Mahboub, O. (2023). A Survey of Deep Learning Based Natural Language Processing in Smart Healthcare. In: Aboutabit, N., Lazaar, M., Hafidi, I. (eds) Advances in Machine Intelligence and Computer Science Applications. ICMICSA 2022. Lecture Notes in Networks and Systems, vol 656. Springer, Cham. https://doi.org/10.1007/978-3-031-29313-9_9
Download citation
DOI: https://doi.org/10.1007/978-3-031-29313-9_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-28845-6
Online ISBN: 978-3-031-29313-9
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)