Abstract
The enormous growth and availability of data poses a great challenge for extracting useful information from documents written in natural language. The information extraction task has become a vital activity in all domains. The process of identifying the names of organization, people, locations or other entities in text is called named entity recognition (NER). It is the subtask and plays an important part to discover and classify the names such as organization name, person name or the location. This is one of the trending fields and most important step in the natural language processing (NLP) for analysis of text. Research on NER changed a lot in the recent decade. NER can consequently examine the entire articles and reveal the individuals, associations, and spots talked about in text. Knowing the applicable labels for every single article help in naturally arranging the articles in all around characterized progressive systems and endorse smooth content disclosure. The pretension of this paper is to present survey on NER. The prime contribution of this research to present state-of-the-art NER is systematically reviewed according to techniques used in NER. This paper also provides tools, datasets, techniques, challenges and future directions in the field of NER with the aim of providing researchers the substantial knowledge for further work.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Nadeau, D., & Sekine, S. (2007). A survey of named entity recognition and classification. Lingvisticae Investigationes, 30(1), 3–26.
Petasis, G., Cucchiarelli, A., Velardi, P., Paliouras, G., Karkaletsis, V., & Spyropoulos, C. D. (2000). Automatic adaptation of proper noun dictionaries through cooperation of machine learning and probabilistic methods. Proceedings of SIGIR, 128–135.
Li, J., Sun, A., Han, J., & Li, C. (2018). A survey on deep learning for named entity recognition. IEEE Transactions On Knowledge And Data Engineering, 20.
Marrero, M., Urbano, J., Sánchez-Cuadrado, S., Morato, J., & Gómez-Berbís, J. M. (2013). Named entity recognition: Fallacies, challenges and opportunities. Computer Standards & Interfaces, 35, 482–489.
Weischedel, R., Hovy, E., Marcus, M., Palmer, M., Belvin, R., Pradhan, S., Ramshaw, L., & Xue, N. (2011). OntoNotes: A large training corpus for enhanced processing. In Handbook of natural language Processing and machine translation: DARPA global autonomous language exploitation. Springer.
Sang, E. F.T . K. (2002). Introduction to the conll-2002 shared task: Language-independent named entity recognition. In Proceedings of the 6th Conference on Natural Language Learning. Stroudsburg, PA, USA. Association for Computational Linguistics (Vol. 31, pp. 1–4).
Kim, J. D., & Ohta, T. (2003). GENIA corpus-a semantically annotated corpus for bio-textmining (Vol. 19).
Grishman, R., & Sundheim, B. (1996). Message understanding conference-6: A brief history. In Proceedings of the 16th Conference on Computational Linguistics, COLING (Vol. 1, pp. 466–471).
Bird, S., Loper, E., & Klein, E. (2009). Natural language processing with python (Vol. 36, pp. 767–771). O’Reilly Media Inc.
Al-Rfou, R., Kulkarni, V., & Perozzi, B. (2014). POLYGLOT-NER: Massive multilingual named entity recognition (Vol. 1).
Manning, C., Surdeanu, M., & Bauer, J. (2014). The Stanford CoreNLP natural language processing toolkit. Proceedings of 52nd Annual Meeting of the Association for Computational Linguistics: System Demonstrations (pp. 55–60).
Kang, Y., Cai, Z., Tan, C.-W., Huang, Q., & Liu, H. (2020). Natural language processing (NLP) in management research: A literature review. Journal of Management Analytics, 7(2), 139–172.
Gardner, M., Grus, J., Neumann, M., Tafjord, O., Dasigi, P., Liu, N., Peters, M., Schmitz, M., & Zettlemoyer, L. (2017). AllenNLP: A deep semantic natural language processing platform. In Proceedings of Workshop for NLP Open Source Software (NLP-OSS), Technical report (pp. 1–6,).
Neumann, M., & King, D. (2019). ScispaCy: Fast and robust models for biomedical natural language processing. In Proceedings of the BioNLP workshop, 319–327.
Goyal, A., Gupta, V., & Kumar, M. (2018). Recent named entity recognition and classification techniques: A systematic review. Computer Science Review, 29, 21–43.
Lin, B. Y., Xu, F., Luo, Z., & Zhu, K. (2017). Multi-channel bilstm-crf model for emerging named entity recognition in social media. In Proceedings of the 3rd Workshop on Noisy User-generated Text (pp. 160–165).
Peters, M. E., Ammar, W., Bhagavatula, C., & Power, R. (2017). Semisupervised sequence tagging with bidirectional language models. In Proceedings of ACL (pp. 1756–1765).
Collobert, R., Weston, J., Bottou, L., Karlen, M., Kavukcuoglu, K., & Kuksa, P. (2011). Natural language processing (almost) from scratch. Journal of Machine Learning Research, 12, 2493–2537.
Ju, M., Miwa, M., & Ananiadou, S. (2018). A neural layered model for nested named entity recognition. Proceedings of NAACL-HLT, 1, 1446–1459.
Yang, Z., Salakhutdinov, R., & Cohen, W. (2016). Multi-task cross-lingual sequence tagging from scratch. arXiv. 2.
Rei, M. (2017). Semi-supervised multitask learning for sequence labeling. Proceedings of ACL (pp. 2121–2130).
Nadeau, D., Turney, P. D., & Matwin, S. (2006). Unsupervised named entity recognition: Generating gazetteers and resolving ambiguity. In Proceedings of the Canadian Society for Computational Studies of Intelligence (pp. 266–277). Springer.
Zhang, S., & Elhadad, N. (2013). Unsupervised biomedical named entity recognition: Experiments with clinical and biological texts. Journal of Biomedical Information, 46, 1088–1098.
Däniken, P. V., & Cieliebak, M. (2017)T. ransfer learning and sentence level features for named entity recognition on tweets. In Proceedings of the 3rd Workshop on Noisy User-generated Text (pp. 166–171).
Zhao, H., Yang, Y., Zhang, Q., & Si, L. (2018). Improve neural entity recognition via multi-task data selection and constrained decoding. NAACL-HLT, 2, 346–351.
Sutton, C., McCallum, A., & Rohanimanesh, K. (2007). Dynamic conditional random fields: Factorized probabilistic models for labeling and segmenting sequence data. Journal of Machine Learning Research, 8, 693–723.
Lin, B. Y., & Lu, W. (2018). Neural adaptation layers for cross-domain named entity recognition. Proceedings of AAAI, 12, 2012–2022.
Tomori, S., Ninomiya, T., & Mori, S. (2016). Domain specific named entity recognition referring to the real world by deep neural networks. Proceedings of ACL, 2, 236–242.
Yadav, V., & Bethard, S. (2018). A survey on recent advances in named entity recognition from deep learning models. In Proceedings of COLING (pp. 2145–2158).
Rei, M., Crichton, G. K., & Pyysalo, S. (2016). Attending to characters in neural sequence labeling models. In Proceedings of COLING (pp. 309–318).
Zukov-Gregoric, A., Bachrach, Y., Minkovsky, P., Coope, S., & Maksak, B. (2017). Neural named entity recognition using a selfattention mechanism. In Proceedings of ICTAI (pp. 652–656).
Xu, G., Wang, C., & He, X. (2018). Improving clinical named entity recognition with global neural attention. In Proceedings of APWeb-WAIM (pp. 264–279).
Zhang, Q., Fu, J., Liu, X., & Huang, X. (2018). Adaptive co-attention network for named entity recognition in tweets. In AAAI.
Batmaz, Z., Yurekli, A., Bilge, A., & Kaleli, C. (2018). A review on deep learning for recommender systems: Challenges and remedies. Artificial Intelligence Review, 1–37.
Akbik, A., Blythe, D., & Vollgraf, R. (2018). Contextual string embeddings for sequence labeling. In Proceedings of COLING (pp. 1638–1649).
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Sharma, A., Amrita, Chakraborty, S., Kumar, S. (2022). Named Entity Recognition in Natural Language Processing: A Systematic Review. In: Gupta, D., Khanna, A., Kansal, V., Fortino, G., Hassanien, A.E. (eds) Proceedings of Second Doctoral Symposium on Computational Intelligence . Advances in Intelligent Systems and Computing, vol 1374. Springer, Singapore. https://doi.org/10.1007/978-981-16-3346-1_66
Download citation
DOI: https://doi.org/10.1007/978-981-16-3346-1_66
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-3345-4
Online ISBN: 978-981-16-3346-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)