Skip to main content

A Pronoun Replacement-Based Special Tagging System for Bengali Language Processing (BLP)

  • Conference paper
  • First Online:
Innovations in Computer Science and Engineering

Abstract

Natural language processing (NLP) is one of the most important thing for human machine interaction and a very important thing for machine learning system. In the world, over 27 crore people use Bengali as their first and mother language, and it has its own written system, so it is very much important to process Bengali language for natural language processing. In this research work, we have tried to demonstrate an upgraded parts of speech tagging system (POS) for Bengali language, where we have used special tagging system with general grammatical parts of speech based on many different things like—Considering suffixes for verb, where get 68% success rate. We have also added places name, occupation name, Bengali Name, Bengali repeated word, digit of Bengali in both written and digit form, English acronym, organization name for both cases. The success rate of tagging for genera tagging is 70 and 76% for special tagging which is ever highest. This tagging system can be used for Bengali language processing (BLP) like—sentiment analysis for Bengali, Bengali text summarization, etc.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 219.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 279.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Azmi, A.M., Al-Thanyyan, S.: A text summarizer for arabic. J. Comput. Speech Lang. 26(4), 260–273 (2012)

    Article  Google Scholar 

  2. Miller, G.A.: Wordnet: a lexical database for English. Commun. ACM 38(11), 39–41 (1995)

    Article  Google Scholar 

  3. Indian Statistical Institute: A Lexical Database for Bengali 2015 [Online]. Available https://www.isical.ac.in/∼lru/wordnetnew/index.php/site/aboutus. Accessed 28 Oct 2015

  4. Chakma, R. et al.: Navigation and tracking of AGV in ware house via wireless sensor network. In: 2019 IEEE 3rd International Electrical and Energy Conference (CIEEC), pp. 1686–1690. Beijing, China, 2019. https://doi.org/10.1109/CIEEC47146.2019.CIEEC-2019589

  5. Milu, S.A., et al.: Sentiment analysis of Bengali reviews for data and knowledge engineering: a Bengali language processing approach. In: Bindhu, V., Chen, J., Tavares, J. (eds.) International Conference on Communication, Computing and Electronics Systems. Lecture Notes in Electrical Engineering, vol. 637. Springer, Singapore (2020). https://doi.org/10.1007/978-981-15-2612-1_8

  6. Notes for Students: Rule Based System, Nov 2000 [Online]. Available https://www.jpaine.org/students/lectures/lect3/node5.html. Accessed 01 Apr 2017

  7. Gpedia: Gpedia, your encyclopaedia [Online]. Available www.gpedia.com/bn. Accessed 25 June 2016

  8. BdJobs.com: Occupation in Bangladesh, Name of Occupation in Largest Job Site in Bangladesh, Feb 2016 [Online]. Available https://bdjobs.com. Accessed 25 June 2016

  9. Emon, I.S., Ahmed, S.S., Milu, S.A., Mahtab, S.S.: Sentiment analysis of Bengali online reviews written with English letter using machine learning approaches. In: Proceedings of the 6th International Conference on Networking, Systems and Security (NSysS ’19). Association for Computing Machinery, New York, NY, USA, pp. 109–115 (2019). https://doi.org/10.1145/3362966.3362977

  10. Khan, M.F.S., Mahtab, S.S.: PLC based energy-efficient home automation system with smart task scheduling. In: 2019 IEEE Sustainable Power and Energy Conference (iSPEC), pp. 35–38. Beijing, China, 2019.https://doi.org/10.1109/iSPEC48194.2019.8975223

  11. Ahmed, S.S., et al.: Opinion mining of bengali review written with english character using machine learning approaches. In: Bindhu, V., Chen, J., Tavares, J. (eds.) International Conference on Communication, Computing and Electronics Systems. Lecture Notes in Electrical Engineering, vol. 637. Springer, Singapore (2020). https://doi.org/10.1007/978-981-15-2612-1_5

  12. Mahtab, S.S., Monsur, A., Ahmed, S.S., Chakma, R., Alam, M.J.: Design and optimization of perovskite solar cell with thin ZnO insulator layer as electron transport. In: 2018 International Conference on Advancement in Electrical and Electronic Engineering (ICAEEE), pp. 1–4. IEEE, Gazipur, Bangladesh (2018). https://doi.org/10.1109/ICAEEE.2018.8643012

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Sheikh Shahparan Mahtab .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Jahan, B., Emon, I.S., Milu, S.A., Hossain, M.M., Mahtab, S.S. (2021). A Pronoun Replacement-Based Special Tagging System for Bengali Language Processing (BLP). In: Saini, H.S., Sayal, R., Govardhan, A., Buyya, R. (eds) Innovations in Computer Science and Engineering. Lecture Notes in Networks and Systems, vol 171. Springer, Singapore. https://doi.org/10.1007/978-981-33-4543-0_80

Download citation

Publish with us

Policies and ethics