Skip to main content

A Rule-Based Approach for Marathi Part-of-Speech Tagging

  • Conference paper
  • First Online:
ICT with Intelligent Applications

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 248))

Abstract

This paper focused on developing the POS-tagger for Marathi. It is one of the very popular Indian languages spoken by the Marathi people. It has its semantic richness and standard in the literature and culture of Maharashtra. We deploy a technique to find Marathi words for their type, such as noun, verb or adjective, and so on. This task is carried out manually and marked in a corpus consisting of words already tagged with their corresponding part-of-speech. This system uses a rule-based approach based on the Marathi transformational grammar. It is important for preprocessing and developing NLP applications. In the absence or less information available related to other phrases and the possible existence of lexical or syntactic mistakes in the training corpus, our proposed system identifies a correct tag and finds its impact on their performance to verify usability for NLP applications. The overall accuracy of the system is 97.56%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 229.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 299.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 299.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Ekbal A., Mandal S.: POS tagging using HMM and rule-based chunking. In Proceedings of International Joint Conference on Artificial Intelligence Workshop on Shallow Parsing for South Asian Languages. IIIT Hyderabad, Hyderabad, India (2007)

    Google Scholar 

  2. Bagul, P., Mishra, A., Mahajan, P., Kulkarni, M., Dhopavkar, G.: Rule-based POS tagger for Marathi Text. Proc. Int. J. Comput. Sci. Inf. Technol. 5(2), 1322–1326 (2014)

    Google Scholar 

  3. Govilkar, S.: Part of speech tagger for the Marathi language. Int. J. Comput. Appl. 119(18), 0975–8887 (2015)

    Google Scholar 

  4. Singh, J., Joshi, N, Mathur, I.: Development of Marathi part of speech tagger using statistical approach. In: Advances in Computing, Communications and Informatics (ICACCI) (2013)

    Google Scholar 

  5. Nirve, J.: Parsing Indian languages with Malt parser. In: Proceedings of the 5th International Conference on Languages, Resources and Evaluation (LERC), pp. 2216–2219 (2009)

    Google Scholar 

  6. Bharati, A., Gupta, M., Yadav, V., Gali, K., Sharma, D.M.: Simple parser for Indian languages in a dependency framework. In: Proceedings of the Third Linguistic Annotation Workshop. Association for Computational Linguistics (2009)

    Google Scholar 

  7. Rao, A.P.D., Ravindran B.: Part of speech tagging and Chunking with HMM and CRF. In: Proceedings of NLPAI Machine Learning Workshop on Part of Speech Tagging and Chunking for Indian Languages. IIIT Hyderabad, India (2006)

    Google Scholar 

  8. Asif, E., Shivaji, B.: Web-based Bengali news corpus for lexicon development and POS tagging. In: Proceeding of Language Resource and Evaluation (2008)

    Google Scholar 

  9. Ekbal, A., Mandal, S.: POS tagging using HMM and rule-based chunking. In: Proceedings of International Joint Conference on Artificial Intelligence Workshop on Shallow Parsing for South Asian Languages. IIIT Hyderabad, India (2007)

    Google Scholar 

  10. Pattabhi, R.K., Sundar, R., Ram, R.V., Krishna, R.V., Sobha, L.: A text chunker and hybrid POS tagger for Indian languages. In Proceedings of International Joint Conference on Artificial Intelligence Workshop on Shallow Parsing for South Asian Languages. IIIT Hyderabad, India (2007)

    Google Scholar 

  11. Singh, J., Joshi, N., Mathur, I.: Part of speech tagging of Marathi text using trigram method. Int. J. Adv. Inf. Technol. (IJAIT) 3(2) (2013). https://doi.org/10.5121/ijait2013.3203

  12. Undarachi Topi Stories in Marathi. http://marathikavitasangrah.in/2018/01/undarachi-topi-stories-in-marathi (2018). Accessed date (2018/1)

  13. टोपीवाला आणि माकडे—Webdunia Marathi. https://marathi.webdunia.com/article/marathi-kids-टोपीवाला-आणि-माकडे.html. (2020). Accessed date (2020/1)

    Google Scholar 

  14. कुत्र्याची हुशारी-–Webdunia Marathi. https://marathi.webdunia.com/article/marathi-kids-stories/kids-story-115062900024_1.html (2020). Accessed date (2020/1)

  15. धक्का लागल्याने कळून येत व्यक्तिमत्व-Webdunia Marathi. http://marathi.webdunia.com/article/marathi-kids-stories/guru-shiya-kid-stroy-in-marathi-118082300008_1.html (2020). Accessed date (2020/1)

  16. गणपतीने आपल्या बुद्धिमत्तेने -Webdunia Marathi. https://marathi.webdunia.com/article/marathi-kids-stories/ganesh-kartikeya-prithivi-parikrama-story-118091800013_1.html (2020). Accessed date (2020/1)

  17. उपकाराचेस्मरणकरणेहेचमाणसाचे-Webdunia Marathi. https://marathi.webdunia.com/article/marathi-kids-stories/kids-story-18091900009_1.html (2020). Accessed date (2020/1)

  18. बालसंस्कार—Marathi Katha.https://www.hindujagruti.org/hinduism-for-kids-marathi/1449.html (2020). Accessed date (2020/1)

  19. Moral stories in Marathi on Trees-वेडेवाकडेझाड. http://marathikavitasangrah.in/2017/03/moral-stories-in-Marathi-on-trees-वेडेवाकडे-झाड.html. (2020). Accessed date (2020/1)

    Google Scholar 

  20. Moral stories in Marathi on peacock-गर्विष्ठमोरशहाणाकरकोचा. http://marathikavitasangrah.in/2017/04/moral-stories-in-marathi-on-peacock-गर्विष्ठ-मोर-शहाणा-करकोचा.html (2020). Accessed date (2020/1)

    Google Scholar 

  21. बालसंस्कार –Marathi Katha.सकारात्मकदृष्टीकोणकसाअसावा.https://www.hindujagruti.org/hinduism-for-kids-marathi/1481.html 11/21 (2020). Accessed date (2020/1)

  22. Sharvari, G., Bakal J.W.: Shubhangi Rathod: Part-of-speech tagger for Marathi. Int. J. Comput. Appli. (0975–8887) vol. 119-No.18 (2015)

    Google Scholar 

Download references

Acknowledgements

The authors would like to acknowledge and thank to Chhatrapati Shahu Maharaj Research Training and Human Development Institute (SARTHI), Pune. They awarded fellowship, CSRI DST Major Project sanctioned No.SR/CSRI/71/2015(G), and also thanks to Computational and Psycholinguistic Research Lab Facility supporting this work and Department of Computer Science and Information Technology, Dr. Babasaheb Ambedkar Marathwada University, Aurangabad, Maharashtra, India.

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Vaishali, P.K., Kalpana, K., Namrata Mahender, C. (2022). A Rule-Based Approach for Marathi Part-of-Speech Tagging. In: Senjyu, T., Mahalle, P.N., Perumal, T., Joshi, A. (eds) ICT with Intelligent Applications. Smart Innovation, Systems and Technologies, vol 248. Springer, Singapore. https://doi.org/10.1007/978-981-16-4177-0_76

Download citation

Publish with us

Policies and ethics