Abstract
This paper focused on developing the POS-tagger for Marathi. It is one of the very popular Indian languages spoken by the Marathi people. It has its semantic richness and standard in the literature and culture of Maharashtra. We deploy a technique to find Marathi words for their type, such as noun, verb or adjective, and so on. This task is carried out manually and marked in a corpus consisting of words already tagged with their corresponding part-of-speech. This system uses a rule-based approach based on the Marathi transformational grammar. It is important for preprocessing and developing NLP applications. In the absence or less information available related to other phrases and the possible existence of lexical or syntactic mistakes in the training corpus, our proposed system identifies a correct tag and finds its impact on their performance to verify usability for NLP applications. The overall accuracy of the system is 97.56%.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Ekbal A., Mandal S.: POS tagging using HMM and rule-based chunking. In Proceedings of International Joint Conference on Artificial Intelligence Workshop on Shallow Parsing for South Asian Languages. IIIT Hyderabad, Hyderabad, India (2007)
Bagul, P., Mishra, A., Mahajan, P., Kulkarni, M., Dhopavkar, G.: Rule-based POS tagger for Marathi Text. Proc. Int. J. Comput. Sci. Inf. Technol. 5(2), 1322–1326 (2014)
Govilkar, S.: Part of speech tagger for the Marathi language. Int. J. Comput. Appl. 119(18), 0975–8887 (2015)
Singh, J., Joshi, N, Mathur, I.: Development of Marathi part of speech tagger using statistical approach. In: Advances in Computing, Communications and Informatics (ICACCI) (2013)
Nirve, J.: Parsing Indian languages with Malt parser. In: Proceedings of the 5th International Conference on Languages, Resources and Evaluation (LERC), pp. 2216–2219 (2009)
Bharati, A., Gupta, M., Yadav, V., Gali, K., Sharma, D.M.: Simple parser for Indian languages in a dependency framework. In: Proceedings of the Third Linguistic Annotation Workshop. Association for Computational Linguistics (2009)
Rao, A.P.D., Ravindran B.: Part of speech tagging and Chunking with HMM and CRF. In: Proceedings of NLPAI Machine Learning Workshop on Part of Speech Tagging and Chunking for Indian Languages. IIIT Hyderabad, India (2006)
Asif, E., Shivaji, B.: Web-based Bengali news corpus for lexicon development and POS tagging. In: Proceeding of Language Resource and Evaluation (2008)
Ekbal, A., Mandal, S.: POS tagging using HMM and rule-based chunking. In: Proceedings of International Joint Conference on Artificial Intelligence Workshop on Shallow Parsing for South Asian Languages. IIIT Hyderabad, India (2007)
Pattabhi, R.K., Sundar, R., Ram, R.V., Krishna, R.V., Sobha, L.: A text chunker and hybrid POS tagger for Indian languages. In Proceedings of International Joint Conference on Artificial Intelligence Workshop on Shallow Parsing for South Asian Languages. IIIT Hyderabad, India (2007)
Singh, J., Joshi, N., Mathur, I.: Part of speech tagging of Marathi text using trigram method. Int. J. Adv. Inf. Technol. (IJAIT) 3(2) (2013). https://doi.org/10.5121/ijait2013.3203
Undarachi Topi Stories in Marathi. http://marathikavitasangrah.in/2018/01/undarachi-topi-stories-in-marathi (2018). Accessed date (2018/1)
टोपीवाला आणि माकडे—Webdunia Marathi. https://marathi.webdunia.com/article/marathi-kids-टोपीवाला-आणि-माकडे.html. (2020). Accessed date (2020/1)
कुत्र्याची हुशारी-–Webdunia Marathi. https://marathi.webdunia.com/article/marathi-kids-stories/kids-story-115062900024_1.html (2020). Accessed date (2020/1)
धक्का लागल्याने कळून येत व्यक्तिमत्व-Webdunia Marathi. http://marathi.webdunia.com/article/marathi-kids-stories/guru-shiya-kid-stroy-in-marathi-118082300008_1.html (2020). Accessed date (2020/1)
गणपतीने आपल्या बुद्धिमत्तेने -Webdunia Marathi. https://marathi.webdunia.com/article/marathi-kids-stories/ganesh-kartikeya-prithivi-parikrama-story-118091800013_1.html (2020). Accessed date (2020/1)
उपकाराचेस्मरणकरणेहेचमाणसाचे-Webdunia Marathi. https://marathi.webdunia.com/article/marathi-kids-stories/kids-story-18091900009_1.html (2020). Accessed date (2020/1)
बालसंस्कार—Marathi Katha.https://www.hindujagruti.org/hinduism-for-kids-marathi/1449.html (2020). Accessed date (2020/1)
Moral stories in Marathi on Trees-वेडेवाकडेझाड. http://marathikavitasangrah.in/2017/03/moral-stories-in-Marathi-on-trees-वेडेवाकडे-झाड.html. (2020). Accessed date (2020/1)
Moral stories in Marathi on peacock-गर्विष्ठमोरशहाणाकरकोचा. http://marathikavitasangrah.in/2017/04/moral-stories-in-marathi-on-peacock-गर्विष्ठ-मोर-शहाणा-करकोचा.html (2020). Accessed date (2020/1)
बालसंस्कार –Marathi Katha.सकारात्मकदृष्टीकोणकसाअसावा.https://www.hindujagruti.org/hinduism-for-kids-marathi/1481.html 11/21 (2020). Accessed date (2020/1)
Sharvari, G., Bakal J.W.: Shubhangi Rathod: Part-of-speech tagger for Marathi. Int. J. Comput. Appli. (0975–8887) vol. 119-No.18 (2015)
Acknowledgements
The authors would like to acknowledge and thank to Chhatrapati Shahu Maharaj Research Training and Human Development Institute (SARTHI), Pune. They awarded fellowship, CSRI DST Major Project sanctioned No.SR/CSRI/71/2015(G), and also thanks to Computational and Psycholinguistic Research Lab Facility supporting this work and Department of Computer Science and Information Technology, Dr. Babasaheb Ambedkar Marathwada University, Aurangabad, Maharashtra, India.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Vaishali, P.K., Kalpana, K., Namrata Mahender, C. (2022). A Rule-Based Approach for Marathi Part-of-Speech Tagging. In: Senjyu, T., Mahalle, P.N., Perumal, T., Joshi, A. (eds) ICT with Intelligent Applications. Smart Innovation, Systems and Technologies, vol 248. Springer, Singapore. https://doi.org/10.1007/978-981-16-4177-0_76
Download citation
DOI: https://doi.org/10.1007/978-981-16-4177-0_76
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-4176-3
Online ISBN: 978-981-16-4177-0
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)