Abstract
We describe in this paper a semi-automatic acquisition of morphological rules for morphological analyser in the case of under-resourced language, which is Iban language. We modify ideas from previous automatic morphological rules acquisition approaches, where the input requirements has become constraints to develop the analyser for under-resourced language. This work introduces three main steps in acquiring the rules from the under-resourced language, which are morphological data acquisition, morphological information validation and morphological rules extraction. The experiment shows that this approach gives successful results with 0.76 of precision and 0.99 of recall. Our findings also suggest that the availability of linguistic references and the selection of assorted techniques for morphology analysis could lead to the design of the workflow. We believe this workflow will assist other researchers to build morphological analyser with the validated morphological rules for the under-resourced languages.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Koskenniemi, K.: Two-Level Morphology: A General Computational Model for Word-form Recognition and Production. PhD thesis, University of Helsinki (1983)
Theron, P., Cloete, I.: Automatic acquisition of two-level morphological rules. In: Proceedings of the Fifth Conference on Applied Natural Language Processing, pp. 103–110. Association for Computational Linguistics (1997)
Sarawak Board Tourism: http://www.sarawaktourism.com/en/component/jumi/about-people
Ling, S.: Iban language for spm in 2008 (2008)
Karagol-Ayan, B.: Resource generation from structured documents for low-density languages. PhD thesis, University of Maryland, College Park (2007)
Kadriu, A., Zdravkova, K.: Semi-automatic learning of two-level phonological rules for agentive nouns. In: 10th International Conference on Computer Modelling Simulation (2008)
Yturralde, B.: Morphological rule acquisition for tagalog words using moving contracting window pattern algorithm. In: Proceedings of the 10th Philippine Computing Science Congress, Ateneo de Davao University (2002) ISSN 1908-1146
Beesley, K.R.: Computational Morphology and Finite-State Methods. IOS Press (2003)
Akshar, B., Rajeev, S., Dipti, M.S., Radhika, M.: Generic morphological analysis shell. In: SALTMIL Workshop on Minority Languages, (2004)
Feldman, A., Hana, J., Brew, C.: A cross-language approach to rapid creation of new morpho-syntactically annotated resources. In: LREC 2006, pp. 549–554 (2006)
Cucerzan, S., Yarowsky, D.: Bootstrapping a multilingual part-of-speech tagger in one person-day. In: Proceeding of the 6th Conference on Natural Language Learning - COLING 2002, pp. 1–7 (2002)
Goldsmith, J.: Unsupervised learning of the morphology of a natural language. Computational Linguistics 27(2), 153–198 (2001)
Creutz, M., Lagus, K.: Unsupervised morpheme segmentation and morphology induction from text corpora using morfessor 1.0, Helsinki University of Technology (2005)
Monson, C., Carbonell, J., Lavie, A., Levin, L.: Paramor: Finding paradigms across morphology (2009)
Karasimos, A., Petropoulou, E.: A crash test with linguistica in modern greek: The case of derivational affixes and bound stems. In: International Conference on Language Resources and Evaluation, LREC 2010 (2010)
Blancafort, H.: Learning morphology of romance, germanic and slavic languages with the tool linguistica. In: International Conference on Language Resources and Evaluation, LREC 2010 (2010)
Dasgupta, S., Vincent, N.: Unsupervised morphological parsing of bengali. Language Resources and Evaluation 40(3-4), 311–330 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Saee, S., Soon, LK., Lim, T.Y., Ranaivo-Malançon, B., Tang, E.K. (2013). Semi-automatic Acquisition of Two-Level Morphological Rules for Iban Language. In: Gelbukh, A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2013. Lecture Notes in Computer Science, vol 7816. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37247-6_15
Download citation
DOI: https://doi.org/10.1007/978-3-642-37247-6_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37246-9
Online ISBN: 978-3-642-37247-6
eBook Packages: Computer ScienceComputer Science (R0)