Abstract
Wordnet is a lexical database where nouns, verbs, adjectives, and adverbs are organized in a conceptual hierarchy linking semantically and lexically related concepts to each other. This paper reports on the prototype of the Tatar Wordnet which currently contains about 5,500 Tatar verbs. Within our project we are creating a model of the semantic system of Tatar verbs as a hierarchical structure considering specifics of the Tatar language. For this purpose we use the entries of available Tatar dictionaries (explanatory dictionaries and those of synonyms). As the first step the extraction of available verbal synonyms from the dictionary of synonyms of the Tatar language was carried out. Then the most frequent 5156 Tatar verbs were selected and classified into several groups (synsets) according to their dominant semantic components with the purpose of adding new synsets and enriching those already existing (currently about 1,500 core synsets were distinguished). Then semantic relations between synsets were mapped (the verbs were linked according to their troponymy, entailment, and causality). The paper presents the results obtained, and discusses some problems encountered along the way.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
References
WordNet. A lexical database for English, http://wordnet.princeton.edu
Miller, G.A.: WordNet: A Lexical Database for English. Communications of the ACM 3(11), 39–41 (1995)
Fellbaum, C.: WordNet: An Electronic Lexical Database. MIT Press, Cambridge (1998)
Bilgin, O., Özlem, C., Kemal, O.: Building a Wordnet for Turkish. Romanian Journal of Information Science and Technology 7(1-2), 163–172 (2004)
Piek, V., Bloksma, L., Rodriguez, H., Climent, S., Calzolari, N., Roventini, A., Bertagna, F., Bertagna, A., Peters, W.: The EuroWordNet Base Concepts and Top Ontology. Deliverable D017 D 34:D036 (1998)
Farreres, X., Rigau, G., Rodriguez, H.: Using WordNet for Building WordNets. In: COLING-ACL Workshop on Usage of Wordnet in Natural Language Processing Systems, Montreal, Canada (1998)
Khanbikova, S. S., Safiullina, F. S.: Dictionary of synonyms of Tatar language. Kazan (1999) (in Tatar)
The Tatar explanatory dictionary in 3 volumes. Kazan (1977-1981) (in Tatar)
The Tatar explanatory dictionary in 1 volume. Kazan (2005) (in Tatar)
ABBYY Lingvo, http://www.abbyy.ru/lingvo
Tatar National Corpus, http://web-corpora.net/TatarCorpus/search/?interface_language=en
Suleymanov, D., Nevzorova, O., Gatiatullin, A., Gilmullin, R., Khakimov, B.: National corpus of the Tatar language “Tugan Tel”: Grammatical Annotation and Implementation. Procedia — Social and Behavioral Sciences 95, 68–74 (2013)
Isahara, H., Bond, F., Uchimoto, K.: Development of the Japanese WordNet. In: 6th International Conference on Language Resources and Evaluation, Marrakech (2008)
Vossen, P. (ed.): EuroWordNet General Document. Version 3 (2002), http://vossen.info/docs/2002/EWNGeneral.pdf
Azarova, I.V., Sinopalnikova, A.A., Yavorskaya, M.V.: Guidelines for RussNet structuring, http://www.dialog-21.ru/Archive/2004/Sinopalnikova.htm (in Russian)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Galieva, A.M., Nevzorova, O.A., Gatiatullin, A.R. (2014). Towards Building Wordnet for the Tatar Language: A Semantic Model of the Verb System. In: Klinov, P., Mouromtsev, D. (eds) Knowledge Engineering and the Semantic Web. KESW 2014. Communications in Computer and Information Science, vol 468. Springer, Cham. https://doi.org/10.1007/978-3-319-11716-4_5
Download citation
DOI: https://doi.org/10.1007/978-3-319-11716-4_5
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11715-7
Online ISBN: 978-3-319-11716-4
eBook Packages: Computer ScienceComputer Science (R0)