Abstract
Most approaches for extracting hypernyms of a noun from the definition in an MRD rely on the lexico-syntactic patterns compiled by human experts. Not only these methods require high cost for compiling lexico-syntatic patterns but also it is very difficult for human experts to compile a set of lexical-syntactic patterns with a broad-coverage, because in natural languages there are various different expressions which represent the same concept. To alleviate these problems, this paper proposes a new method for extracting hypernyms of a noun from an MRD. In proposed approach, we use only syntactic(part-of-speech) patterns instead of lexico-syntactic patterns in identifying hypernyms to reduce the number of patterns while keeping their coverage broad. Our experiment shows that the classification accuracy of the proposed method is 92.37% which is significantly much better than those of previous approaches.
This study was financially supported by special research fund of Chonnam National University in 2004.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
- Function Word
- Word Sense Disambiguation
- Computational Linguistics
- Common Noun
- Machine Readable Dictionary
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Chodorow, M.S., Byrd, R.J., Heidorn, G.E.: Extracting Semantic Hierarchies From A Large On-Line Dictionary. In: Proceedings of the 23rd Conference of the Association for Computational Linguistics (1985)
Rigau, G., Rodriguez, H., Agirre, E.: Building Accurate Semantic Taxonomies from Mololingual MRDs. In: Proceedings of the 36th Conference of the Association for Computational Linguistics (1998)
Hearst, M.A.: Automatic acquisition of hyonyms from large text corpora. In: Proceedings of the Fourteenth International Conference on Computational Linguistics (1992)
Caraballo, S.A.: Automatic construction of a hypernym-labled noun hierarchy from text. In: Proceedings of the 37th Conference of the Association for Computational Linguistics (1999)
Pereira, F., Thishby, N., Lee, L.: Distributional clustering of English words. In: Proceedings of the 31th Conference of the Association for Computational Linguistics (1993)
Roark, B., Charniak, E.: Noun-phrase co-occurrence statistics for semi-automatic semantic lexicon construction. In: Proceedings of the 36th Conference of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics (1998)
Mitchell, T.M.: Machine Learning. Carnegie Mellon University. McGraw-Hill, New York (1997)
Choi, S., Park, H.: A New Method for Inducing Korean Dependency Grammars reflecting the Characteristics of Korean Dependency Relations. In: Proceedings of the 3rd Conterence on East-Asian Language Processing and Internet Information Technology (2003)
Moon, Y., Kim, Y.: Automatic Extraction of Hypernym in Korean. In: Preceedings of Korea Information Science Society, vol. 21(2), pp. 613–616 (1994)
Mon, Y.: The Design and Implementation of WordNet for Korean Nouns. In: Proceedings of Korea Information Science Society (1996)
Kim, M., Kim, T., Noh, B.: The Automatic Extraction of Hypernyms and the Development of WordNet Prototype for Korean Nouns using Koran MRD. In: Proceedings of Korea Information Processing Society (1995)
Jo, P., An, M., Ock, C., Lee, S.: A Semantic Hierarchy of Korean Nouns using the Definitions of Words in a Dictionary. In: Proceedings of Korea Cognition Society (1999)
Choi, Y., Chul, S.: Development of the Algorithm for the Automatic Extraction of Broad Term. In: Proceedings of Korea Information Management Society, pp. 227–230 (1998)
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufman, San Mateo (1993), http://www.rulequest.com/Personal/
KORTERM.: KAIST language resources, http://www.korterm.or.kr/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Choi, S., Park, H. (2005). Extracting Semantic Taxonomies of Nouns from a Korean MRD Using a Small Bootstrapping Thesaurus and a Machine Learning Approach. In: Montoyo, A., Muńoz, R., Métais, E. (eds) Natural Language Processing and Information Systems. NLDB 2005. Lecture Notes in Computer Science, vol 3513. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11428817_1
Download citation
DOI: https://doi.org/10.1007/11428817_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-26031-8
Online ISBN: 978-3-540-32110-1
eBook Packages: Computer ScienceComputer Science (R0)