Abstract
This paper describes an automatic clustering strategy for acquiring both syntactic and semantic subcategorization restrictions from corpora. In order to test our method, preliminary experiments have been performed on a law-case Portuguese corpus. The acquired information is then used for lexicon upgrading and it is validated by a parsing diagnosis system.
Research sponsored by CAPES and PUCRS - Brazil.
Research supported by the PRAXIS XXI project, FCT/MCT, Portugal.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Roberto Basili, Maria Pazienza, and Paola Velardi. Hierarchical clustering of verbs. In Workshop on Acquisition of Lexical Knowledge from Text, pages 56–70, Ohio State University, USA, 1993.
Gilles Bisson, Claire Nédellec, and Dolores Canamero. Designing clustering methods for ontology building: The mo’k workbench. In Internal rapport, http://citerseer.nj.nec.com/316335.html, 2000.
Eric Brill and Philip Resnik. A rule-based approach to prepositional phrase attachment disambiguation. In COLING, 1994.
Ido Dagan, Lillian Lee, and Fernando Pereira. Similarity-based methods of word coocurrence probabilities. Machine Learning, 43, 1998.
David Faure. Conception de méthode d’aprentissage symbolique et automatique pour l’acquisition de cadres de sous-catégorisation de verbes et de connaissances sémantiques à partir de textes: le système ASIUM. PhD thesis, Université Paris XI Orsay, Paris, France, 2000.
David Faure and Claire Nédellec. Asium: Learning subcategorization frames and restrictions of selection. In ECML98, Workshop on Text Mining, 1998.
Pablo Gamallo. Construction conceptuelle d’expressions complexes: traitement de la combinaison nom-adjectif. PhD thesis, Université Blaise Pascal, Clermont-Ferrand, France, 1998.
Pablo Gamallo, Alexandre Agustini, and Gabriel P. Lopes. Selection restrictions acquisition from corpora. In 10th Portuguese Conference on Artificial Intelligence (EPIA’01), Porto, Portugal, 2001. LNAI, Springer-Verlag.
Pablo Gamallo, Caroline Gasperin, Alexandre Agustini, and Gabriel P. Lopes. Syntactic-based methods for measuring word similarity. In V. Mautner, R. Moucek, and K. Moucek, editors, Text, Speech, and Discourse (TSD-2001), pages 116–125. Berlin: Springer Verlag, 2001.
Gregory Grefenstette. Explorations in Automatic Thesaurus Discovery. Kluwer Academic Publishers, USA, 1994.
Gregory Grefenstette. Evaluation techniques for automatic semantic extraction: Comparing syntatic and window based approaches. In Branimir Boguraev and James Pustejovsky, editors, Corpus processing for Lexical Acquisition, pages 205–216. The MIT Press, 1995.
Ralph Grishman and John Sterling. Generalizing automatically generated selectional patterns. In Proceedings of the 15th International on Computational Linguistics (COLING-94), 1994.
Donald Hindle and Mats Rooth. Structural ambiguity and lexical relations. Computational Linguistics, 19(1):103–120, 1993.
Martin Kay. Alghorith schemata and data structures in syntactic processing. Technical report, XEROX PARK, Palo Alto, Ca., Report CSL-80-12, 1980.
Dekang Lin. Automatic retrieval and clustering of similar words. In COLING-ACL’98, Montreal, 1998.
J. Gabriel Pereira Lopes, Vitor Rocio, and João Balsa da Silva. Superando a incompletude da informação lexical (overcoming lack of lexical information, in portuguese). In Mota M. A. Marrafa P., editor, Linguística Computacional: Investigação Fundamental e Aplicações, pages 121–149. Lisboa: Ediyções Colibri, 1999.
Nuno Marques. Uma Metodologia para a Modelação Estatística da Subcategorização Verbal. PhD thesis, Universidade Nova de Lisboa, Lisboa, Portugal, 2000.
Fernando Pereira, Naftali Tishby, and Lillian Lee. Distributional clustering of english words. In Proceedings of the 30th Annual Meeting of the Association of Comptutational Linguistics, pages 183–190, Columbos, Ohio, 1993.
James Pustejovsky. The Generative Lexicon. MIT Press, Cambridge, 1995.
Philip Resnik. Semantic similarity in taxonomy: An information-based measure and its application to problems of ambiguity in natural language. Journal of Artificial Intelligence Research, 11:95–130, 1999.
V. Rocio, E. de la Clergerie, and J. G. P. Lopes. Tabulation for multi-purpose partial parsing. Journal of Grammars, 4(1), 2001.
Luis Talavera and Javier Béjar. Integrating declarative knowledge in hierarchical clustering tasks. In Intelligent Data Analysis, pages 211–222, 1999.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Agustini, A., Gamallo, P., Lopes, G.P. (2002). Assessment of Selection Restrictions Acquisition. In: Bittencourt, G., Ramalho, G.L. (eds) Advances in Artificial Intelligence. SBIA 2002. Lecture Notes in Computer Science(), vol 2507. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36127-8_39
Download citation
DOI: https://doi.org/10.1007/3-540-36127-8_39
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-00124-9
Online ISBN: 978-3-540-36127-5
eBook Packages: Springer Book Archive