Abstract
This chapter gives an overview of the infrastructure, annotation practices, and current challenges of both the English and Arabic PropBank corpora. More details about the Hindi and Urdu PropBanks can be found in chapter “The Hindi/Urdu Treebank Project” (this volume). The focus of current efforts is on expanding the types of relations covered by PropBank. Previously, the annotation effort focused on event relations expressed solely by verbs. (A separate but related effort, NomBank, focused on nouns [26].) However, a complete representation of event relations within and across sentences requires expanding that focus to all syntactic realizations of event and state semantics, including expressions in the form of nouns, adjectives and multi-word expressions. This effort reflects a general desire to move to a deeper level of semantic understanding, abstracting away from language-particular syntactic facts. The chapter closes with a discussion of future directions for PropBank.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
Notes
- 1.
- 2.
The term “Roleset” is sometimes used interchangeably with the term “Frameset”.
- 3.
Note that the actual Frame Files are transliterated following a modified version of the Buckwalter transliteration system http://www.qamus.org/transliteration.htm in line with Arabic TreeBank.
- 4.
- 5.
ARZ is the ISO language code for Egyptian Arabic.
- 6.
Notably, the lemmas included in this example may have been candidates for unification since they are clearly morphologically and semantically related. See Sect. 3.3 for a discussion relating to this issue.
References
Banarescu, L., Bonial, C., Cai, S., Georgescu, M., Griffitt, K., Hermjakob, U., Knight, K., Koehn, P., Palmer, M., Schneider, N.: Abstract Meaning Representation (AMR) 1.0 Specification. http://www.isi.edu/~ulf/amr/help/amr-guidelines.pdf
Bazrafshan, M., Gildea, D.: Semantic roles for string to tree machine translation. In: Association for Computational Linguistics (ACL-13) (2013)
Bonial, C.: Take a look at this! Form, Function and Productivity of English Light Verb Constructions. Ph.D. Thesis, Department of Linguistics, University of Colorado Boulder (2015)
Bonial, C., Stowe, K., Palmer, M.: Renewing and revising SemLink. In: Proceedings of The GenLex Workshop on Linked Data in Linguistics (GenLex-13). Pisa, Italy (2013)
Chen, W.-T., Bonial, C., Palmer, M.: English light verb construction identification using lexical knowledge. In: Proceedings of AAAI 2015. Austin, TX (2015)
Choi, J.D., Bonial, C., Palmer, M.: PropBank instance annotation guidelines using a dedicated editor, jubilee. In: Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC’10). Valetta, Malta, 17 May 2010
Choi, J.D., Bonial, C., Palmer, M.: PropBank frameset annotation guidelines using a dedicated editor, cornerstone. In: Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC’10). Valetta, Malta, 17 May 2010
Choi, J.D., Bonial, C., Palmer, M.: Multilingual PropBank annotation tools: cornerstone and jubilee. In: Proceedings of NAACL-HLT’10: Demos. Los Angeles, CA, (2010)
Dowty, D.: Thematic proto-roles and argument selection. Language 67(3), 547–619 (1991)
Fillmore, C.J., Johnson, C.R., Petruck, M.R.L.: Background to FrameNet. Int. J. Lexicogr. 16(3), 235–250 (2003)
Fillmore, C.J., Lee-Goldman, R., Rhodes, R.: The FrameNet construction. In: Boas, H., Sag, I. (eds.) Sign-Based Construction Grammar. CSLI Publications, Stanford (2012)
Goldberg, A.E.: Constructions: A Construction Grammar Approach to Argument-Structure. University of Chicago Press, Chicago (1995)
Goldberg, A.: Constructions at Work: The Nature of Generalization in Language. Oxford University Press, New York (2006)
Habash, N., Eskander, R., Hawwari, A.: A morphological analyzer for Egyptian Arabic. In: Proceedings of the Twelfth Meeting of the Special Interest Group on Computational Morphology and Phonology (SIGMORPHON2012). Montréal, Canada, 7 June 2012
Hawwari, A., Hwang, J.D., Mansouri, A., Palmer, M.: Classification and deterministic PropBank annotation of predicative adjectives in Arabic. In: Proceedings of the Sixth Joint ISO - ACL SIGSEM Workshop on Interoperable Semantic Annotation. Oxford, UK (2011)
Hwang, J.D., Palmer, M.: Identification of caused motion constructions. In: Proceedings of the Joint Conference on Lexical and Computational Semantics (StarSem). Denver, Colorado (2015)
Hwang, J.D., Bhatia, A., Bonial, C., Mansouri, A., Vaidya, A., Zhou, Y., Xue, N., Palmer, M.: PropBank annotation of multilingual light verb constructions. In: Proceedings of ACL Linguistic Annotation Workshop (LAW) IV. Uppsala, Sweden (2010)
Hwang, J.D., Zaenen, A., Palmer, M.: Criteria for identifying and annotating caused motion constructions in corpus data. In: Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC’14). Reykjavik, Ice-land (2014)
Jespersen, O.: A Modern English Grammar on Historical Principles. Part VI: Morphology. With assistance of Christopherson, P., Haislund, N., Schibsbye, K. Goerge Allen and Unwin, London. Ejnar Munksgaard, Copenhagen (1942)
Kingsbury, P., Palmer, M.: From Treebank to PropBank, In: Proceedings of the Third International Conference on Language Resources and Evaluation (LREC-02). Las Palmas, Canary Islands, Spain, 28 May–3 June 2002
Kingsbury, P., Palmer, M., Marcus, M.: Adding semantic annotation to the Penn Treebank. In: Proceedings of the Human Language Technology (2002)
Kipper, K., Korhonen, A., Ryant, N., Palmer, M.: A large-scale classification of English verbs. Lang. Res. Eval. J. 42, 21–40 (2008)
Maamouri, M., Bies, A., Buckwalter, T., dan Mekki, W.: The Penn Arabic Treebank, Building a Large-scale Annotated Arabic Corpus (2004)
Marcus, M., Santorini, B., Marcinkiewicz, M.A.: Building a large annotated corpus of English: the Penn treebank. Comput. Linguist. 19, 313–330 (1993)
Marcus, M., Kim, G., Ann Marcinkiewicz, M., MacIntyre, R., Bies, A., Ferguson, M., Katz, K., Schasberger, B.: The Penn Treebank: annotating predicate-argument structure. In: Proceedings of the 1994 Human Language Technology Workshop (1994)
Meyers, A., Reeves, R., Macleod, C., Szekely, R., Zielinska, V., Young, B., Grishman, R.: The NomBank project: an interim report. In: Proceedings of the Frontiers in Corpus Annotation, Workshop in conjunction with HLT/NAACL (2004)
Moreda, P., Navarro, B., Palomar, M.: Using semantic roles in information retrieval systems. NLDB 2005, 192–202 (2005)
Nunberg, G., Sag, I.A., Wasow, T.: Idioms. Language 70(3), 491–538 (1994)
Owens, J.: Early Arabic Grammatical Theory: Heterogeneity and Standardization. John Benjamins Publishing (1990)
Palmer, M.: Semlink: linking PropBank, VerbNet and FrameNet. In: Proceedings of the Generative Lexicon Conference (GenLex-09). Pisa, Italy (2009)
Palmer, M., Gildea, D., Kingsbury, P.: The proposition bank: an annotated corpus of semantic roles. Comput. Linguist. 31(1), 71–105 (2005)
Palmer, M., Babko-Malaya, O., Bies, A., Diab, M., Maamouri, M., Mansouri, A., Zaghouni, W.: A pilot Arabic PropBank. In: Proceedings of LREC, Marrakech, Morocco (2008)
Pradhan, S., Hovy, E., Marcus, M., Palmer, M., Ramshaw, L., Weischedel, R.: OntoNotes: a unified relational semantic representation. In: Proceedings of the First IEEE International Conference on Semantic Computing (2007)
Ryding, K.C.: A Reference Grammar of Modern Standard Arabic. Cambridge University Press, Cambridge (2005)
Sammons, M., Vinod Vydiswaran, V.G., Roth, D.: Ask not what textual entailment can do for you. In: ACL. (2010)
Tu, Y., Roth, D.: Learning English light verb constructions: contextual or statistical. In: Proceedings of the Worshop on Multiword Expressions: from Parsing and Generation to the Real World (MWE 2011), pp. 31–39 (2011)
Wu, D., Fung, P.: Semantic roles for SMT: a hybrid two-pass model. In: Proceedings of the Joint Conference of the North American Chapter of ACL/Human Language Technology (NAACL HLT 2009). Boulder, Colorado (2009)
Yi, S.-t., Loper, E., Palmer, M.: Can semantic roles generalize across genres? In: Proceedings of HLT/NAACL-2007. Rochester, NY 22–27 Apr 2007
Zaghouani, W., Diab, M., Mansouri, A., Pradhan, S., Palmer, M.: The revised arabic PropBank. In: Proceedings of ACL Linguistic Annotation Workshop (LAW) IV. Uppsala, Sweden (2010)
Zapirain, B., Agirre, E., Màrquez, L., Surdeanu, M.: Selectional preferences for semantic role classification. Comput. Linguist. 39(3), 631–663 (2013). ISSN 0891-2017
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Bonial, C. et al. (2017). Current Directions in English and Arabic PropBank. In: Ide, N., Pustejovsky, J. (eds) Handbook of Linguistic Annotation. Springer, Dordrecht. https://doi.org/10.1007/978-94-024-0881-2_27
Download citation
DOI: https://doi.org/10.1007/978-94-024-0881-2_27
Published:
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-024-0879-9
Online ISBN: 978-94-024-0881-2
eBook Packages: Social SciencesSocial Sciences (R0)