Abstract
Automatic extraction of semantic relationships between entity instances in an ontology is useful for attaching richer semantic metadata to documents. In this paper we propose an SVM based approach to hierarchical relation extraction, using features derived automatically from a number of GATE-based open-source language processing tools. In comparison to the previous works, we use several new features including part of speech tag, entity subtype, entity class, entity role, semantic representation of sentence and WordNet synonym set. The impact of the features on the performance is investigated, as is the impact of the relation classification hierarchy. The results show there is a trade-off among these factors for relation extraction and the features containing more information such as semantic ones can improve the performance of the ontological relation extraction task.
Chapter PDF
Similar content being viewed by others
Keywords
- Support Vector Machine
- Support Vector Machine Model
- Semantic Feature
- Automatic Extraction
- Dependency Tree
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
ACE. See, http://www.nist.gov/speech/tests/ace/
Annotation Guidelines for Entity Detection and Tracking (EDT) Version 4.2.6, (2004), http://www.ldc.upenn.edu/Projects/ACE/docs/EnglishEDTV4-2-6.PDF
Annotation Guidelines for Relation Detection and Characterization (RDC) Version 4.3.2, (2004), http://www.ldc.upenn.edu/Projects/ACE/docs/EnglishRDCV4-3-2.PDF
Appelt, D.: An Introduction to Information Extraction. Artificial Intelligence Communications 12(3), 161–172 (1999)
Culotta, A., Sorensen, J.: Dependency tree kernels for relation extraction. In: Proceedings of 42th Annual Meeting of the Association for Computational Linguistics, Barcelona, Spain, July 21-26 (2004)
Cunningham, H., Maynard, D., Bontcheva, K., Tablan, V.: GATE: A Framework and Graphical Development Environment for Robust NLP Tools and Applications. In: Proceedings of the 40th Anniversary Meeting of the Association for Computational Linguistics, Philadelphia (July 2002)
Freitag, D., McCallum, A.: Information extraction with HMM structures learned by stochastic optimization. In: Proceedings of the 7th Conference on Artificial Intelligence (AAAI 2000) and of the 12th Conference on Innovative Applications of Artificial Intelligence (IAAI 2000), pp. 584–589. AAAI Press, Menlo Park (2000)
Gaizauskas, R., Hepple, M., Saggion, H., Greenwood, M.A., Humphreys, K.: SUPPLE: A Practical Parser for Natural Language Engineering Applications. Technical report CS–05–08, Department of Computer Science, University of Sheffield (2005)
Handschuh, S., Staab, S., Ciravegna, F.: S-CREAM — Semi-automatic CREAtion of Metadata. In: Gómez-Pérez, A., Benjamins, V.R. (eds.) EKAW 2002. LNCS, vol. 2473, p. 358. Springer, Heidelberg (2002)
Hsu, C.-W., Lin, C.-J.: A comparison of methods for multi-class support vector machines. IEEE Transactions on Neural Networks 13(2), 415–425 (2002)
Kambhatla, N.: Combining lexical, syntactic and semantic features with Maximum Entropy models for extracting relations. In: Proceedings of 42th Annual Meeting of the Association for Computational Linguistic, Barcelona, Spain, July 21-26 (2004)
Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: Proc. 18th International Conf. on Machine Learning, pp. 282–289. Morgan Kaufmann, San Francisco (2001)
Li, Y., Bontcheva, K., Cunningham, H.: SVM Based Learning System For Information Extraction. In: Winkler, J.R., Niranjan, M., Lawrence, N.D. (eds.) Deterministic and Statistical Methods in Machine Learning. LNCS, vol. 3635, pp. 319–339. Springer, Heidelberg (2005)
Lin, D.: Dependency-based Evaluation of MINIPAR. In: Workshop on the Evaluation of Parsing Systems, Granada, Spain (May 1998)
Miller, A.: WordNet: An On-line Lexical Resource. Special issue of the Journal of Lexicography 3(4) (1990)
Motta, E., VargasVera, M., Domingue, J., Lanzoni, M., Stutt, A., Ciravegna, F.: MnM: Ontology Driven Semi-Automatic and Automatic Support for Semantic Markup. In: Gómez-Pérez, A., Benjamins, V.R. (eds.) EKAW 2002. LNCS, vol. 2473, p. 379. Springer, Heidelberg (2002)
Wang, T., Bontcheva, K., Li, Y., Cunningham, H.: D2.1.2. Ontology-Based Information Extraction. SEKT Deliverable D2.1.2 (2005), http://www.sekt-project.org/rd/deliverables/index_html/
Zelenko, D., Aone, C., Richardella, A.: Kernel methods for relation extraction. Journal of Machine Learning Research, 1083–1106 (2003)
Zhou, G., Su, J., Zhang, J., Zhang, M.: Combining Various Knowledge in Relation Extraction. In: Proceedings of the 43th Annual Meeting of the Association for Computational Linguistics (2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wang, T., Li, Y., Bontcheva, K., Cunningham, H., Wang, J. (2006). Automatic Extraction of Hierarchical Relations from Text. In: Sure, Y., Domingue, J. (eds) The Semantic Web: Research and Applications. ESWC 2006. Lecture Notes in Computer Science, vol 4011. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11762256_18
Download citation
DOI: https://doi.org/10.1007/11762256_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-34544-2
Online ISBN: 978-3-540-34545-9
eBook Packages: Computer ScienceComputer Science (R0)