Abstract
The linguistics community is building a metadata-based infrastructure for the description of its research data and tools. At its core is the ISOcat registry, a collaborative platform to hold a (to be standardized) set of data categories (i.e., field descriptors). Descriptors have definitions in natural language and little explicit interrelations. With the registry growing to many hundred entries, authored by many, it is becoming increasingly apparent that the rather informal definitions and their glossary-like design make it hard for users to grasp, exploit and manage the registry’s content. In this paper, we take a large subset of the ISOcat term set and reconstruct from it a tree structure following the footsteps of schema.org. Our ontological re-engineering yields a representation that gives users a hierarchical view of linguistic, metadata-related terminology. The new representation adds to the precision of all definitions by making explicit information which is only implicitly given in the ISOcat registry. It also helps uncovering and addressing potential inconsistencies in term definitions as well as gaps and redundancies in the overall ISOcat term set. The new representation can serve as a complement to the existing ISOcat model, providing additional support for authors and users in browsing, (re-)using, maintaining, and further extending the community’s terminological metadata repertoire.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
DCR Style Guidelines. Version ”2010-05-16”, http://www.isocat.org/manual/DCRGuidelines.pdf (retrieved December 5, 2011)
Data Category specifications. Clarin-NL ISOcat workshop (May 2011), http://www.isocat.org/manual/tutorial/2011/ISOcat-DC-specifications.pdf (retrieved December 5, 2011)
Int’l Organization of Standardization. Data elements and interchange formats – Information interchange – Representation of dates and times (ISO-8601), Geneva (2009)
Int’l Organization of Standardization. Terminology and other language and content resources - Specification of data categories and management of a Data Category Registry for language resources (ISO-12620), Geneva (2009)
Int’l Organization of Standardization. Terminology work – Principles and methods (ISO-704), Geneva (2009)
Schuurman, I., Windhouwer, M.: Explicit semantics for enriched documents. What do ISOcat, RELcat and SCHEMAcat have to offer? In: Proceedings of Supporting Digital Humanities, SDH 2011 (2011)
Soldatova, L.N., King, R.D.: An ontology of scientific experiments. Journal of the Royal Society Interface 3(11), 795–803 (2006)
Wright, S.E., Kemps-Snijders, M., Windhouwer, M.A.: The OWL and the ISOcat: Modeling Relations in and around the DCR. In: LRT Standards Workshop at LREC 2010, Malta (May 2010)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zinn, C., Hoppermann, C., Trippel, T. (2012). The ISOcat Registry Reloaded. In: Simperl, E., Cimiano, P., Polleres, A., Corcho, O., Presutti, V. (eds) The Semantic Web: Research and Applications. ESWC 2012. Lecture Notes in Computer Science, vol 7295. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-30284-8_26
Download citation
DOI: https://doi.org/10.1007/978-3-642-30284-8_26
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-30283-1
Online ISBN: 978-3-642-30284-8
eBook Packages: Computer ScienceComputer Science (R0)