Abstract
Research Data repositories are growing in terms of volume rapidly and exponentially. Their main goal is to provide scientists the essential mechanism to store, share, and re-use datasets generated at various stages of the research process. Despite the fact that metadata play an important role for research data management in the context of these repositories, several factors - such as the big volume of data and its complex lifecycles, as well as operational constraints related to financial resources and human factors - may impede the effectiveness of several metadata elements. The aim of the research reported in this paper was to perform a descriptive analysis of the DC.Subject metadata element and to identify its data quality problems in the context of the Dryad research data repository. In order to address this aim a total of 4.557 packages and 13.638 data files were analysed following a data-preprocessing method. The findings showed emerging trends about the subject coverage of the repository (e.g. the most popular subjects and the authors that contributed the most for these subjects). Also, quality problems related to the lack of controlled vocabulary and standardisation were very common. This study has implications for the evaluation of metadata and the improvement of the quality of the research data annotation process.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Gargouri, Y., Hajjem, C., Lariviere, V., Gingras, Y., Brody, T., Carr, L., Harnad, S.: Self-Selected or Mandated, Open Access Increases Citation Impact for Higher Quality Research. PLOS ONE 5(10) (2010). http://www.plosone.org/article/info:doi/10.1371/journal.pone.0013636 (July 13, 2014)
Mabe, M., Amin, M.: Growth dynamics of scholarly and scientific journals. Scientometrics 51, 147–162 (2001). doi:10.1023/A:1010520913124
Hess, C., Ostrom, E.: A Framework for Analyzing the Knowledge Commons : A Chapter from Understanding Knowledge as a Commons: from Theory to Practice (2005). http://surface.syr.edu/cgi/viewcontent.cgi?article=1020&context=sul
Garoufallou, E., Papatheodorou, C.: A critical introduction to metadata for e science and e-research, special issue on metadata for e-science and e-research. International Journal of Metadata Semantics and Ontologies (IJMSO) 9(1), 1–4 (2014)
Currier, S., Barton, J., O’Beirne, R., Ryan, B.: Quality assurance for digital learning object repositories: issues for the metadata creation process. ALT-J, Research in Learning Technology 12(1), 5–20 (2004)
Heery, R., Anderson, S.: Digital repositories review. Other. Joint Information Systems Committee (2005). http://www.jisc.ac.uk/uploaded_documents/digital-repositories-review-2005.pdf
Greenberg, J., Vision, T.: The Dryad Repository: A New Path for Data Publication in Scholarly Communication. OCLC, Dublin, Ohio (2011). https://www.oclc.org/content/dam/oclc/community/presentations/guests/greenberg-20110425.pdf (January 22, 2015)
Greenberg, J, Swauger, S, Feinstein, E.M.: Metadata capital in a data repository. In: Proceedings of the International Conference on Dublin Core and Metadata Applications, pp. 140–150 (2013)
Beagrie, N., Eakin-Richards, L., Vision, T.: Business Models and Cost Estimation: Dryad Repository Case Study, iPRES2010 Vienna (2010)
Palavitsinis, N., Manouselis, N., Sanchez-Alonso, S.: Metadata quality in digital repositories: empirical results from the cross-domain transfer of a quality assurance process. Journal of the Association of Information Science and Technology 65(6), 1202–1216 (2014)
Rousidis, D., Garoufallou, E., Balatsoukas, P., Sicilia, M.A.: Data Quality Issues and Content Analysis for Research Data Repositories: The Case of Dryad, ELPUB2014. Let’s put data to use: digital scholarship for the next generation. In: 18th International Conference on Electronic Publishing, June 19–20, 2014, Thessaloniki, Greece (2014). http://elpub.scix.net/data/works/att/106_elpub2014.content.pdf
Dryad Digital Repository Wiki. Main Page, April 29, 2015. http://wiki.datadryad.org/Main_Page
Dryad Digital Repository. Frequently Asked Questions, April 29, 2015. http://datadryad.org/pages/faq
White, H., Carrier, S., Thompson, A., Greenberg, J., Scherle, R.: The Dryad data repository: a Singapore framework metadata architecture in a DSpace environment. In: The 2008 International Conference on Dublin Core and Metadata Applications, Berlin (2008)
Greenberg, J., White, H.C., Carrier, S., Scherle, R.: A metadata best practice for a scientific data repository. Journal of Library Metadata 9(3), 194–212 (2009). http://dx.doi.org/10.1080/19386380903405090 (February 15, 2014)
Greenberg, J.: Theoretical considerations of lifecycle modeling: an analysis of the Dryad repository demonstrating automatic metadata propagation, inheritance, and value system adoption. Cataloguing & Classification Quarterly 47(3/4), 380–402 (2009)
Peer, L.: The Role of Data Repositories in Reproducible Research. Yale (2013). http://isps.yale.edu/news/blog/2013/07/the-role-of-data-repositories-in-reproducible-research#.UzINafmSxyM
Greenberg, J.: Linking and Hiving Data in the Dryad Repository. The Semantic Web: Fact or Myth. CENDI, FLICC, and NFAIS Workshop. National Archives, Washington, DC, November 17, 2009 (2009b)
Sokvitne, L.: An Evaluation of the Effectiveness of current Dublin Core Metadata for Retrieval. Proceedings of VALA 2000. Victorian Association for Library Automation: Melbourne (2000)
Beagrie, N., Eakin-Richards, L., Vision, T.: Business Models and Cost Estimation: Dryad Repository Case Study, iPRES2010 Vienna (2010)
Dryad Digital Repository Wiki. Cataloging Guidelines (2009). http://wiki.datadryad.org/Cataloging_Guidelines_2009 (April 12, 2015)
Greenberg, J., Garoufallou, E.: Change and a future for metadata. In: Garoufallou, E., Greenberg, J. (eds.) MTSR 2013. CCIS, vol. 390, pp. 1–5. Springer, Heidelberg (2013)
Integrating Manuscript Processing with the Dryad Digital Repository, April 10, 2015. http://wiki.datadryad.org/images/c/c6/DryadIntegrationOverview.pdf
Rousidis, D., Garoufallou, E., Balatsoukas, P., Sicilia, M.A.: Metadata for big data: a preliminary investigation of metadata quality issues in research data repositories. Information Services and Use 34(3), 279–286 (2014)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Rousidis, D., Garoufallou, E., Balatsoukas, P., Sicilia, MA. (2015). Evaluation of Metadata in Research Data Repositories: The Case of the DC.Subject Element. In: Garoufallou, E., Hartley, R., Gaitanou, P. (eds) Metadata and Semantics Research. MTSR 2015. Communications in Computer and Information Science, vol 544. Springer, Cham. https://doi.org/10.1007/978-3-319-24129-6_18
Download citation
DOI: https://doi.org/10.1007/978-3-319-24129-6_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24128-9
Online ISBN: 978-3-319-24129-6
eBook Packages: Computer ScienceComputer Science (R0)