Abstract
Open Government Data often contain information that, in more or less detail, regard private citizens. For this reason, before publishing them, public authorities manipulate data to remove any sensitive information while trying to preserve their reliability. This paper addresses the lack of tools aimed at measuring the reliability of these data. We present two procedures for the assessment of the Open Government Data reliability, one based on a comparison between open and closed data, and the other based on analysis of open data only. We evaluate the procedures over data from the data.police.uk website and from the Hampshire Police Constabulary in the United Kingdom. The procedures effectively allow estimating the reliability of open data and, actually, their reliability is high even though they are aggregated and smoothed.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Bivand, R., Keitt, T., Rowlingson, B., Pebesma, E., Sumner, M., Hijmans, R.: RGDAL: Bindings for the Geospatial Data Abstraction Library (2010), https://r-forge.r-project.org/projects/rgdal/
Ceolin, D., Moreau, L., O’Hara, K., Schreiber, G., Sackley, A., Fokkink, W., van Hage, W.R., Shadbolt, N.: Reliability Analyses of Open Government Data. In: URSW, pp. 34–39. CEUR-ws.org (2013)
Ceolin, D., van Hage, W.R., Fokkink, W., Schreiber, G.: Estimating Uncertainty of Categorical Web Data. In: URSW, pp. 15–26. CEUR-WS.org (2011)
Cornelli, R.: Why people trust the police. An empirical study. PhD thesis, Università degli Studi di Trento, International Ph.D. in Criminology (February 13, 2003)
CrimeReports. Crimereports (2013), https://www.crimereports.co.uk/
Ebden, M., Huynh, T.D., Moreau, L., Ramchurn, S., Roberts, S.: Network analysis on provenance graphs from a crowdsourcing application. In: Groth, P., Frew, J. (eds.) IPAW 2012. LNCS, vol. 7525, pp. 168–182. Springer, Heidelberg (2012)
Human Inference. DataCleaner (2013), http://datacleaner.org
Jøsang, A.: A logic for uncertain probabilities. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems 9(3), 279–311 (2001)
Jøsang, A.: The consensus operator for combining beliefs. Artificial Intelligence Journal 142, 157–170 (2002)
Killick, R., Eckley, I.A.: changepoint: An R Package for Changepoint Analysis (2013), http://www.lancs.ac.uk/~killick/Pub/KillickEckley2011.pdf
Koch-Weser, I.N.: The Reliability of China’s Economic Data: An Analysis of National Output (2013), http://www.uscc.gov/sites/default/files/Research/TheReliabilityofChina'sEconomicData.pdf
Mapit. Mapit (2013), http://mapit.mysociety.orgs
Talend. Talend Open Studio for Data Quality (2013), http://www.talend.com/products/data-quality
The Open Data Institute. The Open Data Institute (2013), http://www.theodi.org
United Kingdom Police Home Office. data.police.uk (2013), http://data.police.uk
Wilcoxon, F.: Individual comparisons by ranking methods. Biometrics Bulletin 1(6), 80–83 (1945)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Ceolin, D. et al. (2014). Two Procedures for Analyzing the Reliability of Open Government Data. In: Laurent, A., Strauss, O., Bouchon-Meunier, B., Yager, R.R. (eds) Information Processing and Management of Uncertainty in Knowledge-Based Systems. IPMU 2014. Communications in Computer and Information Science, vol 442. Springer, Cham. https://doi.org/10.1007/978-3-319-08795-5_3
Download citation
DOI: https://doi.org/10.1007/978-3-319-08795-5_3
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-08794-8
Online ISBN: 978-3-319-08795-5
eBook Packages: Computer ScienceComputer Science (R0)