Abstract
As the Web of Data is growing at an ever increasing speed, the lack of reliable query solutions for live public data becomes apparent. sparql implementations have matured and deliver impressive performance for public sparql endpoints, yet poor availability—especially under high loads— prevents their use in real-world applications. We propose to tackle this availability problem by defining triple pattern fragments, a specific kind of Linked Data Fragments that enable low-cost publication of queryable data by moving intelligence from the server to the client. This paper formalizes the Linked Data Fragments concept, introduces a client-side sparql query processing algorithm that uses a dynamic iterator pipeline, and verifies servers’ availability under load. The results indicate that, at the cost of lower performance, query techniques with triple pattern fragments lead to high availability, thereby allowing for reliable applications on top of public, queryable Linked Data.
Chapter PDF
Similar content being viewed by others
References
Amundsen, M.: Hypermedia types. In: rest: From Research to Practice, pp. 93–116. Springer (2011)
Bizer, C., Heath, T., Berners-Lee, T.: Linked Data – the story so far. International Journal on Semantic Web and Information Systems 5(3), 1–22 (2009)
Bizer, C., Schultz, A.: The Berlin sparql benchmark. International Journal on Semantic Web and Information Systems 5(2), 1–24 (2009)
Buil-Aranda, C., Hogan, A.: sparql Web-querying infrastructure: Ready for action? In: Proceedings of the 12th International Semantic Web Conference (November 2013)
Erling, O., Mikhailov, I.: Virtuoso: rdf support in a native rdbms. In: Semantic Web Information Management, pp. 501–519. Springer (2010)
Feigenbaum, L., Williams, G.T., Clark, K.G., Torres, E.: sparql 1.1 protocol Recommendation, w3c (March 2013), http://www.w3.org/TR/sparql11-protocol/
Fernández, J.D., Martínez-Prieto, M.A., Gutirrez, C., Polleres, A., Arias, M.: Binary rdf representation for publication and exchange (hdt). Journal of Web Semantics 19, 22–41 (2013)
Fielding, R.T.: Architectural Styles and the Design of Network-based Software Architectures. Ph.D. thesis, University of California (2000)
Fielding, R.T.: rest apis must be hypertext-driven (October 2008), http://roy.gbiv.com/untangled/2008/rest-apis-must-be-hypertext-driven
Graefe, G.: Query evaluation techniques for large databases. acm Computing Surveys 25(2), 73–169 (June 1993)
Grobe, M.: rdf, Jena, sparql and the Semantic Web. In: Proceedings of the 37th Annual acm siguccs Fall Conference: Communication and Collaboration (2009), doi:10.1145/1629501.1629525
Harris, S., Seaborne, A.: sparql 1.1 query language. Recommendation, w3c (March 2013), http://www.w3.org/TR/sparql11-query/
Hartig, O.: Zero-knowledge query planning for an iterator implementation of link traversal based query execution. In: Antoniou, G., Grobelnik, M., Simperl, E., Parsia, B., Plexousakis, D., De Leenheer, P., Pan, J. (eds.) ESWC 2011, Part I. LNCS, vol. 6643, pp. 154–169. Springer, Heidelberg (2011)
Hartig, O.: An overview on execution strategies for Linked Data queries. Datenbank-Spektrum 13(2), 89–99 (2013)
Hartig, O., Bizer, C., Freytag, J.-C.: Executing SPARQL Queries over the Web of Linked Data. In: Bernstein, A., Karger, D.R., Heath, T., Feigenbaum, L., Maynard, D., Motta, E., Thirunarayan, K. (eds.) ISWC 2009. LNCS, vol. 5823, pp. 293–309. Springer, Heidelberg (2009)
Klyne, G., Carrol, J.J.: Resource Description Framework (rdf): Concepts and abstract syntax. Rec., w3c (February 2004), http://www.w3.org/TR/rdf-concepts/
Linked Data api, https://code.google.com/p/linked-data-api/
Matteis, L.: Restpark: Minimal restful api for retrieving rdf triples (2013), http://lmatteis.github.io/restpark/restpark.pdf
Morsey, M., Lehmann, J.: dbpedia sparql benchmark performance assessment with real queries on real data. In: Proceedings of the 9th International Semantic Web Conference (2011)
Ogbuji, C.: sparql 1.1 Graph Store http Protocol. Recommendation, w3c (March 2013), http://www.w3.org/TR/sparql11-http-rdf-update/
Pérez, J., Arenas, M., Gutierrez, C.: Semantics and complexity of sparql. acm Transactions on Database Systems 34(3), 16:1–16:45 (2009)
Schmidt, M., Hornung, T., Meier, M., Pinkel, C., Lausen, G.: SP2Bench: A sparql performance benchmark. In: Semantic Web Information Management (2010)
Speicher, S., Arwe, J., Malhotra, A.: Linked Data Platform 1.0. Candidate recommendation, w3c (June 2014), http://www.w3.org/TR/2014/CR-ldp-20140619/
Sporny, M., Longley, D., Kellogg, G., Lanthaler, M., Lindström, N.: jsonld 1.0. Recommendation, w3c (January 2014), http://www.w3.org/TR/json-ld/
Verborgh, R.: Linked Data Fragments. Unofficial draft, Hydra wthreec Community Group, http://www.hydra-cg.com/spec/latest/linked-data-fragments/
Verborgh, R.: Triple Pattern Fragments. Unofficial draft, Hydra w3c Community Group, http://www.hydra-cg.com/spec/latest/triple-pattern-fragments/
Verborgh, R., Vander Sande, M., Colpaert, P., Coppens, S., Mannens, E., Van de Walle, R.: Web-scale querying through Linked Data Fragments. In: Proceedings of the 7th Workshop on Linked Data on the Web (2014)
Wilde, E., Hausenblas, M.: restful sparql? You name it! – Aligning sparql with rest and resource orientation. In: Proceedings of the 4 Workshop on Emerging Web Services Technology, pp. 39–43. ACM (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Verborgh, R. et al. (2014). Querying Datasets on the Web with High Availability. In: Mika, P., et al. The Semantic Web – ISWC 2014. ISWC 2014. Lecture Notes in Computer Science, vol 8796. Springer, Cham. https://doi.org/10.1007/978-3-319-11964-9_12
Download citation
DOI: https://doi.org/10.1007/978-3-319-11964-9_12
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11963-2
Online ISBN: 978-3-319-11964-9
eBook Packages: Computer ScienceComputer Science (R0)