Abstract
Regular path queries, or RPQs, are basic querying mechanisms on graphs that play an increasingly important role over the past decade. In recent years, large amounts of RDF data are published on the Web since the development of Linked Data. Such a large-scale of data has posed serious challenges to the efficiency of RPQs. In this paper, we devise a double-layer bi-directional index structure that has a linear space complexity, and propose a novel traversal-based algorithm TraPath that achieves the fast evaluation of RPQs by using the index structure. We conduct extensive experiments to evaluate and compare the performance of our prototype system and the Sesame RDF repository with a real-world RDF dataset from DBpedia. The experimental results show that TraPath significantly outperforms the state-of-the-art methods.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Klyne, G., Carroll, J.J., McBride, B.: RDF 1.1 Concepts and Abstract Syntax. W3C Recommendation (2014)
Jupp, S., Malone, J., Bolleman, J., et al.: The EBI RDF platform: Linked Open Data for the Life Sciences. Bioinformatics (2014)
Breslin, J., Decker, S.: The Future of Social Networks on the Internet: the Need for Semantics. IEEE Internet Computing 11(6), 86–90 (2007)
Harris, S., Seaborne, A.: SPARQL 1.1 Query Language. W3C Recommendation (2013)
Mendelzon, A.O., Wood, P.T.: Finding Regular Simple Paths in Graph Databases. SIAM Journal of Computing 24(6), 1235–1258 (1995)
Bizer, C., Heath, T., Berners-Lee, T.: Linked Data - The Story So Far. International Journal on Semantic Web and Information Systems 5(3), 1–22 (2009)
Neumann, T., Weikum, G.: RDF-3X: A RISC-Style Engine for RDF. In: Proceedings of the VLDB Endowment, vol. 1(1), pp. 647–659 (2008)
Ladwig, G., Harth, A.: CumulusRDF: Linked Data Management on Nested Key-Value Stores. In: 7th International Workshop on Scalable Semantic Web Knowledge Base Systems, pp. 30–42 (2011)
Wang, X., Jiang, L., Shi, H., Feng, Z., Du, P.: Jingwei+: A distributed large-scale RDF data server. In: Sheng, Q.Z., Wang, G., Jensen, C.S., Xu, G. (eds.) APWeb 2012. LNCS, vol. 7235, pp. 779–783. Springer, Heidelberg (2012)
Alkhateeb, F., Baget, J.F., Euzenat, J.: Extending SPARQL with Regular Expression Patterns (for Querying RDF). Journal of Web Semantics 7(2), 57–73 (2009)
Kochut, K.J., Janik, M.: SPARQLeR: Extended SPARQL for Semantic Association Discovery. In: Franconi, E., Kifer, M., May, W. (eds.) ESWC 2007. LNCS, vol. 4519, pp. 145–159. Springer, Heidelberg (2007)
Koschmieder, A., Leser, U.: Regular Path Queries on Large Graphs. In: Ailamaki, A., Bowers, S. (eds.) SSDBM 2012. LNCS, vol. 7338, pp. 177–194. Springer, Heidelberg (2012)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Wang, X., Rao, G., Jiang, L., Lyu, X., Yang, Y., Feng, Z. (2014). TraPath: Fast Regular Path Query Evaluation on Large-Scale RDF Graphs. In: Li, F., Li, G., Hwang, Sw., Yao, B., Zhang, Z. (eds) Web-Age Information Management. WAIM 2014. Lecture Notes in Computer Science, vol 8485. Springer, Cham. https://doi.org/10.1007/978-3-319-08010-9_39
Download citation
DOI: https://doi.org/10.1007/978-3-319-08010-9_39
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-08009-3
Online ISBN: 978-3-319-08010-9
eBook Packages: Computer ScienceComputer Science (R0)