Abstract
Being a part of the Information Age, users are challenged with a tremendously growing amount of Web data which generates a need for more sophisticated information retrieval systems. The Semantic Web provides necessary procedures to augment the highly unstructured Web with suitable metadata in order to leverage search quality and user experience. In this article, we will outline an approach for creating a web-scale, precise and efficient information system capable of understanding keyword, entity and natural language queries. By using Semantic Web methods and Linked Data the doctoral work will present how the underlying knowledge is created and elaborated searches can be performed on top.
Chapter PDF
Similar content being viewed by others
References
SPARQL query language for RDF. Technical report, World Wide Web Consortium (January 2008)
Adida, B., Birbeck, M.: RDFa primer 1.0 embedding RDF in XHTML. W3c working draft, W3C (October 2007)
Auer, S., et al.: Managing the life-cycle of linked data with the LOD2 stack. In: Cudré-Mauroux, P., et al. (eds.) ISWC 2012, Part II. LNCS, vol. 7650, pp. 1–16. Springer, Heidelberg (2012)
Brin, S., Page, L.: The anatomy of a large-scale hypertextual web search engine. In: Computer Networks and ISDN Systems, pp. 107–117. Elsevier Science Publishers B. V. (1998)
Bühmann, Usbeck, Ngomo Ngonga, Saleem, Crescenzi, Merialdo, Qui, Both: REX - Web-Scale Extension of RDF Knowledge Bases. Submitted to 11th Extended Semantic Web Conference, Anissaras, Crete, Greece, May 25-29 (2014)
Campinas, S., Ceccarelli, D., Perry, T.E., Delbru, R., Balog, K., Tummarello, G.: The sindice-2011 dataset for entity-oriented search in the web of data. In: 1st Int. Workshop on Entity-Oriented Search (EOS), pp. 26–32 (2011)
Cornolti, M., Ferragina, P., Ciaramita, M.: A framework for benchmarking entity-annotation systems. In: Proceedings of the 22nd International Conference on World Wide Web, WWW 2013, pp. 249–260. International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva (2013)
Crescenzi, V., Merialdo, P., Qiu, D.: A framework for learning web wrappers from the crowd. In: Proceedings of the 22nd International Conference on World Wide Web, WWW 2013, pp. 261–272. International World Wide Web Conferences Steering Committee, Republic and Canton of Geneva (2013)
Ding, L., Pan, R., Finin, T., Joshi, A., Peng, Y., Kolari, P.: Finding and ranking knowledge on the semantic web. In: Gil, Y., Motta, E., Benjamins, V.R., Musen, M.A. (eds.) ISWC 2005. LNCS, vol. 3729, pp. 156–170. Springer, Heidelberg (2005)
Flesca, S., Manco, G., Masciari, E., Rende, E., Tagarelli, A.: Web wrapper induction: a brief survey. AI Communications 17(2), 57–61 (2004)
Gentile, A.L., Zhang, Z., Augenstein, I., Ciravegna, F.: Unsupervised wrapper induction using linked data. In: Proceedings of the Seventh International Conference on Knowledge Capture, K-CAP 2013, pp. 41–48. ACM, New York (2013)
He, X., Baker, M.: xhrank: Ranking entities on the semantic web. In: ISWC Posters & Demos 2010 (2010)
He, X., Baker, M.: A graph-based approach to indexing semantic web data. In: 9th International Semantic Web Conference, ISWC 2010 (November 2010)
Hellmann, S., Lehmann, J., Auer, S., Brümmer, M.: Integrating NLP using linked data. In: Alani, H., et al. (eds.) ISWC 2013, Part II. LNCS, vol. 8219, pp. 98–113. Springer, Heidelberg (2013)
Hoffart, J., Yosef, M.A., Bordino, I., Fürstenau, H., Pinkal, M., Spaniol, M., Taneva, B., Thater, S., Weikum, G.: Robust Disambiguation of Named Entities in Text. In: Conference on Empirical Methods in Natural Language Processing, EMNLP 2011, Edinburgh, Scotland, pp. 782–792 (2011)
Hogan, A., Harth, A., Decker, S.: Reconrank: A scalable ranking method for semantic web data with context. In: 2nd Workshop on Scalable Semantic Web Knowledge Base Systems (2006)
Hogue, A., Karger, D.: Thresher: automating the unwrapping of semantic content from the world wide web. In: Proceedings of the 14th International Conference on World Wide Web, WWW 2005, pp. 86–95. ACM, New York (2005)
Klein, D., Manning, C.D.: Fast exact inference with a factored model for natural language parsing. In: NIPS, pp. 3–10 (2002)
Kleinberg, J.M.: Authoritative sources in a hyperlinked environment. J. ACM 46(5), 604–632 (1999)
Manning, C.D., Raghavan, P., Schtze, H.: Introduction to Information Retrieval (2008)
Manola, F., Miller, E. (eds.): RDF Primer. W3C Recommendation. World Wide Web Consortium (February 2004)
Mayfield, J., Finnin, T.: Information retrieval on the Semantic Web: Integrating inference and retrieval. In: Workshop on the Semantic Web at the 26th Intl. ACM SIGIR Conf. on Research and Development in Information Retrieval, Toronto, Canada (2003)
Mendes, P.N., Jakob, M., Garcia-Silva, A., Bizer, C.: Dbpedia spotlight: Shedding light on the web of documents. In: Proceedings of the 7th International Conference on Semantic Systems, I-Semantics (2011)
Ngonga, A.: Generating conjunctive queries for keyword search on rdf data. In: Sixth ACM WSDM (Web Search and Data Mining) Conference (2013) (submitted)
Ngonga Ngomo, A.-C., Heino, N., Lyko, K., Speck, R., Kaltenböck, M.: SCMS – semantifying content management systems. In: Aroyo, L., Welty, C., Alani, H., Taylor, J., Bernstein, A., Kagal, L., Noy, N., Blomqvist, E. (eds.) ISWC 2011, Part II. LNCS, vol. 7032, pp. 189–204. Springer, Heidelberg (2011)
Sharma, A.K., Gupta, P.: Ontology driven pre and post ranking based information retrieval in web search engines (2012)
Röder, M., Usbeck, R., Gerber, D., Hellmann, S., Both, A.: A collection of datasets for named entity recognition and disambiguation in the nlp interchange format N3. In: LREC. European Language Resources Association, ELRA (2014)
Shah, U., Finin, T., Joshi, A., Cost, R.S., Matfield, J.: Information retrieval on the semantic web. In: Proceedings of the Eleventh International Conference on Information and Knowledge Management, CIKM 2002 (2002)
Shekarpour, S., Marx, E., Ngomo, A.-C.N., Auer, S.: Sina: Semantic interpretation of user queries for question answering on interlinked data. Submitted to Journal of Web Semantics (2013)
Singhal, A.: The end of search as we know it. Presentation at Google I/O, San Francisco (2013)
Smyth, B., Balfe, E., Boydell, O., Bradley, K., Briggs, P., Coyle, M., Freyne, J.: A live-user evaluation of collaborative web search. In: IJCAI, pp. 1419–1424 (2005)
Stoyanovich, J., Bedathur, S.J., Berberich, K., Weikum, G.: Entityauthority: Semantically enriched graph-based authority propagation. In: WebDB (2007)
Unger, C., Bühmann, L., Lehmann, J., Ngonga Ngomo, A.-C., Gerber, D., Cimiano, P.: Template-based question answering over rdf data. In: Proceedings of the 21st International Conference on World Wide Web, pp. 639–648. ACM (2012)
Usbeck, Ngonga Ngomo, Roeder, Auer, Gerber, Both: AGDISTIS - Agnostic Disambiguation of Named Entities Using Linked Open Data. Submitted to 11th Extended Semantic Web Conference, Anissaras, Crete, Greece, May 25-29, 2014 (2013)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Usbeck, R. (2014). Combining Linked Data and Statistical Information Retrieval. In: Presutti, V., d’Amato, C., Gandon, F., d’Aquin, M., Staab, S., Tordai, A. (eds) The Semantic Web: Trends and Challenges. ESWC 2014. Lecture Notes in Computer Science, vol 8465. Springer, Cham. https://doi.org/10.1007/978-3-319-07443-6_58
Download citation
DOI: https://doi.org/10.1007/978-3-319-07443-6_58
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-07442-9
Online ISBN: 978-3-319-07443-6
eBook Packages: Computer ScienceComputer Science (R0)