Abstract
This paper describes the implementation of a semantic web search engine on conversation styled transcripts. Our choice of data is Hansard, a publicly available conversation style transcript of parliamentary debates. The current search engine implementation on Hansard is limited to running search queries based on keywords or phrases hence lacks the ability to make semantic inferences from user queries. By making use of knowledge such as the relationship between members of parliament, constituencies, terms of office, as well as topics of debates the search results can be improved in terms of both relevance and coverage. Our contribution is not algorithmic instead we describe how we exploit a collection of external data sources, ontologies, semantic web vocabularies and named entity extraction in the analysis of underlying semantics of user queries as well as the semantic enrichment of the search index thereby improving the quality of results.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Emmerich, W.W.: Distributed Component Technologies and their Software Engineering Implications. In: Proceedings of the 24th International Conference on Software Engineering, Orlando, Florida, pp. 537–546 (2002)
Junhui, Y., Chan, H.: Keywords Weights Improvement and Application of Information Extraction. In: Gaol, F.L., Nguyen, Q.V. (eds.) Proc. of the 2011 2nd International Congress on CACS. AISC, vol. 144, pp. 95–100. Springer, Heidelberg (2012)
Lam, M.I., Gong, Z., Muyeba, M.K.: A Method for Web Information Extraction. In: Zhang, Y., Yu, G., Bertino, E., Xu, G. (eds.) APWeb 2008. LNCS, vol. 4976, pp. 383–394. Springer, Heidelberg (2008)
Liu, Y., Scheuermann, P., Li, X., Zhu, X.: Using WordNet to Disambiguate Word Senses for Text Classification. In: Shi, Y., van Albada, G.D., Dongarra, J., Sloot, P.M.A. (eds.) ICCS 2007, Part III. LNCS, vol. 4489, pp. 781–789. Springer, Heidelberg (2007)
Navigli, R.: Word Sense Disambiguation: A Survey. ACM Computing Surveys 41(2), Article No. 10 (February 2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Onyimadu, O., Nakata, K., Wang, Y., Wilson, T., Liu, K. (2013). Entity-Based Semantic Search on Conversational Transcripts Semantic. In: Takeda, H., Qu, Y., Mizoguchi, R., Kitamura, Y. (eds) Semantic Technology. JIST 2012. Lecture Notes in Computer Science, vol 7774. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-37996-3_27
Download citation
DOI: https://doi.org/10.1007/978-3-642-37996-3_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-37995-6
Online ISBN: 978-3-642-37996-3
eBook Packages: Computer ScienceComputer Science (R0)