Abstract
Pseudo-relevance feedback is a popular and widely accepted query reformulation strategy for document retrieval and re-ranking. However, problems arise in this task when assumed-to-be relevant documents are actually irrelevant which causes a drift in the focus of the reformulated query. This paper focuses on news story retrieval and re-ranking, and offers a new perspective through the exploration of the pair-wise constraints derived from video near-duplicates for constraint-driven re-ranking. We propose a novel application of PageRank, which is a pseudo-relevance feedback algorithm, and use the constraints built on top of text to improve the relevance quality. Real-time experiments were conducted using a large-scale broadcast video database that contains more than 34,000 news stories.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
References
Rocchio, J.: Relevance Feedback in Information Retrieval. The SMART Retrieval System (1971)
Brin, S., Page, L.: The Anatomy of a Large-Scale Hypertextual Web Search Engine. Computer Networks 30, 107–117 (1998)
Ide, I., Mo, H., Katayama, N., Satoh, S.: Topic Threading for Structuring a Large-Scale News Video Archive. In: Enser, P.G.B., Kompatsiaris, Y., O’Connor, N.E., Smeaton, A., Smeulders, A.W.M. (eds.) CIVR 2004. LNCS, vol. 3115, pp. 123–131. Springer, Heidelberg (2004)
Mo, H., Yamagishi, F., Ide, I., Katayama, N., Satoh, S., Sakauchi, M.: Key Image Extraction from a News Video Archive for Visualizing Its Semantic Structure. In: Aizawa, K., Nakamura, Y., Satoh, S. (eds.) PCM 2004. LNCS, vol. 3331, pp. 650–657. Springer, Heidelberg (2004)
Qin, Z., Liu, L., Zhang, S.: Mining Term Association Rules for Heuristic Query Construction. In: Dai, H., Srikant, R., Zhang, C. (eds.) PAKDD 2004. LNCS (LNAI), vol. 3056, pp. 145–154. Springer, Heidelberg (2004)
Zhai, Y., Shah, M.: Tracking news stories across different sources. In: ACM Multimedia, pp. 2–10 (2005)
Zhang, B., Li, H., Liu, Y., Ji, L., Xi, W., Fan, W., Chen, Z., Ma, W.-Y.: Improving web search results using affinity graph. In: SIGIR, pp. 504–511 (2005)
Hsu, W.H., Chang, S.-F.: Topic Tracking Across Broadcast News Videos with Visual Duplicates and Semantic Concepts. In: ICIP, pp. 141–144 (2006)
Ngo, C.-W., Zhao, W., Jiang, Y.-G.: Fast tracking of near-duplicate keyframes in broadcast domain with transitivity propagation. In: ACM Multimedia, pp. 845–854 (2006)
Song, M., Song, I.-Y., Hu, X., Allen, R.B.: Integration of association rules and ontologies for semantic query expansion. Data Knowl. Eng. 63, 63–75 (2007)
Lin, F., Liang, C.-H.: Storyline-based summarization for news topic retrospection. Decis. Support Syst. 45, 473–490 (2008)
Wan, X., Yang, J.: Multi-document summarization using cluster-based link analysis. In: SIGIR, pp. 299–306 (2008)
Wu, X., Ngo, C.-W., Hauptmann, A.G.: Multimodal News Story Clustering With Pairwise Visual Near-Duplicate Constraint. IEEE Transactions on Multimedia 10, 188–199 (2008)
Otterbacher, J., Erkan, G., Radev, D.R.: Biased LexRank: Passage retrieval using random walks with question-based priors. Inf. Process. Manage. 45, 42–54 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wu, X., Ide, I., Satoh, S. (2010). PageRank with Text Similarity and Video Near-Duplicate Constraints for News Story Re-ranking. In: Boll, S., Tian, Q., Zhang, L., Zhang, Z., Chen, YP.P. (eds) Advances in Multimedia Modeling. MMM 2010. Lecture Notes in Computer Science, vol 5916. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-11301-7_53
Download citation
DOI: https://doi.org/10.1007/978-3-642-11301-7_53
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-11300-0
Online ISBN: 978-3-642-11301-7
eBook Packages: Computer ScienceComputer Science (R0)