Efficient Bag of Words Based Concept Extraction for Visual Object Retrieval

Ergun, Hilal; Sert, Mustafa

doi:10.1007/978-3-319-26154-6_30

Hilal Ergun¹² &
Mustafa Sert¹²

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 400))

583 Accesses
4 Citations

Abstract

Recent burst of multimedia content available on Internet is pushing expectations on multimedia retrieval systems to even higher grounds. Multimedia retrieval systems should offer better performance both in terms of speed and memory consumption while maintaining good accuracy compared to state-of-the-art implementations. In this paper, we discuss alternative implementations of visual object retrieval systems based on popular bag of words model and show optimal selection of processing steps. We demonstrate our offering using both keyword and example-based retrieval queries on three frequently used benchmark databases, namely Oxford, Paris and Pascal VOC 2007. Additionally, we investigate effect of different distance comparison metrics on retrieval accuracy. Results show that, relatively simple but efficient vector quantization can compete with more sophisticated feature encoding schemes together with the adapted inverted index structure.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

An Analysis of BoVW and cBoVW Based Image Retrieval

Exclusive Visual Descriptor Quantization

Enhanced bag of visual words representations for content based image retrieval: a comparative study

Article 27 May 2019

Keywords

References

Arandjelovic, R., Zisserman, A.: Three things everyone should know to improve object retrieval. In: 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2911–2918, June 2012
Google Scholar
Boureau, Y.-L., Bach, F., LeCun, Y., Ponce, J.: Learning mid-level features for recognition. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2559–2566, June 2010
Google Scholar
Bradski, G.: Dr. Dobb’s Journal of Software Tools (2000)
Google Scholar
Chang, C.-C., Lin, C.-J.: LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology 2, 27:1–27:27 (2011)
Article Google Scholar
Chatfield, K., Lempitsky, V.S., Vedaldi, A., Zisserman, A.: The devil is in the details: an evaluation of recent feature encoding methods. In: BMVC, vol. 2, p. 8 (2011)
Google Scholar
Chum, O., Philbin, J., Sivic, J., Isard, M., Zisserman, A.: Total recall: automatic query expansion with a generative feature model for object retrieval. In: IEEE 11th Int’l Conf. on Computer Vision, ICCV 2007, pp. 1–8. IEEE (2007)
Google Scholar
Everingham, M. Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The PASCAL Visual Object Classes Challenge 2007 (VOC2007) Results
Google Scholar
Jegou, H., Douze, M., Schmid, C.: Hamming embedding and weak geometric consistency for large scale image search. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part I. LNCS, vol. 5302, pp. 304–317. Springer, Heidelberg (2008)
Chapter Google Scholar
Jiang, Y.-G., Ngo, C.-W., Yang, J.: Towards optimal bag-of-features for object categorization and semantic video retrieval. In: Proceedings of the 6th ACM International Conference on Image and Video Retrieval, pp. 494–501. ACM (2007)
Google Scholar
Jurie, F., Triggs, B.: Creating efficient codebooks for visual recognition. In: IEEE Int’l Conf. on Computer Vision (ICCV 2005), vol. 1, pp. 604–610, October 2005
Google Scholar
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, pp. 2169–2178 (2006)
Google Scholar
Liu, J.: Image retrieval based on bag-of-words model (2013). CoRR, abs/1304.5168
Google Scholar
Lowe, D.G.: Object recognition from local scale-invariant features. In: The proc. of the 7th IEEE Int’l Conf. on Computer Vision, 1999, vol. 2, pp. 1150–1157. IEEE (1999)
Google Scholar
Marszałek, M., Schmid, C., Harzallah, H., Van De Weijer, J.: Learning object representations for visual object class recognition. In: Visual Recognition Challange Workshop, in Conjunction with ICCV (2007)
Google Scholar
Perd’och, M., Chum, O., Matas, J.: Efficient representation of local geometry for large scale object retrieval. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2009, pp. 9–16. IEEE (2009)
Google Scholar
Perronnin, F., Dance, C.: Fisher kernels on visual vocabularies for image categorization. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2007, pp. 1–8. IEEE (2007)
Google Scholar
Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Object retrieval with large vocabularies and fast spatial matching. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2007, pp. 1–8. IEEE (2007)
Google Scholar
Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Lost in quantization: improving particular object retrieval in large scale image databases. In: IEEE Conf. on Computer Vision and Pattern Recognition (CVPR) 2008, pp. 1–8. IEEE (2008)
Google Scholar
Sert, M., Ergun, H.: Video scene classification using spatial pyramid based features. In: 2014 22nd Signal Processing and Communications Applications Conference (SIU), pp. 1946–1949, April 2014
Google Scholar
Sivic, J., Zisserman, A.: Video google: a text retrieval approach to object matching in videos. In: Proceedings of the Ninth IEEE International Conference on Computer Vision, 2003, vol. 2, pp. 1470–1477, October 2003
Google Scholar
Van De Sande, K.E., Gevers, T., Snoek, C.G.: Evaluating color descriptors for object and scene recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence 32(9), 1582–1596 (2010)
Article Google Scholar
Vedaldi, A., Fulkerson, B.: VLFeat: An open and portable library of computer vision algorithms (2008)
Google Scholar
Vedaldi, A., Zisserman, A.: Efficient additive kernels via explicit feature maps. IEEE Transactions on Pattern Analysis and Machine Intelligence 34(3), 480–492 (2012)
Article Google Scholar
Wang, J., Yang, J., Yu, K., Lv, F., Huang, T., Gong, Y.: Locality-constrained linear coding for image classification. In: 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3360–3367, June 2010
Google Scholar
Yan, Z., Yu, Y.: Sparse similarity matrix learning for visual object retrieval. In: The 2013 Int’l Joint Conf. on Neural Networks (IJCNN), pp. 1–8. IEEE (2013)
Google Scholar
Yang, J., Yu, K., Gong, Y., Huang, T.: Linear spatial pyramid matching using sparse coding for image classification. In: IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2009, pp. 1794–1801, June 2009
Google Scholar
Zhang, J., Marszałek, M., Lazebnik, S., Schmid, C.: Local features and kernels for classification of texture and object categories: A comprehensive study. International Journal of Computer Vision 73(2), 213–238 (2007)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Engineering, Baskent University, Ankara, Turkey
Hilal Ergun & Mustafa Sert

Authors

Hilal Ergun
View author publications
You can also search for this author in PubMed Google Scholar
Mustafa Sert
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mustafa Sert .

Editor information

Editors and Affiliations

Dept. of Comm., Business & Info. Tech., Roskilde University, Roskilde, Denmark
Troels Andreasen
Dept. Computer Science, Roskilde University, Roskilde, Denmark
Henning Christiansen
Sys Res Intit of Polish Acad of Science, Intelligent Systems Laboratory, Warsaw, Poland
Janusz Kacprzyk
Department of Electronic Systems, Aalborg University, Esbjerg, Denmark
Henrik Larsen
Computer Science Department, University Milano-Biccoca, Milano, Italy
Gabriella Pasi
IRISA, University of Rennes I, Rennes, France
Olivier Pivert
Dept. of Tele. & Info. Processing, Ghent University, Gent, Belgium
Guy De Tré
Dept. of Comp. & Art. Int. Sci., University of Granada, Granada, Spain
Maria Amparo Vila
Department of Computer Engineering, Middle East Technical University, Ankara, Turkey
Adnan Yazici
Systems Research Institute, Polish Academy of Sciences, Warsaw, Poland
Sławomir Zadrożny

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ergun, H., Sert, M. (2016). Efficient Bag of Words Based Concept Extraction for Visual Object Retrieval. In: Andreasen, T., et al. Flexible Query Answering Systems 2015. Advances in Intelligent Systems and Computing, vol 400. Springer, Cham. https://doi.org/10.1007/978-3-319-26154-6_30

Download citation

DOI: https://doi.org/10.1007/978-3-319-26154-6_30
Published: 21 October 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-26153-9
Online ISBN: 978-3-319-26154-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Efficient Bag of Words Based Concept Extraction for Visual Object Retrieval

Abstract

Chapter PDF

Similar content being viewed by others

An Analysis of BoVW and cBoVW Based Image Retrieval

Exclusive Visual Descriptor Quantization

Enhanced bag of visual words representations for content based image retrieval: a comparative study

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Efficient Bag of Words Based Concept Extraction for Visual Object Retrieval

Abstract

Chapter PDF

Similar content being viewed by others

An Analysis of BoVW and cBoVW Based Image Retrieval

Exclusive Visual Descriptor Quantization

Enhanced bag of visual words representations for content based image retrieval: a comparative study

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation