Kernel Codebooks for Scene Categorization

van Gemert, Jan C.; Geusebroek, Jan-Mark; Veenman, Cor J.; Smeulders, Arnold W. M.

doi:10.1007/978-3-540-88690-7_52

Jan C. van Gemert⁴,
Jan-Mark Geusebroek⁴,
Cor J. Veenman⁴ &
…
Arnold W. M. Smeulders⁴

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 5304))

Included in the following conference series:

European Conference on Computer Vision

9060 Accesses
240 Citations

Abstract

This paper introduces a method for scene categorization by modeling ambiguity in the popular codebook approach. The codebook approach describes an image as a bag of discrete visual codewords, where the frequency distributions of these words are used for image categorization. There are two drawbacks to the traditional codebook model: codeword uncertainty and codeword plausibility. Both of these drawbacks stem from the hard assignment of visual features to a single codeword. We show that allowing a degree of ambiguity in assigning codewords improves categorization performance for three state-of-the-art datasets.

Download to read the full chapter text

Chapter PDF

Learning visual codebooks for image classification using spectral clustering

Article 27 November 2017

Open Issues on Codebook Generation in Image Classification Tasks

SceneNet: A Perceptual Ontology for Scene Understanding

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Leung, T., Malik, J.: Representing and recognizing the visual appearance of materials using three-dimensional textons. IJCV 43, 29–44 (2001)
Article MATH Google Scholar
Lowe, D.: Distinctive image features from scale-invariant keypoints. IJCV 60, 91–110 (2004)
Article Google Scholar
Boutell, M., Luo, J., Brown, C.: Factor-graphs for region-based whole-scene classification. In: CVPR-SLAM (2006)
Google Scholar
van Gemert, J., Geusebroek, J., Veenman, C., Snoek, C., Smeulders, A.: Robust scene categorization by learning image statistics in context. In: CVPR-SLAM (2006)
Google Scholar
Vogel, J., Schiele, B.: Semantic modeling of natural scenes for content-based image retrieval. IJCV 72, 133–157 (2007)
Article Google Scholar
Bosch, A., Zisserman, A., Munoz, X.: Scene classification using a hybrid generative/discriminative approach. TPAMI 30, 712–727 (2008)
Article Google Scholar
Fei-Fei, L., Perona, P.: A bayesian hierarchical model for learning natural scene categories. In: CVPR (2005)
Google Scholar
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: CVPR, pp. 2169–2178 (2006)
Google Scholar
Nowak, E., Jurie, F., Triggs, B.: Sampling strategies for bag-of-features image classification. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3954, pp. 490–503. Springer, Heidelberg (2006)
Chapter Google Scholar
Perronnin, F., Dance, C., Csurka, G., Bressan, M.: Adapted vocabularies for generic visual categorization. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006. LNCS, vol. 3954, pp. 464–475. Springer, Heidelberg (2006)
Chapter Google Scholar
Quelhas, P., Monay, F., Odobez, J., Gatica-Perez, D., Tuytelaars, T., Gool, L.V.: Modeling scenes with local descriptors and latent aspects. In: ICCV (2005)
Google Scholar
Jurie, F., Triggs, B.: Creating efficient codebooks for visual recognition. In: ICCV, pp. 604–610 (2005)
Google Scholar
Sivic, J., Zisserman, A.: Video Google: A text retrieval approach to object matching in videos. In: ICCV, vol. 2, pp. 1470–1477 (2003)
Google Scholar
Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, Heidelberg (2006)
MATH Google Scholar
Blei, D., Ng, A., Jordan, M.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
MATH Google Scholar
Oliva, A., Torralba, A.: Modeling the shape of the scene: A holistic representation of the spatial envelope. IJCV 42, 145–175 (2001)
Article MATH Google Scholar
Snoek, C., Worring, M., van Gemert, J., Geusebroek, J., Smeulders, A.: The challenge problem for automated detection of 101 semantic concepts in multimedia. In: ACM Multimedia (2006)
Google Scholar
Naphade, M., Huang, T.: A probabilistic framework for semantic video indexing, filtering, and retrieval. Transactions on Multimedia 3, 141–151 (2001)
Article Google Scholar
Winn, J., Criminisi, A., Minka, T.: Object categorization by learned universal visual dictionary. In: ICCV, pp. 1800–1807 (2005)
Google Scholar
Grauman, K., Darrell, T.: The pyramid match kernel: discriminative classification with sets of image features. In: ICCV, pp. 1458–1465 (2005)
Google Scholar
Bosch, A., Zisserman, A., Munoz, X.: Image classification using random forests and ferns. In: ICCV (2007)
Google Scholar
Larlus, D., Jurie, F.: Category level object segmentation. In: International Conference on Computer Vision Theory and Applications (2007)
Google Scholar
Marszałek, M., Schmid, C.: Accurate object localization with shape masks. In: CVPR (2007)
Google Scholar
Silverman, B., Green, P.: Density Estimation for Statistics and Data Analysis. Chapman and Hall, London (1986)
Book MATH Google Scholar
Fei-Fei, L., Fergus, R., Perona, P.: Learning generative visual models from few training examples: An incremental bayesian approach tested on 101 object categories. In: WGMBV (2004)
Google Scholar
Griffin, G., Holub, A., Perona, P.: Caltech-256 object category dataset. Technical Report UCB/CSD-04-1366, California Institute of Technology (2007)
Google Scholar
Chang, C., Lin, C.: LIBSVM: a library for support vector machines (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Intelligent Systems Lab Amsterdam (ISLA), University of Amsterdam, Kruislaan 403, 1098 SJ, Amsterdam, The Netherlands
Jan C. van Gemert, Jan-Mark Geusebroek, Cor J. Veenman & Arnold W. M. Smeulders

Authors

Jan C. van Gemert
View author publications
You can also search for this author in PubMed Google Scholar
Jan-Mark Geusebroek
View author publications
You can also search for this author in PubMed Google Scholar
Cor J. Veenman
View author publications
You can also search for this author in PubMed Google Scholar
Arnold W. M. Smeulders
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Science Department, University of Illinois at Urbana Champaign, 3310 Siebel Hall, IL 61801, Urbana, USA
David Forsyth
Department of Computing, Oxford Brookes University, OX33 1HX, Wheatley, Oxford, UK
Philip Torr
Department of Engineering Science, University of Oxford, Parks Road, OX1 3PJ, Oxford, UK
Andrew Zisserman

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

van Gemert, J.C., Geusebroek, JM., Veenman, C.J., Smeulders, A.W.M. (2008). Kernel Codebooks for Scene Categorization. In: Forsyth, D., Torr, P., Zisserman, A. (eds) Computer Vision – ECCV 2008. ECCV 2008. Lecture Notes in Computer Science, vol 5304. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88690-7_52

Download citation

DOI: https://doi.org/10.1007/978-3-540-88690-7_52
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88689-1
Online ISBN: 978-3-540-88690-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Kernel Codebooks for Scene Categorization

Abstract

Chapter PDF

Similar content being viewed by others

Learning visual codebooks for image classification using spectral clustering

Open Issues on Codebook Generation in Image Classification Tasks

SceneNet: A Perceptual Ontology for Scene Understanding

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Kernel Codebooks for Scene Categorization

Abstract

Chapter PDF

Similar content being viewed by others

Learning visual codebooks for image classification using spectral clustering

Open Issues on Codebook Generation in Image Classification Tasks

SceneNet: A Perceptual Ontology for Scene Understanding

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation