Abstract
Class hierarchies are commonly used to reduce the complexity of the classification problem. This is crucial when dealing with a large number of categories. In this work, we evaluate class hierarchies currently constructed for visual recognition. We show that top-down as well as bottom-up approaches, which are commonly used to automatically construct hierarchies, incorporate assumptions about the separability of classes. Those assumptions do not hold for visual recognition of a large number of object categories. We therefore propose a modification which is appropriate for most top-down approaches. It allows to construct class hierarchies that postpone decisions in the presence of uncertainty and thus provide higher recognition accuracy. We also compare our method to a one-against-all approach and show how to control the speed-for-accuracy trade-off with our method. For the experimental evaluation, we use the Caltech-256 visual object classes dataset and compare to state-of-the-art methods.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Everingham, M., van Gool, L., Williams, C., Winn, J., Zisserman, A.: Overview and results of classification challenge. In: The PASCAL VOC 2007 Challenge Workshop, in conj. with ICCV (2007)
Mikolajczyk, K., Schmid, C.: Scale and affine invariant interest point detectors. IJCV (2004)
Lindeberg, T.: Feature detection with automatic scale selection. IJCV (1998)
Lowe, D.: Distinctive image features form scale-invariant keypoints. IJCV (2004)
Willamowski, J., Arregui, D., Csurka, G., Dance, C.R., Fan, L.: Categorizing nine visual classes using local appearance descriptors. In: IWLAVS (2004)
Schölkopf, B., Smola, A.: Learning with Kernels: Support Vector Machines, Regularization, Optimization and Beyond (2002)
Lazebnik, S., Schmid, C., Ponce, J.: Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: CVPR (2006)
Zhang, J., Marszalek, M., Lazebnik, S., Schmid, C.: Local features and kernels for classification of texture and object categories: A comprehensive study. IJCV (2007)
Fei-Fei, L., Fergus, R., Perona, P.: One-shot learning of object categories. PAMI (2007)
Griffin, G., Holub, A., Perona, P.: Caltech-256 object category dataset. Technical report (2007)
Chen, Y., Crawford, M., Ghosh, J.: Integrating support vector machines in a hierarchical output space decomposition framework. In: IGARSS (2004)
Liu, S., Yi, H., Chia, L.T., Deepu, R.: Adaptive hierarchical multi-class SVM classifier for texture-based image classification. In: ICME (2005)
Zhigang, L., Wenzhong, S., Qianqing, Q., Xiaowen, L., Donghui, X.: Hierarchical support vector machines. In: IGARSS (2005)
Yuan, X., Lai, W., Mei, T., Hua, X., Wu, X., Li, S.: Automatic video genre categorization using hierarchical SVM. In: ICIP (2006)
Zweig, A., Weinshall, D.: Exploiting object hierarchy: Combining models from different category levels. In: ICCV (2007)
He, X., Zemel, R.: Latent topic random fields: Learning using a taxonomy of labels. In: CVPR (2008)
Griffin, G., Perona, P.: Learning and using taxonomies for fast visual category recognition. In: CVPR (2008)
Nister, D., Stewenius, H.: Scalable recognition with a vocabulary tree. In: CVPR (2006)
Philbin, J., Chum, O., Isard, M., Sivic, J., Zisserman, A.: Object retrieval with large vocabularies and fast spatial matching. In: CVPR (2007)
Casasent, D., Wang, Y.C.: A hierarchical classifier using new support vector machines for automatic target recognition. Neural Networks (2005)
Shi, J., Malik, J.: Normalized cuts and image segmentation. PAMI (2000)
Rahimi, A., Recht, B.: Clustering with normalized cuts is clustering with a hyperplane. In: SLCV (2004)
Fowlkes, C., Belongie, S., Chung, F., Malik, J.: Spectral grouping using the Nyström method. PAMI (2004)
Everingham, M., Zisserman, A., Williams, C., van Gool, L.: The PASCAL visual object classes challenge 2006 (VOC 2006) results. Technical report (2006)
Marszałek, M., Schmid, C.: Semantic hierarchies for visual object recognition. In: CVPR (2007)
Perona, P., Griffin, G., Spain, M.: The Caltech 256 Workshop. In: Conj. with ICCV (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Marszałek, M., Schmid, C. (2008). Constructing Category Hierarchies for Visual Recognition. In: Forsyth, D., Torr, P., Zisserman, A. (eds) Computer Vision – ECCV 2008. ECCV 2008. Lecture Notes in Computer Science, vol 5305. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88693-8_35
Download citation
DOI: https://doi.org/10.1007/978-3-540-88693-8_35
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88692-1
Online ISBN: 978-3-540-88693-8
eBook Packages: Computer ScienceComputer Science (R0)