Learning Class-to-Image Distance via Large Margin and L1-Norm Regularization

Wang, Zhengxiang; Gao, Shenghua; Chia, Liang-Tien

doi:10.1007/978-3-642-33709-3_17

Zhengxiang Wang^21,22,
Shenghua Gao^21,23 &
Liang-Tien Chia²¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7573))

Included in the following conference series:

European Conference on Computer Vision

11k Accesses
6 Citations

Abstract

Image-to-Class (I2C) distance has demonstrated its effectiveness for object recognition in several single-label datasets. However, for the multi-label problem, where an image may contain several regions belonging to different classes, this distance may not work well since it cannot discriminate local features from different regions in the test image and all local features have to be counted in the I2C distance calculation. In this paper, we propose to use Class-to-Image (C2I) distance and show that this distance performs better than I2C distance for multi-label image classification. However, since the number of local features in a class is huge compared to that in an image, the calculation of C2I distance is much more expensive than I2C distance. Moreover, the label information of training images can be used to help select relevant local features for each class and further improve the recognition performance. Therefore, to make C2I distance faster and perform better, we propose an optimization algorithm using L1-norm regularization and large margin constraint to learn the C2I distance, which will not only reduce the number of local features in the class feature set, but also improve the performance of C2I distance due to the use of label information. Experiments on MSRC, Pascal VOC and MirFlickr datasets show that our method can significantly speed up the C2I distance calculation, while achieves better recognition performance than the original C2I distance and other related methods for multi-labeled datasets.

Download to read the full chapter text

Chapter PDF

Image Annotation by Learning Label-Specific Distance Metrics

Nearest Neighbor with Multi-feature Metric for Image Annotation

Image Annotation by Propagating Labels from Semantic Neighbourhoods

Article 12 July 2016

References

Boiman, O., Shechtman, E., Irani, M.: In defense of nearest-neighbor based image classification. In: CVPR (2008)
Google Scholar
Bradley, P.S., Mangasarian, O.L.: Feature selection via concave minimization and support vector machines. In: ICML (1998)
Google Scholar
Yuan, G.X., Chang, K.W., Hsieh, C.J., Lin, C.J.: A comparison of optimization methods and software for large-scale L1-regularized linear classification. JMLR 11(52) (2010)
Google Scholar
Perronnin, F., Liu, Y., Sanchez, J., Poirier, H.: Large-scale image retrieval with compressed fisher vectors. In: CVPR (2010)
Google Scholar
Frome, A., Singer, Y., Sha, F., Malik, J.: Learning globally-consistent local distance functions for shape-based image retrieval and classification. In: ICCV (October 2007)
Google Scholar
Frome, A., Singer, Y., Malik, J.: Image retrieval and classification using local distance functions. In: NIPS, vol. 19 (2006)
Google Scholar
Wang, Z., Hu, Y., Chia, L.-T.: Image-to-Class Distance Metric Learning for Image Classification. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part I. LNCS, vol. 6311, pp. 706–719. Springer, Heidelberg (2010)
Chapter Google Scholar
Wang, Z., Hu, Y., Chia, L.T.: Improved learning of i2c distance and accelerating the neighborhood search for image classification. Pattern Recognition 44(10-11), 2384–2394 (2011)
Article Google Scholar
Behmo, R., Marcombes, P., Dalalyan, A., Prinet, V.: Towards Optimal Naive Bayes Nearest Neighbor. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 171–184. Springer, Heidelberg (2010)
Chapter Google Scholar
Tuytelaars, T., Fritz, M., Saenko, K., Darrell, T.: The nbnn kernel. In: ICCV, pp. 1824–1831 (2011)
Google Scholar
McCann, S., Lowe, D.G.: Local naive bayes nearest neighbor for image classification. In: CVPR (2012)
Google Scholar
Wang, H., Nie, F., Huang, H.: Learning instance specific distance for multi-instance classification. In: AAAI (2011)
Google Scholar
Wang, H., Huang, H., Kamangar, F., Nie, F., Ding, C.: Maximum margin multi-instance learning. In: NIPS, vol. 24 (2011)
Google Scholar
Verbeek, J., Guillaumin, M., Mensink, T., Schmid, C.: Image annotation with tagprop on the mirflickr set. In: MIR (2010)
Google Scholar
Lampert, C.H.: Maximum margin multi-label structured prediction. In: NIPS, vol. 24 (2011)
Google Scholar
Weinberger, K.Q., Saul, L.K.: Distance metric learning for large margin nearest neighbor classification. JMLR 10, 207–244 (2009)
MATH Google Scholar
Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The PASCAL Visual Object Classes Challenge 2011 (VOC 2011) (2011) (Results)
Google Scholar
Huiskes, M.J., Thomee, B., Lew, M.S.: New trends and ideas in visual concept detection: The mir flickr retrieval evaluation initiative. In: MIR, pp. 527–536 (2010)
Google Scholar
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. IJCV 60(2) (2004)
Google Scholar
Chum, O., Zisserman, A.: An exemplar model for learning object classes. In: CVPR (2007)
Google Scholar
Sivic, J., Zisserman, A.: Video Google: a text retrieval approach to object matching in videos. In: ICCV (2003)
Google Scholar
Vedaldi, A., Fulkerson, B.: VLFeat: An open and portable library of computer vision algorithms (2008), http://www.vlfeat.org/
Timofte, R., Gool, L.V.: Iterative nearest neighbors for classification and dimensionality reduction. In: CVPR (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

Nanyang Technological University, Singapore
Zhengxiang Wang, Shenghua Gao & Liang-Tien Chia
Fujitsu Research & Development Center Co., Ltd, Beijing, China
Zhengxiang Wang
Advanced Digital Sciences Center, Singapore
Shenghua Gao

Authors

Zhengxiang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Shenghua Gao
View author publications
You can also search for this author in PubMed Google Scholar
Liang-Tien Chia
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Microsoft Research Ltd, CB3 0FB, Cambridge, UK
Andrew Fitzgibbon
Dept. of Computer Science, University of North Carolina, 27599, Chapel Hill, NC, USA
Svetlana Lazebnik
California Institute of Technology, 91125, Pasadena, CA, USA
Pietro Perona
Institute of Industrial Science, The University of Tokyo, 153-8505, Tokyo, Japan
Yoichi Sato
INRIA, 38330, Montbonnot, France
Cordelia Schmid

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, Z., Gao, S., Chia, LT. (2012). Learning Class-to-Image Distance via Large Margin and L1-Norm Regularization. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds) Computer Vision – ECCV 2012. ECCV 2012. Lecture Notes in Computer Science, vol 7573. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33709-3_17

Download citation

DOI: https://doi.org/10.1007/978-3-642-33709-3_17
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33708-6
Online ISBN: 978-3-642-33709-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Learning Class-to-Image Distance via Large Margin and L1-Norm Regularization

Abstract

Chapter PDF

Similar content being viewed by others

Image Annotation by Learning Label-Specific Distance Metrics

Nearest Neighbor with Multi-feature Metric for Image Annotation

Image Annotation by Propagating Labels from Semantic Neighbourhoods

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Learning Class-to-Image Distance via Large Margin and L1-Norm Regularization

Abstract

Chapter PDF

Similar content being viewed by others

Image Annotation by Learning Label-Specific Distance Metrics

Nearest Neighbor with Multi-feature Metric for Image Annotation

Image Annotation by Propagating Labels from Semantic Neighbourhoods

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation