Taxonomic Multi-class Prediction and Person Layout Using Efficient Structured Ranking

Mittal, Arpit; Blaschko, Matthew B.; Zisserman, Andrew; Torr, Philip H. S.

doi:10.1007/978-3-642-33709-3_18

Arpit Mittal²¹,
Matthew B. Blaschko²²,
Andrew Zisserman²¹ &
…
Philip H. S. Torr²³

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7573))

Included in the following conference series:

European Conference on Computer Vision

11k Accesses
7 Citations

Abstract

In computer vision efficient multi-class classification is becoming a key problem as the field develops and the number of object classes to be identified increases. Often objects might have some sort of structure such as a taxonomy in which the mis-classification score for object classes close by, using tree distance within the taxonomy, should be less than for those far apart. This is an example of multi-class classification in which the loss function has a special structure. Another example in vision is for the ubiquitous pictorial structure or parts based model. In this case we would like the mis-classification score to be proportional to the number of parts misclassified.

It transpires both of these are examples of structured output ranking problems. However, so far no efficient large scale algorithm for this problem has been demonstrated. In this work we propose an algorithm for structured output ranking that can be trained in a time linear in the number of samples under a mild assumption common to many computer vision problems: that the loss function can be discretized into a small number of values.

We show the feasibility of structured ranking on these two core computer vision problems and demonstrate a consistent and substantial improvement over competing techniques. Aside from this, we also achieve state-of-the art results for the PASCAL VOC human layout problem.

Download to read the full chapter text

Chapter PDF

An $$\mathcal {O}(n \log n)$$ Cutting Plane Algorithm for Structured Output Ranking

A Fast and Simple Algorithm for Producing Candidate Regions

Learning to Rank Using High-Order Information

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Bengio, S., Weston, J., Grangier, D.: Label embedding trees for large multi-class tasks. In: NIPS (2010)
Google Scholar
Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The PASCAL Visual Object Classes (VOC) challenge. IJCV (2010)
Google Scholar
Tsochantaridis, I., Hofmann, T., Joachims, T., Altun, Y.: Support vector machine learning for interdependent and structured output spaces. In: Proc. ICML (2004)
Google Scholar
Joachims, T., Finley, T., Yu, C.N.J.: Cutting-plane training of structural SVMs. Machine Learning (2009)
Google Scholar
Li, Y., Huttenlocher, D.P.: Learning for stereo vision using the structured support vector machine. In: Proc. CVPR (2008)
Google Scholar
Blaschko, M.B., Lampert, C.H.: Learning to Localize Objects with Structured Output Regression. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part I. LNCS, vol. 5302, pp. 2–15. Springer, Heidelberg (2008)
Chapter Google Scholar
Szummer, M., Kohli, P., Hoiem, D.: Learning CRFs Using Graph Cuts. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part II. LNCS, vol. 5303, pp. 582–595. Springer, Heidelberg (2008)
Chapter Google Scholar
Blaschko, M.B., Vedaldi, A., Zisserman, A.: Simultaneous object detection and ranking with weak supervision. In: NIPS (2010)
Google Scholar
Rahtu, E., Kannala, J., Blaschko, M.B.: Learning a category independent object detection cascade. In: Proc. ICCV (2011)
Google Scholar
Zhang, Z., Warrell, J., Torr, P.H.S.: Proposal generation for object detection using cascaded ranking SVMs. In: Proc. CVPR (2011)
Google Scholar
Huang, J.C., Frey, B.J.: Structured ranking learning using cumulative distribution networks. In: NIPS (2008)
Google Scholar
Herbrich, R., Graepel, T., Obermayer, K.: Large margin rank boundaries for ordinal regression. In: Advances in Large Margin Classifiers (2000)
Google Scholar
Joachims, T.: Training linear SVMs in linear time. In: KDD (2006)
Google Scholar
Deng, J., Berg, A.C., Li, K., Fei-Fei, L.: What Does Classifying More Than 10,000 Image Categories Tell Us? In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part V. LNCS, vol. 6315, pp. 71–84. Springer, Heidelberg (2010)
Chapter Google Scholar
Binder, A., Müller, K.R., Kawanabe, M.: On taxonomies for multi-class image categorization. IJCV (2011)
Google Scholar
Cai, L., Hofmann, T.: Exploiting known taxonomies in learning overlapping concepts. In: IJCAI (2007)
Google Scholar
Imagenet: http://www.image-net.org/
Quattoni, A., Torralba, A.: Recognizing indoor scenes. In: Proc. CVPR (2009)
Google Scholar
Lowe, D.: Distinctive image features from scale-invariant keypoints. IJCV (2004)
Google Scholar
Lazebnik, S., Schmid, C., Ponce, J.: Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories. In: Proc. CVPR (2006)
Google Scholar
Wang, J., Yang, J., Yu, K., Lv, F., Huang, T., Gong, Y.: Locality-constrained linear coding for image classification. In: Proc. CVPR (2010)
Google Scholar
Eichner, M., Ferrari, V.: Better appearance models for pictorial structures. In: Proc. BMVC (2009)
Google Scholar
Yang, Y., Ramanan, D.: Articulated pose estimation with flexible mixtures-of-parts. In: Proc. CVPR (2011)
Google Scholar
Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The PASCAL Visual Object Classes Challenge 2011, VOC 2011 (2011), http://www.pascal-network.org/challenges/VOC/voc2011/
Marin-Jimenez, M., Zisserman, A., Ferrari, V.: Heres looking at you, kid. detecting people looking at each other in videos. In: Proc. BMVC (2011)
Google Scholar
Mittal, A., Zisserman, A., Torr, P.H.S.: Hand detection using multiple proposals. In: Proc. BMVC (2011)
Google Scholar
VOC2010-Results: http://pascallin.ecs.soton.ac.uk/challenges/voc/voc2010/results/
Li, F., Carreira, J., Sminchisescu, C.: Object recognition as ranking holistic figure-ground hypotheses. In: Proc. CVPR (2010)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Engineering Science, University of Oxford, UK
Arpit Mittal & Andrew Zisserman
Center for Visual Computing, École Centrale Paris, France
Matthew B. Blaschko
Department of Computing, Oxford Brookes University, UK
Philip H. S. Torr

Authors

Arpit Mittal
View author publications
You can also search for this author in PubMed Google Scholar
Matthew B. Blaschko
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Zisserman
View author publications
You can also search for this author in PubMed Google Scholar
Philip H. S. Torr
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Microsoft Research Ltd, CB3 0FB, Cambridge, UK
Andrew Fitzgibbon
Dept. of Computer Science, University of North Carolina, 27599, Chapel Hill, NC, USA
Svetlana Lazebnik
California Institute of Technology, 91125, Pasadena, CA, USA
Pietro Perona
Institute of Industrial Science, The University of Tokyo, 153-8505, Tokyo, Japan
Yoichi Sato
INRIA, 38330, Montbonnot, France
Cordelia Schmid

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Mittal, A., Blaschko, M.B., Zisserman, A., Torr, P.H.S. (2012). Taxonomic Multi-class Prediction and Person Layout Using Efficient Structured Ranking. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds) Computer Vision – ECCV 2012. ECCV 2012. Lecture Notes in Computer Science, vol 7573. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33709-3_18

Download citation

DOI: https://doi.org/10.1007/978-3-642-33709-3_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33708-6
Online ISBN: 978-3-642-33709-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Taxonomic Multi-class Prediction and Person Layout Using Efficient Structured Ranking

Abstract

Chapter PDF

Similar content being viewed by others

An $$\mathcal {O}(n \log n)$$ Cutting Plane Algorithm for Structured Output Ranking

A Fast and Simple Algorithm for Producing Candidate Regions

Learning to Rank Using High-Order Information

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Taxonomic Multi-class Prediction and Person Layout Using Efficient Structured Ranking

Abstract

Chapter PDF

Similar content being viewed by others

An $$\mathcal {O}(n \log n)$$ Cutting Plane Algorithm for Structured Output Ranking

A Fast and Simple Algorithm for Producing Candidate Regions

Learning to Rank Using High-Order Information

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation