Learning and Incorporating Top-Down Cues in Image Segmentation

He, Xuming; Zemel, Richard S.; Ray, Debajyoti

doi:10.1007/11744023_27

Xuming He¹⁹,
Richard S. Zemel¹⁹ &
Debajyoti Ray¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3951))

Included in the following conference series:

European Conference on Computer Vision

15k Accesses
58 Citations

Abstract

Bottom-up approaches, which rely mainly on continuity principles, are often insufficient to form accurate segments in natural images. In order to improve performance, recent methods have begun to incorporate top-down cues, or object information, into segmentation. In this paper, we propose an approach to utilizing category-based information in segmentation, through a formulation as an image labelling problem. Our approach exploits bottom-up image cues to create an over-segmented representation of an image. The segments are then merged by assigning labels that correspond to the object category. The model is trained on a database of images, and is designed to be modular: it learns a number of image contexts, which simplify training and extend the range of object classes and image database size that the system can handle. The learning method estimates model parameters by maximizing a lower bound of the data likelihood. We examine performance on three real-world image databases, and compare our system to a standard classifier and other conditional random field approaches, as well as a bottom-up segmentation method.

Download to read the full chapter text

Chapter PDF

Simple-to-Complex Discriminative Clustering for Hierarchical Image Segmentation

Closed-Form Approximate CRF Training for Scalable Image Segmentation

Assessing Hierarchies by Their Consistent Segmentations

Article Open access 18 March 2024

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Peterson, M., Gibson, B.: Shape recognition contributions to figure-ground organization in three-dimensional displays. Cognitive Psychology 25, 383–429 (1993)
Article Google Scholar
Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: Proc. 18th ICML (2001)
Google Scholar
Kumar, S., Hebert, M.: Discriminative random fields: A discriminative framework for contextual interaction in classification. In: ICCV (2003)
Google Scholar
Ren, X., Malik, J.: Learning a classification model for segmentation. In: ICCV (2003)
Google Scholar
Liu, L., Sclaroff, S.: Region segmentation via deformable model-guided split and merge. In: ICCV (2001)
Google Scholar
Borenstein, E., Sharon, E., Ullman, S.: Combining top-down and bottom-up segmentation. In: Proceedings IEEE Workshop of Perceptual Organization in Computer Vision (2004)
Google Scholar
Yu, S., Shi, J.: Object-specific figure-ground segregation. In: CVPR (2003)
Google Scholar
Tu, Z., Chen, X., Yuille, A., Zhu, S.C.: Image parsing: Unifying segmentation, detection, and object recognition. International Journal of Computer Vision 63, 113–140 (2005)
Article Google Scholar
Murphy, K., Torralba, A., Freeman, W.: Using the forest to see the trees: A graphical model relating features, objects and scenes. In: NIPS-04 (2004)
Google Scholar
Carbonetto, P., de Freitas, N., Barnard, K.: A statistical model for general contextual object recognition. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3021, pp. 350–362. Springer, Heidelberg (2004)
Chapter Google Scholar
He, X., Zemel, R., Carreira-Perpinan, M.: Multiscale conditional random fields for image labelling. In: CVPR (2004)
Google Scholar
Shi, J., Malik, J.: Normalized cuts and image segmentation. IEEE Trans. PAMI 22, 888–905 (2000)
Article Google Scholar
Torralba, A., Oliva, A.: Statistics of natural image categories. Network: Computation in neural systems 14, 391–412 (2003)
Article Google Scholar
Jacobs, R.A., Jordan, M.I., Nowlan, S., Hinton, G.E.: Adaptive mixtures of local experts. Neural Computation 3, 1–12 (1991)
Article Google Scholar
Martin, D., Fowlkes, C., Malik, J.: Learning to detect natural image boundaries using local brightness, color and texture cues. IEEE Trans. PAMI. 26, 530–549 (2003)
Article Google Scholar
Hinton, G.E.: Training products of experts by minimizing contrastive divergence. Neural Computation 14, 1771–1800 (2002)
Article MATH Google Scholar
Russell, B., Torralba, A., Murphy, K., Freeman, W.: LabelMe: A database and web-based tool for image annotation (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Toronto, Canada
Xuming He, Richard S. Zemel & Debajyoti Ray

Authors

Xuming He
View author publications
You can also search for this author in PubMed Google Scholar
Richard S. Zemel
View author publications
You can also search for this author in PubMed Google Scholar
Debajyoti Ray
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of Ljubljana, Ljubljana, Slovenia
Aleš Leonardis
Institute for Computer Graphics and Vision, TU Graz, Inffeldgasse 16, 8010, Graz, Austria
Horst Bischof
Vision-based Measurement Group, Inst. of El. Measurement and Meas. Sign. Proc. Graz, University of Technology, Austria
Axel Pinz

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

He, X., Zemel, R.S., Ray, D. (2006). Learning and Incorporating Top-Down Cues in Image Segmentation. In: Leonardis, A., Bischof, H., Pinz, A. (eds) Computer Vision – ECCV 2006. ECCV 2006. Lecture Notes in Computer Science, vol 3951. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11744023_27

Download citation

DOI: https://doi.org/10.1007/11744023_27
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33832-1
Online ISBN: 978-3-540-33833-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Learning and Incorporating Top-Down Cues in Image Segmentation

Abstract

Chapter PDF

Similar content being viewed by others

Simple-to-Complex Discriminative Clustering for Hierarchical Image Segmentation

Closed-Form Approximate CRF Training for Scalable Image Segmentation

Assessing Hierarchies by Their Consistent Segmentations

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Learning and Incorporating Top-Down Cues in Image Segmentation

Abstract

Chapter PDF

Similar content being viewed by others

Simple-to-Complex Discriminative Clustering for Hierarchical Image Segmentation

Closed-Form Approximate CRF Training for Scalable Image Segmentation

Assessing Hierarchies by Their Consistent Segmentations

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation