Semantic Texton Forests

Chapter

pp 173–203
Cite this chapter

Access provided by Autonomous University of Puebla

Computer Vision

Semantic Texton Forests

Matthew Johnson⁴ &
Jamie Shotton⁵

Part of the book series: Studies in Computational Intelligence ((SCI,volume 285))

4072 Accesses
7 Citations

Abstract

The semantic texton forest is an efficient and powerful low-level feature which can be effectively employed in the semantic segmentation of images. As ensembles of decision trees that act directly on image pixels, semantic texton forests do not need the expensive computation of filter-bank responses or local descriptors. They are extremely fast to both train and test, especially compared with k-means clustering and nearest-neighbor assignment of feature descriptors. The nodes in the trees provide (i) an implicit hierarchical clustering into semantic textons, and (ii) an explicit local classification estimate. The bag of semantic textons combines a histogram of semantic textons over an image region with a region prior category distribution. The bag of semantic textons can be used by an SVM classifier to infer an image-level prior over categories, allowing the segmentation to emphasize those categories that the SVM believes to be present. We will examine the segmentation performance of semantic texton forests on two datasets including the VOC 2007 segmentation challenge.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

Similar content being viewed by others

Semantic Texton Forests for Image Categorization and Segmentation

Chapter © 2013

Decision Tree Fields: An Efficient Non-parametric Random Field Model for Image Labeling

Chapter © 2013

Extremely Randomized Trees and Random Subwindows for Image Classification, Annotation, and Retrieval

Chapter © 2013

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Amit, Y., Geman, D.: Shape quantization and recognition with randomized trees. Neural Computation 9(7), 1545–1588 (1997)
Article Google Scholar
Bishop, C.: Pattern Recognition and Machine Learning. Springer-Verlag New York, Inc. (2006)
Google Scholar
Bosch, A., Zisermann, A., Muñoz, X.: Image classification using random forests and ferns. In: Proceedings of the International Conference on Computer Vision (2007)
Google Scholar
Breiman, L.: Random forests. Machine Learning 45(1), 5–32 (2001)
Article MATH Google Scholar
Breiman, L., Friedman, J., Olshen, R.: Classification and Regression Trees. Wadsworth, Belmont (1984)
MATH Google Scholar
Csurka, G., Dance, C., Fan, L., Willamowski, J., Bray, C.: Visual categorization with bags of keypoints. In: Proceedings of the International Workshop on Statistical Learning in Computer Vision, ECCV (2004)
Google Scholar
Elkan, C.: Using the triangle inequality to accelerate k-means. In: Proceedings of the International Conference on Machine Learning, pp. 147–153 (2003)
Google Scholar
Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The PASCAL VOC Challenge (2007), http://www.pascal-network.org/challenges/VOC/voc2007/workshop/index.html
Fei-Fei, L., Perona, P.: A bayesian hierarchical model for learning natural scene categories. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition (2005)
Google Scholar
Geurts, P., Ernst, D., Wehenkel, L.: Extremely randomized trees. Machine Learning 36(1), 3–42 (2006)
Article Google Scholar
Grauman, K., Darrell, T.: The pyramid match kernel: Discriminative classification with sets of image features. In: Proceedings of the International Conference on Computer Vision (2005)
Google Scholar
Jain, A.K.: Fundamentals of Digital Image Processing. Prentice-Hall, New Jersey (1989)
MATH Google Scholar
Jurie, F., Triggs, B.: Creating efficient codebooks for visual recognition. In: Proceedings of the International Conference on Computer Vision, pp. 604–610 (2005)
Google Scholar
Lasserre, J., Kannan, A., Winn, J.: Hybrid learning of large jigsaws. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition, Minneapolis (2007)
Google Scholar
Lepetit, V., Lagger, P., Fua, P.: Randomized trees for real-time keypoint recognition. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition, pp. 775–781 (2005)
Google Scholar
Lowe, D.: Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision 60(2), 91–110 (2004)
Article Google Scholar
Malik, J., Belongie, S., Leung, T., Shi, J.: Contour and texture analysis for image segmentation. International Journal of Computer Vision 43(1), 7–27 (2001)
Article MATH Google Scholar
Marée, R., Geurts, P., Piater, J., Wehenkel, L.: Random subwindows for robust image classification. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition, pp. 34–40 (2005)
Google Scholar
Mikolajczyk, K., Schmid, C.: Scale and affine invariant interest point detectors. International Journal of Computer Vision 60(1), 63–86 (2004)
Article Google Scholar
Moosmann, F., Triggs, B., Jurie, F.: Fast discriminative visual codebooks using randomized clustering forests. In: Proceedings of the International Conference on Neural Information Processing Systems (2006)
Google Scholar
Nistér, D., Stewénius, H.: Scalable recognition with a vocabulary tree. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition (2006)
Google Scholar
Nowak, E., Jurie, F., Triggs, B.: Sampling strategies for bag-of-features image classification. In: Proceedings of the International Conference on Computer Vision (2006)
Google Scholar
Oliva, A., Torralba, A.: Modeling the shape of the scene: a holistic representation of the spatial envelope. International Journal of Computer Vision 42(3), 145–175 (2001)
Article MATH Google Scholar
Oliva, A., Torralba, A.: Building the gist of a scene: The role of global image features in recognition. Visual Perception, Progress in Brain Research 155(1), 23–26 (2006)
Google Scholar
Quelhas, P., Monay, F., Odobez, J.M., Gatica, D., Tuytelaars, T.: Modeling scenes with local descriptors and latent aspects. In: Proceedings of the International Conference on Computer Vision (2005)
Google Scholar
Rabinovich, A., Vedaldi, A., Galleguillos, C., Wiewiora, E., Belongie, S.: Objects in context. In: Proceedings of the International Conference on Computer Vision (2007)
Google Scholar
Russell, B., Torralba, A., Murphy, K., Freeman, W.T.: Labelme: a database and web-based tool for image annotation. Journal of Computer Vision 77(1-3), 157–173 (2008)
Article Google Scholar
Russell, B.C., Efros, A.A., Sivic, J., Freeman, W.T., Zisserman, A.: Using multiple segmentations to discover objects and their extent in image collections. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition (2006)
Google Scholar
Schindler, G., Brown, M., Szeliski, R.: City-scale location recognition. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition, Minneapolis (2007)
Google Scholar
Shotton, J., Winn, J., Rother, C., Criminisi, A.: Textonboost for image understanding: Multi-class object recognition and segmentation by jointly modeling texture, layout, and context. International Journal of Computer Vision 81(1) (2009)
Google Scholar
Sivic, J., Russel, B., Efros, A., Zisserman, A., Freeman, W.: Discovering objects and their localization in images. In: Proceedings of the International Conference on Computer Vision, Beijing, China, pp. 370–377 (2005)
Google Scholar
Sivic, J., Zisserman, A.: Video Google: A text retrieval approach to object matching in videos. In: Proceedings of the International Conference on Computer Vision, vol. 2, pp. 1470–1477 (2003)
Google Scholar
Swain, M., Ballard, D.: Color indexing. Int. J. Computer Vision 7, 11–32 (1991)
Article Google Scholar
Tuytelaars, T., Schmid, C.: Vector quantizing feature space with a regular lattice. In: Proceedings of the International Conference on Computer Vision (2007)
Google Scholar
Varma, M., Zisserman, A.: A statistical approach to texture classification from single images. International Journal of Computer Vision 62(1-2), 61–81 (2005)
Article Google Scholar
Verbeek, J., Triggs, B.: Region classification with markov field aspect models. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition (2007)
Google Scholar
Viola, P., Jones, M.: Rapid object detection using a boosted cascade of simple features. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition, pp. 511–518 (2001)
Google Scholar
Winder, S., Brown, M.: Learning local image descriptors. In: Proceedings of the International Conference on Computer Vision and Pattern Recognition (2007)
Google Scholar
Winn, J., Criminisi, A., Minka, T.: Object categorization by learned universal visual dictionary. In: Proceedings of the International Conference on Computer Vision, Beijing, China, pp. 1800–1807 (2005)
Google Scholar
Zhang, J., Marszałek, M., Lazebnik, S., Schmid, C.: Local features and kernels for classificaiton of texture and object categories: A comprehensive study. International Journal of Computer Vision 73(2), 213–238 (2007)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Nokia, San Francisco, USA
Matthew Johnson
Microsoft Research, Cambridge, UK
Jamie Shotton

Authors

Matthew Johnson
View author publications
You can also search for this author in PubMed Google Scholar
Jamie Shotton
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Engineering, University of Cambridge, CB2 1PZ, Cambridge, UK
Roberto Cipolla
Dipartimento di Matematica ed Informatica, University of Catania, Viale A. Doria 6, I, 95125, Catania, Italy
Sebastiano Battiato & Giovanni Maria Farinella &

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Johnson, M., Shotton, J. (2010). Semantic Texton Forests. In: Cipolla, R., Battiato, S., Farinella, G.M. (eds) Computer Vision. Studies in Computational Intelligence, vol 285. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12848-6_7

Download citation

DOI: https://doi.org/10.1007/978-3-642-12848-6_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12847-9
Online ISBN: 978-3-642-12848-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics