Abstract
This paper studies the prediction of head pose from still images, and summarizes the outcome of a recently organized competition, where the task was to predict the yaw and pitch angles of an image dataset with 2790 samples with known angles. The competition received 292 entries from 52 participants, the best ones clearly exceeding the state-of-the-art accuracy. In this paper, we present the key methodologies behind selected top methods, summarize their prediction accuracy and compare with the current state of the art.
Chapter PDF
Similar content being viewed by others
References
Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines. ACM Trans. on Intell. Syst. and Technology 2(3), 27 (2011)
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: IEEE Conf. Comp. Vis. and Patt. Recogn., vol. 1, pp. 886–893 (2005)
Fanelli, G., Gall, J., Van Gool, L.: Real time head pose estimation with random regression forests. In: IEEE Conf. Comp. Vis. and Patt. Recogn., pp. 617–624 (2011)
Felzenszwalb, P.F., Girshick, R.B., McAllester, D., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE Trans. Pattern Anal. Machine Intell. 32(9), 1627–1645 (2010)
Foytik, J., Asari, V.K.: A two-layer framework for piecewise linear manifold-based head pose estimation. Int. J. of computer vision, 270–287 (2013)
Geng, X., Xia, Y.: Head pose estimation based on multivariate label distribution. In: IEEE Conf. Comp. Vis. and Patt. Recogn., pp. 1837–1842, June 2014
Goodfellow, I.J., Warde-Farley, D., Lamblin, P., Dumoulin, V., Mirza, M., Pascanu, R., Bergstra, J., Bastien, F., Bengio, Y.: Pylearn2: a machine learning research library. arXiv preprint arXiv:1308.4214 (2013)
Gourier, N., Hall, D., Crowley, J.L.: Estimating face orientation from robust detection of salient facial structures. In: FG Net Workshop on Visual Observation of Deictic Gestures. FGnet (IST-2000-26434), Cambridge, UK, pp. 1–9 (2004)
Guo, G., Fu, Y., Dyer, C., Huang, T.: Head pose estimation: Classification or regression? In: Int. Conf. Pattern Recognition, pp. 1–4, December 2008
Hara, K., Chellappa, R.: Growing regression forests by classification: applications to object pose estimation. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part II. LNCS, vol. 8690, pp. 552–567. Springer, Heidelberg (2014)
Huang, C., Ding, X., Fang, C.: Head pose estimation based on random forests for multiclass classification. In: Int. Conf. Pattern Recognition, pp. 934–937 (2010)
Huang, G.B., Ramesh, M., Berg, T., Learned-Miller, E.: Labeled faces in the wild: A database for studying face recognition in unconstrained environments. Tech. Rep. 07–49, University of Massachusetts, Amherst, October 2007
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Adv. in Neural Inf. Proc. Syst., pp. 1097–1105 (2012)
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)
Murphy-Chutorian, E., Trivedi, M.M.: Head pose estimation in computer vision: A survey. IEEE Trans. Pattern Anal. Machine Intell., 607–626 (2009)
Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., Duchesnay, E.: Scikit-learn: Machine learning in Python. J. Machine Learning Res. 12, 2825–2830 (2011)
Pelaez, C., Garcia, G., de la Escalera, A., Armingol, J.: Driver monitoring based on low-cost 3-d sensors. IEEE Trans. on Intelligent Transportation Syst. 15(4), 1855–1860 (2014)
Scholkopf, B., Smola, A.J.: Learning with kernels: support vector machines, regularization, optimization, and beyond. MIT press (2001)
Schulter, S., Leistner, C., Wohlhart, P., Roth, P.M., Bischof, H.: Alternating regression forests for object detection and pose estimation. In: IEEE Conf. Computer Vision, pp. 417–424 (2013)
Taigman, Y., Yang, M., Ranzato, M., Wolf, L.: Deepface: closing the gap to human-level performance in face verification. In: IEEE Int. Conf. Computer Vision, pp. 1701–1708 (2014)
Tulyakov, S., Vieriu, R.L., Semeniuta, S., Sebe, N.: Robust real-time extreme head pose estimation. In: Int. Conf. Pattern Recogn., pp. 2263–2268, Aug 2014
Wolf, L., Hassner, T., Maoz, I.: Face recognition in unconstrained videos with matched background similarity. In: IEEE Int. Conf. Computer Vision, pp. 529–534 (2011)
Wolpert, D.H.: Stacked generalization. Neural Networks 5(2), 241–259 (1992)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Huttunen, H. et al. (2015). Computer Vision for Head Pose Estimation: Review of a Competition. In: Paulsen, R., Pedersen, K. (eds) Image Analysis. SCIA 2015. Lecture Notes in Computer Science(), vol 9127. Springer, Cham. https://doi.org/10.1007/978-3-319-19665-7_6
Download citation
DOI: https://doi.org/10.1007/978-3-319-19665-7_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-19664-0
Online ISBN: 978-3-319-19665-7
eBook Packages: Computer ScienceComputer Science (R0)