U3PT: A New Dataset for Unconstrained 3D Pose Tracking Evaluation

Tran, Ngoc-Trung; Ababsa, Fakhreddine; Charbit, Maurice

doi:10.1007/978-3-319-23192-1_54

Ngoc-Trung Tran^15,16,
Fakhreddine Ababsa¹⁶ &
Maurice Charbit¹⁵

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 9256))

Included in the following conference series:

International Conference on Computer Analysis of Images and Patterns

3121 Accesses

Abstract

3D pose tracking using monocular cameras is an important topic, which has been receiving a great attention since last decades. It is useful in many domains such as: Video Surveillance, Human-Computer Interface, Biometrics, etc. The problem gets much challenging if occurring, for example, fast motion, out-of-plane rotation, the illumination changes, expression, or occlusions. In the literature, there are some datasets reported for 3D pose tracking evaluation, however, all of them retains simple background, no-expression, slow motion, frontal rotation, or no-occlusion. It is not enough to test advances of in-the-wild tracking. Indeed, collecting accurate ground-truth of 3D pose is difficult because some special devices or sensors are required. In addition, the magnetic sensors usually used for 3D pose ground-truth, is uncomfortable to wear and move because of their wires. In this paper, we propose a new recording system that allows people move more comfortable. We create a new challenging dataset, named U3PT (Unconstrained 3D Pose Tracking). It could be considered as a benchmark to evaluate and compare the robustness and precision of state-of-the-art methods that aims to work in-the-wild. This paper will also present the performances of two well-known state-of-the-art methods compared to our method on face tracking when applied to this database. We have carried out several experiments and have reported advantages and some limitations to be improved in the future.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

Towards Generalization of 3D Human Pose Estimation in the Wild

Robust facial landmark detection and tracking across poses and expressions for in-the-wild monocular video

Article Open access 17 March 2017

A generalizable approach for multi-view 3D human pose regression

Article 08 October 2020

Keywords

References

Ababsa, F.: Robust extended kalman filtering for camera pose tracking using 2d to 3d lines correspondences. In: IEEE/ASME International Conference on Advanced Intelligent Mechatronics, pp. 1834–1838 (2009)
Google Scholar
Ababsa, F., Mallem, M.: Robust line tracking using a particle filter for camera pose estimation. In: Proceedings of the ACM Symposium on Virtual Reality Software and Technology (2006)
Google Scholar
Alonso, J., Davoine, F., Charbit, M.: A linear estimation method for 3d pose and facial animation tracking. In: CVPR (2007)
Google Scholar
Asteriadis, S., Karpouzis, K., Kollias, S.: Visual focus of attention in non-calibrated environments using gaze estimation. IJCV (2014)
Google Scholar
Ba, S.O., Odobez, J.-M.: Probabilistic head pose tracking evaluation in single and multiple camera setups. In: Classification of Events, Activities and Relationship Evaluation and Workshop (2007)
Google Scholar
Blanz, V., Vetter, T.: A morphable model for the synthesis of 3d faces. In: SIGGRAPH, pp. 187–194, New York, NY, USA (1999)
Google Scholar
Bouguet, J.Y.: Camera calibration toolbox for matlab (2003)
Google Scholar
Burgos-Artizzu, X., Perona, P., Dollár, P.: Robust face landmark estimation under occlusion. In: ICCV (2013)
Google Scholar
Cao, X., Wei, Y., Wen, F., Sun, J.: Face alignment by explicit shape regression. In: CVPR (2012)
Google Scholar
Cascia, M.L., Sclaroff, S., Athitsos, V.: Fast, reliable head tracking under varying illumination: An approach based on registration of texture-mapped 3d models. TPAMI 22(4), 322–336 (2000)
Article Google Scholar
Cootes, T.F., Edwards, G.J., Taylor, C.J.: Active appearance models. TPAMI 23(6), 681–685 (2001)
Article Google Scholar
Cristinacce, D., Cootes, T.F.: Feature detection and tracking with constrained local models. In: BMVC (2006)
Google Scholar
Dementhon, D.F., Davis, L.S.: Model-based object pose in 25 lines of code. IJCV 15, 123–141 (1995)
Article Google Scholar
Dollar, P., Welinder, P., Perona, P.: Cascaded pose regression. In: CVPR (2010)
Google Scholar
Gross, R., Matthews, I., Cohn, J.F., Kanade, T., Baker, S.: Multi-pie. IVC 28(5), 807–813 (2010)
Article Google Scholar
Jang, J.-S., Kanade, T.: Robust 3d head tracking by online feature registration. In: FG (2008)
Google Scholar
Kazemi, V., Sullivan, J.: One millisecond face alignment with an ensemble of regression trees. In: CVPR (2014)
Google Scholar
Kim, M., Kumar, S., Pavlovic, V., Rowley, H.A.: Face tracking and recognition with visual constraints in real-world videos. In: CVPR (2008)
Google Scholar
Koestinger, M., Wohlhart, P., Roth, P.M., Bischof, H.: Annotated facial landmarks in the wild: a large-scale, real-world database for facial landmark localization. In: First IEEE International Workshop on Benchmarking Facial Image Analysis Technologies (2011)
Google Scholar
Lee, K., Ho, J., Yang, M., Kriegman, D.: Video-based face recognition using probabilistic appearance manifolds 1, 313–320 (2003)
Google Scholar
Lefevre, S., Odobez, J.-M.: Structure and appearance features for robust 3d facial actions tracking. In: ICME (2009)
Google Scholar
Valstar, M.F., Martinez, X.B., Pantic, M.: Facial point detection using boosted regression and graph models. In: CVPR, pp. 2729–2736 (2010)
Google Scholar
Matthews, I., Baker, S.: Active appearance models revisited. IJCV 60(2), 135–164 (2004)
Article Google Scholar
Morency, L.-P., Whitehill, J., Movellan, J.R.: Generalized adaptive view-based appearance model: integrated framework for monocular head pose estimation. In: FG (2008)
Google Scholar
Murphy-Chutorian, E., Trivedi, M.M.: HyHOPE: Hybrid Head Orientation and Position Estimation for Vision-based Driver Head Tracking. IEEE Intelligent Vehicles Symposium (2008)
Google Scholar
Murphy-Chutorian, E., Trivedi, M.M.: Head pose estimation in computer vision: A survey. PAMI 31(4) (2009)
Google Scholar
Ren, S., Cao, X., Wei, Y., Sun, J.: Face alignment at 3000 fps via regressing local binary features (2014)
Google Scholar
Sagonas, C., Tzimiropoulos, G., Zafeiriou, S., Pantic, M.: 300 faces in-the-wild challenge: the first facial landmark localization challenge. In: ICCV Workshops (2013)
Google Scholar
Saragih, J.M., Lucey, S., Cohn, J.F.: Deformable model fitting by regularized landmark mean-shift. IJCV 91, 200–215 (2011)
Article MathSciNet MATH Google Scholar
Stiefelhagen, R., Bernardin, K., Bowers, R., Rose, R.T., Michel, M., Garofolo, J.S.: The CLEAR 2007 evaluation. In: Stiefelhagen, R., Bowers, R., Fiscus, J.G. (eds.) RT 2007 and CLEAR 2007. LNCS, vol. 4625, pp. 3–34. Springer, Heidelberg (2008)
Chapter Google Scholar
Sun, Y., Wang, X., Tang, X.: Deep convolutional network cascade for facial point detection. In: CVPR (2013)
Google Scholar
Vacchetti, L., Lepetit, V., Fua, P.: Stable real-time 3d tracking using online and offline information. TPAMI 26(10), 1385–1391 (2004)
Article Google Scholar
Wang, H., Davoine, F., Lepetit, V., Chaillou, C., Pan, C.: 3-d head tracking via invariant keypoint learning. IEEE Transactions on Circuits and Systems for Video Technology 22(8), 1113–1126 (2012)
Article Google Scholar
Wang, Y., Lucey, S., Cohn, J.: Enforcing convexity for improved alignment with constrained local models. In: CVPR (2008)
Google Scholar
Zhu, X., Ramanan, D.: Face detection, pose estimation, and landmark localization in the wild. In: CVPR (2012)
Google Scholar
Xiao, J., Baker, S., Matthews, I., Kanade, T.: Real-time combined 2d+3d active appearance models. CVPR 2, 535–542 (2004)
Google Scholar
Xiao, J., Moriyama, T., Kanade, T., Cohn, J.: Robust full-motion recovery of head by dynamic templates and re-registration techniques. International Journal of Imaging Systems and Technology 13, 85–94 (2003)
Article Google Scholar
Xiong, X., la Torre Frade, F.D.: Supervised descent method and its applications to face alignment. In: CVPR (2013)
Google Scholar
Zhang, J., Shan, S., Kan, M., Chen, X.: Coarse-to-Fine Auto-Encoder Networks (CFAN) for real-time face alignment. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014, Part II. LNCS, vol. 8690, pp. 1–16. Springer, Heidelberg (2014)
Google Scholar

Download references

Author information

Authors and Affiliations

LTCI-CNRS, Telecom ParisTECH, 37-39, Rue Dareau, 75014, Paris, France
Ngoc-Trung Tran & Maurice Charbit
IBISC, University of Evry, 40, Rue du Pelvoux, 91020, Evry, France
Ngoc-Trung Tran & Fakhreddine Ababsa

Authors

Ngoc-Trung Tran
View author publications
You can also search for this author in PubMed Google Scholar
Fakhreddine Ababsa
View author publications
You can also search for this author in PubMed Google Scholar
Maurice Charbit
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ngoc-Trung Tran .

Editor information

Editors and Affiliations

University of Malta, Msida, Malta
George Azzopardi
University of Groningen, Groningen, The Netherlands
Nicolai Petkov

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tran, NT., Ababsa, F., Charbit, M. (2015). U3PT: A New Dataset for Unconstrained 3D Pose Tracking Evaluation. In: Azzopardi, G., Petkov, N. (eds) Computer Analysis of Images and Patterns. CAIP 2015. Lecture Notes in Computer Science(), vol 9256. Springer, Cham. https://doi.org/10.1007/978-3-319-23192-1_54

Download citation

DOI: https://doi.org/10.1007/978-3-319-23192-1_54
Published: 25 August 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-23191-4
Online ISBN: 978-3-319-23192-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

U3PT: A New Dataset for Unconstrained 3D Pose Tracking Evaluation

Abstract

Chapter PDF

Similar content being viewed by others

Towards Generalization of 3D Human Pose Estimation in the Wild

Robust facial landmark detection and tracking across poses and expressions for in-the-wild monocular video

A generalizable approach for multi-view 3D human pose regression

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

U3PT: A New Dataset for Unconstrained 3D Pose Tracking Evaluation

Abstract

Chapter PDF

Similar content being viewed by others

Towards Generalization of 3D Human Pose Estimation in the Wild

Robust facial landmark detection and tracking across poses and expressions for in-the-wild monocular video

A generalizable approach for multi-view 3D human pose regression

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation