3D-freehand-pose initialization based on operator’s cognitive behavioral models

Feng, Zhiquan; Zhang, Minming; Pan, Zhigeng; Yang, Bo; Xu, Tao; Tang, Haokui; Li, Yi

doi:10.1007/s00371-010-0452-z

3D-freehand-pose initialization based on operator’s cognitive behavioral models

Original Article
Published: 17 April 2010

Volume 26, pages 607–617, (2010)
Cite this article

Download PDF

Access provided by CONRICYT-eBooks

The Visual Computer Aims and scope Submit manuscript

3D-freehand-pose initialization based on operator’s cognitive behavioral models

Download PDF

Zhiquan Feng¹,
Minming Zhang²,
Zhigeng Pan²,
Bo Yang³,
Tao Xu³,
Haokui Tang³ &
…
Yi Li³

184 Accesses
15 Citations
Explore all metrics

Abstract

Tracking, recognition and interaction based on 3D freehand are a part of our virtual assembly system, in which monocular camera is used to input online freehand videos and the hand pose tracker requires a reliable initial pose in the first frame. A novel approach to initializing 3D pose and position of freehand is put forward in this paper visualization of 3D hand model and modeling the operators’ cognitive behaviors. Our approach is composed of three phases: hand posture recognition, coarse-tuning and fine-tuning. The operator moves his/her hand onto the to meet the needs of our virtual assembly system. The main contribution of this paper is that the three core techniques are for the first time integrated together, including human–computer interaction (HCI) in the process of initializing, projection of the 3D hand model in the period of coarse-tuning time. Then, the computer repeatedly fine-tunes the 3D hand model until the projection of the 3D hand model is completely superimposed onto the operator’s hand image. We focus on exploring and modeling cognitive behavior of operator’s hand upon which we design our initialization algorithm. Our research shows that cognitive behavioral models are not only beneficial to reducing cognitive loads for operators, because it makes the computers cater for the changes of the operators’ hand poses, but also helpful to address high dimensionality of articulated 3D hand model. Our experimental results also show that the approach presented in this paper is easier, more pleasurable and satisfactory experience for the operators. Our initialization system has successfully been applied to our 3D freehand tracking system and a simulation virtual assembly system.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

References

Erol, A., et al.: Vision-based hand pose estimation: a review. Comput. Vis. Image Underst. 108, 52–73 (2007)
Article Google Scholar
Julier, S.J., Uhlmann, J.K.: A new extension of the Kalman filter to nonlinear systems. In: Procedure of AeroSense: The 11th International Symposium on Aerospace/Defence Sensing, Simulation and Controls, pp. 82–193. SPIE, Bellingham (1997)
Google Scholar
Cui, J.S.: Studies on three-dimensional model based posture estimation and tracking of articulated objects. PhD Thesis, Tsinghua University, Beijing, China (2004)
Erol, A., Bebis, G., Nicolescu, M., Boyle, R., Twombly, X.: A review on vision-based full DOF hand motion estimation. In: Proceedings of the IEEE Workshop on Vision for Human–Computer Interaction (V4HCI), San Diego, California, vol. 3, pp. 75–83 (2005)
Triesch, J., von der Malsurg, C.: A system for person-independent hand posture recognition against complex background. IEEE Trans. Pattern Anal. Mach. Intell. 23(12), 84–95 (2001)
Article Google Scholar
Shimada, N., Kimura, K., Shirai, Y.: Real-time 3-D hand posture estimation based on 2-D appearance retrieval using monocular camera. In: Proc. Int. Workshop RATFG-RTS. pp. 23–30 (2001)
Rosales, R.: The specialized mappings architecture with applications to vision-based estimation of articulated body pose. PhD Thesis. Boston University, Boston (2002)
Athitsos, V., Sclaroff, S.: Estimating 3D hand pose from a cluttered image. Proc. IEEE Int. Conf. Comput. Vis. Pattern Recogn. 2, 432–439 (2003)
Google Scholar
Stenger, B., Thayananthan, A., Torr, P.H.S., Cipolla, R.: Filtering using a tree-based estimator. Proc. IEEE Int. Conf. Comput. Vis. 2, 1063–1070 (2003)
Article Google Scholar
Tomasi, C., Petrov, S., Sastry, A.: 3D tracking = classification + interpolation. In: The Ninth IEEE International Conference on Computer Vision, pp. 1441–1448 (2003)
Bray, M., Koller-Meier, E., Goo, L.V.: Smart particle filtering for 3D hand tracking. In: The Sixth IEEE International Conference on Automatic Face and Pose Recognition, pp. 675–680. IEEE Computer Society, Los Alamitos (2004)
Chapter Google Scholar
LaGorce, M., Paragios, N., Fleet, D.: Model based hand tracking with texture, shading and self-occlusions. IEEE Proc. IEEE Conf. Comput. Vis. Pattern Recogn. pp. 1–8 (2008)
John, B.E.: Cognitive modeling in human–computer interaction. In: Proceedings of Graphics Interface, pp. 161–167 (1998)
Card, S.K., Moran, T.P., Ewell, A.: The Psychology of Human–Computer Interaction. Lawrence Erlbaum Associates Inc., Hillsdale (1993)
Google Scholar
John, B.E., Kieras, D.E.: Using GOMS for operator interface design and evaluation: which technique? ACM Trans. Comput. Hum. Interact. 3(4), 287–319 (1996)
Article Google Scholar
Anderson, J.R., et al.: Cognitive tutors: lessons learned. J. Learn. Sci. 4(2), 167–207 (1995)
Article Google Scholar
Ritter, F.E., Baxter, G.D., Jones, G., Young, R.M.: Supporting cognitive models as operators. ACM Trans. Hum. Comput. Int. 7(2), 141–173 (2000)
Article Google Scholar
Tang, Z.: Visualization of 3D Datasets [M]. Singhua University Press, Beijing (1999)
Google Scholar
Cleveland, W.S.: Visualizing data [M]. Hobart Press, New Jersey (1993)
Google Scholar
Jaimes, A., Sebe, N.: Multimodal human–computer interaction: a survey. Comput. Vis. Image Underst. 108(1–2), 116–134 (2007)
Article Google Scholar
Karray, F., Alemzadeh, M., Saleh, J.A., Arab, M.N.: Human–computer interaction: overview on state of the art. Int. J. Smart Intell. Syst. 1(1), 137–159 (2008)
Google Scholar
Chapanis, A.: Man Machine Engineering. Wadsworth, Belmont (1965)
Google Scholar
Norman, D.: Cognitive engineering. In: Norman, D., Draper, S. (eds.) Operator Centered Design: New Perspective on Human–Computer Interaction. Lawrence Erlbaum, Hillsdale (1986)
Google Scholar
Picard, R.W.: Affective Computing. MIT Press, Cambridge (1997)
Google Scholar
Te’eni, D., Carey, J., Zhang, P.: Human Computer Interaction: Developing Effective Organizational Information Systems. Wiley, Hoboken (2007)
Google Scholar
John, B.E., Kieras, D.E.: The GOMS family of operator interface analysis techniques: comparison and contrast. ACM Trans. Comput. Hum. Interact., 320–351 (1996)
Gordon, N., Salmond, D.J., Smith, A.F.M.: Novel approach to nonlinear and non-Gaussian Bayesian state estimation. IEE Proc. F 140, 107–113 (1993)
Google Scholar
Feng el al, Z.: Research on features extraction from frame image sequences. In: International Symposium on Computer Science and Computational Technology (ISCSCT’2008), pp. 762–766 (2008)
Stefan, A., Athitsos, V., Alon, J., Sclaroff, S.: Translation and scale-invariant gesture recognition in complex scenes. In: Proceedings of the 1st International Conference on PErvasive Technologies Related to Assistive Environments, Athens, Greece, pp. 1–8 (2008)

Download references

Author information

Authors and Affiliations

School of Information Science and Engineering, University of Jinan, Shandong, 250022, P.R. China
Zhiquan Feng
State Key Laboratory of CAD&CG, Zhejiang University, Hangzhou, 310058, P.R. China
Minming Zhang & Zhigeng Pan
Provincial Key Laboratory for Network Based Intelligent Computing, University of Jinan, Shandong, 250022, P.R. China
Bo Yang, Tao Xu, Haokui Tang & Yi Li

Authors

Zhiquan Feng
View author publications
You can also search for this author in PubMed Google Scholar
Minming Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Zhigeng Pan
View author publications
You can also search for this author in PubMed Google Scholar
Bo Yang
View author publications
You can also search for this author in PubMed Google Scholar
Tao Xu
View author publications
You can also search for this author in PubMed Google Scholar
Haokui Tang
View author publications
You can also search for this author in PubMed Google Scholar
Yi Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhiquan Feng.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Feng, Z., Zhang, M., Pan, Z. et al. 3D-freehand-pose initialization based on operator’s cognitive behavioral models. Vis Comput 26, 607–617 (2010). https://doi.org/10.1007/s00371-010-0452-z

Download citation

Published: 17 April 2010
Issue Date: June 2010
DOI: https://doi.org/10.1007/s00371-010-0452-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

3D-freehand-pose initialization based on operator’s cognitive behavioral models

Abstract

Article PDF

Similar content being viewed by others

Research on Initialization of 3D Hand Pose Based on User and Computer Interaction

Structure-Aware 3D Hand Pose Regression from a Single Depth Image

HRI: human reasoning inspired hand pose estimation with shape memory update and contact-guided refinement

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

3D-freehand-pose initialization based on operator’s cognitive behavioral models

Abstract

Article PDF

Similar content being viewed by others

Research on Initialization of 3D Hand Pose Based on User and Computer Interaction

Structure-Aware 3D Hand Pose Regression from a Single Depth Image

HRI: human reasoning inspired hand pose estimation with shape memory update and contact-guided refinement

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation