Abstract
This paper presents an algorithm to perform pedestrian pose estimation using a stereo vision system in the Advanced Driver Assistance Systems (ADAS) context. The proposed approach isolates the pedestrian point cloud and extracts the pedestrian pose using a visibility based pedestrian 3D model. The model accurately predicts possible self occlusions and uses them as an integrated part of the detection. The algorithm creates multiple pose hypotheses that are scored and sorted using a scheme reminiscent of the Monte Carlo techniques. The technique performs a hierarchical search of the body pose from the head position to the lower limbs. In the context of road safety, it is important that the algorithm is able to perceive the pedestrian pose as quickly as possible to potentially avoid dangerous situations, the pedestrian pose will allow to better predict the pedestrian intentions. To this end, a single pedestrian model is used to detect all pertinent poses and the algorithm is able to extract the pedestrian pose based on a single stereo depth point cloud and minimal orientation information. The algorithm was tested against data captured with an industry standard motion capture system. Accurate results were obtained, the algorithm is able to correctly estimate the pedestrian pose with acceptable accuracy. The use of stereo setup allows the algorithm to be used in many varied contexts ranging from the proposed ADAS context to surveillance or even human-computer interaction.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Schmidt, S., Färber, B.: Pedestrians at the kerb recognising the action intentions of humans. Transportation Research Part F: Traffic Psychology and Behaviour 12(4), 300–310 (2009)
Shotton, J., Sharp, T., Kipman, A., Fitzgibbon, A., Finocchio, M., Blake, A., Cook, M., Moore, R.: Real-time human pose recognition in parts from single depth images. Commun. ACM 56(1), 116–124 (2013)
Poppe, R.: Vision-based human motion analysis: An overview. Computer Vision and Image Understanding 108(1–2), 4–18 (2007)
Geronimo, D., Lopez, A., Sappa, A., Graf, T.: Survey of pedestrian detection for advanced driver assistance systems. IEEE Trans. Pattern Anal. Machine Intell. 32(7), 1239–1258 (2010)
Andriluka, M., Roth, S., Schiele, B.: Pictorial structures revisited: people detection and articulated pose estimation. In: IEEE Conference on Computer Vision and Pattern Recognition, (CVPR), pp. 1014–1021 (2009)
Andriluka, M., Roth, S., Schiele, B.: Monocular 3D pose estimation and tracking by detection. In: IEEE Conference on Computer Vision and Pattern Recognition, (CVPR), pp. 623–630 (2010)
Agarwal, A., Triggs, B.: Recovering 3D human pose from monocular images. IEEE Trans. Pattern Anal. Machine Intell. 28(1), 44–58 (2006)
Hofmann, M., Gavrila, D.M.: Multi-view 3D human pose estimation in complex environment. International Journal of Computer Vision 96(1), 103–124 (2012)
Plagemann, C., Ganapathi, V., Koller, D., Thrun, S.: Real-time identification and localization of body parts from depth images. In: IEEE International Conference on Robotics and Automation, (ICRA), pp. 3108–3113 (2010)
Urtasun, R., Fua, P.: 3D human body tracking using deterministic temporal motion models. In: Pajdla, T., Matas, J.G. (eds.) Computer Vision, (ECCV). LNCS, vol. 3023, pp. 92–106. Springer, Heidelberg (2004)
Yang, H.D., Lee, S.W.: Reconstruction of 3D human body pose from stereo image sequences based on top-down learning. Pattern Recognition 40(11), 3120–3131 (2007)
Ziegler, J., Nickel, K., Stiefelhagen, R.: Tracking of the articulated upper body on multi-view stereo image sequences. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, (CVPR), vol. 1, pp. 774–781 (2006)
Muhlbauer, Q., Kuhnlenz, K., Buss, M.: A model-based algorithm to estimate body poses using stereo vision. In: IEEE International Symposium on Robot and Human Interactive Communication, (RO-MAN), pp. 285–290 (2008)
Pellegrini, S., Iocchi, L.: Human posture tracking and classification through stereo vision and 3D model matching. J. Image Video Process. 2008, 7:1–7:12, January 2008
Keller, C.G., Hermes, C., Gavrila, D.M.: Will the pedestrian cross? probabilistic path prediction based on learned motion features. In: Mester, R., Felsberg, M. (eds.) Pattern Recognition. LNCS, vol. 6835, pp. 386–395. Springer, Heidelberg (2011)
Kohler, S., Goldhammer, M., Bauer, S., Doll, K., Brunsmann, U., Dietmayer, K.: Early detection of the pedestrian’s intention to cross the street. In: IEEE Conference on Intelligent Transportation Systems, ITSC, pp. 1759–1764 (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Almeida, J., Santos, V. (2016). Pedestrian Pose Estimation Using Stereo Perception. In: Reis, L., Moreira, A., Lima, P., Montano, L., Muñoz-Martinez, V. (eds) Robot 2015: Second Iberian Robotics Conference. Advances in Intelligent Systems and Computing, vol 417. Springer, Cham. https://doi.org/10.1007/978-3-319-27146-0_38
Download citation
DOI: https://doi.org/10.1007/978-3-319-27146-0_38
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-27145-3
Online ISBN: 978-3-319-27146-0
eBook Packages: Computer ScienceComputer Science (R0)