Abstract
This paper presents a new technique for the perception and recognition of activities using statistical descriptions of their spatiotemporal properties. A set of motion energy receptive fields is designed in order to sample the power spectrum of a moving texture. Their structure relates to the spatio-temporal energy models of Adelson and Bergen where measures of local visual motion information are extracted by comparing the outputs of a triad of Gabor energy filters. Then the probability density function required for Bayes rule is estimated for each class of activity by computing multi-dimensional histograms from the outputs from the set of receptive fields. The perception of activities is achieved according to Bayes rule. The result at each instant of time is the map of the conditional probabilities that each pixel belongs to each one of the activities of the training set. Since activities are perceived over a short integration time, a temporal analysis of outputs is done using Hidden Markov Models.
The approach is validated with experiments in the perception and recognition of activities of people walking in visual surveillance scenari. The presented work is in progress and preliminary results are encouraging, since recognition is robust to variations in illumination conditions, to partial occlusions and to changes in texture. It is shown that it constitute a powerful early vision tool for human behaviors analysis for smart-environnements.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
E.H. Adelson and J.R. Bergen. Spatio-temporal energy models for the perception of motion. Optical Society of America, 2(2):284–299, 1985.
E.H. Adelson and J.R. Bergen. Computational Models of Visual Processing, chapter The Plenoptic function and the elements of early vision. M. Landy and J.A. Movshons, Cambridge, 1991. MIT Press.
C. Biernacki, G. Celeux, and G. Govaert. Assessing a mixture model for clustering eith integrated classification likelihood. Technical report, October 1998.
A. Bobick and J. Davis. An appearence based representation of action. In Proc. Int. Conference on Pattern Recognition, pages 307–312, 1996.
A.M. Baumberg and D.C. Hogg. Learning spatiotemporal models from training examples. Technical Report 95.9, School of Computer Studies, Division of Artificial Intelligence, University of Leeds, March 1995.
M.J. Black and A.D. Jepson. Eigentracking: Robust matching and tracking of articulated objects using a view-based representation. In European Conference on Computer Vision, pages 329–342, 1996.
V. Colin de Verdiére and J.L. Crowley. Visual recognition using local appearance. In European Conference on Computer Vision, pages 640–654, 1998.
T.F. Cootes, C.J. Taylor, A. Lanitis, D.H. Cooper, and J. Graham. Building and using flexible models incorporating gray-level information. In International Conference on Computer Vision, pages 242–246, May 1993.
D.J. Heeger. Optical flow using spatio-temporal filters. International Journal of Computer Vision, pages 279–302, 1988.
J.J. Koenderink and A.J. van Doorn. Generic neighborhood operators. Pattern Analysis and Machine Intelligence, 14(6):597–605, june 1992.
T. Lindeberg. Feature detection with automatic scale selection. International Journal of Computer Vision, 30(2):79–116, 1998.
H. Murase and S.K. Nayar. Visual learning and recognition of 3-d objects from appearance. International Journal of Computer Vision, 14:5–24, 1995.
B. Schiele. Object Recognition Using Multidimentional Receptive Field Histograms. PhD thesis, Institut National Polytechnique de Grenoble, july 1997.
A. Spinei, D. Pellerin, and J. Herault. Spatio-temporal energy-based me-thod for velocity estimation. Signal Processing, 65:347–362, 1998.
C. Wren, A. Azarbayejani, T. Darrel, and A. Pentland. Pfinder:Real-time tracking of the human body. In International Conference on Automatic face and Gesture Recognition, pages 51–56, 1996.
Y. Yacoob and M.J. Black. Parameterized modeling and recognition of activities. In International Conference on Computer Vision, 1998.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2000 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chomat, O., Martin, J., Crowley, J.L. (2000). A Probabilistic Sensor for the Perception and the Recognition of Activities. In: Computer Vision - ECCV 2000. ECCV 2000. Lecture Notes in Computer Science, vol 1842. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45054-8_32
Download citation
DOI: https://doi.org/10.1007/3-540-45054-8_32
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-67685-0
Online ISBN: 978-3-540-45054-2
eBook Packages: Springer Book Archive