Abstract
Occlusion and lack of visibility in dense crowded scenes make it very difficult to track individual people correctly and consistently. This problem is particularly hard to tackle in single camera systems. We present a multi-view approach to tracking people in crowded scenes where people may be partially or completely occluding each other. Our approach is to use multiple views in synergy so that information from all views is combined to detect objects. To achieve this we present a novel planar homography constraint to resolve occlusions and robustly determine locations on the ground plane corresponding to the feet of the people. To find tracks we obtain feet regions over a window of frames and stack them creating a space time volume. Feet regions belonging to the same person form contiguous spatio-temporal regions that are clustered using a graph cuts segmentation approach. Each cluster is the track of a person and a slice in time of this cluster gives the tracked location. Experimental results are shown in scenes of dense crowds where severe occlusions are quite common. The algorithm is able to accurately track people in all views maintaining correct correspondences across views. Our algorithm is ideally suited for conditions when occlusions between people would seriously hamper tracking performance or if there simply are not enough features to distinguish between different people.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Irani, M., Rousso, B., Peleg, S.: Computing Occluding and Transparent Motions. IJCV 12(1) (1994)
Gurdjos, P., Sturm, P.: Methods and Geometry for Plane-Based Self-Calibration. In: CVPR (2003)
Zhao, T., Nevatia, T.: Tracking Multiple Humans in Complex Situations. IEEE PAMI (2004)
Okuma, K., Taleghani, A., de Freitas, N., Little, J.J., Lowe, D.G.: A boosted particle filter: Multitarget detection and tracking. In: Pajdla, T., Matas, J(G.) (eds.) ECCV 2004. LNCS, vol. 3021, pp. 28–39. Springer, Heidelberg (2004)
Leibe, B., Seemann, E., Schiele, B.: Pedestrian Detection in Crowded Scenes. In: CVPR 2005 (2005)
McKenna, S.J., Jabri, S., Duric, Z., Rosenfeld, A., Wechsler, H.: Tracking Groups of People. In: CVIU 2000 (2000)
Rosales, R., Sclaroff, S.: 3D Trajectory Recovery for Tracking Multiple Objects and Trajectory Guided Recognition of Actions. In: CVPR 1999 (1999)
Sidenbladh, H., Black, M.J., Fleet, D.J.: Stochastic tracking of 3D human figures using 2D image motion. In: Vernon, D. (ed.) ECCV 2000. LNCS, vol. 1843, pp. 702–718. Springer, Heidelberg (2000)
Orwell, J., Massey, S., Remagnino, P., Greenhill, D., Jones, G.A.: A Multi-agent framework for visual surveillance. In: ICIP 1999 (1999)
Cai, Q., Aggarwal, J.K.: Automatic tracking of human motion in indoor scenes across multiple synchronized video streams. In: ICCV 1998 (1998)
Krumm, J., Harris, S., Meyers, B., Brumitt, B., Hale, M., Shafer, S.: Multi-camera multi-person tracking for easy living. In: IEEE International Workshop on Visual Surveillance (2000)
Mittal, A., Larry, S.D.: M2Tracker: A Multi-View Approach to Segmenting and Tracking People in a Cluttered Scene. IJCV (2002)
Laurentini, A.: The Visual Hull Concept for Silhouette Based Image Understanding. IEEE PAMI (1994)
Franco, J., Boyer, E.: Fusion of Multi-View Silhouette Cues Using a Space Occupancy Grid. In: ICCV 2005 (2005)
Cheung, K.M., Kanade, T., Bouguet, J.-Y., Holler, M.: A real time system for robust 3d voxel reconstruction of human motions. In: CVPR 2000 (2000)
Stauffer, C., Grimson, W.E.L.: Adaptive background mixture models for real-time tracking. In: CVPR 1999 (1999)
Gibson, J.J.: The Ecological Approach to Visual Perception. Houghton Mifflen, Boston (1979)
Marr, D.: Vision: A Computational Investigation into the Human Representation and Processing of Visual Information. W.H. Freeman, New York (1982)
Neisser, U.: Cognition and Reality: Principles and Implications of Cognitive Psychology. W.H. Freeman, San Francisco (1976)
Poore, A.B.: Multidimensional Assignments and Multitarget Tracking. In: Proc. Partitioning Data Sets; DIMACS Workshop (1995)
Reid, D.B.: An Algorithm for Tracking Multiple Targets. IEEE Trans. Automatic Control (1979)
Shi, J., Malik, J.: Normalized Cuts and Image Segmentation. IEEE PAMI (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Khan, S.M., Shah, M. (2006). A Multiview Approach to Tracking People in Crowded Scenes Using a Planar Homography Constraint. In: Leonardis, A., Bischof, H., Pinz, A. (eds) Computer Vision – ECCV 2006. ECCV 2006. Lecture Notes in Computer Science, vol 3954. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11744085_11
Download citation
DOI: https://doi.org/10.1007/11744085_11
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33838-3
Online ISBN: 978-3-540-33839-0
eBook Packages: Computer ScienceComputer Science (R0)