Abstract
Tracking people in a dense crowd is a challenging problem for a single camera tracker due to occlusions and extensive motion that make human segmentation difficult. In this paper we suggest a method for simultaneously tracking all the people in a densely crowded scene using a set of cameras with overlapping fields of view. To overcome occlusions, the cameras are placed at a high elevation and only people’s heads are tracked. Head detection is still difficult since each foreground region may consist of multiple subjects. By combining data from several views, height information is extracted and used for head segmentation. The head tops, which are regarded as 2D patches at various heights, are detected by applying intensity correlation to aligned frames from the different cameras. The detected head tops are then tracked using common assumptions on motion direction and velocity. The method was tested on sequences in indoor and outdoor environments under challenging illumination conditions. It was successful in tracking up to 21 people walking in a small area (2.5 people per m2), in spite of severe and persistent occlusions.
Article PDF
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
References
Ali, S., & Shah, M. (2008). Floor fields for tracking in high density crowd scenes. In Proceedings of the European conference on computer vision (ECCV) (pp. 1–14).
Arsic, D., Hristov, E., Lehment, N., Hornler, B., Schuller, B., & Rigoll, G. (2008). Applying multi layer homography for multi camera person tracking. In International conference on distributed smart cameras.
Brostow, G. J., & Cipolla, R. (2006). Unsupervised Bayesian detection of independent motion in crowds. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR) (pp. 594–601).
Cai, Q., & Aggarwal, J. K. (1999). Tracking human motion in structured environments using a distributed-camera system. IEEE Transactions on Pattern Analysis and Machine Intelligence, 21(11), 1241–1247.
Cheung, G. K. M., Kanade, T., Bouguet, J. Y., & Holler, M. (2000). A real time system for robust 3D voxel reconstruction of human motions. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR) (pp. 714–720).
Du, W., & Piater, J. H. (2007). Multi-camera people tracking by collaborative particle filters and principal axis-based integration. In Proceedings of the Asian conference on computer vision (ACCV) (pp. 365–374).
Eshel, R., & Moses, Y. (2008). Homography based multiple camera detection and tracking of people in a dense crowd. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR).
Faugeras, O. D. (1993). Three-dimensional computer vision. Boston: MIT Press.
Felzenszwalb, P. F. (2001). Learning models for object recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR) (pp. 56–62).
Fleuret, F., Berclaz, J., Lengagne, R., & Fua, P. (2007). Multi-camera people tracking with a probabilistic occupancy map. IEEE transactions on pattern analysis and machine intelligence (PAMI).
Franco, J. S., & Boyer, E. (2005). Fusion of multi-view silhouette cues using a space occupancy grid. Proceedings of the International Conference on Computer Vision, 2, 1747–1753.
Garibotto, G., & Cibei, C. (2005). 3D scene analysis by real-time stereovision. In Proceedings of the international conference on image processing (ICIP) (pp. 105–108).
Gavrila, D. M., & Philomin, V. (1999). Real-time object detection for smart vehicles. In Proceedings of the international conference on computer vision (ICCV) (pp. 87–93).
Goldschmidt, R., & Moses, Y. (2008). Practical calibration and synchronization in a wide baseline multi-camera setup using blinking LEDs. Technical report. Interdisciplinary Center Herzliya ftp://ftp.idc.ac.il/Pub/Users/cs/yael/TR-2008/IDC-CS-TR-200801.
Hartley, R., & Zisserman, A. (2000). Multiple view geometry in computer vision. Cambridge: Cambridge University Press.
Isard, M., & MacCormick, J. (2001). BraMBLe: a Bayesian multiple-blob tracker. In Proceedings of the international conference on computer vision (ICCV) (pp. 34–41).
Kettnaker, V., & Zabih, R. (1999). Bayesian multi-camera surveillance. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR) (pp. 253–259).
Khan, S. M., & Shah, M. (2006). A multiview approach to tracking people in crowded scenes using a planar homography constraint. In Proceedings of the European conference on computer vision (ECCV) (pp. IV: 133–146).
Khan, S. M., Yan, P., & Shah, M. (2007). A homographic framework for the fusion of multi-view silhouettes. In Proceedings of the international conference on computer vision (ICCV).
Kim, K., & Davis, L. S. (2006). Multi-camera tracking and segmentation of occluded people on ground plane using search-guided particle filtering. In Proceedings of the European conference on computer vision (ECCV) (pp. 98–109).
Kobayashi, Y., Sugimura, D., Hirasawa, K., Suzuki, N., Kage, H., Sato, Y., & Sugimoto, A. (2006). 3D head tracking using the particle filter with cascaded classifiers. In Proceedings of the British machine vision conference (BMVC) (pp. I:37).
Krumm, J., Harris, S., Meyers, B., Brumitt, B., Hale, M., & Shafer, S. (2000). Multi-camera multi-person tracking for easyliving. In International workshop on visual surveillance.
Laurentini, A. (1994). The visual hull concept for Silhouette-based image understanding. IEEE Transactions on Pattern Analysis and Machine Intelligence, 16(2), 150–162.
Leibe, B., Seemann, E., & Schiele, B. (2005). Pedestrian detection in crowded scenes. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR) (Vol. 1, pp. 878–885). Washington: IEEE Computer Society.
Lloyd, S. P. (1982). Least squares quantization in PCM. IEEE Transactions on Information Theory, 28(2), 129–136.
Mittal, A., & Davis, L. (2001). Unified multi-camera detection and tracking using region matching. In Proceedings of the IEEE workshop on multi-object tracking.
Mittal, A., & Davis, L. (2003). M2tracker: A multi-view approach to segmenting and tracking people in a cluttered scene. International Journal of Computer Vision, 51(3), 189–203.
Nummiaro, K., Koller Meier, E., Svoboda, T., Roth, D., & Van Gool, L. J. (2003). Color-based object tracking in multi-camera environments. In German pattern recognition symposium (pp. 591–599).
Orwell, J., Remagnino, P., & Jones, G. A. (1999). Multi-camera color tracking. In Proceedings of the IEEE workshop on visual surveillance (p. 14).
Papageorgiou, C., & Poggio, T. (1998). Trainable pedestrian detection. In IEEE conference on intelligent vehicles (pp. 35–39).
Polana, R., & Nelson, R. (1994). Low level recognition of human motion (or how to get your man without finding his body parts). In Proceedings of the IEEE workshop on motion of non-rigid and articulated objects (pp. 77–82).
Quaritsch, M., Kreuzthaler, M., Rinner, B., Bischof, H., & Strobl, B. (2007). Autonomous multicamera tracking on embedded smart cameras. Journal on Embedded Systems, 2007, 10.
Rodriguez, M. D., & Shah, M. (2007). Detecting and segmenting humans in crowded scenes. In Proceedings of the international conference on multimedia (pp. 353–356).
Shashua, A., Gdalyahu, Y., & Hayun, G. (2004). Pedestrian detection for driving assistance systems: single-frame classification. In IEEE conference on intelligent vehicles (pp. 1–6).
Smith, K., Gatica-Perez, D., & Odobez, J. (2005). Using particles to track varying numbers of interacting people. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR) (pp. 962–969).
Viola, P. A., Jones, M. J., & Snow, D. (2005). Detecting pedestrians using patterns of motion and appearance. International Journal of Computer Vision, 63(2), 153–161.
Wu, B., & Nevatia, R. (2007). Detection and tracking of multiple, partially occluded humans by Bayesian combination of edgelet based part detectors. International Journal of Computer Vision, 75(2), 247–266.
Yilmaz, A., Javed, O., & Shah, S. (2006). Object tracking: a survey. ACM Journal of Computing Surveys, 38(4), 13.
Yu, Q., Medioni, G., & Cohen, I. (2007). Multiple target tracking using spatio-temporal Markov chain Monte Carlo data association. In Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR).
Zhao, T., & Nevatia, R. (2004). Tracking multiple humans in complex situations. IEEE Transactions on Pattern Analysis and Machine Intelligence, 26(9), 1208–1221.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Eshel, R., Moses, Y. Tracking in a Dense Crowd Using Multiple Cameras. Int J Comput Vis 88, 129–143 (2010). https://doi.org/10.1007/s11263-009-0307-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11263-009-0307-0