Activity Forecasting

Kitani, Kris M.; Ziebart, Brian D.; Bagnell, James Andrew; Hebert, Martial

doi:10.1007/978-3-642-33765-9_15

Kris M. Kitani²¹,
Brian D. Ziebart²¹,
James Andrew Bagnell²¹ &
…
Martial Hebert²¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7575))

Included in the following conference series:

European Conference on Computer Vision

11k Accesses
249 Citations
3 Altmetric

Abstract

We address the task of inferring the future actions of people from noisy visual input. We denote this task activity forecasting. To achieve accurate activity forecasting, our approach models the effect of the physical environment on the choice of human actions. This is accomplished by the use of state-of-the-art semantic scene understanding combined with ideas from optimal control theory. Our unified model also integrates several other key elements of activity analysis, namely, destination forecasting, sequence smoothing and transfer learning. As proof-of-concept, we focus on the domain of trajectory-based activity analysis from visual input. Experimental results demonstrate that our model accurately predicts distributions over future actions of individuals. We show how the same techniques can improve the results of tracking algorithms by leveraging information about likely goals and trajectories.

Download to read the full chapter text

Chapter PDF

Long-Term Activity Forecasting Using First-Person Vision

A Hierarchical Framework for Motion Trajectory Forecasting Based on Modality Sampling

Context-Aware Activity Forecasting

Keywords

References

Munoz, D., Bagnell, J.A., Hebert, M.: Stacked Hierarchical Labeling. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part VI. LNCS, vol. 6316, pp. 57–70. Springer, Heidelberg (2010)
Chapter Google Scholar
Munoz, D., Bagnell, J.A., Hebert, M.: Co-inference for Multi-modal Scene Analysis. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part VI. LNCS, vol. 7577, pp. 668–681. Springer, Heidelberg (2012)
Chapter Google Scholar
Ziebart, B., Ratliff, N., Gallagher, G., Mertz, C., Peterson, K., Bagnell, J., Hebert, M., Dey, A., Srinivasa, S.: Planning-based prediction for pedestrians. In: IROS (2009)
Google Scholar
Abbeel, P., Ng, A.: Apprenticeship learning via inverse reinforcement learning. In: ICML (2004)
Google Scholar
Baker, C., Saxe, R., Tenenbaum, J.: Action understanding as inverse planning. Cognition 113(3), 329–349 (2009)
Article Google Scholar
Ziebart, B., Maas, A., Bagnell, J., Dey, A.: Maximum entropy inverse reinforcement learning. In: AAAI (2008)
Google Scholar
Levine, S., Popovic, Z., Koltun, V.: Nonlinear inverse reinforcement learning with Gaussian processes. In: NIPS (2011)
Google Scholar
Morris, B., Trivedi, M.: A survey of vision-based trajectory learning and analysis for surveillance. Transactions on Circuits and Systems for Video Technology 18(8), 1114–1127 (2008)
Article Google Scholar
Ali, S., Shah, M.: Floor Fields for Tracking in High Density Crowd Scenes. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part II. LNCS, vol. 5303, pp. 1–14. Springer, Heidelberg (2008)
Chapter Google Scholar
Zen, G., Ricci, E.: Earth mover’s prototypes: A convex learning approach for discovering activity patterns in dynamic scenes. In: CVPR (2011)
Google Scholar
Mehran, R., Oyama, A., Shah, M.: Abnormal crowd behavior detection using social force model. In: CVPR (2009)
Google Scholar
Pellegrini, S., Ess, A., Schindler, K., Van Gool, L.J.: You’ll never walk alone: Modeling social behavior for multi-target tracking. In: ICCV (2009)
Google Scholar
Turek, M.W., Hoogs, A., Collins, R.: Unsupervised Learning of Functional Categories in Video Scenes. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part II. LNCS, vol. 6312, pp. 664–677. Springer, Heidelberg (2010)
Chapter Google Scholar
Huang, C., Wu, B., Nevatia, R.: Robust Object Tracking by Hierarchical Association of Detection Responses. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part II. LNCS, vol. 5303, pp. 788–801. Springer, Heidelberg (2008)
Chapter Google Scholar
Kaucic, R., Amitha Perera, A., Brooksby, G., Kaufhold, J., Hoogs, A.: A unified framework for tracking through occlusions and across sensor gaps. In: CVPR (2005)
Google Scholar
Gong, H., Sim, J., Likhachev, M., Shi, J.: Multi-hypothesis motion planning for visual object tracking. In: ICCV (2011)
Google Scholar
Xing, Z., Pei, J., Dong, G., Yu, P.: Mining sequence classifiers for early prediction. In: SIAM International Conference on Data Mining (2008)
Google Scholar
Ryoo, M.: Human activity prediction: Early recognition of ongoing activities from streaming videos. In: ICCV (2011)
Google Scholar
Hoai, M., De la Torre, F.: Max-margin early event detectors. In: CVPR (2012)
Google Scholar
Bellman, R.: A Markovian decision process. Journal of Mathematics and Mechanics 6(5), 679–684 (1957)
MATH Google Scholar
Ratliff, N., Bagnell, J., Zinkevich, M.: Maximum margin planning. In: ICML (2006)
Google Scholar
Oh, S., Hoogs, A., Perera, A., Cuntoor, N., Chen, C., Lee, J., Mukherjee, S., Aggarwal, J., Lee, H., Davis, L., et al.: A large-scale benchmark dataset for event recognition in surveillance video. In: CVPR (2011)
Google Scholar
Wang, S., Lu, H., Yang, F., Yang, M.H.: Superpixel tracking. In: ICCV (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

Carnegie Mellon University, Pittsburgh, PA, 15213, USA
Kris M. Kitani, Brian D. Ziebart, James Andrew Bagnell & Martial Hebert

Authors

Kris M. Kitani
View author publications
You can also search for this author in PubMed Google Scholar
Brian D. Ziebart
View author publications
You can also search for this author in PubMed Google Scholar
James Andrew Bagnell
View author publications
You can also search for this author in PubMed Google Scholar
Martial Hebert
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Microsoft Research Ltd., CB3 0FB, Cambridge, UK
Andrew Fitzgibbon
Dept. of Computer Science, University of North Carolina, 27599, Chapel Hill, NC, USA
Svetlana Lazebnik
California Institute of Technology, 91125, Pasadena, CA, USA
Pietro Perona
Institute of Industrial Science, The University of Tokyo, 153-8505, Tokyo, Japan
Yoichi Sato
INRIA, 38330, Montbonnot, France
Cordelia Schmid

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kitani, K.M., Ziebart, B.D., Bagnell, J.A., Hebert, M. (2012). Activity Forecasting. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds) Computer Vision – ECCV 2012. ECCV 2012. Lecture Notes in Computer Science, vol 7575. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-33765-9_15

Download citation

DOI: https://doi.org/10.1007/978-3-642-33765-9_15
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-33764-2
Online ISBN: 978-3-642-33765-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Activity Forecasting

Abstract

Chapter PDF

Similar content being viewed by others

Long-Term Activity Forecasting Using First-Person Vision

A Hierarchical Framework for Motion Trajectory Forecasting Based on Modality Sampling

Context-Aware Activity Forecasting

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Activity Forecasting

Abstract

Chapter PDF

Similar content being viewed by others

Long-Term Activity Forecasting Using First-Person Vision

A Hierarchical Framework for Motion Trajectory Forecasting Based on Modality Sampling

Context-Aware Activity Forecasting

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation