3D Spatial Layout Propagation in a Video Sequence

Rituerto, Alejandro; Manduchi, Roberto; Murillo, Ana C.; Guerrero, J. J.

doi:10.1007/978-3-319-11755-3_42

Alejandro Rituerto¹⁷,
Roberto Manduchi¹⁸,
Ana C. Murillo¹⁷ &
…
J. J. Guerrero¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 8815))

Included in the following conference series:

International Conference Image Analysis and Recognition

2332 Accesses
1 Citations

Abstract

Intelligent autonomous systems need detailed models of their environment to achieve sophisticated tasks. Vision sensors provide rich information and are broadly used to obtain these models, particularly, indoor scene understanding has been widely studied. A common initial step to solve this problem is the estimation of the \(3\)D layout of the scene. This work addresses the problem of scene layout propagation along a video sequence. We use a Particle Filter framework to propagate the scene layout obtained using a state-of-the-art technique on the initial frame and propose how to generate, evaluate and sample new layout hypotheses on each frame. Our intuition is that we can obtain better layout estimation at each frame through propagation than running separately at each image. The experimental validation shows promising results for the presented approach.

This work was supported by the Spanish FPI grant BES-\(2010\)-\(030299\) and Spanish projects DPI\(2012\)-\(31781\), DGA-T\(04\)-FSE and TAMA.

Access provided by Autonomous University of Puebla. Download to read the full chapter text

Chapter PDF

3D Layout Propagation to Improve Object Recognition in Egocentric Videos

IVS3D: An Open Source Framework for Intelligent Video Sampling and Preprocessing to Facilitate 3D Reconstruction

Wide baseline pose estimation from video with a density-based uncertainty model

Article 13 June 2019

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Badrinarayanan, V., Galasso, F., Cipolla, R.: Label propagation in video sequences. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3265–3272 (2010)
Google Scholar
Coughlan, J.M., Yuille, A.L.: Manhattan world: Compass direction from a single image by bayesian inference. In: IEEE International Conference on Computer Vision (ICCV), pp. 941–947 (1999)
Google Scholar
Delage, E., Lee, H., Ng, A.Y.: A dynamic bayesian network model for autonomous 3d reconstruction from a single indoor image. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2418–2428 (2006)
Google Scholar
Flint, A., Murray, D., Reid, I.: Manhattan scene understanding using monocular, stereo, and 3d features. In: IEEE International Conference on Computer Vision (ICCV), pp. 2228–2235 (2011)
Google Scholar
Furlan, A., Miller, S., Sorrenti, D.G., Fei-Fei, L., Savarese, S.: Free your camera: 3d indoor scene understanding from arbitrary camera motion. In: British Machine Vision Conference (BMVC) (2013)
Google Scholar
Gupta, A., Efros, A.A., Hebert, M.: Blocks world revisited: image understanding using qualitative geometry and mechanics. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 482–496. Springer, Heidelberg (2010)
Chapter Google Scholar
Hedau, V., Hoiem, D., Forsyth, D.: Recovering the spatial layout of cluttered rooms. In: IEEE International Conference on Computer Vision (ICCV), pp. 1849–1856 (2009)
Google Scholar
Hedau, V., Hoiem, D., Forsyth, D.: Thinking inside the box: using appearance models and context based on room geometry. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part VI. LNCS, vol. 6316, pp. 224–237. Springer, Heidelberg (2010)
Chapter Google Scholar
Hoiem, D., Efros, A.A., Hebert, M.: Geometric context from a single image. In: IEEE International Conference onComputer Vision (ICCV), pp. 654–661 (2005)
Google Scholar
Hoiem, D., Efros, A.A., Hebert, M.: Recovering surface layout from an image. International Journal of Computer Vision 75(1), 151–172 (2007)
Article Google Scholar
Hoiem, D., Efros, A.A., Hebert, M.: Putting objects in perspective. International Journal of Computer Vision 80(1), 3–15 (2008)
Article Google Scholar
Kovesi, P.D.: MATLAB and Octave functions for computer vision and image processing
Google Scholar
Lee, D.C., Hebert, M., Kanade, T.: Geometric reasoning for single image structure recovery. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2136–2143 (2009)
Google Scholar
López-Nicolás, G., Omedes, J., Guerrero, J.: Spatial layout recovery from a single omnidirectional image and its matching-free sequential propagation. Robotics and Autonomous Systems (2014)
Google Scholar
Raza, S.H., Grundmann, M., Essa, I.: Geometric context from video. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2013)
Google Scholar
Rituerto, J., Murillo, A., Kosecka, J.: Label propagation in videos indoors with an incremental non-parametric model update. In: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 2383–2389 (2011)
Google Scholar
Rother, C.: A new approach to vanishing point detection in architectural environments. Image and Vision Computing 20(9), 647–655 (2002)
Article Google Scholar
Saxena, A., Sun, M., Ng, A.Y.: Make3d: Learning 3d scene structure from a single still image. IEEE Transactions on Pattern Analysis and Machine Intelligence 31(5), 824–840 (2009)
Article Google Scholar
Tsai, G., Kuipers, B.: Dynamic visual understanding of the local environment for an indoor navigating robot. In: IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 4695–4701 (2012)
Google Scholar

Download references

Author information

Authors and Affiliations

Instituto de Investigación en Ingeniería de Aragón, University of Zaragoza, Zaragoza, Spain
Alejandro Rituerto, Ana C. Murillo & J. J. Guerrero
Computer Vision Lab at University of California, Santa Cruz, USA
Roberto Manduchi

Authors

Alejandro Rituerto
View author publications
You can also search for this author in PubMed Google Scholar
Roberto Manduchi
View author publications
You can also search for this author in PubMed Google Scholar
Ana C. Murillo
View author publications
You can also search for this author in PubMed Google Scholar
J. J. Guerrero
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alejandro Rituerto .

Editor information

Editors and Affiliations

Faculty of Engineering, University of Porto, Porto, Portugal
Aurélio Campilho
Dept. of Electrical and Computer Eng., University of Waterloo, Waterloo, Ontario, Canada
Mohamed Kamel

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Rituerto, A., Manduchi, R., Murillo, A.C., Guerrero, J.J. (2014). 3D Spatial Layout Propagation in a Video Sequence. In: Campilho, A., Kamel, M. (eds) Image Analysis and Recognition. ICIAR 2014. Lecture Notes in Computer Science(), vol 8815. Springer, Cham. https://doi.org/10.1007/978-3-319-11755-3_42

Download citation

DOI: https://doi.org/10.1007/978-3-319-11755-3_42
Published: 10 October 2014
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11754-6
Online ISBN: 978-3-319-11755-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

3D Spatial Layout Propagation in a Video Sequence

Abstract

Chapter PDF

Similar content being viewed by others

3D Layout Propagation to Improve Object Recognition in Egocentric Videos

IVS3D: An Open Source Framework for Intelligent Video Sampling and Preprocessing to Facilitate 3D Reconstruction

Wide baseline pose estimation from video with a density-based uncertainty model

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

3D Spatial Layout Propagation in a Video Sequence

Abstract

Chapter PDF

Similar content being viewed by others

3D Layout Propagation to Improve Object Recognition in Egocentric Videos

IVS3D: An Open Source Framework for Intelligent Video Sampling and Preprocessing to Facilitate 3D Reconstruction

Wide baseline pose estimation from video with a density-based uncertainty model

Keywords

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation