Efficient vision-based navigation

Hornung, Armin; Bennewitz, Maren; Strasdat, Hauke

doi:10.1007/s10514-010-9190-3

Efficient vision-based navigation

Learning about the influence of motion blur

Published: 23 April 2010

Volume 29, pages 137–149, (2010)
Cite this article

Download PDF

Access provided by CONRICYT-eBooks

Autonomous Robots Aims and scope Submit manuscript

Efficient vision-based navigation

Download PDF

Armin Hornung¹,
Maren Bennewitz¹ &
Hauke Strasdat²

506 Accesses
18 Citations
Explore all metrics

Abstract

In this article, we present a novel approach to learning efficient navigation policies for mobile robots that use visual features for localization. As fast movements of a mobile robot typically introduce inherent motion blur in the acquired images, the uncertainty of the robot about its pose increases in such situations. As a result, it cannot be ensured anymore that a navigation task can be executed efficiently since the robot’s pose estimate might not correspond to its true location. We present a reinforcement learning approach to determine a navigation policy to reach the destination reliably and, at the same time, as fast as possible. Using our technique, the robot learns to trade off velocity against localization accuracy and implicitly takes the impact of motion blur on observations into account. We furthermore developed a method to compress the learned policy via a clustering approach. In this way, the size of the policy representation is significantly reduced, which is especially desirable in the context of memory-constrained systems. Extensive simulated and real-world experiments carried out with two different robots demonstrate that our learned policy significantly outperforms policies using a constant velocity and more advanced heuristics. We furthermore show that the policy is generally applicable to different indoor and outdoor scenarios with varying landmark densities as well as to navigation tasks of different complexity.

Article PDF

Cognitive Mapping and Planning for Visual Navigation

Article 04 October 2019

A Hierarchical Path Planning Approach Based on Reinforcement Learning for Mobile Robots

Advancements and Challenges in Mobile Robot Navigation: A Comprehensive Review of Algorithms and Potential for Self-Learning Approaches

Article Open access 17 August 2024

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

References

Bay, H., Tuytelaars, T., & Van Gool, L. (2006). SURF: speeded-up robust features. Proc. of the European Conf. on Computer Vision, 110(3), 346–359.
Google Scholar
Bennewitz, M., Stachniss, C., Burgard, W., & Behnke, S. (2006). Metric localization with scale-invariant visual features using a single perspective camera. In H. I. Christiensen (Ed.), Springer tracts in advanced robotics : Vol. 22, European robotics symposium 2006. Berlin: Springer.
Chapter Google Scholar
Brock, O., & Khatib, O. (1999). High-speed navigation using the global dynamic window approach. In Proc. of the IEEE int. conf. on robotics & automation—ICRA.
Bryson, M., & Sukkarieh, S. (2006). Active airborne localisation and exploration in unknown environments using inertial SLAM. In IEEE Aerospace Conference.
Cassandra, A. R., Kaelbling, L. P., & Kurien, J. A. (1996). Acting under uncertainty: discrete Bayesian models for mobile-robot navigation. In Proc. of the IEEE/RSJ int. conf. on intelligent robots and systems—IROS (pp. 963–972).
Doya, K. (2000). Reinforcement learning in continuous time and space. Neural Computation, 12(1), 219–245.
Article Google Scholar
Fox, D., Burgard, W., & Thrun, S. (1997). The dynamic window approach to collision avoidance. IEEE Robotics & Automation Magazine, 4, 23–33.
Article Google Scholar
He, R., Prentice, S., & Roy, N. (2008). Planning in information space for a quadrotor helicopter in a GPS-denied environments. In Proc. of the IEEE int. conf. on robotics & automation—ICRA (pp. 1814–1820).
Hornung, A., Strasdat, H., Bennewitz, M., & Burgard, W. (2009). Learning efficient policies for vision-based navigation. In Proc. of the IEEE/RSJ int. conf. on intelligent robots and systems—IROS.
Ido, J., Shimizu, Y., Matsumoto, Y., & Ogasawara, T. (2009). Indoor navigation for a humanoid robot using a view sequence. Int. Journal of Robotics Research, 28(2), 315–325.
Article Google Scholar
Julier, S. J., & Uhlmann, J. K. (1997). A new extension of the Kalman filter to nonlinear systems. In Int. symposium on aerospace/defense sensing, simulation and controls, pp. 182–193.
Kass, R. E., & Wasserman, L. (1995). A reference Bayesian test for nested hypotheses and its relationship to the Schwarz criterion. Journal of the American Statistical Association, 90(431), 928–934.
Article MATH MathSciNet Google Scholar
Kollar, T., & Roy, N. (2006). Using reinforcement learning to improve exploration trajectories for error minimization. In Proc. of the IEEE int. conf. on robotics & automation—ICRA (pp. 3338–3343).
Kwok, C., & Fox, D. (2004). Reinforcement learning for sensing strategies. In Proc. of the IEEE/RSJ int. conf. on intelligent robots and systems—IROS (vol. 4, pp. 3158–3163), 28 Sept.–2 Oct.
LaValle, S. M., & Kuffner, J. J. (1999). Randomized kinodynamic planning. In Proc. of the IEEE int. conf. on robotics & automation—ICRA (pp. 473–479).
Lovejoy, W. S. (1991). Computationally feasible bounds for partially observed Markov decision processes. Operations Research, 39(1), 162–175.
Article MATH MathSciNet Google Scholar
Martinez-Cantin, R., de Freitas, N., Brochu, E., Castellanos, J., & Doucet, A. (2009). A Bayesian exploration-exploitation approach for optimal online sensing and planning with a visually guided mobile robot. Journal of Autonomous Robots, 27(2), 93–103.
Article Google Scholar
Menache, I., Mannor, S., & Shimkin, N. (2005). Basis function adaptation in temporal difference reinforcement learning. Annals of Operations Research, 134(1), 215–238.
Article MATH MathSciNet Google Scholar
Michels, J., Saxena, A., & Ng, A. Y. (2005). High speed obstacle avoidance using monocular vision and reinforcement learning. In Proc. of the int. conf. on machine learning—ICML (pp. 593–600). New York: ACM.
Chapter Google Scholar
Miura, J., Negishi, Y., & Shirai, Y. (2006). Adaptive robot speed control by considering map and motion uncertainty. Journal of Robotics & Autonomous Systems, 54(2), 110–117.
Article Google Scholar
Neumann, G. (2005). The reinforcement learning toolbox, reinforcement learning for optimal control tasks. Diplomarbeit, Technischen Universität (University of Technology) Graz, May 2005.
Pelleg, D., & Moore, A. (2000). X-means: extending K-means with efficient estimation of the number of clusters. In Proc. of the int. conf. on machine learning—ICML (pp. 727–734). San Mateo: Morgan Kaufmann.
Google Scholar
Pretto, A., Menegatti, E., Bennewitz, M., Burgard, W., & Pagello, E. (2009). A visual odometry framework robust to motion blur. In Proc. of the IEEE int. conf. on robotics & automation (ICRA).
Roy, N., & Gordon, G. (2002). Exponential family PCA for belief compression in POMDPs. In S. Becker, S. Thrun, K. Obermayer (Eds.), Proc. of the conf. on neural information processing systems—NIPS (pp. 1043–1049), Vancouver, Canada, December 2002.
Roy, N., & Thrun, S. (1999). Coastal navigation with mobile robots. In Proc. of the conf. on neural information processing systems—NIPS (vol. 12, pp. 1043–1049).
Roy, N., Burgard, W., Fox, D., & Thrun, S. (1999). Coastal navigation–mobile robot navigation with uncertainty in dynamic environments. In Proc. of the IEEE int. conf. on robotics & automation—ICRA (vol. 1, pp. 35–40).
Rubinstein, R. Y., & Kroese, D. P. (2004). The cross-entropy method: a unified approach to combinatorial optimization, monte-carlo simulation and neural computation. Berlin: Springer.
Google Scholar
Rummery, G. A., & Niranjan, M. (1994). On-line Q-learning using connectionist systems (Technical report CUED/F-INFENG/TR 166). Cambridge University, Cambridge, UK, September 1994.
Satoh, H. (2006). A state space compression method based on multivariate analysis for reinforcement learning in high-dimensional continuous state spaces. IEICE Transactions on Fundamentals of Electronics, Communications and Computer Sciences, E89-A(8), 2181–2191.
Article Google Scholar
Schlegel, C. (1998). Fast local obstacle avoidance under kinematic and dynamic constraints for a mobile robot. In: Proc. of the IEEE/RSJ int. conf. on intelligent robots and systems—IROS.
Simmons, R. (1996). The curvature-velocity method for local obstacle avoidance. In Proc. of the IEEE int. conf. on robotics & automation—ICRA.
Sondik, E. J. (1971). The optimal control of partially observable Markov decision processes. Ph.D. thesis, Stanford University, Stanford, USA.
Stachniss, C., & Burgard, W. (2002). An integrated approach to goal-directed obstacle avoidance under dynamic constraints for dynamic environments. In Proc. of the IEEE/RSJ int. conf. on intelligent robots and systems—IROS (pp. 508–513), Lausanne, Switzerland.
Strasdat, H., Stachniss, C., & Burgard, W. (2009). Which landmark is useful? Learning selection policies for navigation in unknown environments. In Proc. of the IEEE int. conf. on robotics & automation—ICRA.
Sutton, R. S. (1996). Generalization in reinforcement learning: successful examples using sparse coarse coding. In Proc. of the conf. on neural information processing systems—NIPS (pp. 1038–1044). Cambridge: MIT Press.
Google Scholar
Sutton, R. S., & Barto, A. G. (1998). Adaptive computation and machine learning reinforcement learning: an introduction. Cambridge: MIT Press.
Google Scholar
Thrun, S., Burgard, W., & Fox, D. (2005). Probabilistic Robotics. Cambridge: MIT Press.
MATH Google Scholar
Uther, W. T. B., & Veloso, M. M. (1998). Tree based discretization for continuous state space reinforcement learning. In Proc. of the national conference on artificial intelligence—AAAI (pp. 769–774).
Van Huynh, A., & Roy, N. (2009). icLQG: combining local and global optimization for control in information space. In Proc. of the IEEE international conference on robotics and automation—ICRA.
Weiss, C., Fröhlich, H., & Zell, A. (2006). Vibration-based terrain classification using support vector machines. In Proc. of the IEEE/RSJ int. conf. on intelligent robots and systems—IROS.
Wiering, M., & Schmidhuber, J. (1998). Fast online Q(λ). Machine Learning, 33(1), 105–115.
Article MATH Google Scholar
Wurm, K. M., Kuemmerle, R., Stachniss, C., & Burgard, W. (2009). Improving robot navigation in structured outdoor environments. In Proc. of the IEEE/RSJ int. conf. on intelligent robots and systems—IROS.

Download references

Author information

Authors and Affiliations

Department of Computer Science, University of Freiburg, Georges-Koehler-Allee 79, 79110, Freiburg, Germany
Armin Hornung & Maren Bennewitz
Department of Computing, Imperial College London, 180 Queen’s Gate, South Kensington Campus, London, SW7 2AZ, UK
Hauke Strasdat

Authors

Armin Hornung
View author publications
You can also search for this author in PubMed Google Scholar
Maren Bennewitz
View author publications
You can also search for this author in PubMed Google Scholar
Hauke Strasdat
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Armin Hornung.

Additional information

This work has been supported by the German Research Foundation (DFG) under contract number SFB/TR-8.

Electronic Supplementary Material

Below is the link to the electronic supplementary material.

(MPG 11.3 MB)

(MPG 18.1 MB)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hornung, A., Bennewitz, M. & Strasdat, H. Efficient vision-based navigation. Auton Robot 29, 137–149 (2010). https://doi.org/10.1007/s10514-010-9190-3

Download citation

Received: 18 June 2009
Accepted: 07 April 2010
Published: 23 April 2010
Issue Date: August 2010
DOI: https://doi.org/10.1007/s10514-010-9190-3

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Efficient vision-based navigation

Abstract

Article PDF

Similar content being viewed by others

Cognitive Mapping and Planning for Visual Navigation

A Hierarchical Path Planning Approach Based on Reinforcement Learning for Mobile Robots

Advancements and Challenges in Mobile Robot Navigation: A Comprehensive Review of Algorithms and Potential for Self-Learning Approaches

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic Supplementary Material

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation