Instance-Based Action Models for Fast Action Planning

Ahmadi, Mazda; Stone, Peter

doi:10.1007/978-3-540-68847-1_1

Mazda Ahmadi¹ &
Peter Stone¹

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5001))

Included in the following conference series:

Robot Soccer World Cup

1752 Accesses
4 Citations

Abstract

Two main challenges of robot action planning in real domains are uncertain action effects and dynamic environments. In this paper, an instance-based action model is learned empirically by robots trying actions in the environment. Modeling the action planning problem as a Markov decision process, the action model is used to build the transition function. In static environments, standard value iteration techniques are used for computing the optimal policy. In dynamic environments, an algorithm is proposed for fast replanning, which updates a subset of state-action values computed for the static environment. As a test-bed, the goal scoring task in the RoboCup 4-legged scenario is used. The algorithms are validated in the problem of planning kicks for scoring goals in the presence of opponent robots. The experimental results both in simulation and on real robots show that the instance-based action model boosts performance over using parametric models as done previously, and also incremental replanning significantly improves over original off-line planning.

Download to read the full chapter text

Chapter PDF

Planning Under Uncertainty Through Goal-Driven Action Selection

Improving Sample Efficiency in Behavior Learning by Using Sub-optimal Planners for Robots

Planning When Goals Change: A Moving Target Search Approach

References

Yang, Q., Wu, K., Jiang, Y.: Learning action models from plan examples with incomplete knowledge. In: Proceedings of the Fifteenth International Conference on Automated Planning and Scheduling (June 2005)
Google Scholar
Wang, X.: Learning planning operators by observation and practice. In: Artificial Intelligence Planning Systems, pp. 335–340 (1994)
Google Scholar
Moore, A.W., Atkeson, C.G.: Prioritized sweeping: Reinforcement learning with less data and less time. Machine Learning 13, 103–130 (1993)
Google Scholar
Bellman, R.E.: Dynamic Programming. Princeton University Press, Princeton (1957)
Google Scholar
Stentz, A.T.: The focussed d* algorithm for real-time replanning. In: Proceedings of the International Joint Conference on Artificial Intelligence (August 1995)
Google Scholar
Koenig, S., Likhachev, M.: Improved fast replanning for robot navigation in unknown terrain. In: Proceedings of the 2002 IEEE International Conference on Robotics and Automation (May 2002)
Google Scholar
Wilkins, D.E., Myers, K.L., Lowrance, J.D., Wesley, L.P.: Planning and reacting in uncertain and dynamic environments. Journal of Experimental and Theoretical AI 7(1), 197–227 (1995)
Google Scholar
Chernova, S., Veloso, M.: Learning and using models of kicking motions for legged robots. In: Proceedings of International Conference on Robotics and Automation (ICRA 2004) (May 2004)
Google Scholar
Burgard, W., Cremers, A., Fox, D., Hähnel, D., Lakemeyer, G., Schulz, D., Steiner, W., Thrun, S.: Experiences with an interactive museum tour-guide robot. Artificial Intelligence 114(1-2), 3–55 (1999)
Article MATH Google Scholar
Bevly, D., Farritor, S., Dubowsky, S.: Action module planning and its application to an experimental climbing robot. In: International Conference on Robotics and Automation (2000)
Google Scholar
Frommherz, B., Werling, G.: Generating robot action plans by means of an heuristic search. In: International Conference on Robotics and Automation (1990)
Google Scholar
Leake, D.B. (ed.): Case-Based Reasoning: Experiences, Lessons, and Future Directions. AAAI Press, Menlo Park (1996)
Google Scholar
Atkeson, C.G., Moore, A.W., Schaal, S.: Locally weighted learning for control. Artificial Intelligence Review 11(1-5), 75–113 (1997)
Article Google Scholar
Gabel, T., Riedmiller, M.: Cbr for state value function approximation in reinforcement learning. In: 6th International Conference on Case-Based Reasoning (2005)
Google Scholar
Atkeson, C., Santamaria, J.: A comparison of direct and model-based reinforcement learning. In: International Conference on Robotics and Automation (1997)
Google Scholar
Stone, P., Dresner, K., Fidelman, P., Kohl, N., Kuhlmann, G., Sridharan, M., Stronger, D.: The UT Austin Villa 2005 RoboCup four-legged team. Technical Report UT-AI-TR-05-325, The University of Texas at Austin, Department of Computer Sciences, AI Laboratory (November 2005)
Google Scholar
Puterman, M.L.: Markov Decision Processes. Wiley, NY (1994)
MATH Google Scholar
Kwok, C., Fox, D.: Reinforcement learning for sensing strategies. In: The IEEE International Conference on Intelligent Robots and Systems (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Sciences, The University of Texas at Austin,
Mazda Ahmadi & Peter Stone

Authors

Mazda Ahmadi
View author publications
You can also search for this author in PubMed Google Scholar
Peter Stone
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Ubbo Visser Fernando Ribeiro Takeshi Ohashi Frank Dellaert

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ahmadi, M., Stone, P. (2008). Instance-Based Action Models for Fast Action Planning. In: Visser, U., Ribeiro, F., Ohashi, T., Dellaert, F. (eds) RoboCup 2007: Robot Soccer World Cup XI. RoboCup 2007. Lecture Notes in Computer Science(), vol 5001. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68847-1_1

Download citation

DOI: https://doi.org/10.1007/978-3-540-68847-1_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-68846-4
Online ISBN: 978-3-540-68847-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Instance-Based Action Models for Fast Action Planning

Abstract

Chapter PDF

Similar content being viewed by others

Planning Under Uncertainty Through Goal-Driven Action Selection

Improving Sample Efficiency in Behavior Learning by Using Sub-optimal Planners for Robots

Planning When Goals Change: A Moving Target Search Approach

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Instance-Based Action Models for Fast Action Planning

Abstract

Chapter PDF

Similar content being viewed by others

Planning Under Uncertainty Through Goal-Driven Action Selection

Improving Sample Efficiency in Behavior Learning by Using Sub-optimal Planners for Robots

Planning When Goals Change: A Moving Target Search Approach

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation