Solving Multi-agent Decision Problems Modeled as Dec-POMDP: A Robot Soccer Case Study

Aşık, Okan; Akın, H. Levent

doi:10.1007/978-3-642-39250-4_13

Okan Aşık²³ &
H. Levent Akın²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 7500))

Included in the following conference series:

Robot Soccer World Cup

2451 Accesses
3 Citations
2 Altmetric

Abstract

Robot soccer is one of the major domains for studying the coordination of multi-robot teams. Decentralized Partially Observable Markov Decision Process (Dec-POMDP) is a recent mathematical framework which has been used to model multi-agent coordination. In this work, we model simple robot soccer as Dec-POMDP and solve it using an algorithm which is based on the approach detailed in [1]. This algorithm uses finite state controllers to represent policies and searches the policy space with genetic algorithms. We use the TeamBots simulation environment. We use score difference of a game as a fitness and try to estimate it by running many simulations. We show that it is possible to model a robot soccer game as a Dec-POMDP and achieve satisfactory results. The trained policy wins almost all of the games against the standard TeamBots teams, and a reinforcement learning based team developed elsewhere.

Download to read the full chapter text

Chapter PDF

Using Monte Carlo Search with Data Aggregation to Improve Robot Soccer Policies

A framework based on evolutionary algorithm for strategy optimization in robot soccer

Article 12 July 2018

Collaborative Behavior in Soccer: The Setplay Free Software Framework

Keywords

References

Eker, B.: Evolutionary Algorithms for Solving DEC-POMDP Problems. PhD thesis, Boğaziçi University (2012)
Google Scholar
Bernstein, D.S., Hansen, E.A., Zilberstein, S.: Bounded Policy Iteration for Decentralized POMDPs. In: Proceedings of the Nineteenth International Joint Conference on Artificial Intelligence, Edinburgh, Scotland, pp. 1287–1292 (2005)
Google Scholar
Balch, T.: Teambots mobile robot simulator (2000)
Google Scholar
Meriçli, Ç., Meriçli, T., Levent Akın, H.: A Reward Function Generation Method Using Genetic Algorithms: A Robot Soccer Case Study. In: Proceedings of the 9th International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2010, Richland, SC, vol. 1, pp. 1513–1514 (2010); International Foundation for Autonomous Agents and Multiagent Systems
Google Scholar
Bernstein, D.S., Givan, R., Immerman, N., Zilberstein, S.: The Complexity of Decentralized Control of Markov Decision Processes. Math. Oper. Res. 27, 819–840 (2002)
Article MathSciNet MATH Google Scholar
Wu, F., Chen, X.: Solving Large-Scale and Sparse-Reward DEC-POMDPs with Correlation-MDPs. In: Visser, U., Ribeiro, F., Ohashi, T., Dellaert, F. (eds.) RoboCup 2007. LNCS (LNAI), vol. 5001, pp. 208–219. Springer, Heidelberg (2008)
Google Scholar
Stone, P., Sutton, R.S.: Scaling Reinforcement Learning toward RoboCup Soccer. In: Proc. 18th International Conf. on Machine Learning, pp. 537–544. Morgan Kaufmann, San Francisco (2001)
Google Scholar
Stone, P., Sutton, R.S., Singh, S.: Reinforcement Learning for 3 vs. 2 Keepaway. In: Stone, P., Balch, T., Kraetzschmar, G.K. (eds.) RoboCup 2000. LNCS (LNAI), vol. 2019, pp. 249–258. Springer, Heidelberg (2001)
Chapter Google Scholar
Stone, P., Sutton, R.S., Singh, S.: Reinforcement Learning for 3 vs. 2 Keepaway. In: Stone, P., Balch, T., Kraetzschmar, G.K. (eds.) RoboCup 2000. LNCS (LNAI), vol. 2019, pp. 249–258. Springer, Heidelberg (2001)
Chapter Google Scholar
Whiteson, S., Kohl, N., Miikkulainen, R., Stone, P.: Evolving Soccer Keepaway Players Through Task Decomposition. Machine Learning 59, 5–30 (2005), 10.1007/s10994-005-0460-9
Google Scholar
Stone, P., Kuhlmann, G., Taylor, M.E., Liu, Y.: Keepaway Soccer: From Machine Learning Testbed to Benchmark. In: Bredenfeld, A., Jacoff, A., Noda, I., Takahashi, Y. (eds.) RoboCup 2005. LNCS (LNAI), vol. 4020, pp. 93–105. Springer, Heidelberg (2006)
Chapter Google Scholar
Pietro, A.D., While, L., Barone, L.: Learning In RoboCup Keepaway Using Evolutionary Algorithms. In: GECCO 2002, pp. 1065–1072 (2002)
Google Scholar
Amato, C., Bernstein, D.S., Zilberstein, S.: Optimal Fixed-Size Controllers for Decentralized POMDPs. In: Proceedings of the AAMAS Workshop on Multi-Agent Sequential Decision Making in Uncertain Domains, Hakodate, Japan, pp. 61–71 (2006)
Google Scholar
Levent Akın, H.: Evolutionary Computation: A Natural Answer to Artificial Questions. In: Proceedings of ANNAL: Hints from Life to Artificial Intelligence, pp. 41–52. METU, Ankara (1994)
Google Scholar
Eker, B., Levent Akın, H.: Using evolution strategies to solve DEC-POMDP problems. Soft Computing-A Fusion of Foundations, Methodologies and Applications 14(1), 35–47 (2010)
Google Scholar
Meffert, K., Meseguer, J., Marti, E.D., Meskauskas, A., Vos, J., Rotstan, N.: Jgap: Java genetic algorithms package (2011)
Google Scholar
Meriçli, Ç., Levent Akın, H.: A Layered Metric Definition and Evaluation Framework for Multirobot Systems. In: Iocchi, L., Matsubara, H., Weitzenfeld, A., Zhou, C. (eds.) RoboCup 2008. LNCS, vol. 5399, pp. 568–579. Springer, Heidelberg (2009)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Engineering, Boǧaziçi University, 34342, İstanbul, Turkey
Okan Aşık & H. Levent Akın

Authors

Okan Aşık
View author publications
You can also search for this author in PubMed Google Scholar
H. Levent Akın
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Computer Science School, University of Science and Technology of China, 230027, Hefei, China
Xiaoping Chen
Department of Computer Science, The University of Texas at Austin, 78712-1757, Austin, TX, USA
Peter Stone
Instituto Nacional de Astrofísica, Óptica y Electrónica, Puebla, Mexico
Luis Enrique Sucar
Faculty of Mathematics and Natural Sciences, Institute for Artificial Intelligence and Cognitive Engineering, University of Groningen, 9747 AG, Groningen, The Netherlands
Tijn van der Zant

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Aşık, O., Akın, H.L. (2013). Solving Multi-agent Decision Problems Modeled as Dec-POMDP: A Robot Soccer Case Study. In: Chen, X., Stone, P., Sucar, L.E., van der Zant, T. (eds) RoboCup 2012: Robot Soccer World Cup XVI. RoboCup 2012. Lecture Notes in Computer Science(), vol 7500. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-39250-4_13

Download citation

DOI: https://doi.org/10.1007/978-3-642-39250-4_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-39249-8
Online ISBN: 978-3-642-39250-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Solving Multi-agent Decision Problems Modeled as Dec-POMDP: A Robot Soccer Case Study

Abstract

Chapter PDF

Similar content being viewed by others

Using Monte Carlo Search with Data Aggregation to Improve Robot Soccer Policies

A framework based on evolutionary algorithm for strategy optimization in robot soccer

Collaborative Behavior in Soccer: The Setplay Free Software Framework

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Solving Multi-agent Decision Problems Modeled as Dec-POMDP: A Robot Soccer Case Study

Abstract

Chapter PDF

Similar content being viewed by others

Using Monte Carlo Search with Data Aggregation to Improve Robot Soccer Policies

A framework based on evolutionary algorithm for strategy optimization in robot soccer

Collaborative Behavior in Soccer: The Setplay Free Software Framework

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation