Abstract
This work presents a model for Markov Decision Processes applied to the problem of keeping two agents in equilibrium with respect to the values they exchange when they interact. Interval mathematics is used to model the qualitative values involved in interactions. The optimal policy is constrained by the adopted model of social interactions. The MDP is assigned to a supervisor, that monitors the agents’ actions and makes recommendations to keep them in equilibrium. The agents are autonomous and allowed to not follow the recommendations. Due to the qualitative nature of the exchange values, even when agents follow the recommendations, the decision process is non-trivial.
This work was partially supported by CTINFO/CNPq and FAPERGS.
Access provided by Autonomous University of Puebla. Download to read the full chapter text
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Costa, A.C.R., Dimuro, G.P.: The Case for Using Exchange Values in the Modelling of Collaborative Learning Interactions. In: Mostow, J., Tedesco, P. (eds.) Proceedings of Workshop 9 in the 7th International Conference on Intelligent Tutoring Systems, ITS 2004, Maceió, pp. 19–24 (2004)
d’Inverno, M., Luck, M.: Understanding Agent Systems. Springer, Berlin (2001)
Kaelbling, L.P., Littman, M.L., Cassandra, A.R.: Planning and Acting in Partially Observabe Stochastic Domains. Artificial Intelligence 101(1), 99–134 (1998)
Homans, G.C.: Social Behavior - Its Elementary Forms. Harcourt, Brace &World, New York (1961)
Howard, R.A.: Dynamic Programming and Markov Processes. MIT Press, Cambridge (1960)
Moore, R.E.: Methods and Applications of Interval Analysis. SIAM, Philadelphia (1979)
Piaget, J.: Socialogical Studies. Routlege, London (1995)
Puterman, M.L.: Markov Decision Processes – Discrete Stochastic Dynamic Programming. Wiley, New York (1994)
Rodrigues, M.R., Costa, A.C.R., Bordini, R.: A System of Exchange Values to Support Social Interactions in Artificial Societes. In: Proceeding of the Second International Conference on Autonomous Agnets and Multiagents Systems, AAMAS 2003, Melbourne, Australia, pp. 81–88. ACM, New York (2003)
Rodrigues, M.R., Costa, A.C.R.: Using Qualitative Exchange Values to Improve the Modelling of Social Interactions. In: Hales, D., Edmonds, B., Norling, E., Rouchier, J. (eds.) Procedings of 4th Workshop on Agent Based Simulations, Melbourne, Australia. LNCS, vol. 2927, pp. 57–72 (2003)
Russel, S., Norvig, P.: Artificial Intelligence, a Modern Approach. Prentice Hall, Reading (2003)
White, D.J.: Markov Decision Processes. Wiley, New York (2002)
Wooldridge, M.: An Introduction to Multi-Agent Systems. Wiley, New York (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Dimuro, G.P., Costa, A.C.R. (2006). Interval-Based Markov Decision Processes for Regulating Interactions Between Two Agents in Multi-agent Systems. In: Dongarra, J., Madsen, K., Waśniewski, J. (eds) Applied Parallel Computing. State of the Art in Scientific Computing. PARA 2004. Lecture Notes in Computer Science, vol 3732. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11558958_12
Download citation
DOI: https://doi.org/10.1007/11558958_12
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29067-4
Online ISBN: 978-3-540-33498-9
eBook Packages: Computer ScienceComputer Science (R0)