A finite step algorithm via a bimatrix game to a single controller non-zero sum stochastic game

Nowak, A. S.; Raghavan, T. E. S.

doi:10.1007/BF01581246

A finite step algorithm via a bimatrix game to a single controller non-zero sum stochastic game

Published: March 1993

Volume 59, pages 249–259, (1993)
Cite this article

Download PDF

Access provided by CONRICYT-eBooks

Mathematical Programming Submit manuscript

A finite step algorithm via a bimatrix game to a single controller non-zero sum stochastic game

Download PDF

A. S. Nowak¹ &
T. E. S. Raghavan²

186 Accesses
25 Citations
Explore all metrics

Abstract

Given a non-zero sum discounted stochastic game with finitely many states and actions one can form a bimatrix game whose pure strategies are the pure stationary strategies of the players and whose penalty payoffs consist of the total discounted costs over all states at any pure stationary pair. It is shown that any Nash equilibrium point of this bimatrix game can be used to find a Nash equilibrium point of the stochastic game whenever the law of motion is controlled by one player. The theorem is extended to undiscounted stochastic games with irreducible transitions when the law of motion is controlled by one player. Examples are worked out to illustrate the algorithm proposed.

Article PDF

Completely Mixed Strategies for Generalized Bimatrix and Switching Controller Stochastic Game

Article 16 December 2016

On Completely Mixed Stochastic Games

Article Open access 22 September 2022

Completely mixed strategies for single controller unichain semi-Markov games with undiscounted payoffs

Article 22 October 2016

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

References

D. Blackwell, “Discrete dynamic programming,”Annals of Mathematical Statistics 33 (1962) 719–726.
Google Scholar
J.A. Filar, “Ordered field property for stochastic games when the player who controls transition changes from state to state,”Journal of Optimization Theory and Applications 34 (1981) 503–513.
Google Scholar
J.A. Filar, “On stationary equilibria of a single-controller stochastic game,”Mathematical Programming 30 (1984) 313–325.
Google Scholar
J.A. Filar and T.E.S. Raghavan, “A matrix game solution of a single controller stochastic game,”Mathematics of Operations Research 9 (1984) 356–362.
Google Scholar
A.M. Fink, “Equilibrium in a stochasticn-person game,”Journal of Science of Hiroshima University, Series, A-I 28 (1964) 89–93.
Google Scholar
D. Gillette, “Stochastic games with zero stop probabilities,” in:Contributions to the Theory of Games III. Annals of Mathematical Studies No. 39 (Princeton University Press, Princeton, NJ, 1957) pp. 179–187.
Google Scholar
A. Hordijk and L.C.M. Kallenberg, “Linear programming and Markov games I, II,” in: O. Moeschlin and D. Pallaschke, eds.,Game Theory and Mathematical Economics (North-Holland, Amsterdam, 1981).
Google Scholar
T. Parthasarathy and T.E.S. Raghavan,Some Topics in Two-Person Games (American Elsevier Publishing Corporation, New York, 1971).
Google Scholar
T. Parthasarathy and T.E.S. Raghavan, “An orderfield property for stochastic games when one player controls transition probabilities,”Journal of Optimization Theory and Applications 33 (1981) 375–392.
Google Scholar
T.E.S. Raghavan and J.A. Filar, “Algorithms for stochastic games—a survey,” to appear in:Zeitschrift fur Operations Research.
S.M. Ross, “Non-discounted denumerable Markovian decision models,”Annals of Mathematical Statistics 39 (1968) 412–423.
Google Scholar
L.S. Shapley, “Stochastic games,”Proceedings of the National Academy of Sciences of the U.S.A. 39 (1953) 1095–1100.
Google Scholar
M.J. Sobel, “Noncooperative stochastic games,”Annals of Mathematical Statistics 42 (1971) 1930–1935.
Google Scholar
M. Takahashi, “Equilibrium points of stochastic noncooperativen-person games,”Journal of Science of Hiroshima University, Series A-I 28 (1964) 95–99.
Google Scholar
O.J. Vrieze, “Linear programming and undiscounted stochastic games in which one player controls transitions,”OR Spektrum 3 (1981) 29–35.
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Mathematics, Wrocław Technical University, Wrocław, Poland
A. S. Nowak
Department of Mathematics, Statistics, and Computer Science, University of Illinois at Chicago, USA
T. E. S. Raghavan

Authors

A. S. Nowak
View author publications
You can also search for this author in PubMed Google Scholar
T. E. S. Raghavan
View author publications
You can also search for this author in PubMed Google Scholar

Additional information

The work of this author was supported in part by the NSF grants DMS-9024408 and DMS 8802260.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Nowak, A.S., Raghavan, T.E.S. A finite step algorithm via a bimatrix game to a single controller non-zero sum stochastic game. Mathematical Programming 59, 249–259 (1993). https://doi.org/10.1007/BF01581246

Download citation

Received: 06 March 1989
Revised: 24 June 1991
Issue Date: March 1993
DOI: https://doi.org/10.1007/BF01581246

Key words

Stochastic game theory

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A finite step algorithm via a bimatrix game to a single controller non-zero sum stochastic game

Abstract

Article PDF

Similar content being viewed by others

Completely Mixed Strategies for Generalized Bimatrix and Switching Controller Stochastic Game

On Completely Mixed Stochastic Games

Completely mixed strategies for single controller unichain semi-Markov games with undiscounted payoffs

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Key words

Navigation

A finite step algorithm via a bimatrix game to a single controller non-zero sum stochastic game

Abstract

Article PDF

Similar content being viewed by others

Completely Mixed Strategies for Generalized Bimatrix and Switching Controller Stochastic Game

On Completely Mixed Stochastic Games

Completely mixed strategies for single controller unichain semi-Markov games with undiscounted payoffs

References

Author information

Authors and Affiliations

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Key words

Search

Navigation