Abstract
This chapter discusses stochastic games. We focus on nonzero-sum games and provide a detailed survey of selected recent results. In Section 1, we consider stochastic Markov games. A correlation of strategies of the players, involving “public signals,” is described, and a correlated equilibrium theorem proved recently by Nowak and Raghavan for discounted stochastic games with general state space is presented. We also resport an extension of this result to a class of undiscounted stochastic games, satisfying some uniform ergodicity condition. Stopping games are related to stochastic Markov games. In Section 2, we describe a version of Dynkin’s game related to observation of a Markov process with random assignment mechanism of states to the players. Some recent constributions of the second author in this area are reported. The chapter also provides a brief overview of the theory of nonzero-sum stochastic games and stopping games.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
P. D. Rogers. Nonzero-Sum Stochastic Games. PhD thesis, University of California, Berkeley, 1969. Report ORC 69–8.
M. Sobel. Noncooperative stochastic games Ann. Math. Statist., vol. 42, pp. 1930–1935, 1971.
T. Parthasarathy and T. E. S. Raghavan. An orderfield property for stochastic games when one player controls transition probabilities, J. Optim. Theory Appl., vol. 33, pp. 375–392, 1981.
F. Thuijsman. Optimality and Equilibria in Stochastic Games. PhD thesis, University of Limburg, Maastricht, The Netherlands, 1989.
O. J. Vrieze and F. Thuijsman. On equilibria in repeated games with absorbing states, Internat. J. Game Theory, vol. 18, pp. 293–310, 1989.
T. Parthasarathy. Discounted, positive, and non-cooperative stochastic games, Internat. J. Game Theory, vol. 2, pp. 25–37, 1973.
A. Federgruen. On n-person stochastic games with denumerable state space, Adv. Appl. Probab., vol. 10, pp. 452–471, 1978.
V. Borkar and M. Ghosh. Denumerable state stochastic games with limiting payoff, J. Optimization Theory Appl., vol. 76, pp. 539–560, 1993.
A. S. Nowak. Stationary overtaking equilibria for non-zero-sum stochastic games with countable state spaces, mimeo, Institute of Mathematics, TU Wroclaw, 1994.
D. Duffle, J. Geanakoplos, A. Mas-Colell, and A. McLennan. Stationary Markov equilibria, Technical Report, Dept. of Economics, Harvard University, 1988.
P. K. Dutta. What do discounted optima converge to? A theory of discount rate asymptotics in economics models, J. Economic Theory, vol. 55, pp. 64–94, 1991.
I. Karatzas, M. Shubik, and W. D. Sudderth. Construction of stationary Markov equilibria in a strategic market game, Technical Report 92–05–022, Santa Fe Institute Working Paper, Santa Fe, New Mexico, 1992.
M. Majumdar and R. Sundaram. Symmetric stochastic games of resource extraction: The existence of non-randomized stationary equilibrium; In Stochastic Games and Related Topics, pp. 175–190, Dordrecht, The Netherlands: Kluwer Academic Publishers, 1991.
P. K. Dutta and R. Sundaram. Markovian equilibrium in a class of stochastic games: Existence theorems for discounted and undiscounted models, Economic Theory, vol. 2, 1992.
M. K. Ghosh and A. Bagchi. Stochastic games with average payoff criterion, Technical Report 985, Faculty of Applied Mathematics, University of Twente, Enschede, The Netherlands, 1991.
A. S. Nowak and T. E. S. Raghavan. Existence of stationary correlated equilibria with symmetric information for discounted stochastic games, Math. Oper. Res., vol. 17, pp. 519–526, 1992.
A. S. Nowak. Stationary equilibria for nonzero-sum average payoff ergodic stochastic games with general state space; In Advances in Dynamic Games and Applications (T. Bagar and A. Haurie, eds.), pp. 231–246, Birkhäuser, 1994.
C. Castaing and M. Valadier. Convex Analysis and Measurable Multifonctions, vol. 580 of Lecture Notes in Mathematics. New York: Springer-Verlag, 1977.
C. J. Himmelberg. Measurable relations, Fund. Math, vol. 87, pp. 53–72, 1975.
D. P. Bertsekas and S. E. Shreve. Stochastic Optimal Control: The Discrete Time Case. New York: Academic Press, 1978.
C. J. Himmelberg, T. Parthasarathy, T. E. S. Raghavan, and F. S. van Vleck. Existence of p-equilibrium and optimal stationary strategies in stochastic games, Proc. Amer. Math. Soc., vol. 60, pp. 245–251, 1976.
W. Whitt. Representation and approximation of noncooperative sequential games, SIAM J. Control Optim., vol. 18, pp. 33–48, 1980.
A. S. Nowak. Existence of equilibrium stationary strategies in discounted noncooperative stochastic games with uncountable state space, J. Optim. Theory Appl., vol. 45, pp. 591–602, 1985.
M. Breton and P. L’Ecuyer. Noncooperative stochastic games under a n-stage local contraction assumption, Stochastics and Stochastic Reports, vol. 26, pp. 227–245, 1989.
J.-F. Mertens and T. Parthasarathy. Equilibria for discounted stochastic games, Technical Report 8750, CORE Discussion Paper, Université Catholique de Louvain, 1987.
C. Harris. The existence of subgame-perfect equilibrium in games with simultaneous moves: A case for extensive-form correlation, mimeo, Nuffield College, Oxford, U.K., 1990.
F. Forges. An approach to communication equilibria, Econometrica, vol. 54, pp. 1375–1385, 1986.
R. L. Tweedie. Criteria for rates of convergence of Markov chains, with an application to queueing and storage theory; In Papers in Probability Statistics and Analysis (J. F. C. Kingman and G. E. H. Reuter, eds.), pp. 260–276, Cambridge, U. K.: Cambridge University Press, 1983.
O. Hernández-Lerma, J. C. Rennet, and J. B. Lasserre. Average cost Markov decision processes: Optimality conditions, J. Math. Anal. Appl., vol. 158, pp. 396–406, 1991.
O. Hernández-Lerma, R. Montes-de-Oca, and R. Cavazos-Cadena. Recurrence conditions for Markov decision processes with Borel state space, Ann. Oper. Res., vol. 28, pp. 29–46, 1991.
E. Nummelin. General Irreducible Markov Chains and Non-Negative Operators. London: Cambridge Univ. Press, 1984.
J. Neveu. Mathematical Foundations of the Calculus of Probability. San Francisco: Holden-Day, 1965.
M. Kurano. Markov decision processes with a Borel measurable cost function — the average case, Math. Oper. Res.,vol. 11, pp. 309–320, 1986.
K. Yamada. Duality theorem in Markovian decision problems, J. Math. Anal. Appl.,vol. 50, pp. 579–595, 1975.
S. Yakovitz. Dynamic programming applications in water resources, Water Resources Res., vol. 18, pp. 673–696, 1982.
J. Georgin. Controle de chaînes de Markov sur des espaces arbitraires, Ann. Inst. H. Poincaré, vol. 14,pp. 255–277, 1978.
T. Ueno. Some limit theorems for temporally discrete Markov processes, J. Fac. Science Univ. Tokyo, vol. 7, pp. 449–462, 1957.
A. Hordijk. Dynamic Programming and Markov Potential Theory. Amsterdam: Math. Centrum, 1977.
J. L. Doob. Stochastic Processes. New York: Wiley, 1953.
T. Parthasarathy. Existence of equilibrium stationary strategies in discounted stochastic games, Sankhya Series A, vol. 44, pp. 114–127, 1982.
T. Parthasarathy and S. Sinha Existence of stationary equilibrium strategies in non-zero-sum discounted stochastic games with uncountable state space and state independent transitions, Internat. J. Game Theory, vol. 18, pp. 189–194, 1989.
A. S. Nowak. Zero-sum average payoff stochastic games with general state space, Games and Economic Behavior, (to appear) 1994.
M. Schäl. Average optimality in dynamic programming with general state space, Math. Oper. Res., vol. 18, pp. 163–172, 1993.
E. Dynkin. Game variant of a problem on optimal stopping, Soviet Math. Dokl., vol. 10, pp. 270–274, 1969.
Y. Kifer. Optimal stopping games, T. Probab. Appl., vol. 16, pp. 185–189, 1971.
J. Neveu. Discrete-Parameter Martingales. Amsterdam: North-Holland, 1975.
M. Yasuda. On a randomized strategy in Neveu’s stopping problem, Stochastic Proc. and their Appl., vol. 21, pp. 159–166, 1985.
E. Frid. The optimal stopping for a two-person Markov chain with opposing interests, Theory Probab. Appl., vol. 14, no. 4, pp. 713–716, 1969.
N. Elbakidze. Construction of the cost and optimal policies in a game problem of stopping a Markov process, Theory Probab. Appl., vol. 21, pp. 163–168, 1976.
Y. Ohtsubo. A nonzero-sum extension of Dynkin’s stopping problem, Math. Oper. Res., vol. 12, pp. 277–296, 1987.
E. Ferenstein. A variation of the Dynkin’s stopping game, Math. Japonica, vol. 38, no. 2, pp. 371–379, 1993.
A. Bensoussan and A. Friedman. Nonlinear variational inequalities and differential games with stopping times, J. Funct. Anal., vol. 16, pp. 305–352, 1974.
A. Bensoussan and A. Friedman. Nonzero-sum stochastic differential games with stopping times and free boundary problems, Trans. Amer. Math. Soc., vol. 231, pp. 275–327, 1977.
N. Krylov. Control of Markov processes and W-spaces, Math. USSR-Izv., vol. 5, pp. 233–266, 1971.
J.-M. Bismut. Sur un problème de Dynkin, Z. Wahrsch. Ver. Gebite, vol. 39, pp. 31–53, 1977.
L. Stettner. Zero-sum Markov games with stopping and impulsive strategies, Appl. Math. Optim., vol. 9, pp. 1–24, 1982.
J. Lepeltier and M. Maingueneau. Le jeu de Dynkin en théorie générale sans l’hypothèse de Mokobodski, Stochastics, vol. 13, pp. 25–44, 1984.
K. Szajowski. Double stop by two decision makers, Adv. Appl. Probab., vol. 25, pp. 438–452, 1993.
T. Radzik and K. Szajowski. Sequential games with random priority, Sequential Analysis, vol. 9, no. 4, pp. 361–377, 1990.
K. Ano. Bilateral secretary problem recognizing the maximum or the second maximum of a sequence, J. Information & Optimization Sciences, vol. 11, pp. 177–188, 1990.
E. Enns and E. Ferenstein. On a multi-person time-sequential game with priorities, Sequential Analysis, vol. 6, pp. 239–256, 1987.
E. Ferenstein. Two-person non-zero-sum games with priorities; In Strategies for Sequential Search and Selection in Real Time,Proceedings of the AMS-IMS-SIAM Join Summer Research Conferences held June 21–27, 1990 (T. S. Ferguson and S. M. Samuels, eds.), vol. 125 of Contemporary Mathematics, (University of Massachusetts at Amherst), pp. 119–133, 1992.
M. Sakaguchi. Sequential games with priority under expected value maximization, Math. Japonica, vol. 36, no. 3, pp. 545–562, 1991.
E. Enns and E. Ferenstein. The horse game, J. Oper. Res. Soc. Japan, vol. 28, pp. 51–62, 1985.
M. Fushimi. The secretary problem in a competitive situation, J. Oper. Res. Soc. Jap., vol. 24, pp. 350–358, 1981.
A. Majumdar. Optimal stopping for a two-person sequential game in the discrete case, Pure and Appl. Math. Sci, vol. 23, pp. 67–75, 1986.
M. Sakaguchi. Multiperson multilateral secretary problem, Math. Japonica, vol. 35, pp. 459–473, 1989.
G. Ravindran and K. Szajowski. Non-zero sum game with priority as Dynkin’s game, Math. Japonica, vol. 37, no. 3, pp. 401–413, 1992.
K. Szajowski. On non-zero sum game with priority in the secretary problem, Math. Japonica, no. 3, pp. 415–426, 1992.
J. Gilbert and F. Mosteller. Recognizing the maximum of a sequence, J. Amer. Statist. Assoc.,vol. 61, no. 313, pp. 35–73, 1966.
P. Freeman. The secretary problem and its extensions: A review, Int. Statist. Rev., vol. 51, pp. 189–206, 1983.
J. Rose. Twenty years of secretary problems: A survey of developments in the theory of optimal choice, Management Studies, vol. 1,pp. 53–64, 1982.
T. Ferguson. Who solved the secretary problem?, Statistical Science, vol. 4, pp. 282–289, 1989.
K. Szajowski. Optimal stopping of a discrete Markov processes by two decision makers 1992, SIAM J. on Control and Optimization, vol. 33, no. 5, pp. 1392–1410, 1995.
A. Shiryaev. Optimal Stopping Rules. New York, Heidelberg, Berlin: Springer-Verlag, 1978.
R. Eidukjavicjus. Optimalna ostanovka markovskoj cepi dvumia momentami ostanovki, Lit. Mat. Sbornik, vol. 19, pp. 181–183, 1979. Inf. XIX conf. math.
R. Luce and H. Raiffa. Games and Decisions. New York: John Wiley and Sons, 1957.
G. Haggstrom. Optimal sequential procedures when more then one stop is required, Ann. Math. Statist., vol. 38, pp. 1618–1626, 1967.
W. Stadje. An optimal k-stopping problem for the Poisson process; In Proc. of the 6th Pannonian Symp. on Math. Stat. Bad Tazmannsdorf (Austria), D.Reidel Pub. Comp., 1987. in Mathematical Statistics and Probability Theory vol. B.
E. Dynkin and A. Yushkevich. Theorems and Problems on Markov Processes. New York: Plenum, 1969.
A. Mucci. Differential equations and optimal choice problem, Ann. Statist., vol. 1, pp. 104–113, 1973.
K. Szajowski. Optimal choice problem of a-th object, Matem. Stos., vol. 19, pp. 51–65, 1982. In Polish.
H. W. Kuhn. Extensive games and the problem of information; In Contribution to the Theory of Games (H. Kuhn and A. Tucker, eds.), vol. 24 of Annals of Mathematics Study,Princeton University Press, 1953. Vol. I.
U. Rieder. Equilibrium plans for non-zero-sum Markov games; In Game Theory and Related Topics (D. Moeschlin and D. Palaschke, eds.), pp. 91–101, North-Holland Publishing Company, 1979.
H. Moulin. Game Theory for the Social Sciences. New York: New York University Press, 1986.
K. Szajowski. Markov stopping games with random priority, Zeitschrift für Operations Research, no. 3, pp. 69–84, 1993.
L. S. Shapley. Stochastic games, Proceedings of the National Academy of Science U.S.A., vol. 39, pp. 1095–1100, 1953.
A. M. Fink. Equilibrium in a stochastic n-person game, J. Science of Hiroshima Univ., vol. 28, pp. 89–93, 1964.
M. Takahashi. Equilibrium points of stochastic noncooperative n-person games, J. Science of Hiroshima Univ., vol. 28, pp. 95–99, 1964.
A. S. Nowak and T. E. S. Raghavan. A finite step algorithm via bimatrix games to a single controller nonzero-sum stochastic games, Math. Programming, vol. 59, pp. 249–259, 1993.
T. E. S. Raghavan and J. A. Filar. Algorithms for stochastic games—a survey. Zeitschrift für Operations Research, pp. 437–472, 1991.
J. A. Filar and K. Vrieze. Competitive Markov Decision Processes, Springer-Verlag, New York, 1997.
F. Thuijsman and T. E. S. Raghavan. Perfect information stochastic games and related classes, Inter. J. Game Theory, vol. 26, pp. 403–408, 1997.
N. Vieille. Equilibrium in 2-person stochastic games I: A reduction. Technical Report 9745, CEREMADE, Université Paris 9 Dauphine, France, 1997.
N. Vieille. Equilibrium in 2-person stochastic games II: The case of recursive games. Technical Report 9745, CEREMADE, Université Paris 9 Dauphine, France, 1997.
N. Vieille. Large deviations and stochastic games. Technical Report 9746, CEREMADE, Université Paris 9—Dauphine, France, 1997.
E. Solan and N. Vieille. Correlated equilibrium in stochastic games. Technical Report 9848, CEREMADE, Université Paris 9—Dauphine, France, 1998.
J.-F. Mertens and J. Neyman. Stochastic games, Internat. J. Game Theory, vol. 10, pp. 53–66, 1981.
A. Maitra and W. D. Sudderth. Borel stochastic games with limsup payoff, Ann. Probab., vol. 21, pp. 861–885, 1993.
A. Maitra and W. D. Sudderth. Discrete Gambling and Stochastic Games, Springer-Verlag, New York, 1996.
A. S. Nowak. Non-cooperative non-stationary stochastic games. OPSEARCH, vol. 21, pp. 199–208, 1984.
U. Rieder. Non-cooperative dynamic games with general utility functions; In Stochastic Games and Related Topics, pp. 161–174, Kluwer Academic Publishers, Dordrecht, The Netherlands, 1991.
E. Altman, A. Hordijk, and F. M. Spieksma. Contraction conditions for average and cv-discount optimality in countable state Markov games with unbounded rewards, Math. Oper. Res., vol. 22, pp. 588–618, 1997.
L. I. Sennott. Nonzero-sum stochastic games with unbounded costs: Discounted and average cost cases, Zeitschrift für Operations Research, vol. 40, pp. 145–162, 1994.
F. M. Spieksma and O. Passchier. Almost sure and expected optimality in countable Markov games with unbounded rewards. Technical Report, Department of Mathematics and Computer Science, Leiden University, The Netherlands, 1995.
O. Passchier. The Theory of Markov Games and Queueing Control. PhD thesis, Leiden University, Leiden, The Netherlands, 1996.
A. S. Nowak. Sensitive equilibria for ergodic stochastic games with countable state spaces. Mimeo, Institute of Mathematics, Wroclaw University of Technology, 1998.
E. Altman. Nonzero-sum stochastic games in admission, service and routing control in queueing networksQUESTA, vol. 23, pp. 259–279 1996.
L. O. Curtat. Markov equilibria of stochastic games with complementarities, Games and Economic Behavior, vol. 17, pp. 177–199, 1996.
H.-U. Küenle. Stochastic games with complete information and average cost criterion; In Annals of Dynamic Games, (J. A. Filar and V. A. Gaitsgory, eds.), Birkhäuser, 1999.
A. S. Nowak. Existence of equilibrium stationary strategies in discounted noncooperative stochastic games with uncountable state space, J. Optim. Theory Appl., vol. 45, pp. 591–602, 1985.
A. S. Nowak and E. Altman. e-Nash equilibria for stochastic games with uncountable state spaces and unbounded costs. Mimeo, Institute of Mathematics, Wroclaw University of Technology, 1998.
S. P. Meyn and R. L. Tweedie. Markov Chains and Stochastic Stability, Springer-Verlag, New York, 1993.
U. Rieder. Average optimality in Markov games with general state space, Proceedings of the 3rd Int. Conference on Approximation and Optimization, Puebla, Mexico, 1995.
S.P. Meyn and R.L. Tweedie. Computable bounds for geometric convergence rates of Markov chains, Ann. Appl. Probab., vol. 4, pp. 981–1011, 1994.
A. S. Nowak. Optimal strategies in a class of zero-sum ergodic stochastic games. Mimeo, Institute of Mathematics, Wroclaw University of Technology, 1998.
E. G. Enns and E. Z. Ferenstein. On a multi-person time-sequential game with priorities, Sequential Analysis, vol. 6, pp. 239–256, 1987.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer Science+Business Media New York
About this chapter
Cite this chapter
Nowak, A.S., Szajowski, K. (1999). Nonzero-Sum Stochastic Games. In: Bardi, M., Raghavan, T.E.S., Parthasarathy, T. (eds) Stochastic and Differential Games. Annals of the International Society of Dynamic Games, vol 4. Birkhäuser, Boston, MA. https://doi.org/10.1007/978-1-4612-1592-9_7
Download citation
DOI: https://doi.org/10.1007/978-1-4612-1592-9_7
Publisher Name: Birkhäuser, Boston, MA
Print ISBN: 978-1-4612-7208-3
Online ISBN: 978-1-4612-1592-9
eBook Packages: Springer Book Archive