Two Person Zero-Sum Semi-Markov Games with Unknown Holding Times Distribution on One Side: A Discounted Payoff Criterion

Minjárez-Sosa, J. Adolfo; Luque-Vásquez, Fernando

doi:10.1007/s00245-007-9016-7

Two Person Zero-Sum Semi-Markov Games with Unknown Holding Times Distribution on One Side: A Discounted Payoff Criterion

Published: 05 September 2007

Volume 57, pages 289–305, (2008)
Cite this article

Download PDF

Access provided by CONRICYT-eBooks

Applied Mathematics and Optimization Aims and scope Submit manuscript

Two Person Zero-Sum Semi-Markov Games with Unknown Holding Times Distribution on One Side: A Discounted Payoff Criterion

Download PDF

J. Adolfo Minjárez-Sosa¹ &
Fernando Luque-Vásquez¹

100 Accesses
13 Citations
Explore all metrics

Abstract

This paper deals with two person zero-sum semi-Markov games with a possibly unbounded payoff function, under a discounted payoff criterion. Assuming that the distribution of the holding times H is unknown for one of the players, we combine suitable methods of statistical estimation of H with control procedures to construct an asymptotically discount optimal pair of strategies.

Article PDF

Zero-sum semi-Markov games with state-action-dependent discount factors

Article 05 November 2022

Zero-Sum Markov Games with Random State-Actions-Dependent Discount Factors: Existence of Optimal Strategies

Article 03 March 2018

Semi-stationary Equilibrium Strategies in Non-cooperative N-person Semi-Markov Games

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

References

Bhattacharya, R.N., Majumdar, M.: Controlled semi-Markov models—the discounted case. J. Stat. Plann. Inference 21, 365–381 (1989)
Article MathSciNet MATH Google Scholar
Gordienko, E.I., Minjárez-Sosa, J.A.: Adaptive control for discrete-time Markov processes with unbounded costs: discounted criterion. Kybernetika 34, 217–234 (1998)
MathSciNet Google Scholar
Guo, X.P., Hernández-Lerma, O.: Zero-sum games for continuous-time Markov chains with unbounded transition and average payoff rates. J. Appl. Probab. 40, 327–345 (2003)
Article MathSciNet MATH Google Scholar
Guo, X.P., Hernández-Lerma, O.: Zero-sum continuous-time Markov games with unbounded transition and discounted payoffs. Bernoulli 11, 1009–1029 (2005)
Article MathSciNet MATH Google Scholar
Guo, X.P., Hernández-Lerma, O.: Nonzero-sum games for continuous-time Markov chains with unbounded payoffs. J. Appl. Probab. 42, 303–320 (2005)
Article MathSciNet MATH Google Scholar
Hernández-Lerma, O., Lasserre, J.B.: Discrete-Time Markov Control Processes: Basic Optimality Criteria. Springer, New York (1996)
Google Scholar
Hernández-Lerma, O., Lasserre, J.B.: Further Topics on Discrete-Time Markov Control Processes. Springer, New York (1999)
MATH Google Scholar
Hasminskii, R., Ibragimov, I.: On density estimation in the view of Kolmogorov’s ideas in approximation theory. Ann. Stat. 18, 999–1010 (1990)
Article MathSciNet Google Scholar
Hilgert, N., Minjárez-Sosa, J.A.: Adaptive policies for time-varying stochastic systems under discounted criterion. Math. Methods Oper. Res. 54, 491–505 (2001)
Article MathSciNet MATH Google Scholar
Jaskiewicz, A.: Zero-sum semi-Markov games. SIAM J. Control Optim. 41, 723–739 (2002)
Article MathSciNet MATH Google Scholar
Lal, A.K., Sinha, S.: Zero-sum two person semi-Markov games. J. Appl. Probab. 29, 56–72 (1992)
Article MathSciNet MATH Google Scholar
Luque-Vásquez, F., Robles-Alcaraz, M.T.: Controlled semi-Markov models with discounted unbounded costs. Bol. Soc. Mat. Mexicana 39, 51–68 (1994)
MathSciNet MATH Google Scholar
Lippman, S.A.: Semi-Markov decision processes with unbounded rewards. Manag. Sci. 19, 717–731 (1973)
Article MathSciNet MATH Google Scholar
Lippman, S.A.: On dynamic programming with unbounded rewards. Manag. Sci. 21, 1225–1233 (1975)
MathSciNet MATH Google Scholar
Luque-Vásquez, F.: Zero-sum semi-Markov games in Borel spaces: discounted and average payoff. Bol. Soc. Mat. Mexicana 8, 227–241 (2002)
MathSciNet MATH Google Scholar
Luque-Vásquez, F., Minjárez-Sosa, J.A.: Semi-Markov control processes with unknown holding times distribution under a discounted criterion. Math. Methods Oper. Res. 61, 455–468 (2005)
Article MathSciNet MATH Google Scholar
Nowak, A.S.: Some remarks on equilibria in semi-Markov games. Appl. Math. (Warsaw) 27-4, 385–394 (2000)
Google Scholar
Polowczuk, W.: Nonzero semi-Markov games with countable state spaces. Appl. Math. (Warsaw) 27-4, 395–402 (2000)
MathSciNet Google Scholar
Rieder, U.: Measurable selection theorems for optimization problems. Manuscr. Math. 24, 115–131 (1978)
Article MathSciNet MATH Google Scholar
Ross, S.M.: Applied Probability Models with Optimization Applications. Holden-Day, San Francisco (1970)
MATH Google Scholar
Schäl, M.: Estimation and control in discounted stochastic dynamic programming. Stochastics 20, 51–131 (1987)
MathSciNet MATH Google Scholar
Shapley, L.: Stochastic games. Proc. Natl. Acad. Sci. U.S.A. 39, 1095–1100 (1953)
Article MathSciNet MATH Google Scholar
Vega-Amaya, O.: Average optimality in semi-Markov control models on Borel spaces: unbounded costs and controls. Bol. Soc. Mat. Mexicana 38, 47–60 (1993)
MathSciNet MATH Google Scholar
Vega-Amaya, O.: Zero-sum semi-Markov games: fixed point solutions of the Shapley equation. SIAM J. Control Optim. 42-5, 1876–1894 (2003)
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Departamento de Matemáticas, Universidad de Sonora, Rosales s/n, Centro, 83000, Hermosillo, Sonora, Mexico
J. Adolfo Minjárez-Sosa & Fernando Luque-Vásquez

Authors

J. Adolfo Minjárez-Sosa
View author publications
You can also search for this author in PubMed Google Scholar
Fernando Luque-Vásquez
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to J. Adolfo Minjárez-Sosa.

Additional information

Work supported partially by Consejo Nacional de Ciencia y Tecnología (CONACyT) under Grant 46633-F.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Minjárez-Sosa, J.A., Luque-Vásquez, F. Two Person Zero-Sum Semi-Markov Games with Unknown Holding Times Distribution on One Side: A Discounted Payoff Criterion. Appl Math Optim 57, 289–305 (2008). https://doi.org/10.1007/s00245-007-9016-7

Download citation

Published: 05 September 2007
Issue Date: June 2008
DOI: https://doi.org/10.1007/s00245-007-9016-7

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Two Person Zero-Sum Semi-Markov Games with Unknown Holding Times Distribution on One Side: A Discounted Payoff Criterion

Abstract

Article PDF

Similar content being viewed by others

Zero-sum semi-Markov games with state-action-dependent discount factors

Zero-Sum Markov Games with Random State-Actions-Dependent Discount Factors: Existence of Optimal Strategies

Semi-stationary Equilibrium Strategies in Non-cooperative N-person Semi-Markov Games

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Two Person Zero-Sum Semi-Markov Games with Unknown Holding Times Distribution on One Side: A Discounted Payoff Criterion

Abstract

Article PDF

Similar content being viewed by others

Zero-sum semi-Markov games with state-action-dependent discount factors

Zero-Sum Markov Games with Random State-Actions-Dependent Discount Factors: Existence of Optimal Strategies

Semi-stationary Equilibrium Strategies in Non-cooperative N-person Semi-Markov Games

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation