Optimality equations and inequalities in a class of risk-sensitive average cost Markov decision chains

Cavazos-Cadena, Rolando

doi:10.1007/s00186-009-0285-6

Optimality equations and inequalities in a class of risk-sensitive average cost Markov decision chains

Original Article
Published: 18 February 2009

Volume 71, pages 47–84, (2010)
Cite this article

Download PDF

Access provided by CONRICYT-eBooks

Mathematical Methods of Operations Research Aims and scope Submit manuscript

Optimality equations and inequalities in a class of risk-sensitive average cost Markov decision chains

Download PDF

Rolando Cavazos-Cadena¹

173 Accesses
14 Citations
Explore all metrics

Abstract

This note concerns controlled Markov chains on a denumerable sate space. The performance of a control policy is measured by the risk-sensitive average criterion, and it is assumed that (a) the simultaneous Doeblin condition holds, and (b) the system is communicating under the action of each stationary policy. If the cost function is bounded below, it is established that the optimal average cost is characterized by an optimality inequality, and it is to shown that, even for bounded costs, such an inequality may be strict at every state. Also, for a nonnegative cost function with compact support, the existence an uniqueness of bounded solutions of the optimality equation is proved, and an example is provided to show that such a conclusion generally fails when the cost is negative at some state.

Article PDF

Contractive approximations in average Markov decision chains driven by a risk-seeking controller

Article 05 July 2023

Controlled Semi-Markov Chains with Risk-Sensitive Average Cost Criterion

Article 11 March 2016

Contractive Approximations in Risk-Sensitive Average Semi-Markov Decision Chains on a Finite State Space

Article 22 November 2021

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

References

Arapostathis A, Borkar VK, Fernández-Gaucherand E, Gosh MK, Marcus SI (1993) Discrete-time controlled Markov processes with average cost criteria: a survey. SIAM J Control Optim 31: 282–334
Article MATH MathSciNet Google Scholar
Borkar VS, Meyn SP (2002) Risk-sensitive optimal control for Markov decison process with monotone cost. Math Oper Res 27: 192–209
Article MATH MathSciNet Google Scholar
Cavazos-Cadena R (1988) Necessary and sufficient conditions for a bounded solution to the optimality equation in average reward markov decision chains. Syst Control Lett 10: 71–78
Article MATH MathSciNet Google Scholar
Cavazos-Cadena R, Fernández-Gaucherand E (1999) Controlled Markov chains with risk-sensitive criteria: average cost, optimality equations and optimal solutions. Math Meth Oper Res 43: 121–139
Google Scholar
Cavazos-Cadena R, Fernández-Gaucherand E (2002) Risk-sensitive control in communicating average Markov decision chains. In: Dror M, L’Ecuyer P, Szidarovsky F (eds) Modelling uncertainty: an examination of stochastic theory, methods and applications. Kluwer, Boston, pp 525–544
Google Scholar
Cavazos-Cadena R, Hernández-Hernández D (2004) A characterization of exponential functionals in finite Markov chains. Math Methods Oper Res 60: 399–414
Article MATH MathSciNet Google Scholar
Di Masi GB, Stettner L (2000) Infinite horizon risk sensitive control of discrete time Markov processes with small risk. Syst Control Lett 40: 305–321
Article MathSciNet Google Scholar
Di Masi GB, Stettner L (2007) Infinite horizon risk sensitive control of discrete time Markov processes under minorization properrty. SIAM J Control Optim 46: 231–252
Article MATH MathSciNet Google Scholar
Fleming WH, McEneany WM (1995) Risk-sensitive control on an infinite horizon. SIAM J Control Optim 33: 1881–1915
Article MATH MathSciNet Google Scholar
Hernández-Hernández D, Marcus SI (1996) Risk-sensitive control of Markov processes in countable state space. Syst Control Lett 29: 147–155
Article MATH Google Scholar
Hernández-Hernández D, Marcus SI (1999) Existence of risk-sensitive optimal stationary policies for controlled Markov processes. Appl Math Optim 40: 273–285
Article MATH MathSciNet Google Scholar
Hernández-Lerma O (1988) Adaptive Markov control processes. Springer, New York
Google Scholar
Howard AR, Matheson JE (1972) Risk-sensitive Markov decision processes. Manage Sci 18: 356–369
Article MATH MathSciNet Google Scholar
Jacobson DH (1973) Optimal stochastic linear systems with exponential performance criteria and their relation to stochastic differential games. IEEE Trans Automat Contr 18: 124–131
Article MATH Google Scholar
Jaquette SC (1973) Markov decison processes with a new optimality criterion: discrete time. Ann Stat 1: 496–505
Article MATH MathSciNet Google Scholar
Jaquette SC (1976) A utility criterion for Markov decision processes. Manage Sci 23: 43–49
Article MATH MathSciNet Google Scholar
Jaśkiewicz A (2007) Average optimality for risk sensitive control with general state space. Ann Appl Probab 17: 654–675
Article MATH MathSciNet Google Scholar
Loève M (1980) Probability theory I. Springer, New York
Google Scholar
Puterman ML (1994) Markov decision processes. Wiley, New York
Book MATH Google Scholar
Seneta E (1980) Nonnegative matrices. Springer, New York
Google Scholar
Sennot L (1986) A new condition for the existence of optimum stationary policies in average cost Maarkov decision processes. Oper Res Lett 5: 17–23
Article MathSciNet Google Scholar
Sennot L (1995) Another set of conditions for average optimality in Maarkov control processes. Syst Control Lett 24: 147–151
Article Google Scholar
Thomas LC (1980) Conectedness conditions for denumerable state Markov decision processes. In: Hartley R, Thomas LC, White DJ (eds) Recent advances in Markov decision processes. Academic Press, New York
Google Scholar

Download references

Author information

Authors and Affiliations

Departamento de Estadística y Cálculo, Universidad Autónoma Agraria Antonio Narro, Buenavista, 25315, Saltillo, COAH, Mexico
Rolando Cavazos-Cadena

Authors

Rolando Cavazos-Cadena
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rolando Cavazos-Cadena.

Additional information

Dedicated to Professor Onésimo Hernández-Lerma, on the occasion of his 60th birthday.

This work was supported by the PSF Organisation under Grant No. 08-05(450), and in part by CONACYT under Grant 25357.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Cavazos-Cadena, R. Optimality equations and inequalities in a class of risk-sensitive average cost Markov decision chains. Math Meth Oper Res 71, 47–84 (2010). https://doi.org/10.1007/s00186-009-0285-6

Download citation

Received: 24 July 2008
Accepted: 26 January 2009
Published: 18 February 2009
Issue Date: February 2010
DOI: https://doi.org/10.1007/s00186-009-0285-6

Keywords

Mathematics Subject Classification (2000)

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Optimality equations and inequalities in a class of risk-sensitive average cost Markov decision chains

Abstract

Article PDF

Similar content being viewed by others

Contractive approximations in average Markov decision chains driven by a risk-seeking controller

Controlled Semi-Markov Chains with Risk-Sensitive Average Cost Criterion

Contractive Approximations in Risk-Sensitive Average Semi-Markov Decision Chains on a Finite State Space

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification (2000)

Navigation

Optimality equations and inequalities in a class of risk-sensitive average cost Markov decision chains

Abstract

Article PDF

Similar content being viewed by others

Contractive approximations in average Markov decision chains driven by a risk-seeking controller

Controlled Semi-Markov Chains with Risk-Sensitive Average Cost Criterion

Contractive Approximations in Risk-Sensitive Average Semi-Markov Decision Chains on a Finite State Space

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification (2000)

Search

Navigation