Abstract.
This paper is the second part of our study of Blackwell optimal policies in Markov decision chains with a Borel state space and unbounded rewards. We prove that a stationary policy is Blackwell optimal in the class of all history-dependent policies if it is Blackwell optimal in the class of stationary policies.
We also develop recurrence and drift conditions which ensure ergodicity and integrability assumptions made in the previous paper, and which are more suitable for applications. As an example we study a cash-balance model.
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Author information
Authors and Affiliations
Additional information
Manuscript received: October 1998
Rights and permissions
About this article
Cite this article
Hordijk, A., Yushkevich, A. Blackwell optimality in the class of all policies in Markov decision chains with a Borel state space and unbounded rewards. Mathematical Methods of OR 50, 421–448 (1999). https://doi.org/10.1007/s001860050079
Issue Date:
DOI: https://doi.org/10.1007/s001860050079