Abstract
We consider a class of time-varying stochastic control systems, with Borel state and action spaces, and possibly unbounded costs. The processes evolve according to a discrete-time equation x n + 1=G n (x n , a n , ξn), n=0, 1, … , where the ξn are i.i.d. ℜk-valued random vectors whose common density is unknown, and the G n are given functions converging, in a restricted way, to some function G ∞ as n→∞. Assuming observability of ξn, we construct an adaptive policy which is asymptotically discounted cost optimal for the limiting control system x n+1=G ∞ (x n , a n , ξn).
Article PDF
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Hilgert, N., Minjárez-Sosa, J. Adaptive policies for time-varying stochastic systems under discounted criterion. Mathematical Methods of OR 54, 491–505 (2001). https://doi.org/10.1007/s001860100170
Published:
Issue Date:
DOI: https://doi.org/10.1007/s001860100170