On a reward rate estimation for the finite irreducible continuous-time Markov chain

Andronov, Alexander

doi:10.1080/15598608.2017.1282895

On a reward rate estimation for the finite irreducible continuous-time Markov chain

Published: 01 September 2017

Volume 11, pages 407–417, (2017)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Journal of Statistical Theory and Practice Aims and scope Submit manuscript

On a reward rate estimation for the finite irreducible continuous-time Markov chain

Download PDF

Alexander Andronov¹

20 Accesses
1 Citation
Explore all metrics

Abstract

A continuous-time homogeneous irreducible Markov chain {X(t)}, t ϵ [0; ∞), taking values on N = {1,..., k}, k <∞, is considered. Matrix λ = (λ_ij) of the intensity of transition λ_ij from state i to state j is known. A unit of the sojourn time in state i gives reward β_i so the total reward during time t is \(Y(t) = \mathop \smallint \limits_0^t {\beta _{X(s)}}ds\). The reward rates {β_i} are not known and it is necessary to estimate them. For that purpose the following statistical data on r observations are at our disposal: (1) t, observation time; (2) i, initial state X(0); (3) j, final state X(t); and (4) y, acquired reward Y(t). Two methods are used for the estimation: the weighted least-squares method and the saddle-point method for the Laplace transformation of the reward. Simulation study illustrates the suggested approaches.

References

Abate, J., and W. Whitt. 1992. The Fourier-series method for inverting transform of probability distributions. Queueing Systems 19:5–88.
Article MathSciNet Google Scholar
Andronov, A. M. 1992. Parameter statistical estimates of Markov-modulated linear regression. In Statistical methods of parameter estimation and hypothesis testing, vol. 24, 163–80. Perm, Russia: Perm State University (in Russian).
Google Scholar
Andronov, A. M. 2014. Markov-modulated samples and their applications. In Topics in statistical simulation, ed. V. B. Melas, S. Mignani, P. Monari, and L. Salmoso, vol. 114, 29–35. New York, NY: Springer Proceedings in Mathematics & Statistics, Springer.
Chapter Google Scholar
Bellman, R. 1969. Introduction to matrix analysis. New York, NY: McGraw-Hill.
MATH Google Scholar
Bladt, M., B. Meini, M. F. Neuts, and B. Sericola. 2002. Distributions of reward functions on continuous-time Markov chain. In 4th International Conference on Matrix Analytic Methods. Theory and applications, 1–24. Adelaide, Australia.
Crawford, F. W., V. N. Minin, and M. A. Suchard. 2014. Estimation for general birth–death processes. Journal of the American Statistical Association 109 (506): 730–47.
Article MathSciNet Google Scholar
Kijima, M. 1997. Markov processes for stochastic modeling. London, UK: Chapman & Hall.
Book Google Scholar
Minoux, M. 1989. Programmation matematique. Theorie et Algorithmes. Paris, France: Bordas.
Google Scholar
Pacheco, A., L. C. Tang, and N. U. Prabhu. 2009. Markov-modulated processes & semiregenerative phenomena. Hoboken, NJ: World Scientific.
MATH Google Scholar
Sericola, B. 2000. Occupation times in Markov processes. Stochastic Models 16:479–510.
Article MathSciNet Google Scholar
Turkington, D. A. 2002. Matrix calculus and zero-one matrices. Statistical and econometric applications. Cambridge, UK: Cambridge University Press.
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Mathematical Methods and Modeling, Transport and Telecommunication Institute, LV-1019, Lomonosova 1, Riga, Latvia
Alexander Andronov

Authors

Alexander Andronov
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alexander Andronov.

Additional information

Color versions of one or more of the figures in the article can be found online at https://doi.org/www.tandfonline.com/ujsp.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Andronov, A. On a reward rate estimation for the finite irreducible continuous-time Markov chain. J Stat Theory Pract 11, 407–417 (2017). https://doi.org/10.1080/15598608.2017.1282895

Download citation

Received: 29 April 2016
Accepted: 12 January 2017
Published: 01 September 2017
Issue Date: September 2017
DOI: https://doi.org/10.1080/15598608.2017.1282895

Keywords

AMS Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

On a reward rate estimation for the finite irreducible continuous-time Markov chain

Abstract

Article PDF

Similar content being viewed by others

Finite horizon continuous-time Markov decision processes with mean and variance criteria

Markov control models with unknown random state–action-dependent discount factors

Estimation of the Parameters of Continuous-Time Finite Markov Chain

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

AMS Subject Classification

Navigation

On a reward rate estimation for the finite irreducible continuous-time Markov chain

Abstract

Article PDF

Similar content being viewed by others

Finite horizon continuous-time Markov decision processes with mean and variance criteria

Markov control models with unknown random state–action-dependent discount factors

Estimation of the Parameters of Continuous-Time Finite Markov Chain

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

AMS Subject Classification

Search

Navigation