Markov-Modulated Samples and Their Applications

Andronov, Alexander

doi:10.1007/978-1-4939-2104-1_3

Alexander Andronov⁵

Part of the book series: Springer Proceedings in Mathematics & Statistics ((PROMS,volume 114))

1549 Accesses
2 Citations

Abstract

Samples which elements are dependent random variables are considered. This dependence arises due to total external random environment in which sample elements operate. The environment is described by continuous-time finite and ergodic Markov chain. Sample elements are positive random variables, which can be interpreted as lifetime till a failure. If this random variable is greater than value t > 0 and the environment has state i, then failure rate γ _i(t; β ⁽ⁱ⁾) is a known function of unknown coefficients $\beta ^{(i)} = (\beta _{1}^{(i)},\ldots,\beta _{m}^{(i)})$. Maximum likelihood equations are derived for estimators of {β ⁽ⁱ⁾}. A partial case when m = 1 and γ _i(t; β ⁽ⁱ⁾) = β _i is considered in detail.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Modified generalized Weibull distribution: theory and applications

Article Open access 07 August 2023

On Properties of the Phase-type Mixed Poisson Process and its Applications to Reliability Shock Modeling

Article 16 June 2022

On Methods of Estimation for the Type II Discrete Weibull Distribution

Article 04 July 2018

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

3.1 Problem Setting

The classical sample theory supposes that sample elements are identically distributed and independent (i.i.d.) random variables. Lately a great attention has been granted to dependence in probabilistic structures, for example, dependence between interarrival times of various flows, between service times, etc. Usually it is described by the so-called Markov-modulated processes. They are used widely in environmental, medical, industrial, and sociological researches. We restrict ourselves by a case when elements of the sample are positive random variables. It is convenient to consider them as lifetimes of unreliable elements.

Let us consider sample elements $\{X_{i},i = 1,\ldots,n\}$, modulated by a finite continuous-time Markov chain (see [6]). For simplicity we say that the elements operate in the so-called random environment. The last is described by an “external” continuous-time ergodic Markov chain $J(t),t\geqslant 0$, with a final state space $E =\{ 1,2,\ldots,k\}$. Let λ _i, j be the transition rate from state i to state j.

Additionally, n binary identical elements are considered. Each component can be in two states: up(1) and down(0). The elements of system fail one by one, in random order. For a fixed state i ∈ E, all n elements have the same failure rate γ _i(t) and are stochastically independent. When the external process changes its state from i to j at some random instant t, all elements, which are alive at time t, continue their life with new failure rate γ _j(t). If on interval (t ₀, t) the random environment has state i ∈ E, then the residual lifetime τ _r − t ₀ (up-state) of the rth component, $r = 1,2,\ldots,n$, has a cumulative distribution function (CDF) with failure rate γ _i(t) for time moment t, and the variables $\{\tau _{r} - t_{0},r = 1,2,\ldots,n\}$ are independent.

We wish to get statistical estimates for the unknown parameters $\beta ^{(i)} = (\beta _{1,i},\beta _{2,i},\ldots,\beta _{m,i})^{T}$, $i = 1,\ldots,k$. Note that in the above described process elements of the sample $\{X_{i},i = 1,\ldots,n\}$ are no i.i.d. anymore, as it is assumed in the classical sampling theory.

Further we make the following suppositions. Firstly, parameters of the Markov-modulated processes {λ _i, j} are known. Secondly, with respect to hazard rates γ _i(t), a parametrical setting takes place: all γ _i(t) are known, accurate to m parameters $\beta ^{(i)} = (\beta _{1,i},\beta _{2,i},\ldots,\beta _{m,i})^{T}$, so we will write γ _i(t; β ⁽ⁱ⁾). Further, we use the (m × k)-matrix $\beta = (\beta ^{(1)},\beta ^{(2)},\ldots,\beta ^{(k)})$ of unknown parameters. Thirdly, with respect to the available sample: sample elements are fixed corresponding to their appearance, so the order statistics $X_{(1)},X_{(2)},\ldots,X_{(n)}$ are fixed. Finally, the states of the random environment J(t) are known only for time moments $0,X_{(1)},X_{(2)},\ldots,X_{(n)}$.

The maximum likelihood estimates (see [5, 8, 9]) for the unknown parameters β are derived. Results of a simulation study illustrate the elaborated technique. Presented paper continues our previous investigations [1, 2].

3.2 Transition Probabilities

In this section we cite a result from the paper of Andronov and Gertsbakh [3]. Define N(t) as the number of elements which are in the up state at time moment t. Obviously $P\{N(0) = n\} = 1$. We denote

$$\displaystyle\begin{array}{rcl} p_{r,i,j}(t_{0},t)& =& \!P\{N(t)=r,\,J(t)=j\vert N(t_{0})=r,\,J(t_{0}) = i\},r \in \{ 1,\ldots,n\},\,i,j \in E, \\ p_{r,i}(t_{0},t)& =& \!\left (p_{r,i,1}(t_{0},t),\ldots,p_{r,i,k}(t_{0},t)\right )^{T}\!\!,\,P_{ r}(t_{0},t)=\!\left (p_{r,1}(t_{0},t),\ldots,p_{r,k}(t_{0},t)\right ), \\ \varGamma (t,\beta )& =& \mathrm{diag}\left (\gamma _{1}(t,\beta ^{(1)}),\ldots,\gamma _{ 1}(t,\beta ^{(k)})\right )\!,\!\ \varLambda =\mathrm{diag}\left (\!-\!\sum _{ i=1}^{k}\lambda _{ i,1},\ldots,-\!\sum _{i=1}^{k}\lambda _{ i,k}\!\right ).\quad \quad {}\end{array}$$

(3.1)

It has been shown that

$$\displaystyle\begin{array}{rcl} \dot{P}_{r}(t_{0},t) = -\left (\varLambda +r\varGamma (t,\beta )\right )P_{r}(t_{0},t) +\lambda ^{T}P_{ r}(t_{0},t),0\leqslant t_{0}\leqslant t.& &{}\end{array}$$

(3.2)

Below we consider a simple time-homogeneous case when $\gamma _{i}(t;\beta ^{(i)}) =\gamma _{i}(\beta _{i})\,\forall i\varGamma (t,\beta ) =\varGamma (\beta ) =\mathrm{ diag}(\gamma _{1}(\beta ^{(1)}),\ldots,\gamma _{k}(\beta ^{(k)}))$. Therefore,

$$\displaystyle{\dot{P}_{r}(t_{0},t) = \left (\lambda ^{T} -\left (\varLambda +r\varGamma (\beta )\right )\right )P_{ r}(t_{0},t),\,0\leqslant t_{0}\leqslant t.}$$

In this case a solution can be represented by matrix exponent (see [4, 7]):

$$\displaystyle\begin{array}{rcl} P_{r}(t_{0},t) =\exp \left ((t - t_{0})\left (\lambda ^{T} - (\varLambda +r\varGamma (\beta ))\right )\right ),\,0\leqslant t_{ 0}\leqslant t.& &{}\end{array}$$

(3.3)

3.3 Maximum Likelihood Estimates

In the considered case, besides the initial state j(0) of J(0), a sample of size n is given: $(x,j) =\{ (x_{j(r)},j(r)),r = 1,\ldots,n\}$, where x _j(r) is the rth order statistic of the sample and j(r) = J(x _(r)) is a corresponding state of the random environment. Setting x ₍₀₎ = 0 we rewrite the log-likelihood function as

$$\displaystyle\begin{array}{rcl} \mathrm{ll}(\beta;(x,j))& =& \sum _{r=0}^{n-1}[\ln p_{ n-r,j(r),j(r+1)}(x_{(r)},x_{(r+1)};\beta ) \\ & & +\ln (n - r) +\ln \gamma _{j(r+1)}(x_{(r+1)};\beta ^{(j(r+1))})].{}\end{array}$$

(3.4)

Considering gradients with respect to the column vectors β ^(v), $v = 1,\ldots,k$, we get maximum likelihood equations

$$\displaystyle\begin{array}{rcl} \frac{\partial } {\partial \beta ^{(v)}}\mathrm{ll}(\beta;(x,j))& =& \sum _{r=0}^{n-1} \frac{1} {p_{n-r,j(r),j(r+1)}(x_{(r)},x_{(r+1)};\beta )} \\ & & \times \frac{\partial } {\partial \beta ^{(v)}}p_{n-r,j(r),j(r+1)}(x_{(r)},x_{(r+1)};\beta ) \\ & & +\sum _{r=0}^{n-1} \frac{1} {\gamma _{j(r+1)}(x_{(r+1)};\beta ^{(j(r+1))})} \\ & & \times \frac{\partial } {\partial \beta ^{(v)}}\gamma _{j(r+1)}(x_{(r+1)};\beta ^{(j(r+1))}) = 0,\;v = 1,\ldots,k.{}\end{array}$$

(3.5)

Further, we consider a time-homogeneous case when all rate intensity γ _i(β ⁽ⁱ⁾) have one unknown scalar parameter β _i only, so $\gamma _{i}(\beta ^{(i)}) =\beta _{i}$, $i = 1,\ldots,k$. We will write $p_{m,i,j}(t - t_{0}) = p_{m,i,j}(t_{0},t)$. Then, the likelihood equations (3.5) have the following form:

$$\displaystyle\begin{array}{rcl} \frac{\partial } {\partial \beta ^{(v)}}\mathrm{ll}(\beta;(x,j))& =& \sum _{r=0}^{n-1} \frac{1} {p_{n-r,j(r),j(r+1)}(x_{(r+1)} - x_{(r)};\beta )} \\ & & \times \frac{\partial } {\partial \beta ^{(v)}}p_{n-r,j(r),j(r+1)}(x_{(r+1)} - x_{(r)};\beta ) \\ & & +\frac{1} {\beta _{v}}\sum _{r=0}^{n-1}\delta _{ v,j(r+1)} = 0, {}\end{array}$$

(3.6)

where $\delta _{v,j(r+1)}$ is the Kronecker symbol: $\delta _{v,j(r+1)} = 1$ if $v = j(r + 1)$, and $\delta _{v,j(r+1)} = 0$ otherwise.

Now we must get an expression for the derivative $\frac{\partial } {\partial \beta ^{(v)}} p_{n-r,j(r),j(r+1)}(x_{(r+1)} - x_{(r)};\beta )$. For that we use an expression for a derivative of a matrix exponent (see Lemma of Appendix). Let D _v be a square matrix from zero, where only one non-zero element equals 1 and takes the vth place of a main diagonal. Then, for the homogeneous case when $\varGamma (\beta ) =\mathrm{ diag}(\gamma _{1}(\beta ),\ldots,\gamma _{k}(\beta )) =\varGamma =\mathrm{ diag}(\beta _{1},\ldots,\beta _{k})$, according to (3.3), we have for $v = 1,\ldots,k$:

$$\displaystyle\begin{array}{rcl} \frac{\partial } {\partial \beta _{v}}P_{r}(t_{0},t)& =& \frac{\partial } {\partial \beta _{v}}\exp \{(t - t_{0})(\lambda ^{T} -\varLambda -r\varGamma )\} =\sum _{ i=1}^{\infty }\frac{1} {i!}(t - t_{0})^{i} {}\\ & & \sum _{j=0}^{i-1}(\lambda ^{T} -\varLambda -r\varGamma )^{j}\left (-rD_{ v} \frac{\partial } {\partial \beta _{v}}\beta _{v}\right )(\lambda ^{T} -\varLambda -r\varGamma )^{i-1-j} = {}\\ & & -r\sum _{i=1}^{\infty }\frac{1} {i!}(t - t_{0})^{i}\sum _{ j=0}^{i-1}(\lambda ^{T} -\varLambda -r\varGamma )^{j}D_{ v}(\lambda ^{T} -\varLambda -r\varGamma )^{i-1-j}. {}\\ \end{array}$$

Therefore

$$\displaystyle\begin{array}{rcl} \frac{\partial } {\partial \beta _{v}}P_{r}(t_{0},t)& =& -r\sum _{i=1}^{\infty }\frac{1} {i!}(t - t_{0})^{i}\sum _{ j=0}^{i-1}\left ((\lambda ^{T} -\varLambda -r\varGamma )^{j}\right )^{\langle v\rangle } \\ & & \left ((\lambda ^{T} -\varLambda -r\varGamma )^{i-1-j}\right )_{\langle v\rangle }, {}\end{array}$$

(3.7)

where $M^{\langle v\rangle }$ and $M_{\langle v\rangle }$ mean γth column and γth row of matrix M.

Now we can use a numerical method for the solution of the likelihood equation (3.6). Note that parameter β _v can be non-trivially estimated if state v has been registered as some j(r) = v, $r = 0,1,\ldots,n$.

3.4 Simulation Study

Below there are the results of a simulation study presented, they are performed for an analysis of the described estimating procedure efficiency. As initial data, data from the paper [3] have been used. Let us describe one. A random environment has three states $(k = 3,\,E =\{ 1,2,3\})$. Transition intensities {λ _i, j} from state i to state j, (i, j = 1, 2, 3) are given by a matrix

$$\displaystyle\begin{array}{rcl} \lambda = (\lambda _{i,j}) = \left (\begin{array}{*{10}c} 0 &0.2&0.3\\ 0.1 & 0 &0.2 \\ 0.4&0.2& 0 \end{array} \right ).& &{}\end{array}$$

(3.8)

Let a number of the considered elements n equals 5. For the environment state i ∈ E, all elements have a constant failure rate γ _i(t) = β _i and they fail independently. Therefore, a last time till a given element failure (for the same state i of the environment) has the exponential distribution with parameter β _i. These parameters must be estimated. For that purpose a sample is given. It contains a sequence of n + 1 pairs: $(x,j) =\{ (x_{j(r)},j(r)),\,r = 0,\ldots,n\}$, where x _j(r) is the rth order statistic of n-sample, and j(r) = J(x _(r)) is an environment state in the instant x _(r). The initial pair (x ₍₀₎, j(0)) equals (0, 1).

All the mentioned sampling data are given and are used in an estimating procedure. Own samples are simulated for the following parameter values: $\beta = (\beta _{1}\;\beta _{2}\;\beta _{3})^{T} = (0.1\;0.2\;0.3)^{T}$. It is convenient to present these data as 3 × (n + 1) matrix. An example of such matrix for n = 5 is the following:

$$\displaystyle\begin{array}{rcl} \mathit{Sample} = \left (\begin{array}{*{10}c} r &0& 1 & 2 & 3 & 4 & 5\\ x_{ (r)} & 0&0.624&1.502&2.009&8.711&9.429 \\ J(x_{(r)})&1& 1 & 2 & 1 & 3 & 3 \end{array} \right ).& & {}\\ \end{array}$$

In the simulation process, samples are generated one by one. Various samples are independent. Each sample corresponds to the appointed initial state of the environment: a sample with number 3i + j corresponds to the initial state $J(0) = j;\,j = 1,2,3;\,i = 0,\ldots \,$. Further, q such three samples (with the initial states j = 1, 2, 3) form a block, containing 3q samples. A maximum log-likelihood estimate (MLE) $\tilde{\beta }= (\tilde{\beta _{1}},\,\tilde{\beta _{2}},\,\tilde{\beta _{3}})^{T}$ is calculated for each block.

In broad outline, a procedure is as follows. For each sample, a changing of the environment J(t), t > 0, and instants of element failure x _(r), $r = 1,\ldots,n$, are simulated. Then, for the sample, a logarithm of likelihood function (3.4) and its gradient (3.5) or (3.6) are recorded. These expressions are used for MLE calculation. As an optimization method, the gradient method has been used.

The gradient method is given by the following parameters: n is a sample size (initial number of system elements); b ₀ is an initial value of parameter estimate; d is a step of moving along the gradient; $\varepsilon$ is a maximum module of a difference between sequential values of the parameter estimate β, for which a calculation is ended; L is a limit number of a gradient recalculation during moving from an initial point; K is a number of addends, appreciated in an expansion of the matrix exponent (3.7); 3q is a number of the samples in the block.

A set of such parameters numerical values $(n,b_{0},d,\varepsilon,L,K,q)$ is called an experiment design. Below, the results of simulation study are presented.

In Table 3.1 the corresponding results are presented for the design experiment n = 5, b ₀ = (0. 08 0. 22 0. 328)^T, d = 0. 015, $\varepsilon = 0.01$, L = 20, K = 20, r = 5, q = 5 and various values of total block number N. In the first column the initial value of the estimate b ₀ = (0. 08 0. 22 0. 328)^T is written. The following columns there are the estimate values given by averaging over N blocks. For a big number of the blocks, the coefficients d and $\varepsilon$ have been changed. Namely, for N = 15, 17, 19, 21 those values equal 0. 002, and for N = 21, additionally, L = 40.

Table 3.1 Convergence of the estimates for initial value b ₀ = (0. 08 0. 22 0. 328)^T

Full size table

An analysis of the Table 3.1 shows that a convergence to true values (0. 1 0. 2 0. 3) takes place but very slow.

In conclusion we would like to remark that considered approach allows improving probabilistic predictions for functioning of various complex technical and economical systems.

References

Andronov, A.M.: Parameter statistical estimates of Markov-modulated linear regression. In: Statistical Method of Parameter Estimation and Hypotheses Testing, vol. 24, pp. 163–180. Perm State University, Perm (2012) (in Russian)
Google Scholar
Andronov, A.M.: Maximum Likelihood Estimates for Markov-additive Processes of Arrivals by Aggregated Data. In: Kollo, T. (ed.) Multivariate Statistics: Theory and Applications. Proceedings of IX Tartu Conference on Multivariate Statistics and XX International Workshop on Matrices and Statistics, pp. 17–33. World Scientific, Singapore (2013)
Google Scholar
Andronov, A.M., Gertsbakh, I.B.: Signatures in Markov-modulated processes. Stoch. Models 30, 1–15 (2014)
Article MATH MathSciNet Google Scholar
Bellman, R.: Introduction to Matrix Analysis. McGraw-Hill, New York (1969)
Google Scholar
Kollo, T., von Rosen, D.: Advanced Multivariate Statistics with Matrices. Springer, Dordrecht (2005)
Book MATH Google Scholar
Pacheco A., Tang, L.C., Prabhu N.U.: Markov-Modulated Processes & Semiregenerative Phenomena. World Scientific, New Jersey (2009)
Google Scholar
Pontryagin, L.S.: Ordinary Differential Equations. Nauka, Moscow (2011) (in Russian)
Google Scholar
Rao, C.R.: Linear Statistical Inference and Its Application. Wiley, New York (1965)
Google Scholar
Turkington, D.A.: Matrix calculus and zero-one matrices. Statistical and Econometric Applications. Cambridge University Press, Cambridge (2002)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Transport and Telecommunication institute, 1 Lomonosova street, Riga, LV-1019, Latvia
Alexander Andronov

Authors

Alexander Andronov
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alexander Andronov .

Editor information

Editors and Affiliations

St.Petersburg State University, St.Petersburg, Russia
V.B. Melas
University of Bologna, Rimini, Italy
Stefania Mignani
University of Bologna, Rimini, Italy
Paola Monari
University of Padova, Padova, Italy
Luigi Salmaso

Appendix

Lemma 3.1.

If elements of matrix G(t) are differentiable function of t, then

$$\displaystyle\begin{array}{rcl} \frac{\partial } {\partial t}G(t)^{n}& =& \sum _{ i=0}^{n-1}G(t)^{i}{\biggl [ \frac{\partial } {\partial t}G(t)\biggr ]}G(t)^{n-1-i},\,n = 1,2,\ldots, {}\\ \frac{\partial } {\partial t}\exp (G(t))& =& \frac{\partial } {\partial t}\sum _{i=0}^{\infty }\frac{1} {i!}G(t)^{i} =\sum _{ i=1}^{\infty }\frac{1} {i!}\sum _{j=0}^{i-1}G(t)^{j}{\biggl [ \frac{\partial } {\partial t}G(t)\biggr ]}G(t)^{i-1-j}. {}\\ \end{array}$$

Proof.

Lemma 3.1 is true for n = 1 and 2. If one is true for n > 1, then

$$\displaystyle\begin{array}{rcl} \frac{\partial } {\partial t}G(t)^{n+1}& =& {\biggl [ \frac{\partial } {\partial t}G(t)\biggr ]}G(t)^{n} + G(t) \frac{\partial } {\partial t}G(t)^{n} {}\\ & =& {\biggl [ \frac{\partial } {\partial t}G(t)\biggr ]}G(t)^{n} + G(t)\sum _{ i=0}^{n-1}G(t)^{i}{\biggl [ \frac{\partial } {\partial t}G(t)\biggr ]}G(t)^{n-1-i} {}\\ & =& \sum _{i=0}^{n}G(t)^{i}{\biggl [ \frac{\partial } {\partial t}G(t)\biggr ]}G(t)^{n-i}. {}\\ \end{array}$$

⊓⊔

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Andronov, A. (2014). Markov-Modulated Samples and Their Applications. In: Melas, V., Mignani, S., Monari, P., Salmaso, L. (eds) Topics in Statistical Simulation. Springer Proceedings in Mathematics & Statistics, vol 114. Springer, New York, NY. https://doi.org/10.1007/978-1-4939-2104-1_3

Download citation

DOI: https://doi.org/10.1007/978-1-4939-2104-1_3
Published: 18 November 2014
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4939-2103-4
Online ISBN: 978-1-4939-2104-1
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics

Markov-Modulated Samples and Their Applications

Abstract

Similar content being viewed by others

Modified generalized Weibull distribution: theory and applications

On Properties of the Phase-type Mixed Poisson Process and its Applications to Reliability Shock Modeling

On Methods of Estimation for the Type II Discrete Weibull Distribution

Keywords

3.1 Problem Setting

3.2 Transition Probabilities

3.3 Maximum Likelihood Estimates

3.4 Simulation Study

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Appendix

Lemma 3.1.

Proof.

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Markov-Modulated Samples and Their Applications

Abstract

Similar content being viewed by others

Modified generalized Weibull distribution: theory and applications

On Properties of the Phase-type Mixed Poisson Process and its Applications to Reliability Shock Modeling

On Methods of Estimation for the Type II Discrete Weibull Distribution

Keywords

3.1 Problem Setting

3.2 Transition Probabilities

3.3 Maximum Likelihood Estimates

3.4 Simulation Study

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Appendix

Appendix

Lemma 3.1.

Proof.

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation