Abstract
This paper is concerned with the optimal linear quadratic Gaussian (LQG) control problem for discrete time-varying system with multiplicative noise and multiple state delays. The main contributions are twofolds. First, in virtue of Pontryagin’s maximum principle, we solve the forward and backward stochastic difference equations (FBSDEs) and show the relationship between the state and the costate. Second, based on the solution to the FBSDEs and the coupled difference Riccati equations, the necessary and sufficient condition for the optimal problem is obtained. Meanwhile, an explicit analytical expression is given for the optimal LQG controller. Numerical examples are shown to illustrate the effectiveness of the proposed algorithm.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
The control problem for time-delay systems have received extensively attention since 1950s because of its wide applications in networked control system, intelligent cruise control system, finance, cable-driven manipulators and so on [1,2,3,4,5,6]. There have been a lot of researches on the optimal control and stabilization problem of time-delay systems in recent years, and many results have been surveyed, which concern with single input/state delay or multiple input/state delays [7,8,9,10]. For example, Yue et al. [7] proposed a Lyapunov–Krasovskii functional approach to design the delayed feedback controller of uncertain systems with time-varying input delay, by introducing some relaxation matrices and turning parameters. Lee et al. [9] studied the robust \(\mathrm{H}_{\infty }\) control problem for uncertain linear systems with a state-delay. Based on the obtained delay-dependent bounded real lemma, the delay-dependent condition for the existence of a robust controller was presented. Due to the existence of multiple delays, the optimal controller is related to the past variables, which makes the control problem more challenging.
On the other hand, stochastic uncertainties exist in many control processes, and some results have been shown in [11,12,13,14]. Qi et al. [12] presented the optimal estimation and the optimal output feedback controller of the discrete-time multiplicative noise system with intermittent observations by virtue of coupled Riccati equations. The stabilization condition for this system was developed in the mean square sense. Rami et al. [13] considered the discrete-time stochastic LQ problem subject to state and control-dependent noises. A necessary and sufficient condition for the existence of the optimal control was identified in terms of the solution to the proposed difference Riccati equation. As to meet the actual demand in different areas, the control systems with both stochastic uncertainties and time delay(s) have been thoroughly studied [15,16,17,18,19]. Zhang et al. [15] obtained the optimal linear quadratic regulation (LQR) controller for discrete-time system with input delay and multiplicative noise via the Riccati-ZXL difference equation, while the additive noise was not considered in this reference. In [16], Liang et al. took the state- and control-dependent noise, additive noise and input delay into account, and the optimal controller and the suboptimal linear state estimate feedback controller for the linear quadratic Gaussian (LQG) system were both derived, with only single time delay in the input. Besides, when there are multiplicative noise and multiple delays in the input, Li et al. [19] presented the optimal controller and the optimal cost under the necessary and sufficient condition. However, additive noise was not considered. It is obviously shown that the system models described in the above literatures are all discrete time-invariant and input-delay(s) systems. Moreover, the optimal problem involving simultaneously multiplicative noise, additive noise and multiple state delays are not mentioned. In addition, when the additive noise is related to the multiplicative noise, the analysis and synthesis for the control problem remain challenging.
Different from the existing research systems, the system considered in this paper contains simultaneously multiplicative noise, multiple state delays and additive noise, which is more complex than before. It should be emphasized that the additive noise and the multiplicative noise are dependent, and the coefficients in this paper are time-varying. The LQG control for our paper is much more sophisticated and unsolved. The main contributions of this paper are as follows: (1) The relations between the state and the costate in terms of the discrete time-varying LQG problem is given by lots of inductive calculations, which is also the solution to the forward and backward stochastic difference equations (FBSDEs). (2) If and only if a sequence of matrices are all positive definite, the optimal controller and the associated cost function will be obtained via the coupled difference Riccati equations, and the explicit expression of the unique controller is presented, which is obviously more complicated than LQR controller in [19]. Our approach is based on the stochastic maximum principle, and the key technique is the solution to the FBSDEs.
The rest of this paper is organized as follows. In Sect. 2, the discrete time-varying stochastic LQG control problem is described. In Sect. 3, the key tool to the solution is presented, and the necessary and sufficient condition for the optimal LQG control problem is shown. The solutions to the general LQG problem are derived. Numerical examples are shown in Sect. 4. Conclusions are provided in Sect. 5. Proofs of the Lemma and the Theorem are described in Appendixes.
Notation \({\mathbb {R}}^n\) denotes the n-dimensional real Euclidean space. I presents the unit matrix of appropriate dimension. The superscript \('\) denotes the transpose of the matrix. \(\{\varOmega , {\mathcal {F}},\) \({\mathcal {P}},\{{\mathcal {F}}_k\}_{k\geqslant 0}\}\) denotes a complete probability space on which random variable \(\nu _k\) and \(\mu _k\) are defined such that \(\{{\mathcal {F}}_k\}_{k\geqslant 0}\) is the natural filtration generated by \(\nu _k\) and \(\mu _k\), i.e., \({\mathcal {F}}_k=\sigma \{\nu _0,\ldots ,\nu _k,\mu _0,\ldots ,\) \(\mu _k\}\), augmented by all the \({\mathcal {P}}\)-null sets in \({\mathcal {F}}\). A symmetric \(A>0\,(\geqslant 0)\) means that it is a positive definite (positive semi-definite) matrix. \(\theta _{a,\,b}\) is the usual Kronecker function, i.e., \(\theta _{a,\,b}=0\) if \(a\ne b\), and \(\theta _{a,\,b}=1\) if \(a=b\). \(\mathrm{Tr}(P)\) represents the trace of matrix P.
2 Problem formulation
Consider the discrete time-varying stochastic LQG system with state delays and multiplicative scalar noise:
where \(x_k\in {\mathbb {R}}^n\) is the state, \(u_k\in {\mathbb {R}}^m\) is the input control, the positive integer d is the state delay, \(\nu _k\) is the scalar noise with zero mean and variance \(\gamma\), \(\mu _k\in {\mathbb {R}}^n\) is random variable satisfying \(\mathrm{E}[\mu _k|{\mathcal {F}}_{k-1}]={\bar{\mu }}_k\) and \(\mathrm{E}[\mu _k\mu _k'|{\mathcal {F}}_{k-1}]=Q_{\mu _k}\). The coefficient matrices with compatible dimensions \(\mathcal {C}_i(k),\bar{\mathcal {C}}_i(k),\mathcal {D}(k)\) and \(\bar{\mathcal {D}}(k)\) with \(i=0,\ldots ,d\) are time-varying. \(\nu _k\) and \(\mu _k\) are correlated, satisfying \(\mathrm{E}[\nu _k\mu _k'|{\mathcal {F}}_{k-1}]=\tau\), \(\mathrm{E}[\nu _k\mu _l'|{\mathcal {F}}_{k-1}]=0,\) \(k\ne l\). The initial states \(x_i\) for \(i=-d,\ldots ,0\) are deterministic and known.
Consider the associated cost function for system (1):
where \(Q_k\), \(R_k\) and \({\mathcal {P}}_{N+1}^0\) are positive semi-definite matrices with appropriate dimensions, and N is the horizon length. In view of the fact that \(x_k\) depends on \(\nu _{k-1}, \nu _{k-2}\), \(\ldots , \mu _{k-1}, \mu _{k-2}, \ldots\), and the controller obeys the causality constraint, \(u_k\) must be \(\mathcal {F}_{k-1}\)-measurable (see [13]). Then, the problem to be addressed is stated as follows.
Problem 1
Find the unique \({\mathcal {F}}_{k-1}\)-measurable state feedback controller \(u_k\), \(k=0,\ldots ,N\), for system (1) such that the cost function (2) is minimized.
3 Main results
For simplicity, we make the system (1) to be
where
Following the similar approach in [19], we apply stochastic Pontryagin’s maximum principle [20] to system (3) with the cost function (2) to yield the costate equations:
where \(\zeta _k\) is the costate variable with \(\zeta _k=0\) for \(k>N\).
For further study, we define the following Riccati coupled equations and make the backwards recursion for \(k=N, N-1, \ldots , 0\):
The terminal values of the above matrix sequences \({\mathcal {P}}_k^j\), \(\varOmega _k\) and \(N_k^j\), \(j=0,\ldots ,d\) are given by
It should be emphasized that the recursion will stop unless assuming that \(\varOmega _k\) is invertible. To give the main results of Problem 1, we need to obtain the solution to the FBSDEs (3) and (4)–(6), and then the following lemma is proposed.
Lemma 1
Assuming that \(\varOmega _k\) are positive definite, i.e., \(\varOmega _k>0\), for \(k=0,\ldots ,N\), then the following equation
is the solution to FBSDEs (3) and (4)–(6), where
with the terminal value \(\varPhi _{N+1}=0\), and \({\mathcal {P}}_k^j\), \(N_k^j\), \(\varOmega _k\) satisfy the coupled equations (7)–(10).
Proof
The proof of Lemma 1 is in Appendix A.\(\square\)
Remark 1
It is noted that the system model in this paper is discrete time-varying, and contains simultaneously multiplicative noise, additive noise and multiple state delays. Meanwhile, the multiplicative noise is related with the additive noise. Thus, the problem of optimal LQG control is particularly difficult.
Remark 2
We have defined \({\mathcal {P}}_k^j\), \(N_k^j\) with \(j\in [0,d]\) by the equation (10). As using the notations \({\mathcal {P}}_k^j\), \(N_k^j\) for \(j>d\), we extend the definition \({\mathcal {P}}_k^j=0\), \(N_k^j=0\) for \(j>d\). Besides, the coefficient matrices \(\mathcal {C}_j(k)\), \(\bar{\mathcal {C}}_j(k)\) are set to be 0 for \(j>d\).
Now, we are in the position to present the solution to Problem 1. The results are stated in the following theorem.
Theorem 1
Problem 1has a unique \({\mathcal {F}}_{k-1}\)-measurable \(u_k\) if and only if \(\varOmega _k\), for \(k=0,\ldots ,N\), are positive definite. In this context, the optimal controller \(u_k\) is calculated by
The minimum performance index is as
where
while \({\mathcal {P}}_k^j\), \(N_k^j\), \(\varOmega _k\) satisfy the coupled equations (7)–(10).
Proof
The proof of Theorem 1 is in Appendix B.\(\square\)
Remark 3
Different from the existing work [19], the difficulties caused by the additive noise are mainly as follows. First, this paper considers the optimal LQG control problem with both multiplicative noise and correlated additive noise, which is more challenging than [19]. Second, due to the existence of the additive noise, the key technique to this optimal control problem, i.e., the solution to the FBSDEs (11) is quite more difficult than that of [19]. Besides, the optimal controller \(u_k\) satisfying (13) and the associated optimal cost (14) are more difficult to obtain, and the expression of \(u_k\) and \(J^*_N\) are more complicated than [19].
Remark 4
For a stochastic discrete-time system with no state delays, i.e., \(d=0\) in system (3), it is obviously obtained that the coupled Riccati difference equations:
where
The optimal controller reduces to
where \(\varSigma _k\) as (15). In this context, the solution to the FBSDEs is as
with
Remark 5
From Theorem 1, when the disturbance term \(\mu _k\) and the multiplicative noise \(\nu _k\) are independent, i.e., \(\tau =0\), and when the additive noise is Gaussian white noise, i.e., \({\bar{\mu }}_k=0\), the Riccati difference equations are as (7)–(10), and the matrices can be rewritten as
As the terminal value \(\varPhi _{N+1}=0\), it is obviously obtained that \(\varSigma _k\) and \(\varPhi _k\) in (16), (17) always equal to be zero for \(k=0,\dots ,N\). Then, the optimal controller reduces to
and the solution to the FBSDEs is as
In this case, the associated optimal cost is given by
In view of obtaining the scalar case of optimal LQG control system (3), we derive the results to the general system with multiple delays and multiplicative noise.
Consider the following case of discrete time-varying system:
where \(\mathcal {V}_k=(\nu _k(1) \ldots \nu _k(f))'\) is a f-dimensional white noise defined on a complete probability \(\{\varOmega , {\mathcal {P}}, {\mathcal {F}}\}\). \(\mathcal {V}_k\) satisfies the variance \(\gamma\), i.e.,
Here \({\mathcal {F}}_k\) is the natural filtration generated by \(\mathcal {V}_k\) and \(\mu _k\), i.e., \({\mathcal {F}}_k\) is the \(\sigma\)-algebra generated by \(\{\mathcal {V}_0,\ldots ,\mathcal {V}_k,\mu _0,\) \(\ldots ,\mu _k\}\). Then, the general case of discrete time-varying LQG control problem is stated as follows.
Problem 2
Find the unique \(\mathcal {F}_{k-1}\)-measurable state feedback controller \(u_k\), \(k=0,\dots ,N\), for system (18) such that the cost function (2) is minimized.
To solve Problem 2, we derive the definition as
and the coupled difference Riccati equations (7)–(9) extend to
Based on the above definitions, the solution to Problem 2 is derived in the following theorem.
Theorem 2
Problem 2has a unique \({\mathcal {F}}_{k-1}\)- measurable \(u_k\) if and only if \(\varOmega _k\), for \(k=0,\dots ,N\), are positive definite. In this case, the optimal controller \(u_k\) is given by
and the associated optimal cost function is as
where \({\mathcal {P}}_k^j\), \(N_k^j\), \(\varOmega _k\) satisfy the coupled equations (19)–(21).
Remark 6
It is obviously that the multiplicative noise \(\mathcal {V}_k\) in the general LQG control system (18) is expanded by multiple dimensions of white noises. The existence of multi-dimensional white noise has no essential influence on the optimal control problem, and we can treat it as a whole. Then, the approach of Theorem 1 is also applied to the general situation. Thus, combining the mathematical characteristics of \({\mathcal {V}}_k\), Theorem 2 is derived as the above.
4 Numerical examples
Example 1
Consider the scalar case of time-varying LQG control system (3) in Theorem 1, as the additive noise \(\mu _k\) correlated with \(\nu _k\). Let the associated parameters be as
and the cost function (2) with
By direct calculation , it yields
It is obviously known that \(\varOmega _k\) is positive definite for \(k=0,1\). Thus, from Theorem 1, there exists a unique \(u_k\), which is given by
Second, we shall illustrate that \(u_k^*\) can minimize the cost function (2). Let the controller be arbitrary. For example,
Compare the cost function under \(u_k^*\) and \({\hat{u}}_k\) with different initial values as follows:
Hence, \(u_k^*\) and \(J^*\) are optimal. This demonstrates the correctness of our results.
Example 2
Consider LQG control system (3) with \(x_k\in {\mathbb {R}}^2\), \(u_k\in {\mathbb {R}}^2\), \(d=1\), \(\gamma =1\), \(\tau =[1,1]'\), and the coefficient matrixes are time-invariant with
and the cost function (2) with
Then by direct calculation, the solution to coupled difference Riccati equations is yielded as
Obviously, \(\varOmega _k>0\) for \(k=0,\ldots ,N\), therefore, there exist a optimal controller for Problem 1. In addition, when the initial values are
the optimal controller can be calculated as
Accordingly, the associated cost function is \(J^*=148.8\).
5 Conclusions
In this paper, the discrete time-varying LQG control problem with both multiplicative noise and multiple state delays has been studied. We obtain the solution to the FBSDEs for the discrete time-varying systems. A necessary and sufficient condition for the existence of a unique optimal controller is proposed. The basis of this approach is the stochastic maximum principle and the key is the relationship between the state and costate. In the future works, we expect that the results in this paper shall pave new ways for networked control system with both state delays and packet dropout.
References
Chang, P. H., & Lee, J. W. (1994). A model reference observer for time-delay control and its application to robot trajectory control. IFAC Proceedings Volumes, 27(14), 29–34.
Youcef-Toumi, K., Sasage, Y., Ardini, J., & Huang, S. Y. (1992). The application of time delay control to an intelligent cruise control system. In: American Control Conference, Chicago (pp. 1743–1747).
Anderson, R. J., & Spong, M. W. (1988). Bilateral control of teleoperators with time delay. In: IEEE Conference on Decision & Control, Austin (pp. 167–173).
Wang, Y., Yan, F., Chen, J., Ju, F., & Chen, B. (2019). A new adaptive time-delay control scheme for cable-driven manipulators. IEEE Transactions on Industrial Informatics, 15(6), 3469–3481.
Xu, J., & Zhang, H. (2017). Control for Itô stochastic systems with input delay. IEEE Transactions on Automatic Control, 62(1), 350–365.
Smith, O. J. M. (1959). A controller to overcome dead time. ISA Transactions, 6(2), 28–33.
Yue, D., & Han, Q. L. (2005). Delayed feedback control of uncertain systems with time-varying input delay. Automatica, 41(2), 233–240.
Zhang, H., Xie, L., & Duan, G. (2007). \({\rm H}^\infty\) control of discrete-time systems with multiple input delays. IEEE Transactions on Automatic Control, 52(2), 271–283.
Lee, Y. S., Moon, Y. S., Kwon, W. H., & Park, P. G. (2004). Delay-dependent robust \({\rm H}^\infty\) control for uncertain systems with a state-delay. Automatica, 40(1), 65–72.
Wang, H. P., Shieh, L. S., Tsai, J. S. H., & Zhang, Y. P. (2008). Optimal digital controller and observer design for multiple time-delay transfer function matrices with multiple input-output time delays. International Journal of Systems Science, 39(5), 461–476.
Primbs, J. A., & Sung, C. H. (2009). Stochastic receding horizon control of constrained linear systems with state and control multiplicative noise. IEEE Transactions on Automatic Control, 54(2), 221–230.
Qi, Q., & Zhang, H. (2017). Output feedback control and stabilization for multiplicative noise systems with intermittent observations. IEEE Transactions on Cybernetics, 48(7), 2128–2138.
Rami, M. A., Chen, X., & Zhou, X. Y. (2002). Discrete-time Indefinite LQ control with state and control dependent noises. Journal of Global Optimization, 23(3), 245–265.
Phillis, Y. (1985). Controller design of systems with multiplicative noise. IEEE Transactions on Automatic Control, 30(10), 1017–1019.
Zhang, H., Li, L., Xu, J., & Fu, M. (2015). Linear quadratic regulation and stabilization of discrete-time systems with delay and multiplicative noise. IEEE Transactions on Automatic Control, 60(10), 2599–2613.
Liang, X., Xu, J., & Zhang, H. (2017). Discrete-time LQG control with input delay and multiplicative noise. IEEE Transactions on Aerospace & Electronic Systems, 53(6), 3079–3090.
Liang, X., & Xu, J. (2018). Control for networked control systems with remote and local controllers over unreliable communication channel. Automatica, 98, 86–94.
Liang, X., Xu, J., & Zhang, H. (2020). Optimal control and stabilization for networked control systems with asymmetric information. IEEE Transactions on Control of Network Systems, 7(3), 1355–1365. https://doi.org/10.1109/TCNS.2020.2976296.
Li, L., Zhang, H., & Fu, M. (2017). Linear quadratic regulation for discrete-time systems with multiplicative noise and multiple input delays. Optimal Control Applications and Methods, 38(3), 295–316.
Yong, J., & Zhou, X. Y. (1999). Stochastic controls: Hamiltonian systems and HJB equations. IEEE Transactions on Automatic Control, 46(11), 1846–1846.
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix A
With the stochastic maximum principle (4)–(6) to LQG control system (3) involving multiple state delays and multiplicative noise, we can obtain for \(k=N\),
Using Eqs. (8) and (9), the optimal controller \(u_N\) is as
where \(\varSigma _N=\mathcal {D}'(N){\mathcal {P}}_{N+1}^0{\bar{\mu }}_N+\bar{\mathcal {D}}'(N){\mathcal {P}}_{N+1}^0\tau\). From (4) and (5), we also have
Substituting (7), \(\zeta _{N-1}\) yields
where \(\varPhi _N\) satisfied (12) with the terminal values being zero.
Now, we have verified (11) for \(k=N\). Supposing that \(\zeta _{k-1}\) are as (11) for all \(k\geqslant n+1\), we will show that (11) also holds for \(k=n\). For \(k=n+1\), with (3) and (11), \(\zeta _n\) can be calculated as
Inserting \(\zeta _n\) to (6), (6) will become
Thus, the optimal controller is given by
In virtue of equations (3), (5) and (23), \(\zeta _{n-1}\) yields that
Plugging (3) and (23) into the above equation for times d, we can calculate \(\zeta _{n-1}\) as follows:
where \({\mathcal {P}}_{n+i+1}^{i+j}=0\) for \(i+j>d\) from Remark 1, and then
After inserting (7), we can summarize that
This completes the proof of the lemma.
Appendix B
(Necessity) Suppose that there exists the unique \({\mathcal {F}}_{k-1}\)-measurable \(u_k\) to make the cost function (2) minimized. We will show that \(\varOmega _k, k=0,\dots ,N\) are positive definite by induction and the optimal controller can be designed as (13). Define
When \(k=N\), J(N) is presented as
where \(x_N=0\) and \(x_{N-j}=0\) for \(j=0,\ldots ,d\) as the uniqueness of the optimal controller is unrelated with \(x_k\).
As J(N) can be expressed as a quadratic function of \(u_N\), and the performance index must be positive, it can be obviously know that \(\varOmega _N>0\), i.e., \(\varOmega _k\) is positive definite for \(k=N\). Assuming \(\varOmega _k>0\) for all \(k\geqslant n+1\), we will prove that \(\varOmega _n>0\). With (3), (5) and (6), for \(k\geqslant n+1\), we construct that
To obtain the form of J(N), we add both sides of (24) from \(k=n+1\) to \(k=N\), we have
Then,
Compared with (2), it yields that
Setting \(x_n=0\) and \(x_{n-i}=0\), for \(i=0,\ldots ,d\) as the same as the condition \(k=N\), and plugging (11) into (25), we obtain
Similarly to the case \(\varOmega _N>0\) above, we obviously get \(\varOmega _n>0\) for all \(k=0,\ldots , N\). This completes the proof of necessity.
(Sufficiency) Suppose that \(\varOmega _k>0\) for \(k=0,\dots ,N\) is ture, we will show the existence of the unique \({\mathcal {F}}_{k-1}\)-measurable \(u_k\) to minimize (2). Make the definition:
First, as \(k=k+1\), using the equivalent substitution \(l=l+1\), \(j=j-1\), and \(i=i-1\) in turn, the \(V(x_{k+1})\) can be calculated as
Constructing the form \(V(x_k)-V(x_{k+1})\), we have
Denote
We can obviously know that \(\varPhi _k=\sum \limits _{i=0}^d\phi _{k+i}^i\). Then,
By virtue of (26), the following equation becomes
Adding from \(k=0\) to \(k=N\), the following equation is obtained:
Then, the cost function (2) becomes
As \(\varOmega _k>0\), the unique optimal controller is
which minimized the cost function (2), and the optimal cost is
Now, the proof of Theorem 1 is completed.
Rights and permissions
About this article
Cite this article
Lu, X., Zhang, Q., Liang, X. et al. Optimal LQG control for discrete time-varying system with multiplicative noise and multiple state delays. Control Theory Technol. 19, 328–338 (2021). https://doi.org/10.1007/s11768-021-00053-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11768-021-00053-z