A Control Problem for Parabolic Systems with Incomplete Information

Ananyev, Boris I.

doi:10.1007/978-3-030-22629-9_29

Boris I. Ananyev ORCID: orcid.org/0000-0002-1378-0240^17,18

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11548))

Included in the following conference series:

International Conference on Mathematical Optimization Theory and Operations Research

910 Accesses
1 Citations

Abstract

In this paper, abstract parabolic control systems in Hilbert space are considered. The state of the system is unknown, but there is an equation of measurement in discrete times. The initial state and disturbances are restricted by joint integral constraints. According to measurements, the information set is introduced that contains the true state of the system. This set includes all the states of the system that are compatible with the measurements. The preliminary aim of control consists in minimization of the terminal criterion depending of the information set. We suggest some statements of the problem based on the separation of control and observation processes. The optimal instants of transition from estimation to control are looked for as well. The approach is applied to distributed systems with partial derivatives and to systems with the deviation of time of retarded and neutral types. The approximation scheme are suggested and examples are considered.

Access provided by Autonomous University of Puebla. Download conference paper PDF

Modification of the Dynamic Regularization Method for Linear Parabolic Equations

Article 01 November 2020

Optimal Control for Distributed Linear Systems Subjected to Null Controllability with Constraints on the State

On Approximate Solution of Optimal Control Problems by Parabolic Equations

Article 02 September 2022

Keywords

1 Introduction and Preliminaries

First of all we indicate that problems of control under incomplete information were investigated in many books and papers [3,4,5,6,7,8]. The authors use either the stochastic approach [7] or the minimax deterministic one going back to [3] and developed in subsequent works. We keep to the deterministic problem formulation in [3, 4]. Similar formulations were used and modified in [9,10,11]. In this work, we continue and complement [12, 13] trying to generalize some results from [14, 15] on the case of infinite-dimensional systems. The algorithm of solution is developed and special cases are considered for parabolic and hyperbolic partial differential systems. Examples are examined. We consider also finite dimensional and numerical approximations for the problem.

1.1 Weak Solutions of Evolutionary Systems

Let V, H be two real Hilbert spaces with norms $\Vert \cdot \Vert $ and $|\cdot |$ respectively. Suppose that $V\subset H$, V is dense imbedded in H and separable, $|v|\le \gamma \Vert v\Vert $ for every $v\in V$. The last inequality means that the imbedding V into H is continuous and the dual space $V^*$ contains $H^*=H$. The spaces H and $H^*$ are identified. Let further a(u, v) be a continuous, bilinear and coercive form on V, such that $a(v,v)\ge \alpha \Vert v\Vert ^2$, $\forall v\in V$.

Let a function $f:[0,T]\rightarrow H$ be measurable and $ \int _0^T\Vert f(t)\Vert ^2dt<\infty . $ For every point $z_0\in H$ there exists a unique continuous in H function $z(t)\in V$, $t>0$, such that

$$\begin{aligned} d\langle z(t),v\rangle /dt+a(z(t),v)=\langle f(t),v\rangle , \ \ \forall v\in V, \ \ z(0)=z_0. \end{aligned}$$

(1)

Here z(t) is implicitly supposed to be weakly absolutely continuous (see [1]).

The form a(u, v) defines a linear continuous operator $u\rightarrow Au\in V^*$ according to the equality $a(u,v)=\langle Au,v\rangle $. Define by D(A) the set of all elements $h\in V$, for which $Ah\in H\subset V^*$. The operator $-A$ on H is an infinitesimal closed generator for some strongly continuous semigroup $S(t):H\rightarrow H$ (see [1, 2]). Besides the solution of (1) has a form

$$\begin{aligned} z(t)=S(t)z_0+\int _0^tS(t-s)f(s)ds, \end{aligned}$$

(2)

where the integral is understood in Bochner’s sense [2]. Remark that the solution of (1) may be considered as a generalized solution of Cauchy problem

$$\begin{aligned} \dot{z}+Az=f(t), \ \ z(0)=z_0\in H. \end{aligned}$$

(3)

The generalized solution of (3) exists, is unique and may be represented by (2). The solution z(t) is weakly differentiable in H, i.e. the weak limit $ \lim _{\delta \rightarrow 0}(z(t+\delta )-z(t))/\delta =dz(t)/dt $ there exists a.e. on [0, T] in weak topology of H.

2 The System and Measurements

Consider a controlled system of the form

$$\begin{aligned} \dot{z}+Az=Bu(t)+C\xi (t), \ \ z\in H. \end{aligned}$$

(4)

Suppose that the operator A is defined by continuous bilinear form a(u, v) given on a separable Hilbert space $V\subset H$; B and C are continuous linear operators from Hilbert spaces $H_1$ and $H_2$ to the H, respectively. Let $L_2(0,T,H_i)$ be the Hilbert space of weakly measurable functions $f(t)\in H_i$ such that $ \int _0^T\Vert f(t)\Vert ^2dt\le \infty . $ According to Subsect. 1.1, an each pair of functions $u(\cdot )\in L_2(0,T;H_1)$ and $\xi (\cdot )\in L_2(0,T;H_2)$ along with an initial state $z_0\in H$ defines a unique weak solution $z(t;z_0,u,\xi )$ of (4). This solution satisfies the equation

$$ d\langle z(t),v\rangle /dt+a(z(t),v)=\langle Bu(t)+C\xi (t),v\rangle , \ \ \forall v\in V, \ \ z(0)=z_0, $$

and may be represented as

$$\begin{aligned} z(t)=S(t)z_0+\int _0^tS(t-s)(Bu(s)+C\xi (s))ds. \end{aligned}$$

(5)

In what follows the state z(t) of (4) or (5) is unknown. The available information about it may be described as follows. Given a uniform partition $0=t_0<t_1<\dots <t_N=T$ of [0, T], $t_i-t_{i-1}=T/N=\delta $, at the instants $t_i$ a vector $y_i=Gz(t_{i-1})+w_i$ is observed, where $G:H\rightarrow R^m$ is a finite-dimensional linear operator. Unknown disturbances $\xi (\cdot )$, the initial state $z_0$, and vectors $w_i$ are restricted by the joint constraint

$$\begin{aligned} \Vert z_0\Vert _{P_0}^2+\int _0^T\Vert \xi (t)\Vert _Q^2dt+\sum _{i=1}^N\Vert w_i\Vert _R^2\le 1. \end{aligned}$$

(6)

Here and further we use the notation $\Vert u\Vert _F^2=\langle u,Fu\rangle $ for a self-adjoint positive and coercive operator F; $\langle \cdot \,,\cdot \rangle $ is an inner product in the corresponding space. The operators $P_0$, Q, and the matrix R are supposed to be similar to F. Besides, we have a constraint on the control $u(\cdot )$:

$$\begin{aligned} \int _0^T\Vert u(t)\Vert _F^2dt\le 1. \end{aligned}$$

(7)

2.1 Transformation to a Discrete-Time System

System (5) with measurements and controls may be represented in a discrete-time form

$$\begin{aligned}&z_i=Sz_{i-1}+\eta _i+\xi _i, \ \ \text {where} \ \ S=S(\delta ), \ \ z_i=z(t_i), \\ \nonumber&\eta _i=\int _{t_{i-1}}^{t_i}S(t_i-s)Bu(s)ds, \ \ \xi _i=\int _{t_{i-1}}^{t_i}S(t_i-s)C \xi (s)ds,\\ \nonumber&\qquad \qquad \qquad \,\, y_i=Gz_{i-1}+w_i, \ \ i\in 1:N. \end{aligned}$$

(8)

Let us derive constraints on parameters in (8). Denote by $\xi _i^N$ the set of elements $\{\xi _i,\dots ,\xi _N\}$. The symbol $w_i^N$ has the same meaning. If $i=1$, we write $\xi ^N=\xi ^N_1$. Find first the support function (see, for example, [6]) of all the parameters $\{z_0,\xi ^N,w^N\}$ according to constraints (6). Let $\chi _A(s)$ be a characteristic function. We have

$$\begin{aligned}&\max _{z_0,\xi (\cdot ),w^N}\left\{ \langle k,z_0\rangle +\sum _{i=1}^N\Big (\langle l_i, \xi _i\rangle + \langle m_i,w_i\rangle \Big )\right\} \\&\quad =\max _{z_0,\xi (\cdot ),w^N} \Bigg \{\langle k,z_0\rangle +\int _0^T\bigg \langle \sum _{i=1}^N\chi _{[t_{i-1},t_i]}(s) C^*S^*(t_i-s)l_i,\xi (s)\bigg \rangle ds+\sum _{i=1}^N\langle m_i,w_i\rangle \Bigg \} \\&\quad =\sqrt{\langle k,P_0^{-1}k\rangle +\sum _{i=1}^N\Big (\langle l_i,\mathbf {C}l_i\rangle + \langle m_i,R^{-1}m_i\rangle \Big )}, \end{aligned}$$

where the self-adjoint positive operator $\mathbf {C}$ is defined as

$$\begin{aligned} \mathbf {C}l=\int _0^\delta S(\delta -s)CQ^{-1}C^*S^*(\delta -s)lds\\= \int _{t_{i-1}}^{t_i}S(t_i-s)CQ^{-1}C^*S^*(t_i-s)lds. \end{aligned}$$

that does not depend on i. Doing the same with the control, we obtain

$$\begin{aligned}&\max _{u(\cdot )}\left\{ \sum _{i=1}^N\langle l_i, \eta _i\rangle \right\} =\max _{u(\cdot )} \Bigg \{\int _0^T\bigg \langle \sum _{i=1}^N\chi _{[t_{i-1},t_i]}(s) B^*S^*(t_i-s)l_i,u(s)\bigg \rangle ds\Bigg \}\\&\qquad \qquad \qquad \qquad \qquad \qquad =\sqrt{\sum _{i=1}^N\Big (\langle l_i,\mathbf {B}l_i\rangle \Big )}, \end{aligned}$$

where the self-adjoint positive operator $\mathbf {B}$ is defined as

$$\begin{aligned} \mathbf {B}l=\int _0^\delta S(\delta -s)BF^{-1}B^*S^*(\delta -s)lds. \end{aligned}$$

(9)

Now defining $\mathbb {B}=\mathbf {B}^{1/2}$ and $\mathbb {C}=\mathbf {C}^{1/2}$ we come to the conclusion.

Lemma 1

The discrete-time system (8) with constraints (6), (7) is equivalent to the system

$$\begin{aligned}&\qquad \qquad z_i=Sz_{i-1}+\mathbb {B}u_i+\mathbb {C}v_i, \ \ \text {with constraints} \\ \nonumber&\sum _{i=1}^N\Vert u_i\Vert ^2\le 1, \ \ \Vert z_0\Vert ^2_{P_0}+\sum _{i=1}^N\big (\Vert v_i\Vert ^2+ \Vert w_i\Vert _R^2\big )\le 1, \\ \nonumber&\qquad \qquad y_i=G_iz_{i-1}+w_i, \ \ i\in 1:N. \end{aligned}$$

(10)

Proof

It follows from the fact that the support functions of the sets $\{\mathbb {B}u^N\}$ and $\{z_0,\mathbb {C}v^N,w^N\}$ coincide with functions found above. $\square $

Note that the states $z_i$ of system (10) are not the approximations of $z(t_i)$. We have the equality $z_i=z(t_i)$ under some parameters in the systems.

3 Estimation for Discrete-Time Evolutionary Systems

For system (10) the information set $\mathcal {Z}_j(y,u)$ (see [4]) is defined as follows.

Definition 1

The set $\mathcal {Z}_j(y,u)\subset H$ is said to be informational if it consists of all vectors $z_j$ for which there exist elements $z_0,\,v_i,\,w_i$, such that Eq. (10) are fulfilled for all $i\in 1:j$, constraints in (10) hold, and measurements $y_i=Gz_{i-1}+w_i$ are valid for all $i\in 1:j$.

Introduce the linear operator $\mathbb {S}(z,v)=Sz+\mathbb {C}v$. The representation of $\mathcal {Z}_i(y,u)$ is given by

Theorem 1

The information set is the ellipsoid $\mathcal {Z}_i(y,u)=\{z:\Vert z-\hat{z}_i\Vert ^2_{P_i}+h_i\le 1\}$ with parameters given by the formulas

$$\begin{aligned} P_i^{-1}= & {} SJ_i^{-1}S^*+\mathbf {C},\ \ J_i=P_{i-1} +G^*RG,\\ \nonumber \hat{z}_i= & {} \mathbb {B}u_i+S\check{z}_i, \ \ \check{z}_i=\hat{z}_{i-1}+ J_i^{-1}G^*R(y_i-G\hat{z}_{i-1}),\\ \nonumber \hat{z}_0= & {} 0,\ \ h_i=h_{i-1}+ \Vert y_i-G\hat{z}_{i-1}\Vert ^2_{\mathcal {G}_i},\\ \nonumber h_0= & {} 0, \ \ \mathcal {G}_i^{-1}=G P_{i-1}^{-1}G^*+R^{-1}. \end{aligned}$$

(11)

The sum $\Vert z-\hat{z}_i\Vert ^2_{P_i}+h_i$ is a minimum of relation $\Vert z_0\Vert _{P_0}^2+\sum _{j=1}^i\big (\Vert v_j\Vert ^2+\Vert w_j\Vert _R^2\big )$ under the assumption that parameters $z_0,\,v_j,\,w_j$ submit the boundary condition $z_i=z$ due to Eq. (10).

Proof

Theorem 1 may be proved by induction. Let $u_i=0$ and $F_i(z,v)= \Vert v\Vert ^2+\Vert y_i-Gz\Vert ^2_R$. Introduce some axillary sets and functions:

$$\begin{aligned} \mathcal {V}_i(y)= & {} \left\{ (z,v)\in H\times H:V_{i-1}(z)+F_i(z,v)\le 1\right\} , \nonumber \\ \mathcal {Z}_i(y)= & {} \mathbb {S}\mathcal {V}_i(y), \ \ V_0(z)=\Vert z\Vert ^2_{P_0},\quad i\in 1:N, \nonumber \\ V_i(z_i)= & {} \begin{array}{cc} {\left\{ \begin{array}{ll}\min \limits _{(z,v)\in \mathcal {V}_i(y)}\left\{ V_{i-1}(z)+F_i(z,v):z_i= \mathbb {S}(z,v)\right\} , \ &{}z_i\in \mathcal {Z}_i(y), \\ \qquad \qquad \qquad \qquad 2, \ &{}z_i\not \in \mathcal {Z}_i(y).\end{array}\right. } \end{array} \end{aligned}$$

(12)

The set $\mathcal {V}_i(y)$ is said to be compartible with signal at the instant i, the set $\mathcal {Z}_i(y)$ is informational at the instant i. So, the sets $\mathcal {Z}_i(y)$ are images of $\mathcal {V}_i(y)$ according to (10). Let the signal $y^N$ be realized under the elements $z_0^*$, $v_i^*$, $w_i^*$, $i\in 1:N$. Then the constraints in (10) are fulfilled with these elements. We assert that sets $\mathcal {V}_i(y)$ and $\mathcal {Z}_i(y)$ are not empty for all $i\in 1:N$. The function $V_i(z_i)$ is equal to the minimum of functional $ \tilde{F}_i(z_0,v^i,y)=\Vert z_0\Vert ^2_{P_0}+\sum _{j=1}^iF_j(z_{j-1},v_j) $ over all the elements $z_0$, $v^i$, satisfying to (10) and the boundary condition $z_i=\mathbb {S}(z_{i-1},v_i)$. The informational sets $\mathcal {Z}_i(y)$ are expressed by the inequality $\mathcal {Z}_i(y)=\{z\in H:V_i(z)\le 1\}$. Note that the functional $\tilde{F}_i(z_0^*,v^{i*},y)\le 1$ for all $i\in 1:N$. Therefore, the pair $(z_{i-1}^*,v_i^*)\in \mathcal {V}_i(y)$ and the element $z_i^*\in \mathcal {Z}_i(y)$ $\forall i$. The sets in (12) are not empty. The relation $\mathcal {Z}_i(y)=\left\{ z\in H:V_i(z)\le 1\right\} $ is obvious for $i=1$. Indeed, we have

$$\begin{aligned}&\mathcal {V}_1(y)=\big \{(z,v):\Vert z\Vert ^2_{P_0}+\Vert v\Vert ^2+\Vert y_1-Gz\Vert ^2_R=\Vert z-\check{z}_1\Vert ^2_{J_1} +\Vert v\Vert ^2+h_1\le 1\big \}, \\&\qquad \qquad \quad \mathcal {Z}_1(y)=\mathbb {S}\mathcal {V}_1(y)=\left\{ z:\Vert z-\hat{z}_1\Vert ^2_{P_1}+h_1=V_1(z)\le 1\right\} . \end{aligned}$$

Here we use the known inverse operator formula $R-RG(P+G^*RG)^{-1}G^*R=$ $(R^{-1}+ GP^{-1}G^*)^{-1}$. Let the relation $\mathcal {Z}_{i-1}(y)=\left\{ x\in H:V_{i-1}(x)\le 1\right\} $ be valid and formulas (11), (12), $i\ge 2$, be fulfilled for $i-1$. Now, from (12) it follows that the inclusion $z_i\in \mathcal {Z}_i(y)$ results in the existence of pair $(z_{i-1},v_i)\in \mathcal {V}_i(y)$, for which $z_i=\mathbb {S}(z_{i-1},v_i)$. Therefore, $V_i(z_i)\le 1$. Conversely, if the last inequality is valid, then by definition there exists a pair such that $z_i=\mathbb {S}(z_{i-1},v_i)\in \mathbb {S}\mathcal {V}_i(y)=\mathcal {Z}_i(y)$. Moving back in indexes, we obtain that the inclusion $z\in \mathcal {Z}_i(y)$ is equivalent to the existence of the set $(z_0,v^i)$, for which $\tilde{F}_i(z_0,v^i,y)\le 1$ and $z=\mathbb {S}(z_{i-1},v_i)$ under Eq. (10). So, we get $\min _{z_0,v^i}\tilde{F}_i(z_0,v^i,y)=V_i(z)$ under the boundary condition $z=\mathbb {S}(z_{i-1},v_i)$. Suppose that $V_{i-1}(z)=\Vert z-\hat{z}_{i-1}\Vert ^2_{P_{i-1}}+h_{i-1}$, $i\ge 2$. Then

$$\begin{aligned}&\mathcal {V}_i(y)=\big \{(z,v):\Vert z-\hat{z}_{i-1}\Vert ^2_{P_{i-1}}+h_{i-1}+\Vert v\Vert ^2+\Vert y_i-Gz\Vert ^2_R\\&\qquad \qquad \qquad =\Vert z-\check{z}_i\Vert ^2_{J_i} +\Vert v\Vert ^2+h_i\le 1\big \},\\&\mathcal {Z}_i(y)=\mathbb {S}\mathcal {V}_i(y)=\left\{ z:\Vert z-\hat{z}_i\Vert ^2_{P_i}+h_i=V_i(z)\le 1\right\} . \end{aligned}$$

We see that values $y_i-G\hat{z}_{i-1}$ and $h_i$ do not depend on controls $u_i$. Therefore, the values $\mathbb {B}u_i$ are added additively only for the second equality in (11). $\square $

4 Problem Formulation and General Solution

We are going to formulate a problem in which processes of estimation and control are separate in time. At first the estimation is provided under given control and we get the information set $\mathcal {Z}_i(y,u)$. After that the minimax off-line procedure is realized. Our main control problem consists in finding of the instant i of finishing observation and passing to the new control on the rest of time.

4.1 Minimax Off-Line Control

From now on we introduce the other compatible set $\mathbf {V}_i(y,u)$ of uncertain parameters consisting of all pairs $(z_i,v^N_{i+1})$ that are compatible with the signal $y^i$. The projection $\mathrm{proj}_H\mathbf {V}_i(y,u)$ of the compatible set on H coincides with the information set $\mathcal {Z}_i(y,u)$. This new compatible set is defined by the formula

$$\begin{aligned} \mathbf {V}_i(y,u)=\left\{ (z,v^N_{i+1}):\Vert z-\hat{z}_i\Vert ^2_{P_i}+\sum _{j=i+1}^N\Vert v_j\Vert ^2 \le 1-h_i\right\} , \end{aligned}$$

where parameters are given in (11). Let $\tilde{u}=u^N_{i+1}$ be some controls and $\mathcal {Z}_N(\tilde{u}\mid \mathbf {V}_i(y,u))$ be the attainability domain of first equation in (10) with respect to $\mathbf {V}_i(y,u)$ under given further controls $\tilde{u}$. Consider some functional $\varPhi (\mathcal {Z})$ that defined on all bounded sets $\mathcal {Z}\subset H$. The primary objective of controls consists in minimization of the cost $\varPhi (\mathcal {Z}_N(y,u))$ that depends on the information set. At the initial instant we choose optimal control $u^{N,0}$ that solves the problem $\varPhi (\mathcal {Z}_N(u^N\mid \mathbf {V}_0))\rightarrow \min _{u^N}=r_0$ and after that it is corrected. Here $\mathbf {V}_0=\left\{ (z,v^N):\Vert z\Vert ^2_{P_0}+\sum _{j=1}^N\Vert v_j\Vert ^2 \le 1\right\} $ and the measurements are not taken into account.

At any instant $i=1,\dots ,N$ we solve the auxiliary control problem

$$\begin{aligned} \varPhi (\mathcal {Z}_N(u\mid \mathbf {V}_i(y,u^0)))\rightarrow \min _{u\in \mathbf {U}_i(u^0)}=r_i(y,u^0), \end{aligned}$$

(13)

where $u^0=u^{N,0}$ is a control chosen at initial instant; $\mathbf {U}_i(u^0)$ is a set of controls after the instant i, i.e. $\mathbf {U}_i(u^0)=\big \{u_{i+1}^N:\sum _{j=i+1}^N\Vert u_j\Vert ^2\le 1-\sum _{j=1}^i \Vert u_j^0\Vert ^2\big \}$. Suppose that there exists at least one optimal control $u_{i+1}^{N,i}$ in problem (13).

4.2 Finding of the Observation Stopping Time

Now we explain how to find the instant i of finishing observation and passing to the new optimal control $u_{i+1}^{N,i}$ of problem (13) on the rest of time. To do the choice we compare the value $r_i(y,u^0)$ with value of forecasting

$$\begin{aligned} r_i(s,y^i,u^s)=\,\max _{y_{i+1}^s\in Y_{s,i}(y^i,u^s)}r_s(y,u), \end{aligned}$$

(14)

where $Y_{s,i}(y^i,u^s)=\{y_{i+1}^s\}$ is a set of all possible continuations of signal $y^i$ up to the instant $s>i$. The value (14) is the worst result of control if the system is located in the position $\{y^i,u^i\}$ and up to the instant s the control $u_{i+1}^s$ is used. We set $r_i(i,y^i,u^i)=r_i(y,u)$. Our problem can be repeated [14, 15]. Introduce one more value $ \underline{r}_{\,i}(y,u)=\min _{s\in i:N}r_i(s,y^i,u^s). $ Let us be already located in position $\{y^i,u^i\}$, where $u^i$ is a part of control $u^N$ previously found. In this case, we verify the condition $\underline{r}_{\,i}(y,u)<r_i(y,u)$, $(i\in 1:N-1)$. If this holds, then the control $u_{i+1}^N$ does not change. Otherwise, we pass to the new control $u_{t+1}^{N,t},$ delivering the minimum in (13). So, the first instant i such that

$$\begin{aligned} \underline{r}_{\,i}(y,u)\ge r_i(y,u), \ \ \text {where} \ \ i\in 1:N-1, \end{aligned}$$

(15)

we call the observation stopping time. In this instant i the observation is stopped and we pass the optimal off-line control in problem (13).

Consider some particular cases. Let $u=u^{N,0}$. If $\underline{r}_{\,1}(y,u)\ge r_1(y,u)$, then the observation is stopped at first instant. From the other hand, suppose that relations (15) are not valid for all $i\in 1:N-1$ and $\sum _{i=1}^N\Vert u^0_i\Vert ^2<1$. In this case, the observation continues all the time, but the resource of control is not exhausted at the last instant N. Therefore, we can solve the minimax problem $\varPhi (\mathcal {Z}_N(y,u))\rightarrow \min _{u_N}$, $\Vert u_N\Vert ^2\le 1- \sum _{i=1}^N\Vert u^0_i\Vert ^2$, and regard optimal $\tilde{u}_N$ as an additional control action at the last instant.

4.3 An Algorithm of Repeated Correction

If we can continue observation after any stopping time, then the following algorithm of repeated correction can be proposed.

1.
We find the value $r_0$ and optimal control $u^{N,0}$ before any observations.
2.
At $i=1$ we decide if this control has to be changed, i.e. if the value $\underline{r}_{\,1}(y,u^{N,0})<r_1(y,u^{1,0})$ then the control $u^{N,0}$ should be kept. Otherwise, we pass to the new control $u_2^{N,1},$ delivering the minimum in (13).
3.
In position $\{y^i,u^i\}$, where $u^i$ is a part of control $u^N$ previously found, we verify the condition (15), where $i\in 1:N-1$. If this holds, then we pass to the optimal control $u_{i+1}^{N,i}$, delivering the minimum in (13).
4.
In any case, if at the last instant N the inequality $\sum _{i=1}^N\Vert u_i\Vert ^2<1$ is obtained, we solve the minimax problem $\varPhi (\mathcal {Z}_N(y,u))\rightarrow \min _{u_N}$, $\Vert u_N\Vert ^2\le 1-\sum _{i=1}^N\Vert u_i\Vert ^2$, and regard optimal $\tilde{u}_N$ as an additional control action at the last instant.

According to the algorithm, we obtain the sequence $\{\tau _1,\tau _2,\dots \}$ of instants where control has been changed. This sequence depends on the signal. In particular, the sequence may be empty when observations are bad for control, or it may coincide with the set $1:N-1,$ when, on the contrary, the observations give essential information. The values $r_i=r_{\tau _i}(y,u)$ form the nonincreasing sequence. Here the strong inequalities $r_i >r_{i+1}$ hold if $\tau _{i+1}-\tau _i \ge 2$. In the case $\tau _{i+1}-\tau _i =1$ the strong inequality $r_i>r_{i+1}$ holds if and only if the signal $y_{\tau _{i+1}}$ is not the worst.

Instead of inequality (15) at every instant $i<N$, we may check the simpler condition $r_t(t+1,y^t,u^{t+1})<r_t(y,u)$. If it is fulfilled, then the control $u_{t+1}^N$ does not change. Otherwise, we pass to the new control $u_{t+1}^{N,t}$ in problem (13).

5 A Special Case of the Terminal Cost

Let the terminal functional has the form $\varPhi (\mathcal {Z})=\max _{z\in \mathcal {Z}}\Vert \varDelta z\Vert ,$ where $\varDelta :H\rightarrow R^k$ is a linear finite-dimensional operator and $\Vert \cdot \Vert $ is the Euclidean norm. In this case, we can obtain formulas (13)–(15) in more detail.

First of all we describe all the continuations of the signal.

Lemma 2

A signal $y_{i+1}^s$ is a continuation of the signal $y^i$ iff there exists a sequence $\varphi _{i+1}^s$ such that $\sum \limits _{j=i+1}^s\Vert \varphi _i\Vert ^2_{\mathcal {G}_i} \le 1-h_i,$ and $\hat{z}_j=\mathbb {B}u_j+S(\hat{z}_{j-1} +J_j^{-1}G^*R\varphi _j),\ \ y_j=G\hat{z}_{j-1}+\varphi _j,$ for $j\in i+1:s.$

This lemma follows from Eq. (12). Below we use vectors $l\in R^k$ as column-vectors and the symbol $l'$ is used for row-vector. Then we have the relation

$$\begin{aligned}&r_i(y,u)=\max _{l'l\le 1}\bigg \{\gamma _i(l)\hat{z}_i -\bigg (1-\sum _{j=1}^i\Vert u_i\Vert ^2 \bigg )^{1/2}\bigg (\sum _{j=i+1}^N\gamma _j(l)\mathbf {B}\gamma ^*_i(l) \bigg )^{1/2}\nonumber \\&\qquad \qquad \quad +\,(1-h_i )^{1/2}\left( \pi _0(i)(1-l'l)+l'\varDelta P_{N,i}\varDelta ^*l\right) ^{1/2}\bigg \}, \end{aligned}$$

(16)

where $\gamma _j(l)=\gamma _{j+1}(l)S,\,\,\,\gamma _N(l)=l'\varDelta ;\,\,\,P_{j,i}=SP_{j-1,i}S^*+\mathbf {C},\,\ \, P_{i,i}=P_i^{-1}; \pi _0(i)=\max _{l'l\le 1}l'\varDelta P_{N,i}\varDelta ^*l.$ Using Lemma 2, we obtain

$$\begin{aligned}&r_i(s,y^i,u^i)=\max _{l'l\le 1}\bigg \{ \gamma _i(l)\hat{z}_i +\sum _{j=i+1}^s\gamma _j(l)\mathbb {B}u_j -\bigg (1-\sum _{i=1}^s\Vert u_i\Vert ^2\bigg )^{1/2} \nonumber \\&\quad \cdot \bigg ( \sum _{j=s+1}^N\gamma _j(l)\mathbf {B}\gamma ^*_j(l) \bigg )^{1/2}+(1-h_i)^{1/2}(\pi _0 (s)(1-l'l)+l'\varDelta P_{N,i}\varDelta ^*l)^{1/2}\bigg \}. \end{aligned}$$

(17)

Formulas (16)–(17) are established similarly to [4, 9]. In addition, let us note that optimal control is on the formula

$$\begin{aligned} u_j^0=-\mathbb {B}\gamma ^*_j(l^0)\bigg (1-\sum _{i=1}^j\Vert u_i\Vert ^2 \bigg )^{1/2}\bigg (\sum _{i=j+1}^N\gamma _i(l^0)\mathbf {B}\gamma ^*_i(l^0) \bigg )^{-1/2},\ \ j>i, \end{aligned}$$

where $l^0$ is a maximizer in formula (16) which does not convert the corresponding sum into zero.

6 A Finite-Dimensional Approximation

Let us return to general relations in Sect. 1, where V is a separable Hilbert space and a(u, v) is a bilinear form with properties:

$$\begin{aligned} a(v,v)\ge \alpha \Vert v\Vert ^2, \ \ a(u,v)\le \beta \Vert u\Vert \Vert v\Vert . \end{aligned}$$

(18)

Given finite-dimensional subspace $\mathcal {F}\subset V$, define Ritz’s projector $\varPi :V\rightarrow \mathcal {F}$ as $a(v,u-\varPi u)=0, \ \forall v\in \mathcal {F}$ (see [16]). The following estimate holds:

$$\begin{aligned} \Vert u-\varPi u\Vert \le \beta d(u,\mathcal {F})/\alpha , \ \ \text {where} \ \ d(u,\mathcal {F})=\min _{v\in \mathcal {F}}\Vert u-v\Vert . \end{aligned}$$

(19)

Consider an increasing sequence $\mathcal {F}^n$ of finite-dimensional subspaces $\mathcal {F}^n\subset \mathcal {F}^{n+1}\subset V$ such that the distance $d(u,\mathcal {F}^n)\rightarrow 0$ as $n\rightarrow \infty $ $\forall u\in V$. Such a sequence is called complete. The proof of following lemma may be found in [16] or somewhere.

Lemma 3

Let $u:[0,T]\rightarrow V$ be a continuous function and $\mathcal {F}^n$ be a complete sequence of finite-dimensional subspaces. Then the real function $\Vert u(t)-\varPi ^nu(t)\Vert $ tends to zero uniformly in $t\in [0,T]$, where $\varPi ^n:V\rightarrow \mathcal {F}^n$ is the Ritz projector.

Let H be another Hilbert space and let the space $V\subset H$ be densely imbedded in H as in Sect. 1. The linear operator A with a dense domain $D(A)\subset V$ has been defined as $a(u,v)=\langle Au,v\rangle _H$, $\forall v\in V$. The dual operator $A^*$ is defined by the relation $a(u,v)=\langle u,A^*v\rangle _H$, $\forall u\in V$. The operator $-A^*$ is a infinitesimal generator for the semigroup $S^*(t)$ (see, for example, [17]). In addition, the function $\psi (t)=S^*(t)\psi $, where $\psi \in H$, is defined a weak solution of equation

$$\begin{aligned} d\langle v,\psi (t)\rangle _H/dt+a(v,\psi (t))=0 \ \ \forall v\in V, \ \ \psi (0)=\psi . \end{aligned}$$

This equation is similar to (1). Let us remind that the inclusion $z_0\in D(A)$ implies $z(t)=S(t)z_0\in D(A)$ for all $t\ge 0$ and

$$\begin{aligned} dz(t)/dt+Az(t)=0, \end{aligned}$$

(20)

i.e. z(t) is a strong solution of Eq. (20).

Suppose that the increasing sequence $\mathcal {F}^n\subset V$ of finite-dimensional subspaces is complete. Consider the problem

$$\begin{aligned} d\langle z^n(t), v^n\rangle /dt+a(z^n(t),v^n)=\langle f(t),v^n\rangle \ \ \forall v^n\in \mathcal {F}^n, \ \ z^n(0)=z^n, \end{aligned}$$

(21)

where one needs to find a function $z^n(t)\in \mathcal {F}^n$. The problem (21) is called the Galerkin-type finite-dimensional approximation of problem (1). We need the following

Theorem 2

([18]). Let $z^n\rightarrow z$ in the space H as $n\rightarrow \infty $. Then the solution $z^n(t)$ of problem (21) uniformly converges on [0, T] to the solution z(t) of problem (1) in the space H.

Let $e_1,\dots ,e_n$ be a basis in the space $\mathcal {F}^n$. We set

$$\begin{aligned} z^n(t)=\sum _{j=1}^nq^j(t)e_j, \ \ z^n=\sum _{j=1}^nq^je_j. \end{aligned}$$

A finite-dimensional approximation of problems in Sect. 4 with respect to the complete sequence $\mathcal {F}^n$ of subspaces is as follows. Problem (21) is equivalent to the solution of differential equations in matrix form:

$$\begin{aligned} M\dot{q}+Kq=\mathbf {f}(t), \ \ q_0=[q^1;\dots ;q^n], \ \ \mathbf {f}(t)=[\langle f(t),e_1\rangle ;\dots ; \langle f(t),e_n\rangle ], \end{aligned}$$

where M (det$M\not =$0) and K have elements $\langle e_i,e_j\rangle $ and $a(e_i,e_j)$ respectively. The solution of the system for our problems may be written similarly to (5):

$$\begin{aligned} q(t)=S^n(t)q_0+\int _0^tS^n(t-s)(B^n\mathbf {u}(s)+C^n\mathbf {v}(s))ds, \end{aligned}$$

(22)

where $S^n(t)=\exp (-M^{-1}Kt)$ is the transition matrix having $n\times n$-dimension, $\mathbf {u}(t)$ and $\mathbf {v}(t)$ are n-dimensional measurable functions. Matrices $B^n$ and $C^n$ have the similar structure and represent a multiplication of matrix $M^{-1}$ and the square root of matrices with elements $\langle e_i,BF^{-1}B^*e_j\rangle $ and $\langle e_i,CQ^{-1}C^*e_j\rangle $ respectively. Constraints (6) and (7) are transformed to

$$\begin{aligned} \Vert q_0\Vert ^2_{P_0^n}+\int _0^T\Vert \mathbf {v}(s)\Vert ^2ds+\sum _{i=1}^N \Vert w_i\Vert ^2_R\le \nu ^n, \ \ \int _0^T\Vert \mathbf {u}(s)\Vert ^2ds\le \mu ^n. \end{aligned}$$

(23)

Measurement equation from (8) has the form

$$\begin{aligned} y_i=G^nq(t_i)+w_i, \ \ G^n=[Ge_1,\dots ,Ge_n]\in R^{k\times n}. \end{aligned}$$

(24)

Problems of Sects. 4 and 5 may be solved for relations (22)–(24) as described above.

Let us explain the appearance of numbers $\mu ^n$ and $\nu ^n$ in constraints (23). The matter is that the system (10) is infinite-dimensional and, therefore, the signal $y^N$ of this system in some cases can not be realized in finite-dimensional approximation (22)–(24) if we set $\mu ^n=1$, $\nu ^n=1$. But under some $\mu ^n>1$, $\nu ^n>1$ the finite-dimensional formulas like (13)–(17) are valid. Moreover, we get

Theorem 3

There exist sequences $\mu ^n\downarrow 1$, $\nu ^n\downarrow 1$ as $n\rightarrow \infty $ such that formulas like (11)–(17) for finite-dimensional approximation (22)–(24) hold and $r^n_i(y,u)\rightarrow r_i(y,u)$, $r_i^n(s,y^i,u^i)\rightarrow r_i(s,y^i,u^i)$ as $n\rightarrow \infty $ in relations (16), (17).

In the general case, it is hard to obtain the estimates of velocity for convergence $\mu ^n\downarrow 1$, $\nu ^n\downarrow 1$ with respect to parameters $\alpha $, $\beta $ in (18), (19).

6.1 An Application to Heat Equation

Let the controlled system be described by the equations

$$\begin{aligned}&z_t=z_{xx}+u(t)f(x), \ \ x\in [0,l], \ \ t\ge 0, \ \text {with boundary conditions}\\ \nonumber&\qquad \qquad z_x(t,0)=z(t,0), \ \ z_x(t,l)=-z(t,l). \end{aligned}$$

(25)

Here f(x) is a smooth function on [0, l], u(t) is a control. This system describe the heat process for the uniform bar. In our situation $H_1=R$, $C=0$, $V=H^1(0,l)$, $H=L_2(0,l)$ where $H^1(0,l)$ is the Sobolev space with parameter $k=1$. The operator $B:R\rightarrow L_2(0,l)$ has the form $Bu=uf(x)$. Dual operator $B^*:L_2(0,l)\rightarrow R$ is written as $B^*\phi =\int _0^lf(x)\phi (x)dx$, $\phi \in L_2(0,l)$. The weak form of considered system is obtained by the multiplication of (25) by $\phi \in H^1(0,l)$ with subsequent integration on [0, l] using boundary conditions. The form $a(\phi ,\psi )$ may be written as

$$\begin{aligned} a(\phi ,\psi )=\int _0^l\dot{\phi }(x)\dot{\psi }(x)dx+\phi (l)\psi (l)+\phi (0)\psi (0). \end{aligned}$$

The coercivity follows from Friedrich’s inequality. So, relation (1) for system (25) looks like

$$\partial \int _0^lz(t,x)\phi (x)dx/\partial t+a(z(t,\cdot ),\phi (\cdot ))=u(t)\int _0^lf(x)\phi (x)dx $$

for all $\phi \in H^1(0,l)$, $z(0,x)=z_0(x)$.

Let us divide the segment [0, l] by n subsegments of length $l{/}n$. Let $x_i$, $i\in 0:n$, be the points of partition. For the space $\mathcal {F}^n$ we consider piecewise-linear functions $e_i(x)$, for which $e_i(x_i)=1$ and $e_i(x_j)=0$ if $i\ne j$. The sequence of finite-dimensional subspaces $\mathcal {F}^n$ with basis $e_i(x)$, $i\in 0:n$, is complete. Therefore, we can perform the approximation. Suppose that measurement equations are of the form

$$ y_i=\int _0^lb(x)z(t_{i-1},x)dx+w_i, \ \ i\in 1:N, \ \ \text {where} \ \ b(\cdot )\in L_2(0,l). $$

Consider the $(n+1)\times (n+1)$-matrices M with elements $M_{ij}=\int _0^le_i(x)e_j(x)dx$ and K with elements $K_{ij}=a(e_i,e_j)$. The M is a three-diagonal symmetric matrix, where $M_{00}=M_{nn}=l{/}(3n)$ and other diagonal elements are equal to $2l{/}(3n)$. The secondary diagonal elements are equal to $l{/}(6n)$. The K is also a three-diagonal symmetric matrix, where $K_{00}=K_{nn}=n/l+1$ and other diagonal elements are equal to $2n{/}l$. The secondary diagonal elements of K are equal to $-n/l$. If $f(x)\equiv 1$, then we obtain the finite-dimensional system

$$\begin{aligned} M\dot{q}+Kq=u(t)\mathbf {f}, \ \ \text {where} \ \ \mathbf {f}=l[1;2;\dots ;2;1]/(2n)\in R^{n+1}. \end{aligned}$$

(26)

Let $b(x)\equiv 1$. Then measurement Eq. (24) has the form

$$ y_i=G^nq(t_{i-1})+w_i \ \ \text {where} \ \ G^n=\mathbf {f}'. $$

Suppose that initial constraints (6), (7) may be written as

$$ \int _0^lz^2(0,x)dx+\sum _{i=1}^Nw_i^2\le 1, \ \ \int _0^Tu^2(t)dt\le 1. $$

It follows from this that constraints (23) are:

$$ \Vert q_0\Vert ^2_M+\sum _{i=1}^Nw_i^2\le \mu ^n, \ \ \int _0^Tu^2(t)dt\le 1. $$

We need not to increase the constraints for $u(\cdot )$, but we do it for $q_0$ and $w_i$ in order to include the sequence $y^N$ in the scope. After that we need to convert the continuous system (26) to discrete one of the type (9), (10). Many solved examples of such a finite-dimensional problems where considered in [12, 13, 19].

7 Conclusion

We considered a control problem with incomplete information for abstract parabolic control systems in Hilbert space. Information about the system state are known in discrete instants. According to measurements, the information set was introduced that contained the true state of the system. This set included all the states of the system that were compatible with the measurements. For the terminal criterion depending of the information set, we suggested some statements of the problem based on the separation of control and observation processes. The optimal instants of transition from estimation to control were looked for as well. The approach was applied to distributed systems with partial derivatives. The approximation scheme was suggested and example with heat equation was considered. In this research some aspects demand more detailed study. For example, we need to obtain the estimates for values $\mu ^n$, $\nu ^n$, and convergence speed for parameters in Theorem 3. It is interesting to expand the approach to the case of continuous measurements.

References

Lions, J.-L.: Some Aspects of the Optimal Control of Distributed Parameter Systems. SIAM, Philadelphia (1972)
Book Google Scholar
Yosida, K.: Functional Analysis. Springer, Berlin (1980). https://doi.org/10.1007/978-3-642-61859-8
Book MATH Google Scholar
Krasovski, N.N.: Game problem on motion correction. Appl. Math. Mech. 33(3), 386–396 (1969)
Article MathSciNet Google Scholar
Kurzhanski, A.B.: Control and Observation under Conditions of Uncertainty. Nauka, Moscow (1977). (in Russian)
Google Scholar
Kurzhanski, A.B., Valyi, I.: Ellipsoidal Calculus for Estimation and Control. Birkhäuser, Boston (1997)
Book Google Scholar
Kurzhanski, A.B., Varaiya, P.: Dynamics and Control of Trajectory Tubes, Theory and Computation. Systems & Control: Foundations & Applications, vol. 85. Birkhäuser, Basel (2014). https://doi.org/10.1007/978-3-319-10277-1
Book MATH Google Scholar
Liptser, R.S., Shiryayev, A.N.: Statistics of Random Processes, I General Theory, II Applications. Springer, New York (1978). https://doi.org/10.1007/978-3-662-13043-8
Book MATH Google Scholar
Schweppe, F.: Uncertain Dynamic Systems. Prentice-Hall, Englewood Cliffs (1973)
Google Scholar
Anan’ev, B.I.: Minimax quadratic problem of motion correction. Prikl. Matem. i Mech. 41(3), 436–445 (1977). (in Russian)
MathSciNet Google Scholar
Ananev, B.I., Anikin, S.A.: Problem of reconstructing input signals under communication constraints. Autom. Remote Control 70(7), 1153–1164 (2009)
Article MathSciNet Google Scholar
Ananev, B.I.: Correction of motion under communication constraints. Autom. Remote Control 71(3), 367–378 (2010)
Article MathSciNet Google Scholar
Ananyev, B.I.: A control problem for evolutionary systems with incomplete information. In: IEEE Xplore Digital Library. Add. 09 July 2018: Stability and Oscillations of Nonlinear Control Systems (STAB): 14th International Conference (Pyatnitskiy’s Conference), Moscow, 30 May-1 June, Proceedings (2018). https://doi.org/10.1109/STAB.2018.8408340
Ananyev, B.I., Gusev, M.I., Filippova, T.F.: Control and estimation of dynamical systems states with uncertainty. N.N. Krasovskii Institute of Mathematics and Mechanics UB of RAS. Siberian Branch of RAS, Novosibirsk (2018). (in Russian)
Google Scholar
Ananyev, B.I., Gredasova, N.V.: Multistage motion correction of a linear-quadratic controlled system. Bull. UGTU-UPI 4(56), 280–288 (2005)
Google Scholar
Ananyev, B.I., Gredasova, N.V.: Multistage correction of quasi-linear systems under discrete observations. Proc. Inst. Math. Mech. UB RAS 13(4), 3–13 (2007)
Google Scholar
Strang, G., Fix, G.J.: An Analysis of the Finite Element Method. Prentice-Hall, Englewood Cliffs (1973)
MATH Google Scholar
Curtain, R.F., Pritchard, A.J.: Infinite Dimensional Linear Systems Theory. Springer, Berlin (1978)
Book Google Scholar
Kerimov, A.K.: On the Galerkin’s approximation of optimal control problems for distributed systems of parabolic type. J. Comp. Math. Math. Phys. 19(4), 851–865 (1979). (in Russian)
MATH Google Scholar
Gredasova, N.V.: Problems of multiple correction for controlled systems. Dissertation. IMM UB of RAS, Yekaterinburg (2012). (in Russian)
Google Scholar

Download references

Author information

Authors and Affiliations

N.N. Krasovskii Institute of Mathematics and Mechanics UB of RAS, Kovalevskaya Street 16, Yekaterinburg, 620990, Russia
Boris I. Ananyev
Ural Federal University, Mira 19, Yekaterinburg, Russia
Boris I. Ananyev

Authors

Boris I. Ananyev
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Boris I. Ananyev .

Editor information

Editors and Affiliations

Krasovsky Institute of Mathematics and Mechanics, Ekaterinburg, Russia
Michael Khachay
Sobolev Institute of Mathematics, Novosibirsk, Russia
Yury Kochetov
University of Florida, Gainesville, FL, USA
Panos Pardalos

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ananyev, B.I. (2019). A Control Problem for Parabolic Systems with Incomplete Information. In: Khachay, M., Kochetov, Y., Pardalos, P. (eds) Mathematical Optimization Theory and Operations Research. MOTOR 2019. Lecture Notes in Computer Science(), vol 11548. Springer, Cham. https://doi.org/10.1007/978-3-030-22629-9_29

Download citation

DOI: https://doi.org/10.1007/978-3-030-22629-9_29
Published: 12 June 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-22628-2
Online ISBN: 978-3-030-22629-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

A Control Problem for Parabolic Systems with Incomplete Information

Abstract

Similar content being viewed by others

Modification of the Dynamic Regularization Method for Linear Parabolic Equations

Optimal Control for Distributed Linear Systems Subjected to Null Controllability with Constraints on the State

On Approximate Solution of Optimal Control Problems by Parabolic Equations

Keywords

1 Introduction and Preliminaries

1.1 Weak Solutions of Evolutionary Systems

2 The System and Measurements

2.1 Transformation to a Discrete-Time System

Lemma 1

Proof

3 Estimation for Discrete-Time Evolutionary Systems

Definition 1

Theorem 1

Proof

4 Problem Formulation and General Solution

4.1 Minimax Off-Line Control

4.2 Finding of the Observation Stopping Time

4.3 An Algorithm of Repeated Correction

5 A Special Case of the Terminal Cost

Lemma 2

6 A Finite-Dimensional Approximation

Lemma 3

Theorem 2

Theorem 3

6.1 An Application to Heat Equation

7 Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation