On a Solution of a Guarantee Optimization Problem Under the Functional Constraints on the Disturbance

Gomoyunov, Mikhail; Serkov, Dmitriy

doi:10.1007/s13235-018-0279-1

On a Solution of a Guarantee Optimization Problem Under the Functional Constraints on the Disturbance

Published: 04 August 2018

Volume 9, pages 700–723, (2019)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Dynamic Games and Applications Aims and scope Submit manuscript

On a Solution of a Guarantee Optimization Problem Under the Functional Constraints on the Disturbance

Download PDF

Mikhail Gomoyunov^1,2 &
Dmitriy Serkov^1,2

129 Accesses
2 Citations
Explore all metrics

Abstract

The paper deals with a control problem for a dynamical system under disturbances. A motion of the system is considered on a finite interval of time and described by a nonlinear ordinary differential equation. The control is aimed at minimization of a given quality index. In addition to geometric constraints on the control and disturbance, it is supposed that the disturbance satisfies a compact functional constraint. Namely, all disturbance realizations that can happen in the system belong to some unknown set that is compact in the space $L_1$. Within the game-theoretical approach, the problem of optimizing the guaranteed result of the control is studied. For solving this problem, we propose a new construction of the optimal control strategy. In the linear-convex case, this strategy can be numerically realized on the basis of the upper convex hulls method. Examples are considered. Results of numerical simulations are given.

Control with a Guide in the Guarantee Optimization Problem under Functional Constraints on the Disturbance

Article 01 December 2017

The Linear Constrained Control Problem for Discrete-Time Systems: Regulation on the Boundaries

A Survey on Regularity Conditions for State-Constrained Optimal Control Problems and the Non-degenerate Maximum Principle

Article 23 December 2019

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The paper deals with a control problem for a dynamical system under disturbances. A motion of the system is considered on a finite interval of time and described by a nonlinear ordinary differential equation. The admissible values of the control and disturbance are subject to geometric constraints. The control is aimed at minimization of a given quality index. Within the game-theoretical approach [5,6,7, 18], we study the problem of optimizing the guaranteed result of the control.

In addition to the standard formulation of the guarantee optimization problem, we suppose that the disturbance satisfies a compact functional constraint. According to [8] (see also [2, 16, 17]), it means that all disturbance realizations that can happen in the system belong to some unknown set that is compact in the space $L_1$. This notion of a functional constraint is quite general and can be used in order to formalize an additional available information about the properties of the possible disturbance realizations as functions of time. However, it should be noted that this type of functional constraints substantially differs from the situation when the specific set of the possible disturbance realizations is given. The guarantee optimization problem in the latter case seems to be more complicated problem than the one studied in the paper.

The considered control problem under the functional constraint on the disturbance is formulated in the class of control strategies with full memory (see, e.g., [5, 7, 8]). The corresponding value of the optimal guaranteed result is introduced. The main result of the paper is a new construction of the optimal control strategy. This strategy can be considered as a control procedure with a guide. The proximity between the motions of the original system and guide is provided by the technique of dynamic reconstruction of the disturbance based on the ideas from [10]. The quality of the control process is attained due to the use of the optimal counter-strategy with full memory in the guide. Furthermore, we consider also a particular case of the problem when the right-hand side of the dynamic equation satisfies a certain additional condition [2, 13, 17], which allows to simplify the construction of the optimal strategy.

The proposed new construction of the optimal control strategy makes it possible to develop numerical methods for solving the guarantee optimization problems under consideration. The main difficulty here is to construct the optimal counter-strategy used in the guide. However, in the so-called linear-convex case, this can be done efficiently, for example, by applying the upper convex hulls method [1, 4] (see also [9]). We consider some examples, which are close to pursuit–evasion games, and present the results of numerical simulations.

The paper is organized as follows. In Sect. 2, we give the informal statement of the guarantee optimization problem under the functional constraint on the disturbance. In Sect. 3, to emphasize the differences that arise in the mathematical statement of the problem because of the presence of this additional constraint, we consider the standard statements of guarantee optimization problem without functional constraints. The mathematical statement of the problem with the functional constraint on the disturbance is given in Sect. 4. In Sect. 5, we propose a new construction of the optimal control strategy with full memory. The proof of the corresponding result is given in Sect. 6. Section 7 is devoted to the particular case when the construction of the optimal control strategy can be simplified. Examples are considered in Sect. 8.

2 Statement of the Problem

In this section, we give the informal statement of a guarantee optimization problem under a functional constraint on the disturbance. A dynamical system and a quality index under consideration are described. The notion of a functional constraint on the disturbance is introduced. The strict mathematical statement of the problem is given in Sect. 4.

2.1 Dynamical System and Quality Index

We consider a dynamical system which motion is described by the following differential equation:

$$\begin{aligned} \begin{array}{c} \displaystyle \frac{\,\mathrm {d}x(t)}{\,\mathrm {d}t} = f(t, x(t), u(t), v(t)), \quad t \in T {\text {:=}}[t_0,\vartheta ], \\ \quad x(t) \in \mathbb {R}^n, \quad u(t) \in P \subset \mathbb {R}^p, \quad v(t) \in Q \subset \mathbb {R}^q, \end{array} \end{aligned}$$

(1)

with the initial condition

$$\begin{aligned} x(t_0) = x_0, \quad x_0 \in B(R_0){\text {:=}}\big \{x \in \mathbb {R}^n: \, \Vert x\Vert \le R_0 \big \}. \end{aligned}$$

(2)

Here t is the time, x is the state vector, u is the control vector, v is the disturbance vector; $t_0$ and $\vartheta $ are the initial and terminal times; P and Q are known compact sets; $x_0$ is the initial state of the system; $R_0 > 0$ is a fixed number; the symbol $\Vert \cdot \Vert $ denotes the Euclidian norm of a vector.

It is assumed that the function $f: T \times \mathbb {R}^n \times P \times Q \rightarrow \mathbb {R}^n$ has the following properties: f is continuous; for any compact set $D \subset \mathbb {R}^n$, there exists a number $L > 0$ such that

$$\begin{aligned} \Vert f(t, x, u, v) - f(t, x^\prime , u, v) \Vert \le L \Vert x - x^\prime \Vert , \quad t \in T, \ \ x, x^\prime \in D, \ \ u \in P, \ \ v \in Q; \end{aligned}$$

and, moreover, there exists a number $a > 0$ such that

$$\begin{aligned} \Vert f(t, x, u, v) \Vert \le a (1 + \Vert x\Vert ), \quad t \in T, \quad x \in \mathbb {R}^n, \quad u \in P, \quad v \in Q. \end{aligned}$$

We assume that the segment $T = [t_0, \vartheta ]$ is equipped with the Lebesgue measure. By admissible realizations $u(\cdot )$ of the control and $v(\cdot )$ of the disturbance, we mean measurable functions $u: T \rightarrow P$ and $v: T \rightarrow Q$. The sets of all such realizations are denoted by $\mathcal {U}$ and ${\mathcal {V}}$, respectively. One can show that, due to the properties of the function f, for any initial state $x_0 \in B(R_0)$ and any admissible realizations $u(\cdot ) \in \mathcal {U}$ and $v(\cdot ) \in {\mathcal {V}}$, there exists a unique motion $x(\cdot ) = x(\cdot ; x_0, u(\cdot ), v(\cdot ))$ of system (1) that is an absolutely continuous function $x: T \rightarrow \mathbb {R}^n$ that satisfies initial condition (2) and, together with the realizations $u(\cdot )$ and $v(\cdot )$, satisfies Eq. (1) for almost all $t \in T$. Moreover, one can choose (see, e.g., [7, pp. 8,14,15]) a number $R > 0$ such that, for any motion $x(\cdot ) = x(\cdot ; x_0, u(\cdot ), v(\cdot ))$ of system (1), generated from any initial state $x_0 \in B(R_0)$ by any realization $u(\cdot ) \in \mathcal {U}$ and $v(\cdot ) \in {\mathcal {V}}$, the following inclusions are valid:

$$\begin{aligned} x(t) \in B(R){\text {:=}}\big \{x \in \mathbb {R}^n: \, \Vert x\Vert \le R \big \}, \quad t \in T. \end{aligned}$$

(3)

Let quality of a motion $x(\cdot )$ of system (1) be evaluated by the index

$$\begin{aligned} \gamma = \sigma (x(\cdot )), \end{aligned}$$

(4)

where the function $\sigma : C(T, \mathbb {R}^n) \rightarrow \mathbb {R}$ is continuous.

The goal of the control is to minimize the value $\gamma $ of quality index (4). Since there are unknown disturbances acting in system (1), when we pose and solve this control problem, in accordance with the guaranteed result principle (see, e.g., [3, 5,6,7, 18]), we should take into account that, in the worst case, the disturbances may be aimed at maximization of $\gamma $.

2.2 Functional Constraint on the Disturbance

According to (1), for any time $t \in T$, the value of the disturbance v(t) satisfies the inclusion $v(t) \in Q$. Constraints of this kind are called geometric (or instantaneous). In the present paper, the case is considered when the disturbance satisfies an additional functional constraint imposed not on the instantaneous values v(t), but on the realization $v(\cdot )$ as a whole.

By a functional constraint on the disturbance, we mean a family of subsets $\mathbb {V}\subset 2^{\mathcal {V}}$ such that

$$\begin{aligned} {\mathcal {V}}= \bigcup _{V \in \mathbb {V}} V. \end{aligned}$$

(5)

We say that the disturbance satisfies the functional constraint $\mathbb {V}$ if there exists a set $V \in \mathbb {V}$ such that every disturbance realization $v(\cdot )$ that can happen in system (1) satisfies the inclusion $v(\cdot ) \in V$. Thus, it is assumed that, when forming control actions, we know only the constraint $\mathbb {V}$, but the specific set $V \in \mathbb {V}$ is not given. This notion of a functional constraint is quite general and can be used in order to formalize an additional information about the structure and properties of the possible disturbance realizations. A functional constraint $\mathbb {V}$ is called compact if it consists of compact in $L_1(T, \mathbb {R}^q)$ subsets $V \subset {\mathcal {V}}$. Let us give some typical examples when an additional information about the disturbance can be formalized with the help of such a functional constraint:

1.
It is known that every disturbance realization $v(\cdot )$ is a piecewise constant function with a fixed number $l \in \mathbb {N} \cup \{0\}$ of possible discontinuity points; however, this number l is unknown.
2.
It is known that every disturbance realization $v(\cdot )$ is a continuous function with a fixed modulus of continuity $\omega $; however, this modulus $\omega $ is unknown.
3.
It is known that every realization $v(\cdot )$ is generated by a Carathéodory function $W: T \times \mathbb {R}^n \rightarrow Q$ such that $v(t) = W(t, x(t))$, $t \in T$; however, this function W is unknown (see, e.g., [14]).

In the paper, we consider a guarantee optimization problem for system (1), initial condition (2) and quality index (4) in the case when the disturbance satisfies a compact functional constraint $\mathbb {V}$. In order to emphasize the differences that arise in the mathematical statement of the problem due to the presence of this additional constraint, in the next section we give the standard statements of guarantee optimization problem (1), (2) and (4) without functional constraints.

3 Guarantee Optimization Without Functional Constraints

The mathematical statement of guarantee optimization problem (1), (2) and (4) depends on the way of forming the control actions. In this section, we consider three types of control strategies: quasi-strategies, counter-strategies and strategies with full memory. For each of these types, the corresponding value of the optimal guaranteed result is introduced. The comparison between these values is given.

3.1 Quasi-Strategies

The notion of a quasi-strategy, originating from works [11, 12], formalizes one of the most general ways of forming the control actions in real time without using information about future. In the paper, by a quasi-strategy, we mean a function $\alpha : {\mathcal {V}}\rightarrow \mathcal {U}$ with the following property of nonanticipation: if, for any time $t \in T$ and any realizations $v(\cdot ), v^\prime (\cdot ) \in {\mathcal {V}}$, the equality $v(\tau ) = v^\prime (\tau )$ is valid for almost all $\tau \in [t_0, t]$, then the corresponding images $u(\cdot ) = \alpha (v(\cdot ))$ and $u^\prime (\cdot ) = \alpha (v^\prime (\cdot ))$ satisfy the equality $u(\tau ) = u^\prime (\tau )$ for almost all $\tau \in [t_0, t]$. The set of all quasi-strategies is denoted by $\mathbf {QS}$.

For any initial state $x_0 \in B(R_0)$, the value of the optimal guaranteed result in the class of quasi-strategies is defined as follows:

$$\begin{aligned} \varGamma ^0_{\mathbf {QS}}(x_0){\text {:=}}\inf _{\alpha \in \mathbf {QS}} \ \sup _{v(\cdot ) \in {\mathcal {V}}} \ \sigma \Big ( x\big (\cdot ; x_0, \alpha (v(\cdot )), v(\cdot ) \big ) \Big ). \end{aligned}$$

(6)

Note that any control procedure that forms the current value u(t) on the basis of the information about the initial state $x_0$ and the history of the disturbance actions $v(\cdot )|_{[t_0, t]}$ (including the current value v(t)) can be considered as a quasi-strategy. Therefore, any such control procedure cannot guarantee the value of quality index (4) less than $\varGamma ^0_{\mathbf {QS}}(x_0)$. It is known that the quasi-strategies are a convenient tool in theoretical constructions, but they are impractical in real control problems.

3.2 Counter-Strategies with Full Memory

In the paper, we use the following definition of a counter-strategy (with full memory), which goes back to the constructions from [6, 8]. Let $\varDelta $ be a partition of the time segment $T = [t_0, \vartheta ]$ by times $\tau _i$, $i \in {0}...{n_{\varDelta }}$, i.e.,

$$\begin{aligned} \varDelta = \big \{\tau _i: \tau _0 = t_0, \ \tau _{i - 1} < \tau _i, \ i \in {1}...{n_{\varDelta }}, \ \tau _{n_{\varDelta }} = \vartheta \big \}. \end{aligned}$$

The set of all such partitions is denoted by $\varDelta _T$. By a counter-control (with full memory) on the partition $\varDelta $, we mean a family $\overline{{\mathbf {U}}}^\varDelta = (\overline{{\mathbf {U}}}^\varDelta _i)_{i\in {0}...{(n_{\varDelta }-1)}}$ of mappings

$$\begin{aligned} \overline{{\mathbf {U}}}^\varDelta _i: C([t_0, \tau _i], \mathbb {R}^n) \rightarrow \mathbb {B}(Q, P), \quad i \in {0}...{(n_{\varDelta }-1)}, \end{aligned}$$

where ${\mathbb {B}}(Q,P)$ is the set of all Borel measurable functions from Q to P. Respectively, a counter-strategy is a family $\overline{{\mathbf {U}}}= (\overline{{\mathbf {U}}}^{\varDelta })_{\varDelta \in \varDelta _T}$ of counter-controls defined for every partition $\varDelta \in \varDelta _T$. The set of all counter-strategies is denoted by $\mathbf {CS}$.

Let $x_0 \in B(R_0)$ and $\overline{{\mathbf {U}}}= (\overline{{\mathbf {U}}}^\varDelta )_{\varDelta \in \varDelta _T} \in \mathbf {CS}$. For any partition $\varDelta \in \varDelta _T$, the corresponding counter-control $\overline{{\mathbf {U}}}^\varDelta = (\overline{{\mathbf {U}}}^\varDelta _i)_{i \in {0}...{(n_{\varDelta }- 1)}}$ in a pair with a disturbance realization $v(\cdot ) \in {\mathcal {V}}$ forms in system (1) a control realization $u(\cdot )$ by the following step-by-step feedback rule:

$$\begin{aligned} u(t) = \overline{{\mathbf {U}}}^\varDelta _i (x(\cdot )|_{[t_0,\tau _i]}) \big (v(t)\big ), \quad t \in [\tau _i, \tau _{i+1}), \quad i \in {0}...{(n_{\varDelta }- 1)}, \end{aligned}$$

(7)

where $x(\cdot )|_{[t_0, \tau _i]}$ is the motion history realized up to the time $\tau _i$. Note that, since the function $\overline{{\mathbf {U}}}^\varDelta _i(x(\cdot )|_{[t_0,\tau _i]})$ is Borel measurable, the obtained control realization is admissible, i.e., $u(\cdot ) \in \mathcal {U}$. Thus, from the initial state $x_0$, the counter-control $\overline{{\mathbf {U}}}^\varDelta $ in a pair with the disturbance realization $v(\cdot )$ uniquely generates the system motion, denoted by $x(\cdot ) = x(\cdot ; x_0, \overline{{\mathbf {U}}}^\varDelta , v(\cdot ))$. For the counter-strategy $\overline{{\mathbf {U}}}$, the value of the guaranteed result is defined as follows:

$$\begin{aligned} \varGamma _{\mathbf {CS}}(x_0; \overline{{\mathbf {U}}}){\text {:=}}\lim _{\delta \downarrow 0} \ \sup _{\varDelta \in \varDelta _T: {{\mathbf {D}}(\Delta )} \le \delta } \ \sup _{v(\cdot ) \in {\mathcal {V}}} \ \sigma \Big ( x\big (\cdot ; x_0, \overline{{\mathbf {U}}}^\varDelta , v(\cdot ) \big ) \Big ). \end{aligned}$$

(8)

Here and below, we denote by ${{\mathbf {D}}(\Delta )} {\text {:=}}\max _{i\in {1}...{n_{\varDelta }}} (\tau _i - \tau _{i-1})$ the diameter of the partition $\varDelta $. Respectively, the optimal guaranteed result in the class of counter-strategies is the following value:

$$\begin{aligned} \varGamma ^0_{\mathbf {CS}}(x_0){\text {:=}}\inf _{\overline{{\mathbf {U}}}\in \mathbf {CS}} \ \varGamma _{\mathbf {CS}}(x_0; \overline{{\mathbf {U}}}). \end{aligned}$$

(9)

Note that, according to the results of [6, 7], for any number $\zeta > 0$, there exists a $\zeta $-optimal counter-strategy $\overline{{\mathbf {U}}}_*\in \mathbf {CS}$ such that, for any initial state $x_0 \in B(R_0)$, we have

$$\begin{aligned} \varGamma _{\mathbf {CS}}(x_0; \overline{{\mathbf {U}}}_*) \le \varGamma ^0_{\mathbf {CS}}(x_0) + \zeta . \end{aligned}$$

(10)

Due to the necessity of the direct measurement of the current value of the disturbance v(t), the use of the counter-strategies is also quite complicated in practice.

3.3 Strategies with Full Memory

In accordance with [8], by analogy with the introduced above class of counter-strategies, we define the class of strategies (with full memory) in the following way. By a control (with full memory) on a partition $\varDelta = (\tau _i)_{i\in {0}...{n_{\varDelta }}} \in \varDelta _T$, we mean a family $\mathbf {U}^\varDelta {\text {:=}}(\mathbf {U}^\varDelta _i)_{i\in {0}...{(n_{\varDelta }-1)}}$ of mappings

$$\begin{aligned} \mathbf {U}^\varDelta _i: C([t_0, \tau _i], \mathbb {R}^n) \rightarrow \mathcal {U}|_{[\tau _i,\tau _{i+1})}, \quad i \in {0}...{(n_{\varDelta }-1)}, \end{aligned}$$

where the set $\mathcal {U}|_{[\tau _i,\tau _{i+1})}$ consists of the restrictions on $[\tau _i, \tau _{i+1})$ of all the functions $u(\cdot ) \in \mathcal {U}$. A strategy is a family $\mathbf {U}= (\mathbf {U}^{\varDelta })_{\varDelta \in \varDelta _T}$ of controls defined for every partition $\varDelta \in \varDelta _T$. The set of all strategies is denoted by $\mathbf {S}$.

Let $x_0 \in B(R_0)$ and $\mathbf {U}= (\mathbf {U}^\varDelta )_{\varDelta \in \varDelta _T} \in \mathbf {S}$. For any partition $\varDelta \in \varDelta _T$, the corresponding control $\mathbf {U}^\varDelta = (\mathbf {U}^\varDelta _i)_{i \in {0}...{(n_{\varDelta }- 1)}}$ in a pair with a disturbance realization $v(\cdot ) \in {\mathcal {V}}$ forms in system (1) a control realization $u(\cdot )$ by the following step-by-step feedback rule:

$$\begin{aligned} u(t) = \mathbf {U}^\varDelta _i (x(\cdot )|_{[t_0,\tau _i]}) (t), \quad t \in [\tau _i, \tau _{i+1}), \quad i \in {0}...{(n_{\varDelta }- 1)}. \end{aligned}$$

Thus, from the initial state $x_0$, the control $\mathbf {U}^\varDelta $ in a pair with the disturbance realization $v(\cdot )$ uniquely generates the system motion, denoted by $x(\cdot ) = x(\cdot ; x_0, \mathbf {U}^\varDelta , v(\cdot ))$. The value of the guaranteed result of the strategy $\mathbf {U}$ and the value of the optimal guaranteed result in the class of strategies are defined as follows:

$$\begin{aligned}&\varGamma _{\mathbf {S}}(x_0; \mathbf {U}){\text {:=}}\lim _{\delta \downarrow 0} \ \sup _{\varDelta \in \varDelta _T: {{\mathbf {D}}(\Delta )} \le \delta } \ \sup _{v(\cdot ) \in {\mathcal {V}}} \ \sigma \Big ( x\big (\cdot ; x_0, \mathbf {U}^\varDelta , v(\cdot ) \big ) \Big ), \end{aligned}$$

(11)

$$\begin{aligned}&\varGamma ^0_{\mathbf {S}}(x_0){\text {:=}}\inf _{\mathbf {U}\in \mathbf {S}} \ \varGamma _{\mathbf {S}}(x_0; \mathbf {U}). \end{aligned}$$

(12)

Note that, when using strategies, there is no need in any information about the disturbance. It makes this way of forming the control actions more preferable in comparison with the quasi-strategies and counter-strategies.

3.4 Comparison of Optimal Guaranteed Results

The following relations between the values of optimal guaranteed results (6), (9) and (12) are valid:

$$\begin{aligned} \varGamma ^0_{\mathbf {QS}}(x_0) = \varGamma ^0_{\mathbf {CS}}(x_0), \quad \varGamma ^0_{\mathbf {QS}}(x_0) \le \varGamma ^0_{\mathbf {S}}(x_0), \quad x_0 \in B(R_0). \end{aligned}$$

(13)

The equality in (13) is derived from the results of [6, §§28, 29] [see also [5, §9]]. The inequality in (13) is a straightforward consequence of the given definitions. Note that this inequality can be strict, and a sufficient condition for the equality is the equilibrium condition in a small game (see, e.g., [7, p. 8]) or, in another terminology, the Isaacs’ condition [3]:

$$\begin{aligned} \begin{array}{c} \displaystyle \min _{u \in P} \max _{v \in Q} \langle s, f(t, x, u, v) \rangle = \max _{v \in Q} \min _{u \in P} \langle s, f(t, x, u, v) \rangle , \\ t \in T, \quad x \in \mathbb {R}^n, \quad s \in \mathbb {R}^n, \end{array} \end{aligned}$$

(14)

where the symbol $\langle \cdot , \cdot \rangle $ denotes the inner product of vectors. Thus, the result $\varGamma ^0_{\mathbf {QS}}(x_0)$ can always be guaranteed with the help of the counter-strategies, but, in general, without condition (14), cannot be guaranteed with the help of the strategies.

Another situation when the optimal guaranteed results in the classes $\mathbf {QS}$ and $\mathbf {S}$ coincide regardless of condition (14) is described in the paper and related to compact functional constraints on the disturbance.

4 Guarantee Optimization Under a Functional Constraint on the Disturbance

In this section, for system (1), initial condition (2) and quality index (4), we define the value of the optimal guaranteed result in the class of strategies $\mathbf {S}$ in the case when the disturbance satisfies a compact functional constraint $\mathbb {V}$. In accordance with the informal statement of the problem (see Sect. 2.2), the presence of the functional constraint $\mathbb {V}$ leads to the fact that, in definition (11) of the guaranteed result, we split the operation of taking the upper bound over all disturbance realizations $v(\cdot ) \in {\mathcal {V}}$ into two parts. Firstly, inside, the upper bound is taken only over disturbance realizations $v(\cdot )$ from a set $V \in \mathbb {V}$, and, after that, outside, the upper bound is taken over the sets $V \in \mathbb {V}$.

Let us note that, for any initial state $x_0 \in B(R_0)$, we can define the value of the optimal guaranteed result in the class of quasi-strategies $\mathbf {QS}$ under the functional constraint $\mathbb {V}$ as follows:

$$\begin{aligned} \varGamma ^0_{\mathbf {QS}}(x_0 \mid \mathbb {V}) {\text {:=}}\inf _{\alpha \in \mathbf {QS}} \ \sup _{V \in \mathbb {V}} \ \sup _{v(\cdot ) \in V} \ \sigma \Big ( x\big (\cdot ; x_0, \alpha (v(\cdot )), v(\cdot ) \big ) \Big ). \end{aligned}$$

(15)

However, due to (5), (6) and (15), we have

$$\begin{aligned} \varGamma ^0_{\mathbf {QS}}(x_0 \mid \mathbb {V}) = \varGamma ^0_{\mathbf {QS}}(x_0). \end{aligned}$$

(16)

Therefore, the optimal guaranteed result in the class of quasi-strategies does not depend on the presence of the functional constraints.

Further, let us define the value of the guaranteed result of a strategy $\mathbf {U}= (\mathbf {U}^{\varDelta })_{\varDelta \in \varDelta _T} \in \mathbf {S}$ under the functional constraint $\mathbb {V}$ and the corresponding value of the optimal guaranteed result in the class of strategies $\mathbf {S}$ under the functional constraint $\mathbb {V}$ in the following way:

$$\begin{aligned}&\varGamma _{\mathbf {S}}(x_0; \mathbf {U}\mid \mathbb {V}){\text {:=}}\sup _{V \in \mathbb {V}} \lim _{\delta \downarrow 0} \sup _{\varDelta \in \varDelta _T: {{\mathbf {D}}(\Delta )} \le \delta } \sup _{v(\cdot ) \in V} \sigma \Big ( x\big (\cdot ; x_0, \mathbf {U}^\varDelta , v(\cdot ) \big ) \Big ), \nonumber \\&\quad \varGamma ^0_{\mathbf {S}}(x_0 \mid \mathbb {V}){\text {:=}}\inf _{\mathbf {U}\in \mathbf {S}} \ \varGamma _{\mathbf {S}}(x_0; \mathbf {U}\mid \mathbb {V}). \end{aligned}$$

(17)

According to [17], for any compact functional constraint $\mathbb {V}$, the following equality holds:

$$\begin{aligned} \varGamma ^0_{\mathbf {S}}(x_0 \mid \mathbb {V}) = \varGamma ^0_{\mathbf {QS}}(x_0 \mid \mathbb {V}), \quad x_0 \in B(R_0). \end{aligned}$$

Hence, due to (16), when the disturbance satisfies a compact functional constraint $\mathbb {V}$, the optimal guaranteed result $\varGamma ^0_{\mathbf {QS}}(x_0)$ in the class of quasi-strategies $\mathbf {QS}$ can be guaranteed with the help of the strategies with full memory. Thus, the considered in the paper guarantee optimization problem in the class of strategies $\mathbf {S}$ under the functional constraint $\mathbb {V}$ can be formulated as follows. For any number $\zeta > 0$, we should find a strategy $\mathbf {U}_*\in \mathbf {S}$ such that, for any initial state $x_0 \in B(R_0)$, the following inequality is valid:

$$\begin{aligned} \varGamma _{\mathbf {S}}(x_0; \mathbf {U}_*\mid \mathbb {V}) \le \varGamma ^0_{\mathbf {QS}}(x_0) + \zeta . \end{aligned}$$

(18)

The main contribution of the paper is a new construction of this $\zeta $-optimal strategy $\mathbf {U}_*.$

5 Construction of Optimal Strategy

Let $\varepsilon \in (0, 1)$ be an accuracy parameter. Let us define a strategy $\mathbf {U}_\varepsilon = (\mathbf {U}_\varepsilon ^\varDelta )_{\varDelta \in \varDelta _T} \in \mathbf {S}$ such that it satisfies inequality (18) for any sufficiently small values of $\varepsilon .$ Let us introduce the necessary notations and constructions.

According to (10) and (13), let us fix an $\varepsilon $-optimal counter-strategy $\overline{{\mathbf {U}}}_\varepsilon = (\overline{{\mathbf {U}}}_\varepsilon ^\varDelta )_{\varDelta \in \varDelta _T} \in \mathbf {CS}$ such that

$$\begin{aligned} \varGamma _{\mathbf {CS}}(x_0; \overline{{\mathbf {U}}}_\varepsilon ) \le \varGamma ^0_{\mathbf {QS}}(x_0) + \varepsilon , \quad x_0 \in B(R_0). \end{aligned}$$

(19)

For the compact set P, which determines the geometric constraint on the control (see (1)), let us choose an $\varepsilon $–net $(u^\varepsilon _j)_{j\in {1}...{n_\varepsilon }} \subset P:$

$$\begin{aligned} \max _{u\in P} \min _{j\in {1}...{n_\varepsilon }} \Vert u - u^\varepsilon _j\Vert \le \varepsilon . \end{aligned}$$

(20)

At first, it is convenient to define the control $\mathbf {U}^\varDelta _\varepsilon = (\mathbf {U}^\varDelta _{\varepsilon i})_{i \in {0}...{(n_\varDelta -1)}}$ only for partitions $\varDelta \in \varDelta _T$ that satisfy the following condition:

$$\begin{aligned} {{\mathbf {D}}(\Delta )}\le 3 {\mathbf {d}}(\Delta ), \end{aligned}$$

(21)

where ${\mathbf {d}}(\Delta )= \min _{i\in {1}...{n_{\varDelta }}} (\tau _i - \tau _{i-1})$ is the inner diameter of the partition $\varDelta $. Let us consider the auxiliary times

$$\begin{aligned} \tau _i' {\text {:=}}\tau _i - \varepsilon {\mathbf {d}}(\Delta ), \quad \tau '_{ij} {\text {:=}}\tau _{i}' + \frac{j(\tau _{i}-\tau _{i}')}{n_{\varepsilon }}, \quad j \in {0}...{n_{\varepsilon }}, \quad i \in {1}...{(n_{\varDelta }- 1)}. \end{aligned}$$

(22)

Note that, since $\varepsilon \in (0, 1)$, the following inclusions are valid:

$$\begin{aligned} \tau '_{ij} \in (\tau _{i-1},\tau _{i}], \quad i \in {1}...{(n_{\varDelta }-1)}, \quad j \in {0}...{n_\varepsilon }. \end{aligned}$$

For any $i \in {0}...{(n_{\varDelta }- 1)}$ and any function $x^{(i)}(\cdot )\in C([t_0, \tau _i], \mathbb {R}^n)$, let us choose a vector $\nu _i (x^{(i)}(\cdot ))$ such that

$$\begin{aligned} \nu _i (x^{(i)}(\cdot )) \in {\left\{ \begin{array}{ll} Q, &{} i = 0, \\ \mathop {\mathrm{argmin}}\limits _{v\in Q} \max \limits _{j\in {1}...{n_\varepsilon }} \big \Vert d_{ij}(x^{(i)}(\cdot ))-f(\tau _i, x^{(i)}(\tau _i),u_j^\varepsilon ,v) \big \Vert , &{} i \in {1}...{(n_{\varDelta }-1)}, \end{array}\right. } \end{aligned}$$

(23)

where, for $i \in {1}...{(n_{\varDelta }- 1)}$ and $j \in {1}...{n_{\varepsilon }}$, we denote by ${d}_{ij}(x^{(i)}(\cdot ))$ the divided difference

$$\begin{aligned} d_{ij}(x^{(i)}(\cdot )) {\text {:=}}\frac{x^{(i)}(\tau '_{ij})-x^{(i)}(\tau '_{i(j-1)})}{\tau '_{ij}-\tau '_{i(j-1)}}. \end{aligned}$$

(24)

Before proceeding to the formal definition of the control $\mathbf {U}_\varepsilon ^\varDelta $ on the partition $\varDelta $, let us describe it as a control procedure with a guide.

5.1 Optimal Control with a Guide

The control $\mathbf {U}_\varepsilon ^\varDelta $ on the partition $\varDelta $ can be treated as a control procedure with a guide (see, e.g., [7, §8.2]). A motion of the guide is considered as an auxiliary motion $y(\cdot )$ of system (1). We suppose that this motion $y(\cdot )$ satisfies the same initial condition $y(t_0) = x_0$ as the motion $x(\cdot )$ of the original system, and we denote by ${{\overline{u}}}(\cdot ) \in \mathcal {U}$ and ${{\overline{v}}}(\cdot ) \in {\mathcal {V}}$ the corresponding control and “disturbance” realizations that determine this motion $y(\cdot )$. Thus, according to the introduced notations, we have $y(\cdot ) = x(\cdot ; x_0, {{\overline{u}}}(\cdot ), {{\overline{v}}}(\cdot )).$ Note that, according to choice (3) of the number R, for any such auxiliary motion $y(\cdot )$, the inclusions $y(t) \in B(R)$, $t \in T$, are valid.

Let us describe this control procedure. In order to choose the “disturbance” ${{\overline{v}}}(\cdot )$ in the guide, when forming the control $u(\cdot )$ in the original system, we use the series of the test control actions $u_j^\varepsilon $, $j\in {1}...{n_\varepsilon }$, on the small part $[\tau _{i+1}^\prime , \tau _{i+1})$ of every step $[\tau _i, \tau _{i+1})$ of the partition $\varDelta $. By the observations of the corresponding reactions of the original system to these test controls, as in the theory of inverse problem of dynamics (see, e.g., [10]), we choose “on the fly” the “disturbance” $\overline{v}(\cdot )$ in the guide that in a some sense approximates the disturbance $v(\cdot )$ acting in the original system. After that, by the found approximation ${{\overline{v}}}(\cdot )$, we choose the control ${{\overline{u}}}(\cdot )$ in the guide according to the fixed $\varepsilon $-optimal counter-control $\overline{{\mathbf {U}}}_\varepsilon ^\varDelta $. Finally, the constructed control ${{\overline{u}}}(\cdot )$ is used in the original system on the current step of the partition except for the “test” part $[\tau _{i+1}^\prime , \tau _{i+1})$. Under a suitable choice of the parameters (see Lemma 1), the obtained motion $x(\cdot )$ of the original system is close to the constructed $\varepsilon $-optimal motion $y(\cdot )$ of the guide.

Thus, we consider the following step-by-step procedure of forming a control realization $u(\cdot ) \in \mathcal {U}$ in the original system and piecewise constant realizations ${{\overline{u}}}(\cdot )$ and ${{\overline{v}}}(\cdot )$ of the form

$$\begin{aligned} {{\overline{u}}}(t) = {\overline{u}}_i \in P, \quad {{\overline{v}}}(t) = {\overline{v}}_i \in Q, \quad t \in [\tau _i, \tau _{i + 1}), \quad i \in {0}...{(n_{\varDelta }- 1)}, \end{aligned}$$

(25)

in the guide. For any $i \in {0}...{(n_{\varDelta }- 1)}$, let $x(\cdot )|_{[t_0, \tau _i]}$ and $y(\cdot )|_{[t_0, \tau _i]}$ be, respectively, the histories of the motions of the original system and guide realized up to the time $\tau _i$. “Reconstructing” the disturbances acting in the original system on the interval $[\tau _{i-1}, \tau _i)$, we set

$$\begin{aligned} {\overline{v}}_i = \nu _i\big (x(\cdot )|_{[t_0, \tau _i]}\big ). \end{aligned}$$

(26)

Using the fixed counter-control $\overline{{\mathbf {U}}}^\varDelta _{\varepsilon } = (\overline{{\mathbf {U}}}^\varDelta _{\varepsilon i})_{i \in {0}...{(n_{\varDelta }- 1)}}$ in the guide, according to (7), we put

$$\begin{aligned} {\overline{u}}_i = \overline{{\mathbf {U}}}_{{\varepsilon } i}^\varDelta \big (y(\cdot )|_{[t_0, \tau _i]} \big ) ({\overline{v}}_i). \end{aligned}$$

(27)

After that, we define

$$\begin{aligned} u(t) = {\left\{ \begin{array}{ll} {\overline{u}}_i, &{}\quad t \in [\tau _i, \tau _{i+1}'), \\ u_j^\varepsilon , &{}\quad t \in [\tau '_{(i+1)(j-1)}, \tau '_{(i+1)j}), \quad j \in {1}...{n_\varepsilon }, \end{array}\right. } \end{aligned}$$

(28)

where $u_j^\varepsilon $ are the elements of the chosen $\varepsilon $-net.

An illustration to the described control procedure with the guide is given in Fig. 1.

5.2 Optimal Control Strategy with Full Memory

The control procedure with guide (26)–(28) is formalized as the control with full memory $\mathbf {U}^\varDelta _\varepsilon = (\mathbf {U}^\varDelta _{\varepsilon i})_{i \in {0}...{(n_{\varDelta }- 1)}}$ on the partition $\varDelta $ as follows. Let $i \in {0}...{(n_\varDelta -1)}$ and $x^{(i)}(\cdot )\in C([t_0, \tau _i],\mathbb {R}^n).$ Set

$$\begin{aligned} {{\overline{v}}}(t) = \nu _k \big (x^{(i)}(\cdot )|_{[t_0, \tau _k]}\big ), \quad t \in [\tau _k, \tau _{k+1}), \quad k \in {0}...{i}. \end{aligned}$$

(29)

Consider the auxiliary motion $y^{(i)}(t) {\text {:=}}x(t; x_0, \overline{{\mathbf {U}}}^\varDelta _\varepsilon , {{\overline{v}}}(\cdot ))$, $t \in [t_0, \tau _i]$, of system (1) and put

$$\begin{aligned} \mathbf {U}^\varDelta _{\varepsilon i}\big (x^{(i)}(\cdot )\big )(t){\text {:=}}{\left\{ \begin{array}{ll} \overline{{\mathbf {U}}}^\varDelta _{\varepsilon i}\big (y^{(i)}(\cdot )\big )({{\overline{v}}}(\tau _i)), &{} t \in [\tau _i, \tau _{i+1}'),\\ u^\varepsilon _j, &{} t \in [\tau '_{(i+1)(j-1)}, \tau '_{(i+1)j}), \quad j \in {1}...{n_\varepsilon }. \end{array}\right. } \end{aligned}$$

(30)

Thus, the control with full memory $\mathbf {U}^\varDelta _\varepsilon $ is defined for any partition $\varDelta $ that satisfies condition (21). Let a partition $\varDelta = (\tau _i)_{i\in {0}...{n_{\varDelta }}}$ do not satisfy this condition. Then, firstly, we “thin” this partition to a partition $\varDelta ^*\in \varDelta _T$, $\varDelta ^*\subset \varDelta $, satisfying (21). This can always be done, for instance, as follows:

$$\begin{aligned}&\varDelta ^*{\text {:=}}\Big \{\tau ^*_0{\text {:=}}t_0, \tau ^*_{n_{\varDelta ^*}}{\text {:=}}\vartheta , \\&\tau ^*_{i}{:=} \hbox {argmin} \{\tau \in \varDelta : \ \tau \ge i 2 {{\mathbf {D}}(\Delta )} \}, \ i\in \mathbb {N}, \ i\le (\vartheta - t_0) / (2 {{\mathbf {D}}(\Delta )}) \Big \}. \end{aligned}$$

Note that this “thinned” partition satisfies the condition

$$\begin{aligned} {{\mathbf {D}}(\Delta ^*)} \le 3 {{\mathbf {D}}(\Delta )}. \end{aligned}$$

(31)

After that, the control $\mathbf {U}_\varepsilon ^\varDelta $ is defined with the help of the control $\mathbf {U}_\varepsilon ^{\varDelta ^*}$ for the corresponding “thinned” partition by ignoring the times $\tau _i \in \varDelta \setminus \varDelta ^*$. Note that, according to this definition, for any initial state $x_0 \in B(R_0)$ and any disturbance realization $v(\cdot ) \in {\mathcal {V}}$, we have $x(\cdot ; x_0, \mathbf {U}_\varepsilon ^\varDelta , v(\cdot )) = x(\cdot ; x_0, \mathbf {U}_\varepsilon ^{\varDelta ^*}, v(\cdot ))$.

Hence, the strategy $\mathbf {U}_\varepsilon = (\mathbf {U}^\varDelta _\varepsilon )_{\varDelta \in \varDelta _T}$ is completely defined.

Theorem 1

Let $\mathbb {V}$ be a compact functional constraint on the disturbance. Then, for any number $\zeta > 0$, there exists a number $\varepsilon ^*\in (0, 1)$ such that, for any initial state $x_0 \in B(R_0)$ and any number $\varepsilon \in (0, \varepsilon ^*]$, the strategy with full memory $\mathbf {U}_\varepsilon \in \mathbf {S}$, defined by relations (29), (30), satisfies the inequality

$$\begin{aligned} \varGamma _{\mathbf {S}}(x_0; \mathbf {U}_\varepsilon \mid \mathbb {V}) \le \varGamma ^0_{\mathbf {QS}}(x_0) + \zeta . \end{aligned}$$

(32)

The proof of the theorem is given in the next section.

6 Proof of Theorem 1

The basis of the proof of Theorem 1 constitutes the following lemma, which is also of independent interest.

Lemma 1

For any number $\xi > 0$, there exists a number $\varepsilon _*\in (0,1)$ such that, for any number $\varepsilon \in (0,\varepsilon _*]$ and any set $V \subset {\mathcal {V}}$ compact in $L_1(T, \mathbb {R}^q)$, we can specify a number $\delta _*> 0$ such that, for any initial state $x_0 \in B(R_0)$ and any satisfying (21) partition $\varDelta \in \varDelta _T$ with the diameter ${{\mathbf {D}}(\Delta )} \le \delta _*$, the following statement holds. Let the motions $x(\cdot )$ and $y(\cdot )$ of system (1) be generated from the initial state $x_0$ by realizations $u(\cdot )$, $v(\cdot )$ and ${{\overline{u}}}(\cdot )$, ${{\overline{v}}}(\cdot )$, respectively. Let the inclusion $v(\cdot ) \in V$ be valid and these realizations satisfy relations (25) and (26), (28) for $i \in {0}...{(n_{\varDelta }- 1)}$. Then the following inequality holds:

$$\begin{aligned} \Vert x(t) - y(t)\Vert \le \xi , \quad t \in T. \end{aligned}$$

(33)

Before proving the lemma, let us introduce the necessary notations. Due to the properties of the function f from the right-hand side of Eq. (1) and compactness of the sets T, B(R), P and Q, we choose numbers $\varkappa > 0$ and $L > 0$ such that, for any $t \in T$, $x \in B(R)$, $u \in P$ and $v \in Q$, we have

$$\begin{aligned} \Vert f(t, x, u, v)\Vert \le \varkappa , \quad \Vert f(t, x, u, v) - f(t, x^\prime , u, v)\Vert \le L \Vert x - x^\prime \Vert . \end{aligned}$$

(34)

Let us denote

$$\begin{aligned} \begin{array}{rcl} \mu _t (\delta ) &{} {\text {:=}}&{} \max \Big \{ \Vert f(t, x, u, v) - f(t^\prime , x, u, v)\Vert : \\ &{} &{} \qquad t, t^\prime \in T, \ x \in B(R), \ u \in P, \ v \in Q, \ |t - t^\prime | \le \delta \Big \}, \\ \mu _u (\delta ) &{} {\text {:=}}&{} \max \Big \{ \Vert f(t, x, u, v) - f(t, x, u', v)\Vert : \\ &{} &{} \qquad t \in T, \ x \in B(R), \ u, u' \in P, \ v\in Q, \ \Vert u - u'\Vert \le \delta \Big \}, \\ \mu _v (\delta ) &{} {\text {:=}}&{} \max \Big \{ \Vert f(t, x, u, v) - f(t, x, u, v^\prime )\Vert : \\ &{} &{} \qquad t \in T, \ x \in B(R), \ u \in P, \ v, v^\prime \in Q, \ \Vert v - v^\prime \Vert \le \delta \Big \}, \\ \psi (\delta ) &{} {\text {:=}}&{} \mu _t(\delta ) + L \varkappa \delta , \quad \delta > 0. \end{array} \end{aligned}$$

Note that these functions $\mu _t(\delta )$, $\mu _u(\delta )$, $\mu _v(\delta )$ and $\psi (\delta )$ are nondecreasing and tend to zero when $\delta \downarrow 0$. Note also that, for any motion $x(\cdot )$ of system (1) generated from an initial state $x_0 \in B(R_0)$ by realizations $u(\cdot ) \in \mathcal {U}$ and $v(\cdot ) \in {\mathcal {V}}$, the inequality

$$\begin{aligned} \big \Vert f(t, x(t), u, v) - f(t^\prime , x(t^\prime ), u, v^\prime ) \big \Vert \le \psi (|t - t^\prime |) + \mu _v (\Vert v - v^\prime \Vert ) \end{aligned}$$

(35)

holds for any $t, t' \in T$, $u\in P$ and $v, v'\in Q$.

Proof of Lemma 1

Fix a number $\xi > 0$ and choose a number $\xi _* > 0$ from the condition

$$\begin{aligned} \xi _* \exp ({L (\vartheta - t_0)}) \le \xi . \end{aligned}$$

(36)

Let a number $\varepsilon _* \in (0, 1)$ be such that

$$\begin{aligned} 2(\vartheta -t_0)(\varepsilon _*\varkappa +\mu _u(\varepsilon _*))\le \xi _*/3. \end{aligned}$$

(37)

Fix a number $\varepsilon \in (0,\varepsilon _*]$ and a set $V \subset {\mathcal {V}}$ compact in $L_1(T, \mathbb {R}^q)$. Taking into account [2, Assertion 3], one can specify a number $\delta _1 > 0$ such that, for any $\delta \in (0, \delta _1]$ and any function $v(\cdot ) \in V$, the following inequality holds:

$$\begin{aligned} \int _T\frac{1}{4\delta }\int _{s-2\delta }^{s+2\delta }\mu _v\big (\Vert v(s)-v(\tau )\Vert \big ) \,\mathrm {d}\tau \,\mathrm {d}s \le \frac{\varepsilon \xi _*}{72n_\varepsilon }. \end{aligned}$$

(38)

Here and below, it is assumed that $v(t) = 0$ for $t \notin T$. Further, choose a number $\delta _2 > 0$ such that

$$\begin{aligned} 2\varkappa \delta _2+2(\vartheta -t_0)\big ( 2 \mu _t(2 \delta _2) + 3 L \varkappa \delta _2 \big ) \le \xi _*/ 3. \end{aligned}$$

(39)

Put $\delta _*= \min \{\delta _1, \delta _2\} > 0$. Let us show that the assertion of the lemma holds for the chosen parameters.

In accordance with the statement of the lemma, let us assume that an initial state $x_0 \in B(R_0)$, a partition $\varDelta \in \varDelta _T$, realizations $u(\cdot )$, $v(\cdot )$ and ${{\overline{u}}}(\cdot )$, ${{\overline{v}}}(\cdot )$, and the corresponding motions ${x}(\cdot )$ and ${y}(\cdot )$ of system (1) are fixed. Let $\delta {\text {:=}}{{\mathbf {D}}(\Delta )}\le \delta _*$. Let us estimate the value $\Vert x(t) - y(t)\Vert $ for $t \in T$. Since the motions ${x}(\cdot )$ and ${y}(\cdot )$ are generated from the same initial state, we have

$$\begin{aligned} \Vert {x}(t)-{y}(t)\Vert =\bigg \Vert \int _{t_0}^t \big ( f(s,{x}(s),{u}(s),v(s))-f(s,{y}(s),{{\overline{u}}}(s),{{\overline{v}}}(s)) \big ) \,\mathrm {d}s\bigg \Vert . \end{aligned}$$

In the right-hand side, we add and subtract under the integral sign the quantities $f (s,{x}(s),{{\overline{u}}}(s),v(s))$ and $f(s,{x}(s),{{\overline{u}}}(s),{{\overline{v}}}(s))$ (we continue the estimate):

$$\begin{aligned}&\displaystyle \le \left\Vert\int _{t_0}^t \big ( f(s,{x}(s),{u}(s),v(s))-f(s,{x}(s),{{\overline{u}}}(s),v(s)) \big ) \,\mathrm {d}s\right\Vert\nonumber \\&\quad \displaystyle +\left\Vert\int _{t_0}^t \big ( f(s,{x}(s),{{\overline{u}}}(s),v(s))-f(s,{x}(s),{{\overline{u}}}(s),{{\overline{v}}}(s)) \big ) \,\mathrm {d}s\right\Vert\nonumber \\&\quad \displaystyle +\left\Vert\int _{t_0}^t \big ( f(s,{x}(s),{{\overline{u}}}(s),{{\overline{v}}}(s))-f(s,{y}(s),{{\overline{u}}}(s),{{\overline{v}}}(s)) \big ) \,\mathrm {d}s\right\Vert\nonumber \\&\quad \displaystyle {\text {:=}}I_1+I_2+I_3. \end{aligned}$$

(40)

Let us estimate the integral $I_1$ in (40). Due to (22), (25) and (28), the measure of the set E that consists of all points $t\in T$ such that ${u}(t) \ne {{\overline{u}}}(t)$ does not exceed $\varepsilon (\vartheta - t_0)$. Therefore, applying (34), we obtain

$$\begin{aligned} I_1 \le \int _E \Vert f(s,{x}(s),{u}(s),v(s))-f(s,{x}(s),{{\overline{u}}}(s),v(s)) \Vert \,\mathrm {d}s \le 2 \varkappa \varepsilon (\vartheta -t_0). \end{aligned}$$

(41)

Let us estimate $I_2$ in (40). By the definition of $\varepsilon $-net (see (20)), we derive

(42)

Let $i\in {1}...{(n_{\varDelta }-1)}$, $j\in {1}...{n_\varepsilon }$ and $s\in [\tau _{i},\tau _{i+1})$. We have

(43)

Let us estimate the first term from the right-hand side of this inequality. Since, due to (24) and (30), the equality ${u}(\tau ) = u^\varepsilon _j$ holds for $\tau \in [\tau '_{i(j-1)}, \tau '_{ij})$, we get

$$\begin{aligned}&\displaystyle \bigg \Vert f(s,{x}(s),u^\varepsilon _j,v(s))-\frac{{x}(\tau '_{ij})-{x}(\tau '_{i(j-1)})}{\tau '_{ij}-\tau '_{i(j-1)}}\bigg \Vert \\&\quad \displaystyle =\bigg \Vert \int ^{\tau '_{ij}}_{\tau '_{i(j-1)}}\frac{f(s,{x}(s),u^\varepsilon _j,v(s))-f(\tau ,{x}(\tau ),u^\varepsilon _j,v(\tau ))}{\tau '_{ij}-\tau '_{i(j-1)}}\,\mathrm {d}\tau \bigg \Vert \end{aligned}$$

$$\begin{aligned}&\displaystyle \le \int ^{\tau '_{ij}}_{\tau '_{i(j-1)}}\frac{\psi (s-\tau )+\mu _v(\Vert v(s)-v(\tau )\Vert )}{\tau '_{ij}-\tau '_{i(j-1)}}\,\mathrm {d}\tau \nonumber \\&\displaystyle \le \psi (2\delta )+\frac{n_\varepsilon }{\varepsilon {\mathbf {d}}(\Delta )}\int _{s-2\delta }^{s+2\delta }\mu _v(\Vert v(s)-v(\tau )\Vert )\,\mathrm {d}\tau . \end{aligned}$$

(44)

For the second term from the right-hand side of (43), taking into account (23), (26) and the inclusion $s \in [\tau _{i}, \tau _{i+1})$, we derive

$$\begin{aligned}&\displaystyle \max _{j\in {1}...{n_\varepsilon }}\Vert d_{ij}({x}(\cdot ))-f(s,{x}(s),u^\varepsilon _j,{{\overline{v}}}(s))\Vert \nonumber \\&\displaystyle \quad = \max _{j\in {1}...{n_\varepsilon }}\Vert d_{ij}({x}(\cdot ))-f(s,{x}(s),u^\varepsilon _j,{\overline{v}}_i(\tau _i))\Vert \nonumber \\&\quad \displaystyle \le \max _{j\in {1}...{n_\varepsilon }}\Vert d_{ij}({x}(\cdot ))-f(\tau _i,{x}(\tau _i),u^\varepsilon _j,{\overline{v}}_i(\tau _i))\Vert +\psi (\delta ) \nonumber \\&\quad \displaystyle \le \max _{j\in {1}...{n_\varepsilon }}\Vert d_{ij}({x}(\cdot ))-f(\tau _i,{x}(\tau _i),u^\varepsilon _j,v(s))\Vert +\psi (\delta ) \nonumber \\&\quad \displaystyle \le \max _{j\in {1}...{n_\varepsilon }}\Vert d_{ij}({x}(\cdot ))-f(s,{x}(s),u^\varepsilon _j,v(s))\Vert +2\psi (\delta )\nonumber \\&\quad \displaystyle \le \psi (2\delta )+2\psi (\delta )+\frac{n_\varepsilon }{\varepsilon {\mathbf {d}}(\Delta )}\int _{s-2\delta }^{s+2\delta }\mu _v(\Vert v(s)-v(\tau )\Vert )\,\mathrm {d}\tau . \end{aligned}$$

(45)

From (42)–(45), adding the estimate of difference (43) on the interval $[\tau _0,\tau _1]$, we obtain

$$\begin{aligned} I_2\le & {} 2\varkappa \delta +2(\vartheta -t_0) \big ( \psi (2\delta )+\psi (\delta )+\mu _u(\varepsilon ) \big ) \nonumber \\&+\, \frac{2n_\varepsilon }{\varepsilon {\mathbf {d}}(\Delta )}\int _T\int _{s-2\delta }^{s+2\delta }\mu _v(\Vert v(s)-v(\tau )\Vert ) \,\mathrm {d}\tau \,\mathrm {d}s. \end{aligned}$$

(46)

For the integral $I_3$ in (40), according to (34), we have

$$\begin{aligned} I_3 \le \int _{t_0}^t L\Vert {x}(s)-{y}(s)\Vert \,\mathrm {d}s. \end{aligned}$$

(47)

Thus, from estimates (41), (46) and (47), for any $t \in T$, we derive

$$\begin{aligned}&\displaystyle \Vert {x}(t)-{y}(t)\Vert \\&\quad \displaystyle \le \int _{t_0}^t L\Vert {x}(s)-{y}(s)\Vert \,\mathrm {d}s +2\varkappa \delta + 2(\vartheta -t_0)\big (\varkappa \varepsilon +\psi (2\delta )+\psi (\delta )+\mu _u(\varepsilon )\big ) \\&\qquad \displaystyle + \frac{2n_\varepsilon }{\varepsilon {\mathbf {d}}(\Delta )}\int _T\int _{s-2\delta }^{s+2\delta }\mu _v(\Vert v(s)-v(\tau )\Vert )\,\mathrm {d}\tau \,\mathrm {d}s \\&\quad \displaystyle \le \int _{t_0}^t L\Vert {x}(s)-{y}(s)\Vert \,\mathrm {d}s+2\varkappa \delta +2(\vartheta -t_0)\big (\varkappa \varepsilon +\mu _u(\varepsilon )+3L\varkappa \delta +2\mu _t(2\delta )\big )\\&\qquad \displaystyle +\sup _{v'(\cdot )\in V}\bigg \{\frac{6n_\varepsilon }{\varepsilon \delta }\int _T\int _{s-2\delta }^{s+2\delta }\mu _v(\Vert v'(s)-v'(\tau )\Vert )\,\mathrm {d}\tau \,\mathrm {d}s\bigg \}\\&\quad \displaystyle {\text {:=}}\int _{t_0}^t L\Vert {x}(s)-{y}(s)\Vert \,\mathrm {d}s+\varPsi (\varepsilon ,\delta ). \end{aligned}$$

Due to choice (37)–(39) of $\varepsilon $ and $\varDelta $, we have $\varPsi (\varepsilon ,\delta )\le \xi _*$. Then, applying the Bellman–Gronwall lemma, we deduce $\Vert {x}(t)-{y}(t)\Vert \le \xi _*\exp {(L(\vartheta -t_0))}$, $t\in T$. Therefore, according to (36), inequality (33) holds. Lemma 1 is proved. $\square $

Proof of Theorem 1

Fix a compact functional constraint on the disturbance $\mathbb {V}$ and a number $\zeta > 0$. Let us consider the set $\widetilde{D} \subset C(T, \mathbb {R}^n)$ that consists of all functions $x(\cdot )$ such that

$$\begin{aligned} \Vert x(t)\Vert \le R, \quad \Vert x(t) - x(t^\prime )\Vert \le \varkappa |t - t^\prime |, \quad t, t^\prime \in T, \end{aligned}$$

where R and $\varkappa $ are taken from (3) and (34). Note that the set $\widetilde{D}$ is compact. Hence, since the function $\sigma $ from quality index (4) is continuous, there exists a number $\xi > 0$ such that, for any functions $x(\cdot ), y(\cdot ) \in \widetilde{D}$, if $\Vert x(t) - y(t)\Vert \le \xi $, $t \in T$, then

$$\begin{aligned} |\sigma (x(\cdot )) - \sigma (y(\cdot ))| \le \zeta /3. \end{aligned}$$

By this number $\xi $, let us choose a number $\varepsilon _*> 0$ according to Lemma 1, and put $\varepsilon ^*= \min \{\varepsilon _*, \zeta /3\}$. Let us prove that the assertion of the theorem holds for this value $\varepsilon ^*$.

Fix an initial state $x_0 \in B(R_0)$ and a number $\varepsilon \in (0, \varepsilon ^*]$. According to (8) and (19), there exists a number $\widetilde{\delta } > 0$ such that, for any partition $\varDelta \in \varDelta _T$, ${{\mathbf {D}}(\Delta )}\le \widetilde{\delta }$, and any disturbance realization $v(\cdot ) \in {\mathcal {V}}$, we have

$$\begin{aligned} \sigma \Big ( x\big (\cdot ; x_0, \overline{{\mathbf {U}}}_\varepsilon ^\varDelta , v(\cdot ) \big ) \Big ) \le \varGamma _{\mathbf {CS}}(x_0; \overline{{\mathbf {U}}}_\varepsilon ) + \zeta /3 \le \varGamma ^0_{\mathbf {QS}}(x_0) + 2 \zeta /3. \end{aligned}$$

(48)

Fix a set $V \in \mathbb {V}$. Let $\delta _*$ be chosen by $\varepsilon $ and V according to Lemma 1. Put $\delta ^*= \min \{\delta _*, \widetilde{\delta }\}$. Let a partition $\varDelta \in \varDelta _T$ be such that ${{\mathbf {D}}(\Delta )} \le \delta ^*/3$. We assume that $\varDelta $ satisfies condition (21). Otherwise, we replace $\varDelta $ by the “thinned” partition $\varDelta ^*$ (see Sect. 5.2). In any case, we have a partition that satisfies condition (21) and has the diameter not exceeding the value of $\delta ^*$ (see (31)).

Let $v(\cdot ) \in V$ and $x(\cdot ) = x(\cdot ; x_0, \mathbf {U}^\varDelta _\varepsilon , v(\cdot ))$. Let $y(\cdot )$ be the corresponding motion of the guide. According to (27), we have $y(\cdot ) = x(\cdot ; x_0, \overline{{\mathbf {U}}}^\varDelta _\varepsilon , {{\overline{v}}}(\cdot ))$ for some ${{\overline{v}}}(\cdot ) \in {\mathcal {V}}$, and, therefore, due to (48), we obtain

$$\begin{aligned} \sigma (y(\cdot )) \le \varGamma ^0_{\mathbf {QS}}(x_0) + 2 \zeta /3. \end{aligned}$$

Furthermore, by the choice of $\delta _*$, we have $\Vert x(t) - y(t)\Vert \le \xi $, $t \in T$. Hence, due to the choice of $\xi $, since $x(\cdot ), y(\cdot ) \in \widetilde{D}$, we deduce

$$\begin{aligned} \sigma (x(\cdot )) \le \sigma (y(\cdot )) + \zeta /3. \end{aligned}$$

Thus, we have shown that, for any set $V \in \mathbb {V}$, there exists a number $\delta ^*>0$ such that, for any partition $\varDelta \in \varDelta _T$, ${{\mathbf {D}}(\Delta )} \le \delta ^*$, and any disturbance realization $v(\cdot ) \in V$, the following inequality holds:

$$\begin{aligned} \sigma \Big (x\big (\cdot ; x_0, \mathbf {U}^\varDelta _\varepsilon , v(\cdot )\big )\Big ) \le \varGamma ^0_{\mathbf {QS}}(x_0) + \zeta . \end{aligned}$$

From this fact, taking into account definition (17), we conclude the validity of inequality (32). Theorem 1 is proved. $\square $

Let us give some remarks concerning Theorem 1 and Lemma 1.

1.
For the guarantee optimization problem under a functional constraint on the disturbance, Lemma 1 can be considered as an analog of the estimates from [7, §2.3], which play a key role in establishing the properties of the extremal shift strategies.
2.
Although the construction of the strategy $\mathbf {U}_\varepsilon $ and estimate (32) of its optimality are independent on a set $V\in \mathbb {V}$ (and even on a constraint $\mathbb {V}$), according to Lemma 1, to provide inequality (33) for a given number $\xi > 0$, a partition $\varDelta $ should be chosen on the basis of the specific set V.
3.
Coefficient “3” in condition (21) can be replaced by any other number from $[1,\infty )$. This coefficient can only affect the rate of convergence of the guaranteed result of the corresponding strategy to the optimal guaranteed result when the parameters $\varepsilon $ and ${{\mathbf {D}}(\Delta )}$ are decreasing to zero.

7 Reduction of the Disturbance Reconstruction Problem

In numerical realization of the optimal strategy $\mathbf {U}_\varepsilon $, the rapid growth when $\varepsilon \downarrow 0$ of the dimension of disturbance reconstruction problem (23), (26) can cause difficulties. However, it is known that when the function f from (1) satisfies Property 1 (see [2]), to reconstruct the disturbance, it is sufficient to use any single value of the control instead of the series of “test” controls as in the general case. Therefore, we can simply use the previous step control value. The rest construction of the optimal control strategy remains the same.

Property 1

For any $t \in T$, $x \in B(R)$ and $v, v'\in Q$, if the equality

$$\begin{aligned} f(t,x,u,v) = f(t,x,u,v') \end{aligned}$$

holds for some value $u = u' \in P$, then this equality holds for any value $u \in P$.

Note that Property 1 is valid for any function f that is injective with respect to $v \in Q$ for any fixed $t \in T$, $x \in B(R)$ and $u \in P$. Another example is given by the following particular case of system (1):

$$\begin{aligned} \frac{\,\mathrm {d}x(t)}{\,\mathrm {d}t} = \bar{f}(t, x(t), u(t)) + \bar{g}(t,x(t),u(t)) \bar{h}(t,x(t),v(t)), \end{aligned}$$

(49)

where $\bar{f}: T \times \mathbb {R}^n \times P \rightarrow \mathbb {R}^n$, $\bar{h}: T \times \mathbb {R}^n \times Q \rightarrow \mathbb {R}^m$, and $\bar{g}$ maps $T \times \mathbb {R}^n \times P$ into the space of ($n\times m$)-matrices. Property 1 holds for system (49) if the kernel of the linear operator $\bar{g}(t, x, u): \mathbb {R}^m \rightarrow \mathbb {R}^n$ does not depend on $u \in P$ for any $t \in T$, $x \in B(R)$. Note also that Property 1, formulated in different terms, is considered in [13, 15].

Let us define a strategy with full memory $\widehat{{\mathbf {U}}}_{\varepsilon }^\varDelta {\text {:=}}(\widehat{{\mathbf {U}}}_{\varepsilon i}^\varDelta )_{i\in {0}...{(n_\varDelta -1)}}$ for any value of the accuracy parameter $\varepsilon \in (0, 1)$. Let a partition $\varDelta \in \varDelta _T$ satisfy condition (21). Since, instead of the series of “test” controls, we now use only the previous step control, in accordance with (22), we put

$$\begin{aligned} n_\varepsilon = 1, \quad \tau '_{i0} = \tau '_i = \tau _{i-1}, \quad \tau '_{i1}=\tau _i, \quad i \in {1}...{(n_{\varDelta }- 1)}, \end{aligned}$$

and, therefore, due to (23) and (24), we define

$$\begin{aligned} \widehat{\nu }_i (x^{(i)}(\cdot )) \in {\left\{ \begin{array}{ll} Q, &{} i = 0,\\ \mathop {\mathrm{argmin}}\limits _{v\in Q}\left\| d_{i1}(x^{(i)}(\cdot )) - f(\tau _i, x^{(i)}(\tau _i),{\overline{u}}_{i-1},v)\right\| , &{} i \in {1}...{(n_{\varDelta }-1)}, \end{array}\right. } \end{aligned}$$

(50)

where

$$\begin{aligned} d_{i1}(x^{(i)}(\cdot )) = \frac{x^{(i)}(\tau _i)-x^{(i)}(\tau _{i - 1})}{\tau _i - \tau _{i - 1}}. \end{aligned}$$

By analogy with Sect. 5.1, we consider the following control procedure with the guide. We define a piecewise constant control realization

$$\begin{aligned} u(t) = u_i \in P, \quad t \in [\tau _i, \tau _{i + 1}), \quad i \in {0}...{(n_{\varDelta }- 1)}, \end{aligned}$$

(51)

in the original system and realizations ${{\overline{u}}}(\cdot )$ and ${{\overline{v}}}(\cdot )$ of form (25) in the guide according to the following rule:

$$\begin{aligned}&{\overline{v}}_i {\text {:=}}\widehat{\nu }_i\big (x(\cdot )|_{[t_0, \tau _i]}\big ), \end{aligned}$$

(52)

$$\begin{aligned}&{\overline{u}}_i = \overline{{\mathbf {U}}}_{{\varepsilon } i}^\varDelta \big (y(\cdot )|_{[t_0, \tau _i]} \big ) ({\overline{v}}_i), \end{aligned}$$

(53)

$$\begin{aligned}&u_i = {\overline{u}}_i, \end{aligned}$$

(54)

where $\overline{{\mathbf {U}}}^\varDelta _{\varepsilon } = (\overline{{\mathbf {U}}}^\varDelta _{\varepsilon i})_{i \in {0}...{(n_{\varDelta }- 1)}}$ is fixed $\varepsilon $-optimal counter-control (19), and $y(\cdot )$ is the corresponding motion of the guide.

Thus, by analogy with Sect. 5.2, the control $\widehat{{\mathbf {U}}}^\varDelta = (\widehat{{\mathbf {U}}}^\varDelta _{\varepsilon i})_{i \in {0}...{(n_{\varDelta }- 1)}}$ on the partition $\varDelta $ is defined by

$$\begin{aligned} \widehat{{\mathbf {U}}}^\varDelta _{\varepsilon i}\big (x^{(i)}(\cdot )\big )(t) {\text {:=}}\overline{{\mathbf {U}}}^\varDelta _{\varepsilon i}\big (y^{(i)}(\cdot )\big )({{\overline{v}}}(\tau _i)), \quad t \in [\tau _i, \tau _{i+1}), \quad i \in {0}...{(n_\varDelta -1)}, \end{aligned}$$

(55)

where

$$\begin{aligned} {{\overline{v}}}(t)= & {} \widehat{\nu }_k \big (x^{(i)}(\cdot )|_{[t_0, \tau _k]}\big ), \quad t \in [\tau _k, \tau _{k+1}), \quad k \in {0}...{i}, \nonumber \\ y^{(i)}(t)= & {} x\big (t; x_0, \overline{{\mathbf {U}}}^\varDelta _\varepsilon , {{\overline{v}}}(\cdot )\big ), \quad t \in [t_0, \tau _i]. \end{aligned}$$

(56)

As usual, if a partition $\varDelta \in \varDelta _T$ does not satisfy condition (21), then we should use the control $\widehat{{\mathbf {U}}}_\varepsilon ^{\varDelta ^*}$ on the corresponding “thinned” partition $\varDelta ^*$.

An illustration to the described in this section control procedure with the guide is given in Fig. 2.

Note also that the strategy $\widehat{{\mathbf {U}}}_\varepsilon $ differs from the one proposed in [2, 13]. This new construction follows naturally from the general case (see Sect. 5) and has better convergence estimates.

Theorem 2

Let system (1) satisfy Property 1. Let $\mathbb {V}$ be a compact functional constraint on the disturbance. Then, for any number $\zeta > 0$, there exists a number $\varepsilon ^*\in (0, 1)$ such that, for any initial state $x_0 \in B(R_0)$ and any number $\varepsilon \in (0, \varepsilon ^*]$, the strategy with full memory $\widehat{{\mathbf {U}}}_\varepsilon \in \mathbf {S}$, defined by relations (56), (55), satisfies the inequality

$$\begin{aligned} \varGamma _{\mathbf {S}}(x_0; \widehat{{\mathbf {U}}}_\varepsilon \mid \mathbb {V}) \le \varGamma ^0_{\mathbf {QS}}(x_0) + \zeta . \end{aligned}$$

Theorem 2 is proved by the same scheme as Theorem 1, but the following lemma is used instead of Lemma 1. This lemma establishes a suitable estimate of the closeness between the motions of the original system and guide when Property 1 is satisfied and control procedure (52)–(54) is used.

Lemma 2

Let system (1) satisfy Property 1. Then, for any number $\xi > 0$ and any set $V \subset {\mathcal {V}}$ compact in $L_1(T, \mathbb {R}^q)$, we can specify a number $\delta _*> 0$ such that, for any initial state $x_0 \in B(R_0)$ and any satisfying (21) partition $\varDelta \in \varDelta _T$ with the diameter ${{\mathbf {D}}(\Delta )} \le \delta _*$, the following statement holds. Let the motions $x(\cdot )$ and $y(\cdot )$ of system (1) be generated from the initial state $x_0$ by realizations $u(\cdot )$, $v(\cdot )$ and ${{\overline{u}}}(\cdot )$, ${{\overline{v}}}(\cdot )$, respectively. Let the inclusion $v(\cdot )\in V$ be valid and these realizations satisfy relations (25), (51) and (52), (54) for $i \in {0}...{(n_{\varDelta }- 1)}$. Then the following inequality holds:

$$\begin{aligned} \Vert x(t) - y(t)\Vert \le \xi , \quad t \in T. \end{aligned}$$

(57)

In the proof of the lemma, we use the notation:

$$\begin{aligned}&\mu _{uv} (\delta ){\text {:=}}\max \Big \{ \Vert f(t, x, u, v) - f(t, x, u, v^\prime ) \Vert :\\&\quad t \in T, \ x \in B(R), \ u, u^\prime \in P, \ v, v^\prime \in Q, \ \Vert f(t, x, u^\prime , v) - f(t, x, u^\prime , v^\prime ) \Vert \le \delta \Big \}. \end{aligned}$$

Note that, the inequality

$$\begin{aligned} \Vert f(t, x, u, v) - f(t, x, u, v')\Vert \le \mu _{uv} \big ( \Vert f(t, x, u', v) - f(t, x, u', v')\Vert \big ) \end{aligned}$$

(58)

holds for any $t \in T$, $x \in B(R)$, $u,u'\in P$ and $v,v'\in Q$. Furthermore, if Property 1 is satisfied, then, according to [2, Assertion 1], we have

$$\begin{aligned} \lim _{\delta \downarrow 0} \mu _{uv}(\delta ) = 0. \end{aligned}$$

(59)

Proof of Lemma 2

Fix a number $\xi > 0$ and a set $V \subset {\mathcal {V}}$ compact in $L_1(T, \mathbb {R}^q)$. Choose a number $\xi _*> 0$ from condition (36). Taking into account [2, Assertions 2, 3] and (59), one can specify a number $\delta _*> 0$ such that, for any number $\delta \in (0,\delta _*]$ and any function $v(\cdot ) \in V$, the following inequality holds:

$$\begin{aligned} 2 \varkappa \delta + \int _{T} \mu _{uv}\bigg (4\psi (2\delta ) + \frac{6}{\delta }\int _{s - 2 \delta }^{s + 2 \delta } \mu _v(\Vert v(s) - v(\tau )\Vert )\,\mathrm {d}\tau \bigg )\,\mathrm {d}s \le \xi _*. \end{aligned}$$

(60)

Let us show that the assertion of the lemma holds for this value $\delta _*$.

In accordance with the statement of the lemma, let us assume that an initial state $x_0 \in B(R_0)$, a partition $\varDelta \in \varDelta _T$, realizations $u(\cdot )$, $v(\cdot )$ and ${{\overline{u}}}(\cdot )$, ${{\overline{v}}}(\cdot )$, and the corresponding motions ${x}(\cdot )$ and ${y}(\cdot )$ of system (1) are fixed. Let $\delta {\text {:=}}{{\mathbf {D}}(\Delta )}\le \delta _*$. Let us estimate the value $\Vert x(t) - y(t)\Vert $ for $t \in T$. In view of (25), (51) and (54), we have

$$\begin{aligned} \Vert {x}(t)-{y}(t)\Vert =\bigg \Vert \int _{t_0}^t \big (f(s,{x}(s),{{\overline{u}}}(s),v(s))-f(s,{y}(s),{{\overline{u}}}(s),{{\overline{v}}}(s))\big )\,\mathrm {d}s\bigg \Vert . \end{aligned}$$

In the right-hand side, we add and subtract under the integral sign the quantity $f(s,{x}(s),{{\overline{u}}}(s),{{\overline{v}}}(s))$ (we continue the estimate):

$$\begin{aligned}&\displaystyle \le \left\Vert\int _{t_0}^t \big ( f(s,{x}(s),{{\overline{u}}}(s),v(s)) - f(s,{x}(s),{{\overline{u}}}(s),{{\overline{v}}}(s)) \big ) \,\mathrm {d}s\right\Vert \nonumber \\&\quad \displaystyle +\left\Vert\int _{t_0}^t \big ( f(s,{x}(s),{{\overline{u}}}(s),{{\overline{v}}}(s)) - f(s,{y}(s),{{\overline{u}}}(s),{{\overline{v}}}(s)) \big ) \,\mathrm {d}s\right\Vert {\text {:=}}I_1+I_2. \end{aligned}$$

(61)

Let us estimate $I_1$ in (61). Let $i\in {1}...{(n_{\varDelta }- 1)}$ and $s\in [\tau _{i},\tau _{i+1})$. We have

(62)

Let us estimate the first term in the right-hand side of this inequality. Due to (35), we obtain

$$\begin{aligned}&\displaystyle \bigg \Vert f(s,{x}(s),{\overline{u}}_{i-1},v(s))-\frac{{x}(\tau _{i})-{x}(\tau _{i-1})}{\tau _{i}-\tau _{i-1}}\bigg \Vert \nonumber \\&\quad \displaystyle =\bigg \Vert \int ^{\tau _{i}}_{\tau _{i-1}}\frac{f(s,{x}(s),{\overline{u}}_{i-1},v(s))-f(\tau ,{x}(\tau ), {\overline{u}}_{i-1},v(\tau ))}{\tau _{i}-\tau _{i-1}}\,\mathrm {d}\tau \bigg \Vert \nonumber \\&\quad \displaystyle \le \int ^{\tau _{i}}_{\tau _{i-1}}\frac{\psi (s-\tau )+\mu _v(\Vert v(s)-v(\tau )\Vert )}{\tau _{i}-\tau _{i-1}}\,\mathrm {d}\tau \nonumber \\&\quad \displaystyle \le \psi (2\delta )+\frac{1}{{\mathbf {d}}(\Delta )}\int _{s-2\delta }^{s+2\delta }\mu _v(\Vert v(s)-v(\tau )\Vert )\,\mathrm {d}\tau \nonumber \\&\quad \displaystyle \le \psi (2\delta )+\frac{3}{\delta }\int _{s-2\delta }^{s+2\delta }\mu _v(\Vert v(s)-v(\tau )\Vert )\,\mathrm {d}\tau . \end{aligned}$$

(63)

For the second term in the right-hand side of (62), taking into account (50), (52) and the inclusion $s\in [\tau _{i},\tau _{i+1})$, we derive

$$\begin{aligned}&\Vert d_{i1}({x}(\cdot ))-f(s,{x}(s),{\overline{u}}_{i-1},{{\overline{v}}}(s))\Vert \nonumber \\&\quad =\Vert d_{i1}({x}(\cdot ))-f(s,{x}(s),{\overline{u}}_{i-1},{\overline{v}}(\tau _i))\Vert \nonumber \\&\quad \le \Vert d_{i1}({x}(\cdot ))-f(\tau _i,{x}(\tau _i),{\overline{u}}_{i-1},{\overline{v}}(\tau _i))\Vert +\psi (\delta )\nonumber \\&\quad \le \Vert d_{i1}({x}(\cdot ))-f(\tau _i,{x}(\tau _i),{\overline{u}}_{i-1},v(s))\Vert +\psi (\delta )\nonumber \\&\quad \le \Vert d_{i1}({x}(\cdot ))-f(s,{x}(s),{\overline{u}}_{i-1},v(s))\Vert +2\psi (\delta )\nonumber \\&\quad \displaystyle \le 3 \psi (2\delta ) + \frac{3}{\delta } \int _{s-2\delta }^{s+2\delta }\mu _v(\Vert v(s)-v(\tau )\Vert )\,\mathrm {d}\tau . \end{aligned}$$

(64)

From (62)–(64), we obtain

Therefore, due to (34) and (58), we deduce

$$\begin{aligned} I_1\le 2\varkappa \delta +\int _{T}\mu _{uv}\bigg (4\psi (2\delta )+\frac{6}{\delta }\int _{s-2\delta }^{s+2\delta }\mu _v(\Vert v(s)-v(\tau )\Vert )\,\mathrm {d}\tau \bigg )\,\mathrm {d}s. \end{aligned}$$

(65)

For the integral $I_2$ in (61), using (34), we obtain

$$\begin{aligned} I_2\le \int _{t_0}^t L\Vert {x}(s)-{y}(s)\Vert \,\mathrm {d}s. \end{aligned}$$

(66)

According to (61), (65) and (66), we have

$$\begin{aligned} \Vert {x}(t)-{y}(t)\Vert\le & {} \int _{t_0}^t L\Vert {x}(s)-{y}(s)\Vert \,\mathrm {d}s\\&+\,2\varkappa \delta +\sup _{v^\prime (\cdot )\in V}\bigg \{\int _{T}\mu _{uv}\bigg (4\psi (2\delta )+\frac{6}{\delta }\int _{s-2\delta }^{s+2\delta }\mu _v(\Vert v^\prime (s)-v^\prime (\tau )\Vert )\,\mathrm {d}\tau \bigg )\,\mathrm {d}s\bigg \}\\&\qquad \qquad \qquad \qquad \displaystyle {\text {:=}}\int _{t_0}^t L\Vert {x}(s)-{y}(s)\Vert \,\mathrm {d}s+\varPhi (\delta ). \end{aligned}$$

Due to choice (60) of $\varDelta $, we get $\varPhi (\delta )\le \xi _*$. Then, applying the Bellman–Gronwall lemma, we deduce $\Vert {x}(t)-{y}(t)\Vert \le \xi _*\exp {(L(\vartheta - t_0))}$, $t\in T$. Therefore, according to (36), inequality (57) holds. Lemma 2 is proved. $\square $

8 Examples

In this section, we give examples illustrating the availability for numerical realization of the proposed in the paper (see Sects. 5 and 7) solution of the guarantee optimization problem for system (1), initial condition (2) and quality index (4) under a compact functional constraint on the disturbance $\mathbb {V}$. The most difficult part in this solution is to construct the $\varepsilon $-optimal counter-strategy with full memory $\overline{{\mathbf {U}}}_\varepsilon $. With few exceptions, optimal strategies in guarantee optimization problems (differential games) are hard to calculate. However, there are some classes of the problems for which effective procedures are known for calculating the value of the optimal guaranteed result (the game value) and, as a consequence, for constructing the corresponding optimal strategies. For example, in the so-called linear-convex case, we can apply the upper convex hulls method [1, 4] (see also [9]). We use this method in Examples 2 and 3.

Example 1

The first example shows that Property 1 is essential in Lemma 2 and Theorem 2. Let a motion of a dynamical system be described by the equation

$$\begin{aligned} \frac{\,\mathrm {d}x(t)}{\,\mathrm {d}t} = u(t) v(t), \ \ t \in [0,1], \ \ x(t) \in \mathbb {R}, \ \ u(t) \in \{0, 1\}, \ \ v(t) \in \{-1, 1\}, \end{aligned}$$

(67)

with the initial condition $x(0) = 0$, and let $\gamma = x(1)$ be a quality index. Note that system (67) does not satisfy Property 1. In this problem, the optimal guaranteed result in the class of quasi-strategies (6) is $\varGamma ^0_{\mathbf {QS}}(0) = 0$, and the counter-strategy $\overline{{\mathbf {U}}}_0(v) = 0$ for $v = 1$ and $\overline{{\mathbf {U}}}_0(v) = 1$ for $v = -1$ is optimal. Suppose that a set V from a compact functional constraint $\mathbb {V}$ consists of the only one function $v(t) = 1$, $t \in [0, 1]$. We consider a partition $\varDelta $ of the time interval [0, 1] with the constant step $\delta = {{\mathbf {D}}(\Delta )}$ and define piecewise constant realizations ${u}(\cdot )$, ${{\overline{u}}}(\cdot )$ and ${{\overline{v}}}(\cdot )$ by the following rule:

$$\begin{aligned} {u}_i ={{\overline{u}}}_i = {\left\{ \begin{array}{ll} 1, &{}\quad \text{ if } i \text{ is } \text{ even },\\ 0, &{}\quad \text{ otherwise }, \end{array}\right. } \quad {{\overline{v}}}_i = {\left\{ \begin{array}{ll} -1, &{}\quad \text{ if } i \text{ is } \text{ even },\\ 1, &{}\quad \text{ otherwise }, \end{array}\right. } \quad i \in {0}...{(n_{\varDelta }- 1)}. \end{aligned}$$

One can verify that such ${u}(\cdot )$, ${{\overline{u}}}(\cdot )$ and ${{\overline{v}}}(\cdot )$ satisfy relations (52)–(54). Substituting the realizations ${u}(\cdot )$ and $v(\cdot )$ into system (67), we obtain

$$\begin{aligned} \gamma = x(1) \ge 1/2 - \delta /2. \end{aligned}$$

Thus, in this problem, the control procedure with guide (52)–(54), and, therefore, strategy $\widehat{{\mathbf {U}}}_\varepsilon $ (55), (56), does not guarantee for the quality index $\gamma $ the value $\varGamma ^0_{\mathbf {QS}}(0)$. So, the assertion of Theorem 2 does not hold in this example. The analysis of the corresponding motion of the guide shows that the assertion of Lemma 2 does not hold here either.

Table 1 The results of the numerical simulation in Example 2

Full size table

Example 2

Let a motion of a dynamical system be described by the equations

$$\begin{aligned} {\left\{ \begin{array}{ll} \displaystyle \frac{\,\mathrm {d}x_1(t)}{\,\mathrm {d}t} = u_1(t)( v_1(t)+v_2(t)), &{}\quad \ t \in [0,2],\\ \displaystyle \frac{\,\mathrm {d}x_2(t)}{\,\mathrm {d}t} = u_2(t) v_1(t) v_2(t), &{}\quad \ x(t) = \big ( x_1(t), x_2(t)\big ) \in \mathbb {R}^2, \end{array}\right. } \end{aligned}$$

(68)

and the initial condition $x(0) = (0, 0)$. Let the geometric constraints on the control and disturbance have the form

$$\begin{aligned}&P {\text {:=}}\big \{(u_1, u_2) \in {\mathbb {R}}^2: \ 0.5\le |u_i| \le 1.5, \ i = 1, 2\big \},\\&Q {\text {:=}}\big \{(v_1, v_2) \in \mathbb {R}^2 : \ 1 \le v_1^2 + v_2^2 \le 4\big \}. \end{aligned}$$

Let us consider the quality index

$$\begin{aligned} \gamma {\text {:=}}\sqrt{(x_1(1)-2)^2+(x_2(1)-1)^2+x_1^2(2)+(x_2(2)+2)^2)}. \end{aligned}$$

(69)

Note that the right-hand side of system (68) is not injective with respect to $v = (v_1, v_2)$. But since the system is of form (49), and the corresponding kernel is constant and equal to $\{(0,0)\}$, system (68) satisfies Property 1. Moreover, one can show that system (68) does not satisfy condition (14).

Let a set V from a compact functional constraint on the disturbance $\mathbb {V}$ consist of all functions from [0, 2] to Q that are piecewise constant on the partition of [0, 2] with the constant step 0.05. So, the number of possible switchings of the disturbance is not greater than 40. Note that, for the chosen Q, the set V is compact in $L_1([0, 2],\mathbb {R}^2)$.

In simulation below we use the strategy with full memory $\widehat{{\mathbf {U}}}_\varepsilon $ described in Sect. 7. We construct the corresponding $\varepsilon $-optimal counter-strategy $\overline{{\mathbf {U}}}_\varepsilon $ and the value of the optimal guaranteed result $\varGamma ^0_{\mathbf {QS}}(0, 0)$ on the basis of the upper convex hulls method. Furthermore, we simulate disturbance realizations on the basis of the optimal counter-strategy of the disturbance (the second player), which is also constructed by the upper convex hulls method. The step $\delta $ of the partition $\varDelta $ used in the corresponding control with full memory $\widehat{{\mathbf {U}}}_\varepsilon ^\varDelta $ we vary within the set $\{0.05, 0.01, 0.002, 0.0004\}$.

For the value of optimal guaranteed result in the class of strategies with full memory under the compact functional constraint $\mathbb {V}$, we obtain

$$\begin{aligned} \varGamma ^0_{\mathbf {S}}(0,0 \mid \mathbb {V}) = \varGamma ^0_{\mathbf {QS}}(0,0) \approx 2.8760. \end{aligned}$$

The results of the numerical simulation are presented in Table 1, where $\delta $ is the step of the partition $\varDelta $; $\Vert x(\cdot ) - y(\cdot )\Vert $ is the maximal distance between the motions of the system $x(\cdot )$ and the guide $y(\cdot )$; $\gamma $ is the realized value of quality index (69). The motions of the system and guide for $\delta = 0.01$ and $\delta = 0.0004$ are shown in Figs. 3 and 4, respectively.

Table 2 The results of the numerical simulation in Example 3

Full size table

Example 3

Let us consider the same guarantee optimization problem as in Example 2 but with the geometrical constraints

$$\begin{aligned}&P {\text {:=}}\big \{u\in \mathbb {R}^2: u\in \{(-1.1),(-1,0),(1,0),(1,-1)\}\big \},\\&Q {\text {:=}}\big \{(v_1, v_2) \in \mathbb {R}^2 : |v_1|,|v_2|\in \{0.5,2\}\big \}. \end{aligned}$$

One can show that Property 1 is not fulfilled in this case. Therefore, in the numerical simulations, we use the strategy with full memory $\mathbf {U}_\varepsilon $ described in Sect. 5. In (20), we put $n_\varepsilon = 4$ and choose the whole set P as its $\varepsilon $-net. In (22), we choose $\varepsilon = 0.01$.

For the corresponding value of the optimal guaranteed result, we obtain

$$\begin{aligned} \varGamma ^0_{\mathbf {S}}(0,0 \mid \mathbb {V}) = \varGamma ^0_{\mathbf {QS}}(0,0) \approx 2.8359. \end{aligned}$$

The results of the numerical simulation are presented in Table 2. The motions of the system and guide for $\delta = 0.01$ and $\delta = 0.0004$ are shown in Figs. 5 and 6, respectively.

References

Gomoyunov MI, Kornev DV (2016) On calculating the value of a differential game in the class of counter strategies. Ural Math J 2(1):38–47
Article MATH Google Scholar
Gomoyunov MI, Serkov DA (2017) Control with a guide in the guarantee optimization problem under functional constraints on the disturbance. Proc Steklov Inst Math 299(Suppl 1):49–60
Article MathSciNet MATH Google Scholar
Isaacs R (1965) Differential games. Wiley, New York
MATH Google Scholar
Kornev DV (2012) On numerical solution of positional differential games with nonterminal payoff. Autom Rem Control 73(11):1808–1821
Article MathSciNet MATH Google Scholar
Krasovskii AN, Krasovskii NN (1995) Control under lack of information. Birkhäuser, Boston
Book MATH Google Scholar
Krasovskii NN (1985) Control of a dynamical system. Nauka, Moscow (in Russian)
Krasovskii NN, Subbotin AI (1988) Game-theoretical control problems. Springer, New York
Book Google Scholar
Kryazhimskii AV (1991) The problem of optimization of the ensured result: unimprovability of full-memory strategies. In: Kryazhimskii AV (ed) Constantin caratheodory: an international tribute, vol 1. World Scientific, Teaneck, pp 636–675
Chapter Google Scholar
Lukoyanov NYu, Gomoyunov MI (2019) Differential games on minmax of the positional quality index (submitted to Dyn Games Appl)
Osipov YuS, Kryazhimskii AV (1995) Inverse problems for ordinary differential equations: dynamical solutions. Gordon and Breach Science Publishers, London
MATH Google Scholar
Roxin E (1969) Axiomatic approach in differential games. J Optim Theory Appl 3(3):153–163
Article MathSciNet MATH Google Scholar
Ryll-Nardzewski C (1964) A theory of pursuit and evasion. In: Dresher M, Shapley LS, Tucker AW (eds) Advances in game theory. Princeton University Press, Princeton, pp 113–126
Google Scholar
Serkov DA (2012) Guaranteed control under functionally restricted disturbances. Mat Teor Igr Prilozh 4(2):71–95 (in Russian)
MATH Google Scholar
Serkov DA (2012) Optimal guarantee under the disturbances of Caratheodory type. Vestn Udmurtsk Univ Mat Mekh Komp Nauki 2:74–83 (in Russian)
Article MATH Google Scholar
Serkov DA (2013) Optimal risk control under functionally restricted disturbances. Mat Teor Igr Prilozh 5(1):74–103 (in Russian)
MATH Google Scholar
Serkov DA (2014) On the unimprovability of full-memory strategies in the risk minimization problem. Proc Steklov Inst Math 287(Suppl 1):175–184
Article MathSciNet MATH Google Scholar
Serkov DA (2015) On the unimprovability of full-memory strategies in problems of guaranteed result optimization. Proc Steklov Inst Math 291(Suppl 1):157–172
Article MathSciNet MATH Google Scholar
Subbotin AI, Chentsov AG (1981) Guarantee optimization in control problems. Nauka, Moscow (in Russian)

Download references

Acknowledgements

We thank the referees for their careful reading and their remarks that allowed us to improve the paper.

Author information

Authors and Affiliations

Krasovskii Institute of Mathematics and Mechanics of the Ural Branch of the Russian Academy of Sciences, Ural Branch of Russian Academy of Sciences, S. Kovalevskaya Str., 16, Ekaterinburg, Russia
Mikhail Gomoyunov & Dmitriy Serkov
Ural Federal University, Mira Str., 32, Ekaterinburg, Russia
Mikhail Gomoyunov & Dmitriy Serkov

Authors

Mikhail Gomoyunov
View author publications
You can also search for this author in PubMed Google Scholar
Dmitriy Serkov
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mikhail Gomoyunov.

Additional information

This work was supported by the Integrated Program for Fundamental Research of the Ural Branch of the Russian Academy of Sciences (Project No. 18-1-1-10).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gomoyunov, M., Serkov, D. On a Solution of a Guarantee Optimization Problem Under the Functional Constraints on the Disturbance. Dyn Games Appl 9, 700–723 (2019). https://doi.org/10.1007/s13235-018-0279-1

Download citation

Published: 04 August 2018
Issue Date: 15 September 2019
DOI: https://doi.org/10.1007/s13235-018-0279-1

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

On a Solution of a Guarantee Optimization Problem Under the Functional Constraints on the Disturbance

Abstract

Similar content being viewed by others

Control with a Guide in the Guarantee Optimization Problem under Functional Constraints on the Disturbance

The Linear Constrained Control Problem for Discrete-Time Systems: Regulation on the Boundaries

A Survey on Regularity Conditions for State-Constrained Optimal Control Problems and the Non-degenerate Maximum Principle

1 Introduction

2 Statement of the Problem

2.1 Dynamical System and Quality Index

2.2 Functional Constraint on the Disturbance

3 Guarantee Optimization Without Functional Constraints

3.1 Quasi-Strategies

3.2 Counter-Strategies with Full Memory

3.3 Strategies with Full Memory

3.4 Comparison of Optimal Guaranteed Results

4 Guarantee Optimization Under a Functional Constraint on the Disturbance

5 Construction of Optimal Strategy

5.1 Optimal Control with a Guide

5.2 Optimal Control Strategy with Full Memory

Theorem 1

6 Proof of Theorem 1

Lemma 1

Proof of Lemma 1

Proof of Theorem 1

7 Reduction of the Disturbance Reconstruction Problem

Property 1

Theorem 2

Lemma 2

Proof of Lemma 2

8 Examples

Example 1

Example 2

Example 3

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation