Direct Treatment of Special Optimization Problems

Jahn, Johannes

doi:10.1007/978-3-030-42760-3_9

Johannes Jahn²

913 Accesses

Abstract

Many of the results derived in this book are concerned with a generally formulated optimization problem. But if a concrete problem is given which has a rich mathematical structure, then solutions or characterizations of solutions can be derived sometimes in a direct way. In this case one takes advantage of the special structure of the optimization problem and can achieve the desired results very quickly.

Access provided by Autonomous University of Puebla. Download chapter PDF

Many of the results derived in this book are concerned with a generally formulated optimization problem. But if a concrete problem is given which has a rich mathematical structure, then solutions or characterizations of solutions can be derived sometimes in a direct way. In this case one takes advantage of the special structure of the optimization problem and can achieve the desired results very quickly.

In this final chapter we present two special optimal control problems and show how they can be treated without the use of general theoretical optimization results. The first problem is a so-called linear quadratic optimal control problem. For the given quadratic objective functional one gets a minimal solution with the aid of a simple quadratic completion without using necessary optimality conditions. The second problem is a time-minimal optimal control problem which can be solved directly by the application of a separation theorem.

9.1 Linear Quadratic Optimal Control Problems

In this section we consider a system of autonomous linear differential equations

$$\displaystyle \begin{aligned} \dot{x}(t) = Ax(t)+Bu(t) \mbox{ almost everywhere on } [0,\hat{T}] \end{aligned} $$

(9.1)

and an initial condition

$$\displaystyle \begin{aligned} x(0)=x^{0} \end{aligned} $$

(9.2)

(where $\hat {T}{\,>\,}0$ and $x^{0}{\,\in \,}\mathbb {R}^{n}$ are arbitrarily given). Let A and B be (n, n) and (n, m) matrices with real coefficients, respectively. Let every control $u\in L_{\infty }^{m}([0,\hat {T}])$ be feasible (i.e. the controls are unconstrained). It is our aim to steer the system (9.1), (9.2) as close to a state of rest as possible at the terminal time $\hat {T}$. In other words: For a given positive definite symmetric (n, n) matrix G with real coefficients the quadratic form $x(\hat {T})^{T}Gx(\hat {T})$ should be minimal. Since we want to reach our goal with a minimal steering effort, for a given positive definite symmetric (m, m) matrix R with real coefficients the expression $\int \limits _{0}^{\hat {T}} u(t)^{T}Ru(t)\, dt$ should be minimized as well. These two goals are used for the definition of the objective functional $J:L_{\infty }^{m}([0,\hat {T}])\rightarrow \mathbb {R}$ with

$$\displaystyle \begin{aligned} J(u) = x(\hat{T})^{T}Gx(\hat{T}) + \int\limits_{0}^{\hat{T}} u(t)^{T}Ru(t)\, dt \mbox{ for all } u\in L_{\infty}^{m}([0,\hat{T}]). \end{aligned}$$

Under these assumptions the considered linear quadratic optimal control problem then reads as follows:

(9.3)

In order to be able to present an optimal control for the problem (9.3) we need two technical lemmas.

Lemma 9.1 (relationship between control and trajectory)

Let P(⋅) be a real (n, n) matrix function which is symmetric and differentiable on $[0,\hat {T}]$. Then it follows for an arbitrary control $u\in L_{\infty }^{m}([0,\hat {T}])$and a trajectory x of the initial value problem (9.1), (9.2):

$$\displaystyle \begin{aligned} \begin{array}{rcl} 0 &\displaystyle = &\displaystyle {x^{0}}^{T}P(0)x^{0} - x(\hat{T})^{T}P(\hat{T})x(\hat{T}) + \int\limits_{0}^{\hat{T}}\Big[ 2u(t)^{T}B^{T}P(t)x(t)\\ &\displaystyle &\displaystyle \ \ \ \ \ \ \ \ \ + x(t)^{T}\left( \dot{P}(t)+A^{T}P(t)+P(t)A\right) x(t)\Big] dt. \end{array} \end{aligned} $$

Proof

For an arbitrary control $u\in L_{\infty }^{m}([0,\hat {T}])$ and a corresponding trajectory x of the initial value problem (9.1), (9.2) and an arbitrary real matrix function P(⋅) defined on $[0,\hat {T}]$ and being symmetric and differentiable it follows:

$$\displaystyle \begin{aligned} \begin{array}{rcl} \frac{d}{dt}\left[ x(t)^{T}P(t)x(t)\right] &\displaystyle = &\displaystyle \dot{x}(t)^{T}P(t)x(t)+x(t)^{T}\left( \dot{P}(t)x(t) +P(t)\dot{x}(t)\right) \\ &\displaystyle = &\displaystyle \big(Ax(t)+Bu(t)\big)^{T}P(t)x(t)\\ &\displaystyle &\displaystyle +x(t)^{T}\left( \dot{P}(t)x(t)+P(t)\left(Ax(t)+Bu(t)\right) \right) \\ &\displaystyle = &\displaystyle x(t)^{T}\left(\dot{P}(t)+A^{T}P(t)+P(t)A\right) x(t)\\ &\displaystyle &\displaystyle +2u(t)^{T}B^{T}P(t)x(t) \mbox{ almost everywhere on } [0,\hat{T}]. \end{array} \end{aligned} $$

With the initial condition (9.2) we get immediately by integration

$$\displaystyle \begin{aligned} \begin{array}{rcl} &\displaystyle &\displaystyle {x(\hat{T})^{T}P(\hat{T})x(\hat{T})-{x^{0}}^{T}P(0)x^{0}}\\ &\displaystyle &\displaystyle \ = \int\limits_{0}^{\hat{T}} \Big[ 2u(t)^{T}B^{T}P(t)x(t) + x(t)^{T}\left( \dot{P}(t)+A^{T}P(t)+P(t)A\right) x(t)\Big]\, dt \end{array} \end{aligned} $$

which implies the assertion. □

Lemma 9.2 (Bernoulli matrix differential equation)

The (n, n) matrix function P(⋅) with

$$\displaystyle \begin{aligned} P(t) = \bigg[ e^{A(t-\hat{T})}G^{-1}e^{A^{T}(t-\hat{T})} &+\int\limits_{t}^{\hat{T}} e^{A(t-s)}BR^{-1}B^{T}e^{A^{T}(t-s)}\, ds \bigg] ^{-1} \\ &\qquad \qquad \qquad \quad \mathit{\mbox{for all }} t\in [0,\hat{T}] {} \end{aligned} $$

(9.4)

is a solution of the Bernoulli matrix differential equation

$$\displaystyle \begin{aligned} \dot{P}(t)+A^{T}P(t)+P(t)A-P(t)BR^{-1}B^{T}P(t)=0_{(n,n)} \mathit{\mbox{ for all }} t\in [0,\hat{T}] \end{aligned} $$

(9.5)

with the terminal condition

$$\displaystyle \begin{aligned} P(\hat{T})=G. \end{aligned} $$

(9.6)

The matrix function P(⋅) defined by (9.4) is symmetric.

Proof

First we define the (n, n) matrix function Q(⋅) by

$$\displaystyle \begin{aligned} Q(t) = e^{A(t-\hat{T})}G^{-1}e^{A^{T}(t-\hat{T})} +\int\limits_{t}^{\hat{T}} e^{A(t-s)}BR^{-1}B^{T}e^{A^{T}(t-s)}\, ds \mbox{ for all } t\in [0,\hat{T}] \end{aligned}$$

(notice that the matrix exponential function is defined as a matrix series). It is evident that Q(⋅) is a symmetric matrix function. For an arbitrary $z\in \mathbb {R}^{n}$, $z\neq 0_{\mathbb {R}^{n}}$, we obtain

$$\displaystyle \begin{aligned} \begin{array}{rcl} z^{T}Q(t)z &\displaystyle = &\displaystyle \underbrace{z^{T}e^{A(t-\hat{T})}G^{-1}e^{A^{T}(t-\hat{T})}z}_{ >\ 0} +\int\limits_{t}^{\hat{T}}\underbrace{z^{T}e^{A(t-s)}BR^{-1}B^{T} e^{A^{T}(t-s)}z}_{\geq\ 0} ds\\ &\displaystyle > &\displaystyle 0 \mbox{ for all } t\in [0,\hat{T}]. \end{array} \end{aligned} $$

Consequently, for every $t\in [0,\hat {T}]$ the matrix Q(t) is positive definite and therefore invertible, i.e. the matrix function P(⋅) with

$$\displaystyle \begin{aligned} P(t)=Q(t)^{-1} \mbox{ for all } t\in [0,\hat{T}] \end{aligned}$$

is well-defined. Since Q(⋅) is symmetric, P(⋅) is also symmetric.

It is obvious that P(⋅) satisfies the terminal condition (9.6). Hence, it remains to be shown that P(⋅) is a solution of the Bernoulli matrix differential equation (9.5). For this proof we calculate the derivative (notice the implications for arbitrary $t\in [0,\hat {T}]$: $Q(t)\cdot Q(t)^{-1}=I \Longrightarrow \dot {Q}(t)Q(t)^{-1}+Q(t)\frac {d}{dt}\left ( Q(t)^{-1}\right ) =0_{(n,n)} \Longrightarrow \frac {d}{dt}\left ( Q(t)^{-1}\right ) =-Q(t)^{-1}\dot {Q}(t)$Q(t)⁻¹)

$$\displaystyle \begin{aligned} \begin{array}{rcl} \dot{P}(t) &\displaystyle = &\displaystyle \frac{d}{dt}\left( Q(t)^{-1}\right) \\ &\displaystyle = &\displaystyle -Q(t)^{-1}\dot{Q}(t)Q(t)^{-1}\\ &\displaystyle = &\displaystyle -Q(t)^{-1}\bigg[ Ae^{A(t-\hat{T})}G^{-1}e^{A^{T}(t-\hat{T})} +e^{A(t-\hat{T})}G^{-1}e^{A^{T}(t-\hat{T})}A^{T}\\ &\displaystyle &\displaystyle +\int\limits_{t}^{\hat{T}}\bigg( Ae^{A(t-s)}BR^{-1} B^{T}e^{A^{T}(t-s)}\\ &\displaystyle &\displaystyle +e^{A(t-s)}BR^{-1}B^{T}e^{A^{T}(t-s)}A^{T}\bigg)\, ds -BR^{-1}B^{T}\bigg] Q(t)^{-1}\\ &\displaystyle = &\displaystyle -Q(t)^{-1}\left[ AQ(t)+Q(t)A^{T}-BR^{-1}B^{T}\right] Q(t)^{-1}\\ &\displaystyle = &\displaystyle -Q(t)^{-1}A-A^{T}Q(t)^{-1}+Q(t)^{-1}BR^{-1}B^{T}Q(t)^{-1}\\ &\displaystyle = &\displaystyle -P(t)A-A^{T}P(t)+P(t)BR^{-1}B^{T}P(t) \mbox{ for all } t\in [0,\hat{T}]. \end{array} \end{aligned} $$

Consequently, P(⋅) satisfies the Bernoulli matrix differential equation (9.5). □

With the aid of the two preceding lemmas it is now possible to present the optimal control of the linear quadratic problem (9.3).

Theorem 9.3 (feedback control)

The so-called feedback control $\bar {u}$ given by

$$\displaystyle \begin{aligned} \bar{u}(t)= -R^{-1}B^{T}P(t)x(t) \mathit{\mbox{ almost everywhere on }} [0,\hat{T}] \end{aligned}$$

is the only optimal control of the linear quadratic control problem (9.3) where the matrix function P(⋅) is given by (9.4).

Proof

In the following let P(⋅) be the matrix function defined by (9.4). Then we have with Lemmas 9.1 and 9.2 for every control $u\in L_{\infty }^{m}([0,\hat {T}])$ with $u\neq \bar {u}$:

$$\displaystyle \begin{aligned} \begin{array}{rcl} J(u) &\displaystyle = &\displaystyle x(\hat{T})^{T}Gx(\hat{T})+\int\limits_{0}^{\hat{T}}u(t)^{T} Ru(t)\, dt\\ &\displaystyle = &\displaystyle {x^{0}}^{T}P(0)x^{0}+x(\hat{T})^{T}[G-P(\hat{T})]x(\hat{T})\\ &\displaystyle &\displaystyle +\int\limits_{0}^{\hat{T}}\bigg[ u(t)^{T}Ru(t)+2u(t)^{T}B^{T}P(t)x(t)\\[-0.5ex] &\displaystyle &\displaystyle \ \ \ \ \ \ \ \ \ \ +x(t)^{T}\Big( \dot{P}(t) +A^{T}P(t)+P(t)A\Big) x(t)\bigg]\, dt\\ &\displaystyle &\displaystyle \qquad \qquad \qquad \qquad \qquad \qquad \qquad \quad \mbox{(from Lemma 9.1)}\\ &\displaystyle = &\displaystyle {x^{0}}^{T}P(0)x^{0}+\int\limits_{0}^{\hat{T}} \bigg[ u(t)^{T}Ru(t) +2u(t)^{T}B^{T}P(t)x(t)\\[-0.5ex] &\displaystyle &\displaystyle \qquad \qquad \qquad \qquad +x(t)^{T}P(t)BR^{-1}B^{T}P(t)x(t)\bigg]\, dt\\ &\displaystyle &\displaystyle \qquad \qquad \qquad \qquad \qquad \qquad \qquad \mbox{(from Lemma 9.2)}\\ &\displaystyle = &\displaystyle {x^{0}}^{T}P(0)x^{0} +\int\limits_{0}^{\hat{T}} \Big( u(t)+R^{-1}B^{T} P(t)x(t)\Big) ^{T}R\\ &\displaystyle &\displaystyle \qquad \qquad \qquad \qquad \Big( u(t)+R^{-1}B^{T}P(t)x(t)\Big)\, dt\\ &\displaystyle > &\displaystyle {x^{0}}^{T}P(0)x^{0}\\ &\displaystyle = &\displaystyle J(\bar{u}). \end{array} \end{aligned} $$

Hence $\bar {u}$ is the only minimal point of the functional J. □

The optimal control presented in Theorem 9.3 depends on the time variable t and the current state x(t). Such a control is called a

or a

(see Fig. 9.1).

If the control function depends only on t and not on the state x(t), then it is called an

Feedback controls are of special importance for applications. Although feedback controls are also derived from the mathematical model, they make use of the real state of the system which is described mathematically only in an approximate way. Hence, in the case of perturbations which are not included in the mathematical model, feedback controls are often more realistic for the regulation of the system.

Since the matrix function P is analytic and the trajectory x is absolutely continuous, the optimal control $\bar {u}$ in Theorem 9.3 is an absolutely continuous vector function. In fact, a solution of the linear quadratic optimal control problem lies in a smaller subspace of $L_{\infty }^{m}([0,\hat {T}])$.

Notice that the proof of Theorem 9.3 could be done with the aid of an optimality condition. Instead of this we use a quadratic completion with Lemmas 9.1 and 9.2 which is simpler from a mathematical point of view.

The linear quadratic control problem (9.3) can be formulated more generally. If one defines the objective functional J by

$$\displaystyle \begin{aligned} J(u)=x(\hat{T})^{T}Gx(\hat{T})&+\int\limits_{0}^{\hat{T}} \left( x(t)^{T}Qx(t)+ u(t)^{T} Ru(t)\right) dt \\ &\qquad \qquad \qquad \qquad \qquad \mbox{for all } u\in L_{\infty}^{m}([0,\hat{T}]) \end{aligned} $$

where Q is a positive definite symmetric (n, n) matrix with real coefficients, then the result of Theorem 9.3 remains almost true for the modified control problem. The only difference is that then the matrix function P(⋅) is a solution of the Riccati matrix differential equation

$$\displaystyle \begin{aligned} \dot{P}(t)+A^{T}P(t)+P(t)A+Q-P(t)BR^{-1}B^{T}P(t)=0_{(n,n)} \mbox{ for all } t\in [0,\hat{T}] \end{aligned}$$

with the terminal condition $P(\hat {T})=G$.

Example 9.4 (feedback control)

As a simple model we consider the differential equation

$$\displaystyle \begin{aligned} \dot{x}(t)=3x(t)+u(t) \mbox{ almost everywhere on } [0,1] \end{aligned}$$

with the initial condition

$$\displaystyle \begin{aligned} x(0)=x^{0} \end{aligned}$$

where $x^{0}\in \mathbb {R}$ is arbitrarily chosen. The objective functional J reads as follows:

$$\displaystyle \begin{aligned} J(u)=x(1)^{2}+\frac{1}{5}\int\limits_{0}^{1} u(t)^{2}\, dt \mbox{ for all } u\in L_{\infty}([0,1]). \end{aligned}$$

Then we obtain the function P as

$$\displaystyle \begin{aligned} \begin{array}{rcl} P(t) &\displaystyle = &\displaystyle \left[ e^{3(t-1)}e^{3(t-1)}+5 \int\limits\limits_{t}^{1} e^{3(t-s)}e^{3(t-s)}\, ds \right]^{-1}\\ &\displaystyle = &\displaystyle \left[ e^{6(t-1)}+5\int\limits_{t}^{1} e^{6(t-s)}\, ds \right]^{-1}\\ &\displaystyle = &\displaystyle \left[ e^{6(t-1)}-\frac{5}{6}e^{6(t-1)}+\frac{5}{6}\right]^{-1}\\ &\displaystyle = &\displaystyle \frac{6}{5+e^{6(t-1)}}\mbox{ for all } t\in [0,1]. \end{array} \end{aligned} $$

Hence, the optimal control $\bar {u}$ is given by

$$\displaystyle \begin{aligned} \bar{u}(t) & = -5\, \frac{6}{5+e^{6(t-1)}}\, x(t)\\ & = -\, \frac{30}{5+e^{6(t-1)}}\, x(t) \mbox{ almost everywhere on } [0,1]. {} \end{aligned} $$

(9.7)

If we plug the feedback control $\bar {u}$ in the differential equation, we can determine the trajectory x:

$$\displaystyle \begin{aligned} \begin{array}{rcl} \dot{x}(t) &\displaystyle = &\displaystyle 3 x(t) + \bar{u}(t)\\ &\displaystyle = &\displaystyle 3x(t)-\frac{30}{5+e^{6(t-1)}} x(t)\\ &\displaystyle = &\displaystyle \left( 3-\frac{30}{5+e^{6(t-1)}}\right) x(t). \end{array} \end{aligned} $$

Then we obtain the trajectory x as

$$\displaystyle \begin{aligned} x(t) & = x^{0}\ e^{\,\int\limits_{0}^{t}\left( 3-\frac{30}{5+e^{6(s-1)}} \right) ds}\\ & = x^{0}\ e^{\left( 3s-6(s-1)+\ln (e^{6(s-1)}+5)\right)\big|{}_{0}^{t}} \\ & = x^{0}\ e^{-3t+\ln (e^{6(t-1)}+5)-\ln (e^{-6}+5)}\\ & = \frac{x^{0}}{e^{-6}+5}\ e^{-3t}\ \left( e^{6(t-1)}+5\right) \mbox{ for all } t\in [0,1].{} \end{aligned} $$

(9.8)

If we plug the equation (9.8) in the equation (9.7), we get the optimal control $\bar {u}$ in the open loop form

$$\displaystyle \begin{aligned} \bar{u}(t)= -\,\frac{30x^{0}}{e^{-6}+5}\,e^{-3t} \mbox{ almost everywhere on } [0,1] \end{aligned}$$

(compare Fig. 9.2). This optimal control is even a smooth function.

9.2 Time Minimal Control Problems

An important problem in control theory is the problem of steering a linear system with the aid of a bounded control from its initial state to a desired terminal point in minimal time. In this section we answer the questions concerning the existence and the characterization of such a time minimal control. As a necessary condition for such an optimal control we derive a so-called weak bang-bang principle. Moreover, we investigate a condition under which a time minimal control is unique.

In this section we consider the system of linear differential equations

$$\displaystyle \begin{aligned} \dot{x}(t)=A(t)x(t)+B(t)u(t) \mbox{ almost everywhere on }[0,\hat{T}] \end{aligned} $$

(9.9)

with the initial condition

$$\displaystyle \begin{aligned} x(0)=x^{0} \end{aligned} $$

(9.10)

and the terminal condition

$$\displaystyle \begin{aligned} x(\hat{T})=x^{1} \end{aligned} $$

(9.11)

where $\hat {T} > 0$, $x^{0},x^{1} \in \mathbb {R}^{n}$, A and B are (n, n) and (n, m) matrix functions with real coefficients, respectively, which are assumed to be continuous on $[0,\hat {T}]$, and controls u are chosen from $L_{\infty }^{m}([0,\hat {T}])$ with $\| u_{i}\|{ }_{L_{\infty }([0,\hat {T}])}\leq 1$ for all i ∈{1, …, m}. Then we ask for a minimal time $\bar {T}\in [0,\hat {T}]$ so that the linear system (9.9) can be steered from x ⁰ to x ¹ on the time interval $[0,\bar {T}]$.

If we consider the linear system (9.9) on a time interval [0, T] with $T\in [0,\hat {T}]$ we use the abbreviation

$$\displaystyle \begin{aligned} U(T) & := \{u\in L_{\infty}^{m}([0,T])\ |\ \mbox{for every }k\in \{1,\ldots ,m\}\mbox{ we have} \\ &\quad \,\, |u_{k}(t)|\leq 1 \mbox{ almost everywhere on } [0,T]\}\\ &\quad \,\,\mbox{ for all } T\in [0,\hat{T}] {} \end{aligned} $$

(9.12)

for the set of all feasible controls with terminal time T.

Definition 9.5 (set of attainability)

For any $T\in [0,\hat {T}]$consider the linear system (9.9) on [0, T] with the initial condition (9.10). The set

$$\displaystyle \begin{aligned} \begin{array}{rcl} K(T){} &\displaystyle := &\displaystyle \{x(T)\in \mathbb{R}^{n}\ |\ u\in U(T)\mbox{ and }x \mbox{ satisfies the linear}\\ &\displaystyle &\displaystyle \mbox{ system (9.9) on {$[0,T]$} and the initial condition (9.10)}\} \end{array} \end{aligned} $$

(with U(T) given in (9.12)) is called the

.

The set of attainability consists of all terminal points to which the system can be steered from x ⁰ at the time T. Since we assume by (9.11) that the system can be steered to x ¹ we have $x^{1}\in K(\hat {T})$. Hence, the problem of finding a time minimal control for the linear system (9.9) satisfying the conditions (9.10), (9.11) can be transformed to a problem of the following type: Determine a minimal time $\bar {T}\in [0,\hat {T}]$ for which $x^{1}\in K(\bar {T})$ (see Fig. 9.3).

Before going further we recall that for an arbitrary $u\in L_{\infty }^{m}([0,T])$ the solution of the initial value problem (9.9), (9.10) with respect to the time interval $[0,T], \ T\in [0,\hat {T}]$, can be written as

$$\displaystyle \begin{aligned} x(t)=\Phi (t)x^{0}+\Phi (t) \int\limits_{0}^{t} \Phi (s)^{-1}B(s)u(s)\, ds \mbox{ for all } t\in [0,\bar{T}] \end{aligned}$$

where Φ is the fundamental matrix with

$$\displaystyle \begin{aligned} \dot{\Phi} (t)=A(t)\Phi (t) \mbox{ for all } t\in [0,T],\end{aligned}$$

$$\displaystyle \begin{aligned} \Phi (0)=I \mbox{ (identity matrix)}^{14}. \end{aligned}$$

^{Footnote 1} Notice that in the case of a time independent matrix A, the fundamental matrix Φ is given as

$$\displaystyle \begin{aligned}\Phi (t)=e^{At}=\sum_{i=0}^{\infty} A^{i}\frac{t^{i}}{i!} \mbox{ for all } t\in [0,T].\end{aligned}$$

In the following, for reasons of simplicity, we use the abbreviations

$$\displaystyle \begin{aligned}Y(t){} := \Phi^{-1}(t)B(t) \mbox{ for all } t\in [0,T]\end{aligned}$$

and

$$\displaystyle \begin{aligned}R(T){} := \Bigg\{\int\limits_{0}^{T} Y(t)u(t) dt \ \Bigg|\ u\in U(T)\Bigg\} \mbox{ for all } T\in[0,\hat{T}].\end{aligned}$$

The set R(T) is sometimes called the

. A connection between K and R is given by

$$\displaystyle \begin{aligned} K(T) & = \Phi (T)\left( x^{0}+R(T)\right) \\ & = \{\Phi (T)x^{0}+\Phi (T)y\ | \ y\in R(T)\} \mbox{ for all } T\in [0,\hat{T}].\ \ {} \end{aligned} $$

(9.13)

First we investigate properties of the set of attainability.

Lemma 9.6 (properties of the set of attainability)

For every $T\in [0,\hat {T}]$the set K(T) of attainability for the initial value problem (9.9), (9.10) with respect to the time interval [0, T] is nonempty, convex and compact.

Proof

We present a proof of this lemma only in a short form. Let some $T\in [0,\hat {T}]$ be arbitrarily given. Because of the initial condition (9.10) it is obvious that R(T) ≠ ∅. Next we show that the reachable set

$$\displaystyle \begin{aligned}R(T)=\Bigg\{\int\limits_{0}^{T} Y(t)u(t)\, dt \ \Bigg|\ u\in U(T)\Bigg\} \end{aligned}$$

is convex and compact. U(T) is the closed unit ball in $L_{\infty }^{m}([0,T])$ and therefore weak*-compact. Next we define the linear mapping $L:L_{\infty }^{m}([0,T])\rightarrow \mathbb {R}^{n}$ with

$$\displaystyle \begin{aligned}L(u)=\int\limits_{0}^{T} Y(t)u(t)\, dt \mbox{ for all }u\in L_{\infty}^{m}([0,T]).\end{aligned}$$

L is continuous with respect to the norm topology in $L_{\infty }^{m}([0,T])$, and therefore it is also continuous with respect to the weak*-topology in $L_{\infty }^{m}([0,T])$. Since L is continuous and linear and the set U(T) is weak*-compact and convex, the image R(T) = L(U(T)) is compact and convex. Because of the equation (9.13) the set K(T) is also compact and convex. □

As a first important result we present an existence theorem for time minimal controls.

Theorem 9.7 (existence of a time minimal control)

If there is a control which steers the linear system (9.9) with the initial condition (9.10) to a terminal state x ¹within a time $\tilde {T}\in [0,\hat {T}]$, then there is also a time minimal control with this property.

Proof

We assume that $x^{1}\in K(\tilde {T})$. Next we set

$$\displaystyle \begin{aligned} \bar{T} := \inf \{T\in [0,\hat{T}]\ | \ x^{1}\in K(T)\}.\end{aligned}$$

Then we have $\bar {T}\leq \tilde {T}$, and there is a monotonically decreasing sequence $(T_{i})_{i\in \mathbb {N}}$ with the limit $\bar {T}$ and a sequence $(u^{i})_{i\in \mathbb {N}}$ of feasible controls with

$$\displaystyle \begin{aligned} x^{1} =: x(T_{i},u^{i})\in K(T_{i})\end{aligned}$$

(let x(T _i, u ⁱ) denote the terminal state at the time T _i with the control u ⁱ). Then it follows

$$\displaystyle \begin{aligned} \begin{array}{rcl} &\displaystyle &\displaystyle {\| x^{1}-x(\bar{T},u^{i}) \|}\\ &\displaystyle &\displaystyle \ = \| x(T_{i},u^{i})-x(\bar{T},u^{i})\|\\ &\displaystyle &\displaystyle \ = \Bigg\| \Phi (T_{i})x^{0}+\Phi (T_{i})\int\limits_{0}^{T_{i}} Y(t)u^{i}(t)\, dt -\Phi (\bar{T})\int\limits_{0}^{T_{i}} Y(t)u^{i}(t)\, dt\\ &\displaystyle &\displaystyle \quad -\Phi (\bar{T})x^{0}-\Phi (\bar{T})\int\limits_{0}^{\bar{T}} Y(t)u^{i}(t)\, dt +\Phi (\bar{T})\int\limits_{0}^{T_{i}} Y(t)u^{i}(t)\, dt \Bigg\|\\ &\displaystyle &\displaystyle \ \leq \big\|(\Phi (T_{i})-\Phi (\bar{T}))x^{0}\big\| +\Bigg\|(\Phi (T_{i})-\Phi (\bar{T}))\int\limits_{0}^{T_{i}}Y(t)u^{i}(t)\,dt\Bigg\|\\ &\displaystyle &\displaystyle \quad +\Bigg\|\Phi (\bar{T})\int\limits_{\bar{T}}^{T_{i}}Y(t)u^{i}(t)\, dt\Bigg\| \end{array} \end{aligned} $$

which implies because of the continuity of Φ

$$\displaystyle \begin{aligned} x_{1} = \lim_{i\rightarrow\infty} x(\bar{T},u^{i}). \end{aligned}$$

Since $x(\bar {T},u^{i})\in K(\bar {T})$ for all $i\in \mathbb {N}$ and the set $K(\bar {T})$ is closed, we get $x^{1}\in K(\bar {T})$ which completes the proof. □

In our problem formulation we assume that the terminal condition (9.11) is satisfied. Therefore Theorem 9.7 ensures that a time minimal control exists without additional assumptions. For the presentation of a necessary condition for such a time minimal control we need some lemmas given in the following.

Lemma 9.8 (property of the set of attainability)

Let the linear system (9.9) with the initial condition (9.10) be given. Then the set-valued mapping $K:[0,\hat {T}]\rightarrow 2^{\mathbb {R}^{n}}$(where K(⋅) denotes the set of attainability) is continuous (with respect to the Hausdorff distance).

Proof

First we prove the continuity of the mapping R. For that proof let $\bar {T},T\in [0,\hat {T}]$, with $\bar {T}\neq T$, be arbitrarily chosen. Without loss of generality we assume $\bar {T}<T$. Then for an arbitrary $\bar {y}\in R(\bar {T})$ there is a feasible control $\bar {u}$ with

$$\displaystyle \begin{aligned} \bar{y}=\int\limits_{0}^{\bar{T}}Y(t)\bar{u}(t)\, dt.\end{aligned}$$

For the feasible control u given by

$$\displaystyle \begin{aligned}u(t)=\left\{ \begin{array}{l} \bar{u}(t) \mbox{ almost everywhere on }[0,\bar{T}]\\ (1,\ldots ,1)^{T} \mbox{ for all } t\in (\bar{T},T] \end{array} \right\} \end{aligned}$$

we have

$$\displaystyle \begin{aligned} \int\limits_{0}^{T} Y(t)u(t)\, dt \in R(T).\end{aligned}$$

Consequently we get

$$\displaystyle \begin{aligned} \begin{array}{rcl} d(\bar{y},R(T)) &\displaystyle := &\displaystyle \min_{y\in R(T)} \|\bar{y}-y\|\\ &\displaystyle \leq &\displaystyle \Bigg\|\bar{y}-\int\limits_{0}^{T} Y(t)u(t)\, dt\Bigg\|\\ &\displaystyle = &\displaystyle \Bigg\|\int\limits_{\bar{T}}^{T} Y(t)(1,\ldots ,1)^{T} dt \Bigg\|\\ &\displaystyle \leq &\displaystyle \sqrt{m} \int\limits_{\bar{T}}^{T} |\hspace{-0.1667em} |\hspace{-0.1667em} | Y(t)|\hspace{-0.1667em} |\hspace{-0.1667em} |\, dt \end{array} \end{aligned} $$

and

$$\displaystyle \begin{aligned} \max_{\bar{y}\in R(\bar{T})} d(\bar{y},R(T)) \leq \sqrt{m} \int\limits_{\bar{T}}^{T} |\hspace{-0.1667em} |\hspace{-0.1667em} | Y(t) |\hspace{-0.1667em} |\hspace{-0.1667em} |\, dt\end{aligned}$$

(here ∥⋅∥ denotes the Euclidean norm in $\mathbb {R}^{n}$ and $|\hspace{-0.1667em} |\hspace{-0.1667em} |\cdot |\hspace{-0.1667em} |\hspace{-0.1667em} |$ denotes the spectral norm). Similarly one can show

$$\displaystyle \begin{aligned} \max_{y\in R(T)} d(R(\bar{T}),y) \leq \sqrt{m} \int\limits_{\bar{T}}^{T} |\hspace{-0.1667em} |\hspace{-0.1667em} | Y(t) |\hspace{-0.1667em} |\hspace{-0.1667em} |\, dt. \end{aligned}$$

Hence, we obtain for the metric ϱ:

$$\displaystyle \begin{aligned} \begin{array}{rcl} \varrho (R(\bar{T}),R(T)) &\displaystyle := &\displaystyle \max_{\bar{y}\in R(\bar{T})} \ \min_{y\in R(T)} \ \| \bar{y}-y\| \ \ +\ \ \max_{y\in R(T)} \ \min_{\bar{y}\in R(\bar{T})} \ \|\bar{y}-y\| \\ &\displaystyle \leq &\displaystyle 2\sqrt{m}\int\limits_{\bar{T}}^{T} |\hspace{-0.1667em} |\hspace{-0.1667em} | Y(t) |\hspace{-0.1667em} |\hspace{-0.1667em} |\, dt. \end{array} \end{aligned} $$

Since the matrix function Y is continuous, there is a constant α > 0 with

$$\displaystyle \begin{aligned}|\hspace{-0.1667em} |\hspace{-0.1667em} | Y(t)|\hspace{-0.1667em} |\hspace{-0.1667em} | \leq \alpha \mbox{ for all }t\in [0,\hat{T}].\end{aligned}$$

Then we get

$$\displaystyle \begin{aligned} \varrho (R(\bar{T}),R(T)) \leq 2\alpha\sqrt{m}(T-\bar{T}).\end{aligned}$$

Consequently, the set-valued mapping R is continuous. Since the fundamental matrix Φ is continuous and the images of the set-valued mapping R are bounded sets, we obtain with the equation (9.13) (notice for $\bar {T},T\hspace{-0.1667em}\in \hspace{-0.1667em} [0,\hat {T}]$ and a constant $\beta \hspace{-0.1667em} >\hspace{-0.1667em} 0$ the inequality $\varrho (K(\bar {T}),K(T))$ $\leq \beta |\hspace{-0.1667em} |\hspace{-0.1667em} |\Phi (\bar {T})-\Phi (T)|\hspace{-0.1667em} |\hspace{-0.1667em} | +|\hspace{-0.1667em} |\hspace{-0.1667em} |\Phi (\bar {T})|\hspace{-0.1667em} |\hspace{-0.1667em} |\,\varrho (R(\bar {T}),R(T))$) that the mapping K is continuous. □

Lemma 9.9 (property of the set of attainability)

Let the linear system (9.9) with the initial condition (9.10) and some $\bar {T}\in [0,\hat {T}]$be given. Let $\bar {y}$be a point in the interior of the set $K(\bar {T})$of attainability, then there is a time $T\in (0,\bar {T})$so that $\bar {y}$is also an interior point of K(T).

Proof

Let $\bar {y}$ be an interior point of the set $K(\bar {T})$ (this implies $\bar {T}>0$). Then there is an ε > 0 so that $B(\bar {y},\varepsilon )\subset K(\bar {T})$ for the closed ball $B(\bar {y}, \varepsilon )$ around $\bar {y}$ with radius ε. Now we assume that for all $T\in (0,\bar {T})$ $\bar {y}$ is not an interior point of the set K(T). For every $T\in (0,\bar {T})$ the set $K(T)\subset \mathbb {R}^{n}$ is closed and convex. Then for every $T\in (0,\bar {T})$ there is a hyperplane separating the set K(T) and the point $\bar {y}$ (compare Theorems C.5 and C.3). Consequently, for every $T\in (0,\bar {T})$ there is a point $y_{T}\in B(\bar {y},\varepsilon )$ whose distance to the set K(T) is at least ε. But this contradicts the continuity of the set-valued mapping K. □

The next lemma is the key for the proof of a necessary condition for time minimal controls. For the formulation of this result we use the function $\mbox{sgn}:\mathbb {R}\rightarrow \mathbb {R}$ given by

$$\displaystyle \begin{aligned} \mbox{sgn} (y){} = \left\{ \begin{array}{rl} 1 & \mbox{for } y>0\\ 0 & \mbox{for } y=0\\ -1 & \mbox{for } y<0 \end{array} \right\} . \end{aligned} $$

Lemma 9.10 (property of the set of attainability)

Let the linear system (9.9) with the initial condition (9.10) and some $\bar {T}\in (0,\hat {T}]$be given. If $\bar {x}(\bar {T},\bar {u})\in \partial K(\bar {T})$for some $\bar {u}\in U(\bar {T})$, then there is a vector $\eta \neq 0_{\mathbb {R}^{n}}$so that for all k ∈{1, …, m}:

$$\displaystyle \begin{aligned} \bar{u}_{k}(t)=\mathit{\mbox{sgn}}[\eta^{T}Y_{k}(t)] \mathit{\mbox{ almost everywhere on }} \{t\in[0,\bar{T}]\; |\; \eta^{T}Y_{k}(t)\neq 0\} \end{aligned}$$

( $\bar {x}(\bar {T},\bar {u})$denotes the state at the time $\bar {T}$with respect to the control $\bar {u}$; Y _k(t) denotes the k-th column of the matrix Y (t)).

Proof

Let an arbitrary point $\bar {y}:=\bar {x}(\bar {T},\bar {u})\in \partial K(\bar {T})$ be given. Since the set $K(\bar {T})$ is a convex and closed subset of $\mathbb {R}^{n}$, by a separation theorem (see Theorem C.5) there is a vector $\bar {\eta }\neq 0_{\mathbb {R}^{n}}$ with the property

$$\displaystyle \begin{aligned} \bar{\eta}^{T}\bar{y} \geq \bar{\eta}^{T}y \mbox{ for all } y\in K(\bar{T}). \end{aligned}$$

Because of

$$\displaystyle \begin{aligned} \bar{\eta}^{T}\bar{y} = \bar{\eta}^{T}\Phi (\bar{T})x^{0} +\bar{\eta}^{T}\Phi (\bar{T})\int\limits_{0}^{\bar{T}}Y(t)\bar{u}(t)\, dt \end{aligned}$$

and

$$\displaystyle \begin{aligned} \bar{\eta}^{T}y=\bar{\eta}^{T}\Phi (\bar{T})x^{0} +\bar{\eta}^{T}\Phi (\bar{T})\int\limits_{0}^{\bar{T}} Y(t)u(t)\, dt \mbox{ for all } y\in K(\bar{T}) \end{aligned}$$

we obtain for $\eta ^{T}:=\bar {\eta }^{T}\Phi (\bar {T})$

$$\displaystyle \begin{aligned} \eta^{T}\int\limits_{0}^{\bar{T}}Y(t)\bar{u}(t)\, dt \geq \eta^{T}\int\limits_{0}^{\bar{T}}Y(t)u(t)\, dt \end{aligned} $$

(9.14)

for all feasible controls steering the linear system (9.9) with the initial condition (9.10) to a state in the set $K(\bar {T})$ of attainability. From the inequality (9.14) we conclude

$$\displaystyle \begin{aligned} \eta^{T}Y(t)\bar{u}(t)\geq\eta^{T}Y(t)u(t) \mbox{ almost everywhere on } [0,\bar{T}]. \end{aligned} $$

(9.15)

For the proof of the implication “(9.14) ⇒ (9.15)” we assume that the inequality (9.15) is not true. Then there is a feasible control u and a set $M\subset [0,\bar {T}]$ with positive measure so that

$$\displaystyle \begin{aligned} \eta^{T}Y(t)\bar{u}(t) < \eta^{T}Y(t)u(t) \mbox{ almost everywhere on } M. \end{aligned}$$

If one defines the feasible control u ^∗ by

$$\displaystyle \begin{aligned} u^{\ast}(t)=\left\{ \begin{array}{rl} \bar{u}(t) & \mbox{almost everywhere on } [0,\bar{T}]\setminus M\\ u(t) & \mbox{almost everywhere on } M \end{array} \right\} , \end{aligned}$$

then it follows

$$\displaystyle \begin{aligned} \begin{array}{rcl} \eta^{T}\int\limits_{0}^{\bar{T}}Y(t)u^{\ast}(t)\, dt &\displaystyle = &\displaystyle \eta^{T}\int\limits_{M}^{}Y(t)u(t)\, dt + \eta^{T}\hspace{-0.1667em} \int\limits_{[0,\bar{T}] \setminus M}^{}\hspace{-0.1667em} Y(t)\bar{u}(t)\, dt\\ &\displaystyle > &\displaystyle \eta^{T}\int\limits_{M}^{}Y(t)\bar{u}(t)\, dt + \eta^{T}\hspace{-0.1667em} \int\limits_{[0,\bar{T}]\setminus M}^{}\hspace{-0.1667em} Y(t)\bar{u}(t)\, dt \\ &\displaystyle = &\displaystyle \eta^{T}\int\limits_{0}^{\bar{T}}Y(t)\bar{u}(t)\, dt \end{array} \end{aligned} $$

which contradicts the inequality (9.14). Hence, the inequality (9.15) is true.

From the inequality (9.15) we get for all k ∈{1, …, m}

$$\displaystyle \begin{aligned} \bar{u}_{k}(t) = \mbox{sgn }[\eta^{T}Y_{k}(t)] \mbox{ almost everywhere on } \{t\in [0,\bar{T}]\; |\; \eta^{T}Y_{k}(t)\neq 0\}. \end{aligned}$$

□ □

Now we present the afore-mentioned necessary condition for time minimal controls.

Theorem 9.11 (necessary condition for time minimal controls)

Let the linear system (9.9) with the initial condition (9.10) and the terminal condition (9.11) be given. If $\bar {u}$is a time minimal control with respect to the minimal terminal time $\bar {T}\in [0,\hat {T}]$, then there is a vector $\eta \neq 0_{\mathbb {R}^{n}}$so that for all k ∈{1, …, m}:

$$\displaystyle \begin{aligned} \bar{u}_{k}(t)=\mathit{\mbox{sgn}}[\eta^{T}Y_{k}(t)] \mathit{\mbox{ almost everywhere on }} \{t\in [0,\bar{T}]\; |\; \eta^{T}Y_{k}(t)\neq 0\}. \end{aligned} $$

(9.16)

Proof

The assertion is obvious for $\bar {T}=0$. Therefore we assume $\bar {T}>0$ for the following. We want to show that

$$\displaystyle \begin{aligned} \bar{y}:=\Phi (\bar{T})x^{0}+\Phi (\bar{T})\int\limits_{0}^{\bar{T}} Y(t)\bar{u}(t)\, dt \ \in\ \partial K(\bar{T}). \end{aligned} $$

(9.17)

Suppose that $\bar {y}$ were an interior point of the set $K(\bar {T})$ of attainability. Then by Lemma 9.9 there is a time $T\in (0,\bar {T})$ so that $\bar {y}$ is also an interior point of the set K(T). But this contradicts the fact that $\bar {T}$ is the minimal time. Hence, the condition (9.17) is true. An application of Lemma 9.10 completes the proof. □

The statement (9.16) is also called a

If the measure of the set $\{t\in [0,\bar {T}]\; |\; \eta ^{T} Y_{k}(t)=0\}$ equals 0 for every k ∈{1, …, m}, the statement (9.16) is called a

. Theorem 9.11 can also be formulated as follows:

The next example illustrates the applicability of Theorem 9.11.

Example 9.12 (necessary condition for time minimal controls)

We consider the harmonic oscillator mathematically formalized by

$$\displaystyle \begin{aligned} \ddot{y}(t)+y(t)=u(t) \mbox{ almost everywhere on } [0,\hat{T}], \end{aligned}$$

$$\displaystyle \begin{aligned} \| u\|{}_{L_{\infty}([0,\hat{T}])}\leq 1 \end{aligned}$$

where $\hat {T}>0$ is sufficiently large. An initial condition is not given explicitly. The corresponding linear system of first order reads

$$\displaystyle \begin{aligned} \dot{x}(t)=\underbrace{\left( \begin{array}{rr} 0 & 1 \\ -1 & 0 \end{array} \right)}_{=:\ A} x(t) + \underbrace{\left( \begin{array}{r} 0 \\ 1 \end{array}\right)}_{=:\ B} u(t). \end{aligned}$$

We have

$$\displaystyle \begin{aligned} \Phi (t)= e^{At}=\sum_{i=0}^{\infty} A^{i}\frac{t^{i}}{i!} =\left( \begin{array}{rr} \cos t & \sin t \\ -\sin t & \cos t \end{array}\right) \end{aligned}$$

and

$$\displaystyle \begin{aligned} Y(t)=\Phi (t)^{-1}B=e^{-At}B=\left( \begin{array}{r} -\sin t \\ \cos t \end{array}\right) .\end{aligned}$$

Then we obtain for an arbitrary vector $\eta \neq 0_{\mathbb {R}^{n}}$

$$\displaystyle \begin{aligned} \eta^{T}Y(t)=-\eta_{1}\sin t + \eta_{2}\cos t. \end{aligned}$$

Consequently, we get for a number $\alpha \in \mathbb {R}$ and a number δ ∈ [−π, π]

$$\displaystyle \begin{aligned} \eta^{T}Y(t)=\alpha\sin (t+\delta ) \end{aligned}$$

and therefore

$$\displaystyle \begin{aligned} \mbox{sgn}[\eta^{T}Y(t)] = \mbox{sgn}[\alpha\sin (t+\delta )] \end{aligned}$$

(see Fig. 9.4).

Conclusion: If there is a time minimal control $\bar {u}$, then it fulfills the strong bang-bang principle, and therefore it is unique. After π time units one always gets a change of the sign of $\bar {u}$.

With a standard result from control theory one can see that the considered linear system is null controllable (i.e., it can be steered to the origin in a finite time). Hence, by Theorem 9.7 there is also a time minimal control $\bar {u}$ which steers this system into a state of rest, and therefore the preceding results are applicable.

Now we present an example for which the necessary condition for time minimal controls does not give any information.

Example 9.13 (necessary condition for time minimal controls)

We investigate the simple linear system

$$\displaystyle \begin{aligned} \left. \begin{array}{l} \dot{x}_{1}(t)=x_{1}(t)+u(t) \\ \dot{x}_{2}(t)=x_{2}(t)+u(t) \end{array}\right\} \mbox{ almost everywhere on } [0,\hat{T}]\end{aligned} $$

with

$$\displaystyle \begin{aligned} \| u\|{}_{L_{\infty}[0,\hat{T}]}\leq 1\end{aligned} $$

and $\hat {T}>0$. Here we set

$$\displaystyle \begin{aligned} A=\left( \begin{array}{rr} 1 & 0 \\ 0 & 1 \end{array}\right) =I \ \mbox{ and }\ B=\left(\begin{array}{r} 1 \\ 1 \end{array}\right) .\end{aligned} $$

Then we obtain

$$\displaystyle \begin{aligned} Y(t)=e^{-At}B=e^{-t}\left(\begin{array}{r} 1 \\ 1 \end{array} \right) \end{aligned} $$

and for any vector $\eta \neq 0_{\mathbb {R}^{2}}$ we get

$$\displaystyle \begin{aligned} \eta^{T}Y(t)=(\eta_{1}+\eta_{2})e^{-t}. \end{aligned} $$

For example, for $\eta =\left (\begin {array}{r}1 \\ -1\end {array}\right )$ we conclude

$$\displaystyle \begin{aligned} \eta^{T}Y(t)=0 \mbox{ for all } t\in [0,\hat{T}], \end{aligned}$$

and Theorem 9.11 does not give a suitable necessary condition for time minimal controls.

Next we investigate the question under which conditions time minimal controls are unique. For this investigation we introduce the notion of normality.

Definition 9.14 (normal linear system)

(a)
The linear system (9.9) is called

(with $T\in [0,\hat {T}]$), if for every vector $\eta \neq 0_{\mathbb {R}^{n}}$the sets
$$\displaystyle \begin{aligned} G_{k}(\eta )=\{t\in [0,T] \ |\ \eta^{T}Y_{k}(t)=0\} \mathit{\mbox{ with }} k\in\{ 1,\ldots ,m\}\end{aligned}$$

have the measure 0. Y _k(t) denotes again the k-th column of the matrix Y (t).
(b)
The linear system (9.9) is called
, if for every $T\in [0,\hat {T}]$this system is normal on [0, T].

Theorem 9.15 (uniqueness of a time minimal control)

Let the linear system (9.9) with the initial condition (9.10) and the terminal condition (9.11) be given. If $\bar {u}$is a time minimal control with respect to the minimal terminal time $\bar {T}\in [0,\hat {T}]$and if the linear system (9.9) is normal on $[0,\bar {T}]$, then $\bar {u}$is the unique time minimal control.

Proof

By Theorem 9.11 for every time minimal control $\bar {u}$ there is a vector $\eta \neq 0_{\mathbb {R}^{n}}$ so that for all k ∈{1, …, m}:

$$\displaystyle \begin{aligned}\bar{u}_{k}(t)=\mbox{sgn}[\eta^{T}Y_{k}(t)] \mbox{ almost everywhere on } [0,\bar{T}]\setminus G_{k}(\eta ).\end{aligned}$$

Then the assertion follows from the normality assumption (notice that in the proof of Lemma 9.10 the vector η depends on the terminal state and not on the control). □

A control $\bar {u}$ which satisfies the assumptions of Theorem 9.15 fulfills the strong bang-bang principle

$$\displaystyle \begin{aligned} \bar{u}(t)=\mbox{ sgn}[\eta^{T}Y_{k}(t)] \mbox{ almost everywhere on } [0,\bar{T}].\end{aligned}$$

One obtains an interesting characterization of the concept of normality in the case of an autonomous linear system (9.9) with constant matrix functions A and B.

Theorem 9.16 (characterization of normality)

The autonomous linear system (9.9) with constant matrix functions A and B is normal if and only if for every k ∈{1, …, m} either

$$\displaystyle \begin{aligned} \mathit{\mbox{rank }}(B_{k},AB_{k},\ldots ,A^{n-1}B_{k})=n\end{aligned} $$

(9.18)

or

$$\displaystyle \begin{aligned} \mathit{\mbox{rank }}(A-\lambda I,B_{k})=n\mathit{\mbox{ for all eigenvalues }} \lambda\mathit{\mbox{ of }}A.\end{aligned} $$

(9.19)

Here B _k denotes the k-th column of the matrix B.

Proof

We fix an arbitrary terminal time $T\in [0,\hat {T}]$. First notice that for every k ∈{1, …, m} and every $\eta \in \mathbb {R}^{n}$

$$\displaystyle \begin{aligned} \eta^{T}Y_{k}(t)=\eta^{T}e^{-At}B_{k}\mbox{ for all } t\in [0,T]. \end{aligned} $$

Consequently, the real-valued analytical function η ^TY _k(⋅) on [0, T] is either identical to 0 or it has a finite number of zeros on this interval. Therefore, the autonomous linear system (9.9) is normal on [0, T] if and only if the following implication is satisfied:

$$\displaystyle \begin{aligned} \eta^{T}e^{-At}B_{k}=0\mbox{ for all }t\in [0,T]\mbox{ and some } k\in\{ 1,\ldots ,m\} \Rightarrow \eta =0_{\mathbb{R}^{n}}.\end{aligned} $$

(9.20)

Next we show that the implication (9.20) is equivalent to the condition (9.18). For this proof we assume that the condition (9.18) is satisfied. Let a vector $\eta \in \mathbb {R}^{n}$ with

$$\displaystyle \begin{aligned} \eta^{T}e^{-At}B_{k}=0\mbox{ for all }t\in [0,T]\mbox{ and some } k\in\{ 1,\ldots ,m\}\end{aligned} $$

be arbitrarily given. By repeated differentiation and setting “t = 0” we get

$$\displaystyle \begin{aligned} \eta^{T} (B_{k},AB_{k},\ldots ,A^{n-1}B_{k})=0_{\mathbb{R}^{n}}^{T} \mbox{ for some }k\in\{ 1,\ldots ,m\}.\end{aligned} $$

By assumption the system of row vectors of the matrix (B _k, AB _k, …, A ⁿ⁻¹B _k) is linear independent, and therefore we get $\eta =0_{\mathbb {R}^{n}}$. Hence, the implication (9.20) is satisfied, i.e. the autonomous linear system (9.9) is normal on [0, T].

Now we assume that the condition (9.18) is not satisfied. This means that for some k ∈{1, …, m} the system of row vectors of the matrix (B _k, AB _k, …, A ⁿ⁻¹B _k) is linear dependent. Then there is a vector $\eta \neq 0_{\mathbb {R}^{n}}$ with

$$\displaystyle \begin{aligned} \eta^{T}(B_{k},AB_{k},\ldots ,A^{n-1}B_{k})=0_{\mathbb{R}^{n}}^{T} \end{aligned}$$

which implies

$$\displaystyle \begin{aligned} \eta^{T}B_{k}=\eta^{T}AB_{k}=\cdots =\eta^{T}A^{n-1}B_{k}=0. \end{aligned} $$

(9.21)

The Cayley-Hamilton theorem states that the matrix A satisfies its characteristic equation, i.e.

$$\displaystyle \begin{aligned} A^{n}=\alpha_{0}I+\alpha_{1}A+\cdots +\alpha_{n-1}A^{n-1} \end{aligned}$$

with appropriate coefficients $\alpha _{0},\alpha _{1},\ldots ,\alpha _{n-1}\in \mathbb {R}$. Then we obtain with (9.21)

$$\displaystyle \begin{aligned} \eta^{T}A^{n}B_{k}=\alpha_{0}\eta^{T}B_{k}+\alpha_{1}\eta^{T}AB_{k} +\cdots +\alpha_{n-1}\eta^{T}A^{n-1}B_{k}=0 \end{aligned}$$

and by induction

$$\displaystyle \begin{aligned} \eta^{T}A^{l}B_{k}=0 \mbox{ for all }l\geq n. \end{aligned} $$

(9.22)

Equations (9.21) and (9.22) imply

$$\displaystyle \begin{aligned} \eta^{T}A^{l}B_{k}=0\mbox{ for all }l\geq 0 \end{aligned}$$

which leads to

$$\displaystyle \begin{aligned} \eta^{T}e^{-At}B_{k}=\eta^{T}\left(\sum_{i=0}^{\infty} A^{i}\frac{(-t)^{i}}{i!}\right) B_{k} =0\mbox{ for all }t\in [0,T]. \end{aligned}$$

Consequently, the implication (9.20) is not satisfied, i.e. the autonomous linear system (9.9) is not normal on [0, T].

Finally we show the equivalence of the two rank conditions (9.18) and (9.19). Let k ∈{1, …, m} be arbitrarily chosen.

Assume that the condition (9.19) is not satisfied, i.e. for some possibly complex eigenvalue λ of A we have

$$\displaystyle \begin{aligned} \mbox{rank }(A-\lambda I,B_{k})\neq n.\end{aligned}$$

Then there is a vector $z\in \mathbb {R}^{n}$ with $z\neq 0_{\mathbb {R}^{n}}$ and

$$\displaystyle \begin{aligned} z^{T}(A-\lambda I,B_{k})=0_{\mathbb{R}^{n+1}}^{T}, \end{aligned}$$

i.e.

$$\displaystyle \begin{aligned} z^{T}A=\lambda z^{T} \end{aligned} $$

(9.23)

and

$$\displaystyle \begin{aligned} z^{T}B_{k}=0. \end{aligned} $$

(9.24)

With the equations (9.23) and (9.24) we conclude

$$\displaystyle \begin{aligned} z^{T}AB_{k}=\lambda z^{T}B_{k}=0,\end{aligned}$$

and by induction we get

$$\displaystyle \begin{aligned} z^{T}A^{l}B_{k}=0 \mbox{ for all } l\geq 0.\end{aligned}$$

Hence we have

$$\displaystyle \begin{aligned} \mbox{rank }(B_{k},AB_{k},\ldots ,A^{n-1}B_{k})\neq n.\end{aligned}$$

Conversely, we assume now that the equation (9.18) is not satisfied. Then there is a $z\neq 0_{\mathbb {R}^{n}}$ with

$$\displaystyle \begin{aligned} z^{T}B_{k}=0,\ z^{T}AB_{k}=0,\ldots ,\ z^{T}A^{n-1}B_{k}=0. \end{aligned}$$

Again with the Cayley-Hamilton theorem we conclude immediately

$$\displaystyle \begin{aligned} z^{T}A^{l}B_{k}=0 \mbox{ for all }l\geq 0.\end{aligned}$$

Consequently, the linear subspace

$$\displaystyle \begin{aligned} S:=\{ \tilde{z}\in \mathbb{R}^{n}\ |\ \tilde{z}^{T}A^{l}B_{k}=0 \mbox{ for all } l\geq 0\} \end{aligned}$$

has the dimension ≥ 1. Since the set S is invariant under A ^T (i.e. A ^TS ⊂ S), one eigenvector $\bar {z}$ of A ^T belongs to S. Hence, there is an eigenvalue λ of A ^T which is also an eigenvalue of A so that

$$\displaystyle \begin{aligned} A^{T}\bar{z}=\lambda \bar{z} \end{aligned}$$

or alternatively

$$\displaystyle \begin{aligned} \bar{z}^{T}(A-\lambda I)=0_{\mathbb{R}^{n}}^{T}. \end{aligned} $$

(9.25)

Because of $\bar {z}\in S$ we obtain with l = 0

$$\displaystyle \begin{aligned} \bar{z}^{T}B_{k}=0. \end{aligned} $$

(9.26)

Equations (9.25) and (9.26) imply

$$\displaystyle \begin{aligned} \mbox{rank }(A-\lambda I,B_{k})\neq n\mbox{ for some eigenvalue }\lambda\mbox{ of }A. \end{aligned}$$

This completes the proof. □

In control theory the condition

$$\displaystyle \begin{aligned} \mbox{rank }(B,AB,\ldots ,A^{n-1}B)=n \end{aligned}$$

is called the

. It is obvious that the condition

$$\displaystyle \begin{aligned}\mbox{rank }(B_{k},AB_{k},\ldots ,A^{n-1}B_{k})=n \mbox{ for all }k\in\{ 1,\ldots ,m\}\end{aligned}$$

which is given in Theorem 9.16 implies the Kalman condition. Moreover, in control theory the condition

$$\displaystyle \begin{aligned} \mbox{rank }(A-\lambda I,B)=n\mbox{ for all eigenvalues }\lambda \mbox{ of }A \end{aligned}$$

is called the

which is implied by the condition

$$\displaystyle \begin{aligned}\mbox{rank } (A-\lambda I,B_{k}) = n\mbox{ for all }k\in \{ 1,\ldots ,m\} \mbox{ and all eigenvalues }\lambda\mbox{ of }A.\end{aligned}$$

One can show with the same arguments as in the proof of Theorem 9.16 that the Kalman and Hautus conditions are equivalent. In control theory one proves that the Kalman condition (or the Hautus condition) characterizes the controllability of an autonomous linear system, i.e. in this case there is an unconstrained control which steers the autonomous linear system from an arbitrary initial state to an arbitrary terminal state in finite time.

The following example shows that the Kalman condition (or the Hautus condition) does not imply the condition (9.18) (and (9.19), respectively).

Example 9.17 (Kalman condition)

The following autonomous linear system satisfies the Kalman condition but it is not normal:

$$\displaystyle \begin{aligned} \left. \begin{array}{l} \dot{x}_{1}(t)=-x_{1}(t)+u_{1}(t)\\ \dot{x}_{2}(t)=-2x_{2}(t)+u_{1}(t)+u_{2}(t) \end{array}\right\} \mbox{ almost everywhere on } [0,\hat{T}] \end{aligned}$$

with some $\hat {T}>0$. Here we set

$$\displaystyle \begin{aligned}A=\left( \begin{array}{rr} -1 & 0\\0 & -2\end{array}\right) \ \ \mbox{and}\ \ B=\left( \begin{array}{rr} 1 & 0\\1 & 1\end{array} \right) .\end{aligned}$$

Then we have

$$\displaystyle \begin{aligned} B_{1}=\left( \begin{array}{r}1\\1\end{array}\right) ,\ AB_{1}=\left( \begin{array}{rr} -1\\-2\end{array}\right) ,\end{aligned}$$

$$\displaystyle \begin{aligned} B_{2}=\left( \begin{array}{r}0\\1\end{array}\right) ,\ AB_{2}=\left( \begin{array}{r}0\\-2\end{array}\right) .\end{aligned}$$

The matrix (B ₂, AB ₂) has the rank 1, and therefore the linear system is not normal. On the other hand we have

$$\displaystyle \begin{aligned} \mbox{rank }(B,AB)=2,\end{aligned}$$

i.e. the Kalman condition is satisfied.

Exercises

(9.1)
Consider the differential equation
$$\displaystyle \begin{aligned} \dot{x}(t)=2x(t)-3u(t) \mbox{ almost everywhere on } [0,2]\end{aligned}$$

with the initial condition
$$\displaystyle \begin{aligned} x(0)=x^{0} \end{aligned}$$

for an arbitrarily chosen $x^{0}\in \mathbb {R}$. Determine an optimal control $\bar {u}\in L_{\infty }([0,2])$ as a minimal point of the objective functional $J:L_{\infty }([0,2]) \rightarrow \mathbb {R}$ with
$$\displaystyle \begin{aligned} J(u)=\frac{1}{2} x(1)^{2}+2 \int\limits_{0}^{2}u(t)^{2}\, dt \mbox{ for all } u\in L_{\infty}([0,2]). \end{aligned}$$
(9.2)
([51, p. 132–133]) Let the initial value problem
$$\displaystyle \begin{aligned} \dot{x}(t)=u(t) \mbox{ almost everywhere on } [0,1], \end{aligned}$$

$$\displaystyle \begin{aligned} x(0)=1 \end{aligned}$$

be given. Determine an optimal control u ∈ L _∞([0, 1]) for which the objective functional $J: L_{\infty }([0,1])\rightarrow \mathbb {R}$ with
$$\displaystyle \begin{aligned} J(u)=\int\limits_{0}^{1}\left( u(t)^{2}+x(t)^{2}\right) dt \mbox{ for all } u\in L_{\infty}([0,1]) \end{aligned}$$

becomes minimal.
(9.3)
Consider the linear differential equation of n-th order
$$\displaystyle \begin{aligned} y^{(n)}(t)+a_{n-1}y^{(n-1)}(t)+\cdots &+a_{0}y(t)=u(t) \\ &\mbox{ almost everywhere on } [0,\hat{T}] \end{aligned} $$

where $\hat {T}>0$ and $a_{0},\ldots ,a_{n-1}\in \mathbb {R}$ are given constants. The control u is assumed to be an $L_{\infty }([0,\hat {T}])$ function. Show that the system of linear differential equations of first order which is equivalent to this differential equation of n-th order satisfies the Kalman condition.
(9.4)
([216, p. 22–24]) Let the system of linear differential equations
$$\displaystyle \begin{aligned} \dot{x}(t)=Ax(t)+Bu(t) \mbox{ almost everywhere on } [0,\hat{T}] \end{aligned}$$

with
$$\displaystyle \begin{aligned} A=\left( \begin{array}{rrrr} 0 & 1 & 0 & 0\\ -\alpha & 0 & 0 & 0\\ 0 & 0 & 0 & 1\\ 0 & 0 & 0 & 0 \end{array}\right) \ \mbox{ and }\ B=\left( \begin{array}{r} 0\\ -\beta\\ 0\\ \gamma \end{array}\right) \end{aligned}$$

be given where $\hat {T}>0$, α > 0, β > 0 and γ > 0 are constants. It is assumed that $u\in L_{\infty }([0,\hat {T}])$. Show that this system satisfies the Hautus condition.
(9.5)
For the linear system in exercise (9.2) assume in addition that the terminal time $\hat {T}$ is sufficiently large. Moreover, let the initial condition
$$\displaystyle \begin{aligned} x(0)=x^{0} \end{aligned}$$

with $x^{0}\in \mathbb {R}^{4}$ and the terminal condition
$$\displaystyle \begin{aligned} x(\hat{T})=0_{\mathbb{R}^{4}} \end{aligned}$$

be given. For the control u we assume
$$\displaystyle \begin{aligned} \| u\|{}_{L_{\infty}([0,\hat{T}])}\leq 1. \end{aligned}$$

It can be proved with a known result from control theory that this system can be steered from x ⁰ to $0_{\mathbb {R}^{4}}$ in finite time. Show then that a time minimal control exists which is unique, and give a characterization of this time minimal control.

Notes

1.
A proof of this existence result can be found e.g. in [212, p. 121–122].

References

R.W. Brockett, Finite dimensional linear systems (Wiley, New York, 1970).
Google Scholar
W. Krabs, Einführung in die Kontrolltheorie (Wissenschaftliche Buchgesellschaft, Darmstadt, 1978).
Google Scholar
W. Krabs, Optimal control of undamped linear vibrations (Heldermann, Lemgo, 1995).
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Mathematics, University of Erlangen-Nürnberg, Erlangen, Germany
Johannes Jahn

Authors

Johannes Jahn
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Jahn, J. (2020). Direct Treatment of Special Optimization Problems. In: Introduction to the Theory of Nonlinear Optimization. Springer, Cham. https://doi.org/10.1007/978-3-030-42760-3_9

Download citation

DOI: https://doi.org/10.1007/978-3-030-42760-3_9
Published: 03 July 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-42759-7
Online ISBN: 978-3-030-42760-3
eBook Packages: Business and ManagementBusiness and Management (R0)

Publish with us

Policies and ethics

Direct Treatment of Special Optimization Problems

Abstract

9.1 Linear Quadratic Optimal Control Problems

Lemma 9.1 (relationship between control and trajectory)

Proof

Lemma 9.2 (Bernoulli matrix differential equation)

Proof

Theorem 9.3 (feedback control)

Proof

Example 9.4 (feedback control)

9.2 Time Minimal Control Problems

Definition 9.5 (set of attainability)

Lemma 9.6 (properties of the set of attainability)

Proof

Theorem 9.7 (existence of a time minimal control)

Proof

Lemma 9.8 (property of the set of attainability)

Proof

Lemma 9.9 (property of the set of attainability)

Proof

Lemma 9.10 (property of the set of attainability)

Proof

Theorem 9.11 (necessary condition for time minimal controls)

Proof

Example 9.12 (necessary condition for time minimal controls)

Example 9.13 (necessary condition for time minimal controls)

Definition 9.14 (normal linear system)

Theorem 9.15 (uniqueness of a time minimal control)

Proof

Theorem 9.16 (characterization of normality)

Proof

Example 9.17 (Kalman condition)

Exercises

Notes

References

Author information

Authors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation