Minimum energy for linear systems with finite horizon: a non-standard Riccati equation

Acquistapace, P.; Gozzi, F.

doi:10.1007/s00498-017-0204-y

Minimum energy for linear systems with finite horizon: a non-standard Riccati equation

Original Article
Published: 20 October 2017

Volume 29, article number 19, (2017)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Mathematics of Control, Signals, and Systems Aims and scope Submit manuscript

Minimum energy for linear systems with finite horizon: a non-standard Riccati equation

Download PDF

P. Acquistapace¹ &
F. Gozzi²

303 Accesses
1 Citation
Explore all metrics

Abstract

This paper deals with a non-standard infinite dimensional linear quadratic control problem arising in the physics of non-stationary states (see, for example, Bertini et al. J Statist Phys 116:831–841, 2004): finding the minimum energy to drive a fixed stationary state $\bar{x}=0$ into an arbitrary non-stationary state x. The Riccati equation (RE) associated with this problem is not standard since the sign of the linear part is opposite to the usual one, thus preventing the use of the known theory. Here we consider the finite horizon case when the leading semigroup is exponentially stable. We prove that the linear selfadjoint operator P(t), associated with the value function, solves the above-mentioned RE (Theorem 4.12). Uniqueness does not hold in general, but we are able to prove a partial uniqueness result in the class of invertible operators (Theorem 4.13). In the special case where the involved operators commute, a more detailed analysis of the set of solutions is given (Theorems 4.14, 4.15 and 4.16). Examples of applications are given.

The turnpike property and the longtime behavior of the Hamilton–Jacobi–Bellman equation for finite-dimensional LQ control problems

Article 16 June 2022

On the solvability of the Yakubovich linear-quadratic infinite horizon minimization problem

Article 17 December 2019

On the Asymptotic Stability for Nonlinear Oscillators with Time-Dependent Damping

Article 23 October 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

This paper is devoted to the study of a family of non-standard linear quadratic finite horizon minimum energy problems in Hilbert spaces: finding the minimum energy to drive a dynamical system from a fixed equilibrium state 0 (at time $t=t_0$) into an arbitrary non-equilibrium state x (at time $t=t_1$). These problems arise (in particular when $t_0 \rightarrow -\infty $ and $t_1=0$) in the control representation of the rate function for a class of large deviation problems (see, for example, [12] and the references quoted therein; see also [18, Chapter 8] for an introduction to the subject); it is motivated by applications in the physics of non-equilibrium states, and in this context, it has been studied in various papers, see, for example, [3,4,5,6,7].

In such papers the state equation is possibly nonlinear and the energy function can be state dependent. One of the main goals, formulated, for example, in [6] in the infinite horizon case, is then to show that the value function is the unique (or maximal/minimal) solution of the associated Hamilton–Jacobi–Bellman (HJB) equation. Our goal is exactly this one. Due to the difficulty of the problem we restrict ourselves to study the linear quadratic case: hence solving the HJB equation reduces to solve a Riccati equation (RE). In this paper, as a first step, we consider the finite horizon problem which we describe in the next subsection together with our main results.

1.1 Description of the problem and of the main results

To better clarify our results we state, roughly and informally, the mathematical problem (see Sect. 2.2 for a precise description). The state space X and the control space U are both real separable Hilbert spaces. We take the linear controlled system in X

$$\begin{aligned} \left\{ \begin{array}{ll} y'(s)=Ay(s)+Bu(s), \quad s\in [0,t], \\ y(0) = 0, \end{array}\right. \end{aligned}$$

(1)

where $A: D(A) \subset X \rightarrow X$ generates an exponentially stable strongly continuous semigroup and $B:U\rightarrow X$ is a linear bounded operator (see Assumption 2.1 below). Given a point $x \in X$ we consider the set $\mathcal{U}_{[0,t]}(0,x) $ of all control strategies $u(\cdot )$ that drive the system from the equilibrium state 0 (at time $s=0$) into an arbitrary non-equilibrium state x (at time $s=t$). It is well known (see Sect. 2.2, Theorem 2.7-(i)) that the set $\mathcal{U}_{[0,t]}(0,x) $ is non-empty if and only if $x \in R(Q_t^{1/2})$ where $Q_t$ is the so-called controllability operator (see Definition 2.3).

We want to minimize the “energy-like” cost functional

$$\begin{aligned} J_{[0,t]}(u) =\frac{1}{2} \int _{0}^t \Vert u(s)\Vert ^2_U\, \mathrm {d}s. \end{aligned}$$

(2)

The value function V(t, x) is defined as

$$\begin{aligned} V(t,x) = \inf _{u\in \mathcal{U}_{[0,t]}(0,x)} J_{[0,t]}(u), \end{aligned}$$

(3)

and it is finite only when $x \in R(Q_t^{1/2})$. As the problem is linear quadratic, V is a quadratic form in the variable x, i.e., $V(t,x)= \langle R(t)x, x\rangle _X$ for some symmetric operator-valued function $R(\cdot )$. Hence, we can consider the associated Riccati equation (RE) in X (with unknown $R(\cdot )$) which is, formally,

$$\begin{aligned} \frac{d }{d t}\langle R(t)x,y\rangle _X=-\langle Ax, R(t)y\rangle _X - \langle R(t)x, Ay\rangle _X -\langle B^*R(t) x, B^*R(t)y \rangle _U \end{aligned}$$

(4)

for every $x,y \in D(A)\cap D(R(t))$, with the initial condition $R(0)=+\infty $. Note that for each $t>0$ the operator R(t) is unbounded (because $V(t,\cdot )$ is defined only in $R(Q_t^{1/2})$). A way to overcome this problem, which we use in this paper, is to rewrite the problem in a space where the solution of the Riccati equation is a bounded operator. We are able to do that when a null controllability assumption hold (see Assumption 2.4). Indeed in such case (see Proposition C.2-(ii)) the spaces $R(Q_t^{1/2})$ are constant in time for sufficiently large t, hence they are all equal to a given Hilbert space that we call H (see Sect. 4.1). Then, we rewrite (4) in H so that its unknown (that we call $P(\cdot )$) becomes a bounded operator, see Sect. 4.3 for explanations.

Note that the sign of the linear part of (4) (the first two terms of the right-hand side) is opposite to the usual one (see, for example, for minimum energy problem in Hilbert spaces, [12, 15, 16, 19, 24, 29]). This does not allow us to approach (4) using the standard method (described, for example, in [2, pp. 390–394 and 479–486], see also [24, p. 1018]), which consists in solving the RE using a fixed point theorem and a suitable a priori estimate. For forward RE like ours this is possible when the sign of the linear part is positive (in order to get a suitable semigroup generation property^{Footnote 1}) and the quadratic term is negative (in order to get the a priori estimate).

On the other hand the opposite sign of the linear part comes from the nature of the motivating problem: to look at the minimum energy path from equilibrium to non-equilibrium states (see [6]), which is the opposite direction of the standard one considered, for example, in [8, 9, 24, 28], (see also the books [2, 10, 11]). This means that the value function depends on the final point, while in the above-quoted problems it depends on the initial one (see also Remark 4.1 on this). Therefore, we are driven to use a different approach that exploits the structure of the problem; we partially borrow some ideas from [24] and from^{Footnote 2} [21] and [25]. The main idea comes from the fact that the candidate solution of the RE associated with the value function V is the pseudoinverse of the unique solution of a Lyapunov linear equation, which is easier and is studied in Sect. 3 providing an existence and uniqueness result in Proposition 3.3.

We list now our main results. We show, under the above-mentioned null controllability Assumption 2.4, that the value function solves the associated RE (Theorem 4.12) and that a partial uniqueness holds (Theorem 4.13). In the special case when A is selfadjoint and A and $BB^*$ commute we are able to prove that the spaces $R(Q_t^{1/2})$ all coincide for every $t>0$ (Proposition C.2-(iii)), hence we do not need Assumption 2.4 and, moreover, we can go deeper, finding more insights on the structure of the family of solutions (Theorems 4.14, 4.15 and 4.16).

1.2 Plan of the paper

Section 2 is devoted to the presentation of our finite horizon minimum energy problem: after the description of our assumptions (Sect. 2.1) we provide the general formulation of the problem in Sect. 2.2.

Section 3 is devoted to the study of the associated Lyapunov equation, a key tool for the analysis of our RE. The main result of this section (Proposition 3.3) is more general than what we found in the literature and is then completely proved.

Section 4 is devoted to the analysis of the RE and to the presentation of the main results. It is divided in 5 subsections.

In the first (Sect. 4.1), we study the properties of the space H which seems the good one where to study the RE.
Sect. 4.2 concerns the study of the regularity properties of V.
In Sect. 4.3 we prove that V solves the RE (Theorem 4.12).
In Sect. 4.4 we present our partial uniqueness result (Theorem 4.13).
In Sect. 4.5 we refine our results in the special case of selfadjoint commuting operators.

Finally, Sect. 5 contains two significant examples. At the end there is an Appendix divided into 4 parts. In the first three we collect some preliminary results on pseudoinverses (A), commuting operators (B), and controllability operators (C). In the last one (D) we collect the proofs of several lemmas and propositions.

2 Minimum energy problems

2.1 Assumptions

Let $-\infty<s<t<+\infty $. Consider the abstract linear equation

$$\begin{aligned} \left\{ \begin{array}{l} y'(r)=Ay(r)+Bu(r), \quad r\in \, ]s,t] \\ y(s) = z \in X, \end{array} \right. \end{aligned}$$

(5)

under the following assumption.

Assumption 2.1

(i)
X, the state space, and U, the control space, are real separable Hilbert spaces;
(ii)
$A:D(A)\subseteq X \rightarrow X$ is the generator of an exponentially stable $C_0$-semigroup in X: i.e., there exist $M>0$ and $\omega >0$ such that
$$\begin{aligned} \Vert e^{t A}\Vert _{\mathcal{L}(X)} \le M e^{-\omega t} \qquad \forall t\ge 0. \end{aligned}$$
(6)
(iii)
$B\in \mathcal{L}(U,X)$, where $\mathcal{L}(U,X)$ is the space of bounded linear operators from U to X;
(iv)
$u \in L^2(s,t;U)$.

We recall the following well-known result (see, for example, [23, p. 106, Corollary 2.2 and Definition 2.3]).

Proposition 2.2

For $-\infty<s<t<+\infty $, $z\in X$ and $u\in L^2(s,t;U)$, the mild solution of (5), defined by

$$\begin{aligned} y(r;s,z,u) = e^{(r-s)A}z + \int _s^r e^{(r-\sigma )A}Bu(\sigma ) \, \mathrm {d}\sigma , \quad r\in [s,t], \end{aligned}$$

(7)

is in C([s, t], X).

In the sequel we will always assume that Assumption 2.1 holds, without mentioning it. Moreover, to prove most of the results of the paper we will also need the assumption below. We state it now, and we will say explicitly when we will use it. Before all we need to define the so-called controllability operator.

Definition 2.3

For $t\ge 0$ set

$$\begin{aligned} Q_t z = \int _0^t e^{rA} B B^* e^{rA^*}z \, \mathrm {d}r, \qquad z\in X, \end{aligned}$$

(8)

and, for $t=+\infty $,

$$\begin{aligned} Q_\infty z = \lim _{t \rightarrow +\infty } Q_t z= \int _0^{+\infty } e^{rA} B B^* e^{rA^*}z \, \mathrm {d}r, \qquad z\in X. \end{aligned}$$

(9)

Note that $Q_\infty $ is well defined by Assumption 2.1-(ii).

Assumption 2.4

There exists $T_0>0$ such that^{Footnote 3}

$$\begin{aligned} R(e^{T_0A})\subseteq R\left( Q_{T_0}^{1/2}\right) , \end{aligned}$$

(10)

It is well known (see, for example, [14, Appendix D]) that this assumption is equivalent to assume null controllability at time $T_0$ for the system (5): this means that for each $z\in X$ there exists a control $u\in L^2(0,T_0;U)$ such that the solution of (5) with $[s,t]=[0,T_0]$ vanishes at time $T_0$.

Remark 2.5

We have supposed in Assumption 2.1 that the semigroup $\{e^{tA}\}_{t\ge 0}$ is exponentially stable, also in view of a future study of the infinite horizon case. Anyway, we believe that some of the main results of this paper should hold even in the absence of exponential stability. This will be the objective of future work.
When Assumption 2.4 holds we know (see Proposition C.2(ii)) that the spaces $R(Q_t^{1/2})$ are all equal for $t \ge T_0$; this is a key tool for our analysis. In the case when A is selfadjoint and commutes with $BB^*$ (see Sect. 4.5) the framework is simpler; in particular, we can prove that the spaces $R(Q_t^{1/2})$ are all equal for $t >0$ (Proposition C.2(iii)) without using the null controllability Assumption 2.4. This allows us to prove Theorems 4.14, 4.15 and 4.16, without using such assumption.

2.2 General formulation

Given a time interval $[s,t]\subset {\mathbb R}$, an initial state $z\in X$ and a control $u\in L^2(s,t;U)$ we consider the state equation (5) and its mild solution $y(\cdot ; s,x,u)$, given by (7). We define the class of controls $u(\cdot )$ bringing the state $y(\cdot )$ from a fixed $z\in X$ at time s to a given target $x\in X$ at time t:

$$\begin{aligned} \mathcal{U}_{[s,t]}(z,x) {\mathop {=}\limits ^{\text {def}}}\left\{ u\in L^2(s,t;U) \; : \; y(t;s,z,u)=x \right\} . \end{aligned}$$

(11)

We recall our cost functional, namely the energy:

$$\begin{aligned} J_{[s,t]}(u) = \frac{1}{2} \int _s^t \Vert u(r)\Vert _U^2\, \mathrm {d}r. \end{aligned}$$

(12)

The minimum energy problem at (s, t; z, x) is the problem of minimizing the functional $J_{[s,t]}(u)$ over all $u \in \mathcal{U}_{[s,t]}(z,x)$. The value function of this control problem (the minimum energy) is

$$\begin{aligned} V_1(s,t;z,x){\mathop {=}\limits ^{\text {def}}}\inf _{u\in \mathcal{U}_{[s,t]}(z,x)} J_{[s,t]}(u). \end{aligned}$$

(13)

with the agreement that the infimum over the empty set is $+\infty $. The following easy proposition, straightforward consequence of (7), allows to reduce the number of variables.

Proposition 2.6

We have, for all $-\infty<s<t<+\infty $,

$$\begin{aligned} \begin{array}{lcl} u(\cdot )\in \mathcal{U}_{[s,t]}(z,x)&{} \iff &{} u(\cdot +t)\in \mathcal{U}_{[s-t,0]}(0,x-e^{(t-s)A}z) \\ &{} \iff &{} u(\cdot +s)\in \mathcal{U}_{[0,t-s]}(0,x-e^{(t-s)A}z).\end{array} \end{aligned}$$

(14)

and then

$$\begin{aligned} V_1(s,t;z,x)=V_1(s-t,0;0,x-e^{(t-s)A}z) =V_1(0,t-s;0,x-e^{(t-s)A}z). \end{aligned}$$

$\square $

From now on we will set, for simplicity of notation,

$$\begin{aligned} V(t,x) := V_1(0,t;0,x) = \inf _{u\in \mathcal{U}_{[0,t]}(0,x)} J_{[0,t]}(u) \qquad \forall t\in \,]0,+\infty [\,,\quad \forall x\in X. \end{aligned}$$

(15)

Now we look at the set where V is finite: this is the reachable set in the interval [0, t], starting from 0, defined as

$$\begin{aligned} \mathcal{R}_{[0,t]}^0:= \left\{ x \in X:\ \mathcal{U}_{[0,t]}(0,x) \ne \emptyset \right\} . \end{aligned}$$

(16)

Defining the operator

$$\begin{aligned} \mathcal{L}_{t}:L^2(0,t;U) \rightarrow X, \qquad \mathcal{L}_{t}u=\int _0^{t} e^{(t-\tau )A} B u(\tau )\, \mathrm {d}\tau , \end{aligned}$$

it is clear that

$$\begin{aligned} \mathcal{R}_{[0,t]}^0:= \mathcal{L}_{t}\left( L^2(0,t;U)\right) ; \end{aligned}$$

(17)

hence, the set where V is finite is just $R\left( \mathcal{L}_{t}\right) $.

We now recall a fundamental and well-known result, which establishes the relationship between the family of operators $\{Q_t, \;t\in [0,+\infty ]\}$ and our minimum energy problem (see, for example, [29, Theorem 2.3, p. 210]).

Theorem 2.7

Let $x\in X$ and $t>0$.

(i)
The set $\mathcal{U}_{[0,t]}(0,x)$ is non-empty if and only if $x \in R({Q}^{1/2}_{t})$. In particular, we have
$$\begin{aligned} \mathcal{L}_{t}\left( L^2(0,t;U)\right) =\mathcal{R}^0_{[0,t]}=R(Q_{t}^{1/2}) \qquad \forall t\ge 0\,. \end{aligned}$$
(18)
(ii)
If $x \in R({Q}^{1/2}_{t})$, there is exactly one minimizing strategy $\hat{u}_{t,x}$ for the functional $J_{[0,t]}$ over $\mathcal{U}_{[0,t]}(0,x)$, and, moreover,
$$\begin{aligned} V(t,x) =J_{[0,t]}(\hat{u}_{t,x}) =\frac{1}{2} \Vert Q^{-1/2}_t x\Vert _X^2, \end{aligned}$$
(19)
where, for $t>0$, ${ Q}^{-1/2}_t: R(Q_t^{1/2})\rightarrow [\ker { Q}_t^{1/2}]^\perp $ is the pseudoinverse of $Q^{1/2}_t$.
(iii)
If $x \in R(Q_t)$, then $V(t,x) =\frac{1}{2} \langle Q^{-1}_t x,x \rangle _X$, where $Q_t^{-1}: R({Q}_t)\rightarrow [\ker {Q}_t]^\perp $ is the pseudoinverse of $Q_t$.

Since V is quadratic, the HJB equation associated with our problem becomes a differential Riccati equation, namely (4). Our main aim is then to prove that the linear symmetric operator R associated with V is a solution of such Riccati equation and prove a kind of uniqueness result. We will do this in Sect. 4.

Remark 2.8

It is possible to extend the above minimum energy problem to the case when $s=-\infty $ or when $t=+\infty $. The energy functional becomes then an integral over a half-line. In the first case we have to take the initial datum $z=0$ and, properly defining the mild solutions in the left half-line (requiring that (7) is satisfied for all $r\ge s =-\infty $), we have to define the set of control strategies as follows:

$$\begin{aligned} \mathcal{U}_{[-\infty ,t]}(0,x) {\mathop {=}\limits ^{\text {def}}}\left\{ u\in L^2(-\infty ,t;U) \; : \; y(t;-\infty ,0,u)=x \right\} . \end{aligned}$$

In the second case the problem is trivial. Indeed formally one should define

$$\begin{aligned} \mathcal{U}_{[s,+\infty ]}(z,x) {\mathop {=}\limits ^{\text {def}}}\left\{ u\in L^2(s,+\infty ;U) \; : \; \lim _{t\rightarrow + \infty } y(t;s,z,u)=x \right\} . \end{aligned}$$

However, it is easy to show that, due to the exponential stability of $\{e^{tA}\}$, for every $u\in L^2(s,+\infty ;U)$ we have $\lim _{t\rightarrow + \infty } y(t;s,z,u)=0$, so that the class $\mathcal{U}_{[s,+\infty ]}(z,x)$ is empty unless $x=0$; in this case the optimal control strategy is clearly $u \equiv 0$.

In a subsequent paper we will study the infinite horizon problem when the starting time is $- \infty $ and the arrival time is 0: the value function of this problem is formally $V_1(-\infty ,0;0,x)=V(+\infty , x)$. Some results about it will be also given in the present paper. For simplicity we will use the notation

$$\begin{aligned} V_\infty (x):=V_1(-\infty ,0;0,x). \end{aligned}$$

In Proposition 4.8 we will prove that

$$\begin{aligned} V_\infty (x) = \lim _{t\rightarrow +\infty } V(t,x)=\inf _{t>0} V(t,x). \end{aligned}$$

$\square $

3 The Lyapunov equation

We want now to show that the function $t\rightarrow Q_t$, from $[0,+\infty )$ to $\mathcal{L}(X)$, solves a suitable Lyapunov equation. To this purpose we prove first the following lemma.

Lemma 3.1

(i):: If $x\in D(A^*)$, then for every $t \in [0,+\infty ]$ we have $x \in D(AQ_t)$ and
$$\begin{aligned} AQ_t x= & {} e^{tA}BB^* e^{tA^*}x - BB^*x \nonumber \\&- Q_tA^*x \qquad \forall x\in D(A^*), \quad \forall t \in [0,+\infty [\,, \end{aligned}$$
(20)

$$\begin{aligned} AQ_\infty x= & {} - BB^*x - Q_\infty A^*x \qquad \forall x\in D(A^*). \end{aligned}$$
(21)
(ii):: For every $t\in [0,+\infty ]$ we have $D(A^*) \subseteq D((AQ_t)^*)\subseteq D(AQ_t)$, they all are dense in X, and
$$\begin{aligned} (AQ_t) x= & {} e^{tA}BB^* e^{tA^*}x - (AQ_t)^*x \nonumber \\&- BB^*x \quad \forall x\in D((AQ_t)^*), \quad \forall t \in [0,+\infty [\,. \end{aligned}$$
(22)

$$\begin{aligned} (AQ_\infty ) x= & {} - (AQ_\infty )^*x - BB^*x \quad \forall x\in D((AQ_\infty )^*). \end{aligned}$$
(23)
(iii):: For every $t\in \,]0,+\infty ]$, if $x\in Q_t(D((AQ_t)^*))$, then $Ax \in [\ker Q_t]^\perp $.
(iv):: For every $t\in \,]0,+\infty ]$, $Q_t(D(A^*))$ is dense in $[\ker Q_t]^\perp $.

Proof

(i):

Let $x\in D(A^*)$. Since A generates an exponentially stable semigroup, it is also invertible. Then, we can write, integrating by parts:

$$\begin{aligned} Q_tx= & {} \int _0^t e^{rA}BB^*e^{rA^*}x\,\mathrm {d}r \\= & {} A^{-1} \left[ e^{rA}BB^* e^{rA^*}x \right] _0^t - A^{-1}\int _0^t e^{rA}BB^*e^{rA^*}A^*x\,\mathrm {d}r \\= & {} A^{-1} \left[ e^{tA}BB^* e^{tA^*}x - BB^*x - Q_t A^*x\right] , \end{aligned}$$

and (i) follows.

(ii):

The first inclusion follows from the definition of the adjoint’s domain. Indeed if $y\in D(A^*)$, then

$$\begin{aligned} \exists c>0 \; \hbox {such that } \; |\left\langle Ax,y \right\rangle _X |\le c\Vert x\Vert _X \quad \forall x \in D(A). \end{aligned}$$

In such formula, choosing any $z \in X$ and setting $x= Q_t z$ we get

$$\begin{aligned} |\left\langle AQ_tz ,y \right\rangle _X |\le c\Vert Q_t z\Vert _X \le c\Vert Q_t\Vert \Vert z\Vert _X\quad \forall z \in D(AQ_t). \end{aligned}$$

This implies $y \in D((AQ_t)^*)$.

Concerning the second inclusion, if $x\in D((AQ_t)^*)$ we can write for each $y\in D(A^*)$, by (20), for some $c_t>0$,

$$\begin{aligned} |\langle Q_tx,A^*y\rangle _X|= & {} |\langle x,Q_tA^*y\rangle _X| \\= & {} |\langle x, -AQ_ty + e^{tA}BB^* e^{tA^*}y - BB^*y\rangle _X| \\= & {} |\langle -(AQ_t)^*x + e^{tA}BB^* e^{tA^*}x - BB^*x, y\rangle _X| \le c_t \Vert y\Vert _X\,, \end{aligned}$$

so that $Q_tx\in D(A)$, i.e., $x\in D(AQ_t)$, and (22) holds. This proves the claim for $t\in [0,+\infty [$. For the case $t=+\infty $ we argue in a similar way: let $x\in D((AQ_\infty )^*)$; then for each $y\in D(A^*)$, by (21) we have, for some $c_\infty >0$,

$$\begin{aligned} |\langle Q_\infty x,A^*y\rangle _X|= & {} |\langle x,Q_\infty A^*y\rangle _X| \\= & {} |\langle x, -AQ_\infty y - BB^*y\rangle _X| \\= & {} |\langle -(AQ_\infty )^*x - BB^*x, y\rangle _X| \le c_\infty \Vert y\Vert _X\,, \end{aligned}$$

so that $Q_\infty x\in D(A)$, i.e., $x\in D(AQ_\infty )$, and (23) holds.

(iii):

By assumption we have $x=Q_t z$ with $z\in D((AQ_t)^*)$. Let $w\in \ker Q_t$. By the proof of point (ii) of Proposition C.1 we get $B^*e^{sA^*}w=0$ for all $s\in [0,t]$. Moreover, w belongs obviously to $D(AQ_t)$ (with $AQ_t w =A\,0 =0$). Hence, we have by (22), when $t<+\infty $

$$\begin{aligned} \langle Ax,w\rangle _X= & {} \langle AQ_t z, w\rangle _X = \langle B^*e^{tA^*}z,B^*e^{tA^*}w\rangle _X \\&-\langle B^*z,B^*w\rangle _X - \langle z,AQ_t w\rangle _X = 0. \end{aligned}$$

As a consequence, $Ax\in [\ker Q_\infty ]^\perp $. Similarly, if $t=+\infty $, we have for every $w\in \ker Q_\infty $, by (23),

$$\begin{aligned} \langle Ax,w\rangle _X = \langle AQ_\infty z, w\rangle _X = -\langle B^*z,B^*w\rangle _X - \langle z,AQ_\infty w\rangle _X = 0, \end{aligned}$$

and the claim follows.

(iv):

We just consider the case $t=+\infty $, since the case $0<t <+\infty $ is quite similar. Fix $x\in [\ker Q_\infty ]^\perp $. As $[\ker Q_\infty ]^\perp = \overline{R(Q_\infty )}$ there is a sequence of elements $x_n\in R(Q_\infty )$ such that $x_n \rightarrow x$. Hence, there exists $\{z_n\}\subset X$ such that $Q_\infty z_n \rightarrow x$ in X. Since $D(A^*)$ is dense in X, for each $n\in {\mathbb N}^+$ we can find $y_n \in D(A^*)$ such that $\Vert y_n-z_n\Vert _X<1/n$, so that $Q_\infty y_n \rightarrow x$ in X, too.

For the last statement, observe first that, since $D(A^*) \subseteq D((AQ_\infty )^*)$, then $Q_\infty (D(A^*)) \subseteq Q_\infty (D((AQ_\infty )^*))$, so the latter is dense in $[\ker Q_\infty ]^\perp $, too.

$\square $

Definition 3.2

A map $Q(\cdot ):[0,+\infty ) \rightarrow \mathcal {L}(X)$ is a solution of the differential Lyapunov equation

$$\begin{aligned} \left\{ \begin{array}{l} Q'(t) = AQ(t) + Q(t)A^* + BB^*, \quad t>0,\\ Q(0)=0, \end{array} \right. \end{aligned}$$

(24)

if:

for each $t\ge 0$ the operator Q(t) is positive and selfadjoint and $Q(0)=0$;
for each $t\ge 0$ and $x \in D(A^*)$ we have $Q_t x \in D(A)$;
for each $x \in D(A^*)$ the map $t\rightarrow Q(t)x$ is differentiable and
$$\begin{aligned} \frac{\mathrm {d}}{\mathrm {d}t}Q_{(t)}x = AQ_{(t)}x + Q_{(t)}A^*x + BB^*x \qquad \forall t>0. \end{aligned}$$

Similarly an operator $Q\in \mathcal {L}(X)$ is a solution of the algebraic Lyapunov equation

$$\begin{aligned} AQ + QA^* + BB^*=0 \end{aligned}$$

(25)

if:

Q is positive and selfadjoint;
for each $x \in D(A^*)$ we have $Q x \in D(A)$;
for each $x \in D(A^*)$
$$\begin{aligned} AQx + QA^*x + BB^*x=0. \end{aligned}$$

Proposition 3.3

The operator $Q_t$ defined by (8) is a solution of the differential Lyapunov equation (24). Similarly the operator $Q_\infty $ solves the algebraic Lyapunov equation (25).

Moreover, for all $t\ge 0$, let Q(t) be positive and selfadjoint, and such that it solves the Lyapunov equation (24) in weak sense, i.e., $Q(0)=0$, the map $t\mapsto \left\langle Q(t) x,y\right\rangle _X$ is differentiable for every $x,y \in D(A^*)$ and

$$\begin{aligned} \frac{\mathrm {d}}{\mathrm {d}t}\left\langle Q(t)x,y\right\rangle _X = \left\langle Q(t)x,A^*y\right\rangle _X +\left\langle A^*x, Q(t)y\right\rangle _X + \left\langle B^*x,B^*y\right\rangle _U \qquad \forall t>0. \end{aligned}$$

Then, $Q(t)=Q_t$ for all $t\ge 0$.

Similarly let Q be positive and selfadjoint, and such that it solves the Lyapunov equation (25) in weak sense, i.e., for all $x,y \in D(A^*)$

$$\begin{aligned} \left\langle Qx,A^*y\right\rangle _X +\left\langle A^*x, Qy\right\rangle _X + \left\langle B^*x,B^*y\right\rangle _U =0 . \end{aligned}$$

Then, $Q=Q_\infty $.

Proof

We give the proof for the reader’s convenience since we did not find it in the literature. Indeed, in [11, Theorem 5.1.3], in [2, part II, Chapter 1, Theorem 2.4] and in [14, Appendix D] only the algebraic Riccati equation is considered, and it is shown that it has a positive operator-valued solution if and only if the semigroup generated by A is exponentially stable.

Consider first the differential Lyapunov equation (24). By the definition of $Q_t$ we obviously have

$$\begin{aligned} \frac{\mathrm {d}}{\mathrm {d}t}Q_tx = e^{tA}BB^* e^{tA^*} x \qquad \forall x\in X. \end{aligned}$$

Then, the existence result follows from Lemma 3.1-(i).

Concerning uniqueness, we observe that, if $Q_1(t)$ and $Q_2(t)$ are two functions with values in the space of bounded, selfadjoint, positive operators, and they both solve (24) in weak sense, then the difference $Q(t):=Q_1(t)-Q_2(t)$ satisfies the homogeneous equation

$$\begin{aligned} \frac{\mathrm {d}}{\mathrm {d}t}\left\langle Q(t)x,y\right\rangle _X = \left\langle Q(t)x,A^*y\right\rangle _X +\left\langle A^*x, Q(t)y\right\rangle _X \qquad \forall t>0, \end{aligned}$$

with $Q(0)=0$. Now take any $x \in D(A^*)$ and $t_0>0$ and observe that, by simple computations,

$$\begin{aligned} \frac{\mathrm {d}}{\mathrm {d}t}\langle Q(t)e^{(t_0-t)A^*}x,e^{(t_0-t)A^*}x \rangle _X=0, \end{aligned}$$

so that it must be

$$\begin{aligned} \langle Q(t_0)x,x \rangle _X= \langle Q(0)e^{t_0 A^*}x,e^{t_0 A^*}x \rangle _X=0. \end{aligned}$$

Since $Q(t_0)$ is selfadjoint, we can use polarization to get $Q(t_0)=0$ for every $t_0>0$ and so the claim.

Now we look at the algebraic Lyapunov equation (25). From (21) it follows that $Q_\infty $ solves (25). To show uniqueness, similarly for the case of the differential Lyapunov equation, we observe that, if $Q_1$ and $Q_2$ are two bounded, selfadjoint, positive operators which solve (25) in weak sense, then the difference $Q:=Q_1-Q_2$ satisfies the homogeneous equation

$$\begin{aligned} \left\langle Qx,A^*y\right\rangle _X +\left\langle A^*x, Qy\right\rangle _X =0 . \end{aligned}$$

Hence, for any $x \in D(A^*)$ as before we deduce

$$\begin{aligned} \frac{d}{dt}\langle Qe^{tA^*}x,e^{tA^*}x \rangle _X=0, \end{aligned}$$

so that it must be, since A is of negative type,

$$\begin{aligned} \langle Qx,x \rangle _X=\lim _{t \rightarrow + \infty } \langle Qe^{t A^*}x,e^{t A^*}x \rangle _X=0. \end{aligned}$$

As above, since Q is selfadjoint, we use polarization getting $Q=0$ and so $Q_1=Q_2$. $\square $

Remark 3.4

If A is selfadjoint and commutes with $BB^*$ (see Sect. 5.2 for a typical example of this case), then, by Proposition C.1-(v), it also commutes with $Q_t$, $t \in [0,+\infty ]$. Moreover, by the Lyapunov equation (24) we have, for all $x \in D(A)$,

$$\begin{aligned} \frac{\mathrm {d}}{\mathrm {d}t}Q_t x=2AQ_t x + BB^*x \end{aligned}$$

and, by (25), we have, for all $x \in D(A)$,

$$\begin{aligned} 2A Q_\infty x =-BB^* x. \end{aligned}$$

Indeed, this last equality holds for all $x \in X$, as it follows from (59). Finally, from the last one we easily get, for all $y \in R(Q_\infty )\subseteq D(A)$, taking $x=Q_\infty ^{-1}y$,

$$\begin{aligned} 2A y =- BB^* Q_\infty ^{-1}y. \end{aligned}$$

$\square $

4 The Riccati equation

From Theorem 2.7 above we know that the value function $V(t,\cdot )$ is finite only in the set $R(Q_t^{1/2})$ and is given by $V(t,x)=\frac{1}{2}\Vert Q_t^{-1/2}x\Vert ^2_X$. Moreover, for points $x\in R(Q_t)$ we can write $V(t,x)=\frac{1}{2}\langle Q_t^{-1}x,x \rangle _X$. So V is a quadratic form on X, defined, however, only for $x\in R(Q_t^{1/2})$; thus, we expect that the associated operator $Q_t^{-1}$ solves^{Footnote 4} our Riccati equation (4), which we rewrite here for the reader’s convenience:

$$\begin{aligned} \frac{\mathrm {d}}{\mathrm {d}t}\langle R(t)x,y \rangle _X = -\langle Ax, R(t)y\rangle _X - \langle R(t)x,Ay\rangle _X - \langle B^*R(t) x,B^*R(t)y \rangle _U\,, \quad t>0, \end{aligned}$$

(26)

for every $x,y\in D(A)\cap D(R(t))$, with the initial condition $R(0^+)=+\infty $. This is indeed the case when the null controllability Assumption 2.4 holds, as we will prove later (see Theorem 4.12 below). Note that the initial condition has to be interpreted in the following sense: for every positive sequence $t_n\rightarrow 0^+$ we have

$$\begin{aligned} \lim _{n\rightarrow +\infty } \left\langle R(t_n)x,x\right\rangle _X=+\infty , \quad \forall x \in \bigcap _{n\in {\mathbb N}}D(R(t_n)), \; x \ne 0. \end{aligned}$$

(27)

Note, moreover, that we cannot expect uniqueness of the RE without any initial condition as, obviously, $R\equiv 0$ is a solution.

Equation (26) is difficult for several reasons: the infinite initial condition (arising also in [24]), the negative sign of the linear part (which does not arise in [24]) and the unboundedness of the expected solution (which is also not present in [24]). In particular, the difference due to the negative sign is substantial: even in the simplest diagonal case (see Sect. 5.2) there is no semigroup associated with the linear part of (26) on the whole space X, so that the equation cannot be rewritten in mild form as usual (see, for example, [29, Theorem 4.1, p. 234]).

Note that, if we change the sign of the linear part, then we are exactly in the case treated by [24], and the solution, when the null controllability Assumption 2.4 holds, just coincides with the operator-valued function on X given, formally, by $e^{tA^*}Q_t^{-1}e^{tA}$.

Note, moreover, that, in the case when A is selfadjoint and commutes with $BB^*$, the null controllability Assumption 2.4 is not needed to get the main results (Theorems 4.14, 4.15 and 4.16) since, in this case, thanks to Proposition C.2-(iii), the spaces $R(Q_t^{1/2})$ are all equal for every $t>0$. This fact (which is false in the general case without the null controllability Assumption 2.4, see Example C.5) is enough to prove Theorems 4.14, 4.15 and 4.16.

Remark 4.1

We observe that performing a time inversion in the state equation, or in the RE, does not change the difficulty of the problem, which lies in the fact that the equation is forward and the linear part is negative. Of course this is not true if A generates not just a $C_0$-semigroup but a $C_0$-group (this includes the case of bounded A). We do not want to assume this, since our examples, in particular the diagonal one (which arises in our motivating application to physics, see [6] and Sect. 5.2), do not possess such property. $\square $

4.1 The space H and its properties

In order to study equation (26) it will be useful to rewrite it in a different form and in a different space, which we call H: under the null controllability assumption (Assumption 2.4) it is, for $t\ge T_0$, the reachable set of the control system (5), hence the set where the value function V of (15) is well defined. Then, we define

$$\begin{aligned} H=R(Q_\infty ^{1/2}). \end{aligned}$$

(28)

Of course it holds

$$\begin{aligned} H \subseteq \overline{R(Q_\infty ^{1/2})}= [\ker Q_\infty ^{1/2}]^\perp =[\ker Q_\infty ]^\perp . \end{aligned}$$

The inclusion is in general proper. Define in H the inner product

$$\begin{aligned} \langle x, y\rangle _H = \langle Q_\infty ^{-1/2}x,Q_\infty ^{-1/2}y \rangle _X \qquad \forall x,y\in H. \end{aligned}$$

(29)

We provide now some useful results on the space H which will form the ground for our main results. We divide them into six Lemmas, whose proofs are collected in Appendix D. The first three (which do not need the null controllability Assumption 2.4) concern the structure of the space H and the behavior in H of the operators $Q_t$.

Lemma 4.2

(i)
The space H introduced in (28), endowed with the inner product (29), is a Hilbert space continuously embedded into X.
(ii)
The space $R(Q_\infty )$ is dense in H.
(iii)
The operator $Q_\infty ^{-1/2}$ is an isometric isomorphism from H to $[\ker Q_\infty ^{1/2}]^\perp $, and in particular
$$\begin{aligned} \Vert Q_\infty ^{-1/2}x\Vert _X = \Vert x\Vert _H \qquad \forall x\in H. \end{aligned}$$
(30)
(iv)
We have
$$\begin{aligned} \Vert Q_\infty ^{1/2}\Vert _{\mathcal{L}(X)}=\Vert Q_\infty ^{1/2}\Vert _{\mathcal{L}(H)}. \end{aligned}$$
(v)
For every $F \in \mathcal{L}(X)$ such that $R(F) \subseteq H$ we have $Q_\infty ^{-1/2}F \in \mathcal{L}(X)$, so that $F\in \mathcal{L}(X,H)$.

Lemma 4.3

For $0<t\le +\infty $ let $Q_t$ be the operator defined by (8). Then, if $t\in [T_0, + \infty ]$ the space $Q_t(D(A^*))$ is dense in H. In particular, $D(A)\cap H$ is dense in H.

Lemma 4.4

For $0<t\le +\infty $ let $Q_t$ be the operator defined by (8). Then,

(i)
$\langle z, Q_t^{-1/2}w\rangle _X = \langle Q_t^{-1/2}z, w\rangle _X$ for all $z,w\in R(Q_t^{1/2})$;
(ii)
$\langle Q_\infty ^{1/2}x, y\rangle _H = \langle x, Q_\infty ^{1/2}y\rangle _H$ for all $x,y\in H$;
(iii)
$\langle Q_\infty x, y\rangle _H = \langle x, Q_\infty y\rangle _H$ for all $x,y\in H$.

Suppose now that Assumption 2.4 holds. The next two lemmas deal with the operators $Q_t^{1/2}Q_\infty ^{-1/2}$ and $Q_t^{-1/2}Q_\infty ^{1/2}$. By Proposition C.2-(ii) and using the definition of H, we have

$$\begin{aligned} H = R(Q_\infty ^{1/2}) = R(Q_t^{1/2}) \qquad \forall t\ge T_0, \end{aligned}$$

so that $Q_t^{1/2}Q_\infty ^{-1/2}$ is well defined from H into H.

Lemma 4.5

Under Assumption 2.4, for fixed $t\ge T_0$ the operator $Q_t^{1/2}Q_\infty ^{-1/2}:H\rightarrow H$ is an isomorphism, with inverse $Q_\infty ^{1/2}Q_t^{-1/2}$. Similarly for fixed $t\ge T_0$ the operator $Q_t^{-1/2}Q_\infty ^{1/2}:H\rightarrow H$ is an isomorphism, with inverse $Q_\infty ^{-1/2}Q_t^{1/2}$.

Similarly, we have:

Lemma 4.6

Under Assumption 2.4, for fixed $t\ge T_0$ the operator $Q_t^{-1/2}Q_\infty ^{1/2}:X\rightarrow X$ is an isomorphism on the closed subspace $[\ker Q_\infty ]^\perp = \overline{R(Q_\infty ^{1/2})}=[\ker Q_\infty ^{1/2}]^\perp $, with

$$\begin{aligned}{}[Q_\infty ^{-1/2}Q_t^{1/2}]Q_t^{-1/2}Q_\infty ^{1/2}x=P_{[\ker Q_\infty ]^\perp }x \qquad \forall x\in X. \end{aligned}$$

The last lemma, which does not need the null controllability Assumption 2.4, describes the adjoint in H of an operator $L\in \mathcal{L}([\ker Q_\infty ]^\perp )\cap \mathcal{L}(H)$.

Lemma 4.7

Let $L\in \mathcal{L}([\ker Q_\infty ]^\perp )\cap \mathcal{L}(H)$. Then,

$$\begin{aligned} \langle Lx,y\rangle _H = \langle x, Q_\infty L^* Q_\infty ^{-1}y\rangle _H \qquad \forall x\in H, \quad \forall y\in R(Q_\infty ), \end{aligned}$$

where $L^*\in \mathcal{L}([\ker Q_\infty ]^\perp )$ is the adjoint of the operator L in $[\ker Q_\infty ]^\perp $.

To avoid confusion, for any $L\in \mathcal{L}(H)$ we will denote by $L^{*H}$ the adjoint of L in H, i.e., $L^{*H}=Q_\infty L^* Q_\infty ^{-1}$. Moreover, for a subspace V of H we will write $V^{*H}$ for the topological dual of V when H is identified with its dual.

We remark that, under Assumption 2.4, if $y\in R(Q_\infty )= Q_\infty ^{1/2}(H)$ we have $Q_\infty ^{-1/2}y \in H$ and, by Lemma 4.5, $Q_\infty ^{1/2}Q_t^{-1/2}Q_\infty ^{-1/2}y \in H$. Thus, $Q_\infty Q_t^{-1/2}Q_\infty ^{-1/2}y \in Q_\infty ^{1/2}(H)=R(Q_\infty )$. Consequently, under Assumption 2.4 we may write, by Lemma 4.7,

$$\begin{aligned}{}[Q_\infty ^{1/2}Q_t^{-1/2}]^{*H}y= & {} Q_\infty [Q_\infty ^{1/2}Q_t^{-1/2}]^*Q_\infty ^{-1}y \nonumber \\= & {} Q_\infty Q_t^{-1/2}Q_\infty ^{-1/2}y \qquad \forall y\in R(Q_\infty ). \end{aligned}$$

(31)

4.2 Properties of the value function

We now state the main properties of the value function V(t, x) defined by (15). The proofs are in Appendix D.

Proposition 4.8

The value function V given by (15) has the following properties:

(i)
For every $t_0 >0$ and $x \in R(Q_{t_0}^{1/2})$, the function $V(\cdot ,x)$ is decreasing in $\,]t_0,+ \infty [\,$.
(ii)
For every $t>0$ the function $V(t,\cdot )$ is quadratic with respect to $x\in R(Q_t^{1/2})$, i.e., there exists a linear positive selfadjoint operator
$$\begin{aligned} P_V(t): R(Q_{t}^{1/2})\subseteq H \rightarrow [R(Q_{t}^{1/2})]^{*H} \supseteq H \end{aligned}$$
such that
$$\begin{aligned} V(t,x)=\frac{1}{2} \langle P_V(t) x,x \rangle _{[R(Q_{t}^{1/2})]^{*H},R(Q_{t}^{1/2})} \qquad \forall t>0, \quad \forall x\in R(Q_{t}^{1/2}); \end{aligned}$$
(32)
moreover, we have
$$\begin{aligned} P_V(t) = [Q_\infty ^{1/2}Q_t^{-1/2}]^{*H}Q_\infty ^{1/2}Q_t^{-1/2} \qquad \forall t>0. \end{aligned}$$
(33)
(iii)
Assume now that Assumption 2.4 holds. Then,
1. (a)
  the operator $P_V(t)$ belongs to $\mathcal {L}(H)$ and
  $$\begin{aligned} V(t,x)=\frac{1}{2} \langle P_V(t) x,x \rangle _H \qquad \forall t\ge T_0, \quad \forall x\in H; \end{aligned}$$
  (34)
  in particular,
  $$\begin{aligned} P_V(t)x = Q_\infty Q_t^{-1}x \qquad \forall x\in R(Q_t), \quad \forall t\ge T_0. \end{aligned}$$
  (35)
  In addition
  $$\begin{aligned} \Vert P_V(\tau )\Vert _{\mathcal{L}(H)}\le \Vert P_V(t)\Vert _{\mathcal{L}(H)}\le \Vert P_V(T_0)\Vert _{\mathcal{L}(H)} < +\infty \qquad \forall \tau \ge t\ge T_0. \end{aligned}$$
  (36)
2. (b)
  The map $(t,x) \mapsto V(t,x)$ from $[T_0,+\infty [\,\times H$ to ${\mathbb R}$ is continuous, uniformly on $[T_0,+\infty [\,\times B_H(0,R)$ for every $R>0$; moreover, the map $t\mapsto P_V(t)$ from $[T_0,+\infty [\,$ to $\mathcal{L}(H)$ is continuous.
3. (c)
  Finally, we have
  $$\begin{aligned} \lim _{t\rightarrow +\infty } V(t,x)= \frac{1}{2} \Vert x\Vert _H^2 \qquad \forall x\in H. \end{aligned}$$
  (37)

Remark 4.9

Equations (33) and (35) show that the operator $Q_\infty Q_t^{-1}$, defined on $R(Q_t)$, has in fact an extension to all of H, given by $[Q_\infty ^{1/2}Q_t^{-1/2}]^{*H}Q_\infty ^{1/2}Q_t^{-1/2}$, i.e., P(t). $\square $

4.3 The value function solves the Riccati equation

We want now to show that the operator $P_V(t)$, given by (33) or (35), satisfies for $t\ge T_0$ the Riccati equation (26). To do this we first rewrite it in the space H. The unknown is now, for all $t\in [0,T]$, an operator $P(t) \in \mathcal{L}(H)$ which is, formally, $Q_\infty R(t)$ where R is the unknown of (26), while the equation is

$$\begin{aligned} \frac{\mathrm {d}}{\mathrm {d}t}\langle P(t)x,y \rangle _H= & {} -\langle Ax, Q_\infty ^{-1}P(t)y\rangle _X - \langle Q_\infty ^{-1}P(t)x,Ay\rangle _X\nonumber \\&- \langle B^*Q_\infty ^{-1}P(t)x,B^*Q_\infty ^{-1}P(t)y \rangle _U. \end{aligned}$$

(38)

Note that the term in the left-hand side is written using the inner product of the space H, while the first two in the right-hand side are written with the inner product in X: they could be written in H, too, but at the price of requiring more regularity on the points x, y (since Ax, Ay in this case should belong to H).

Definition 4.10

Let $0<t_0<+\infty $.

(i)
An operator-valued function $P:[t_0,+\infty [\,\rightarrow \mathcal{L}_+(H)$ is a solution of the Riccati equation (38) if it is strongly continuous and for every $t\ge t_0$ there is a set $D_P(t) \subset H$, dense in H, such that for every $x,y\in D_P(t)$ there exists $\left\langle \frac{\mathrm {d}}{\mathrm {d}t}P(t)x,y\right\rangle _H$, all terms of (38) make sense and the equation holds.
(ii)
A function R, defined on $[t_0,+\infty [\,$ with values in the set of closed, densely defined, unbounded, positive operators in X, is a solution of the Riccati equation (26) if for every $t\ge t_0$ there is a set $D_R(t) \subset X$, dense in $[\ker Q_\infty ]^\perp $, such that for every $x,y\in D_R(t)$ there exists $\left\langle \frac{\mathrm {d}}{\mathrm {d}t}R(t)x,y\right\rangle _X$, all terms of (26) make sense and the equation holds.

Remark 4.11

(i):

In the above definition the domain $D_P(t)$ varies with time, since its natural choice (see the next theorem) is $D(A)\cap R(Q_t)$ which may change with time; similarly for the domain $D_R(t)$. Moreover, $D_R(t)$ is assumed to be dense in $[\ker Q_\infty ]^\perp $ and not in X, since its natural choice (see the next theorem) is $R(Q_t)$ which is indeed dense in $[\ker Q_\infty ]^\perp $ and not in X, in general.

(ii):

Note that we wrote equation (38) without the initial condition: the reason is that we are interested to study all solutions of such equation, also in view of the study of the infinite horizon case, where the initial condition disappears. Clearly, looking at our original minimum energy problem (see Theorem 2.7-(iii)), the natural condition for (38) (respectively (26)) is $P(0^+)=+\infty $ (respectively $R(0^+)=+\infty $); this condition, more precisely, reads as follows: for every positive sequence $t_n \rightarrow 0^+$,

$$\begin{aligned} \exists \lim _{n\rightarrow +\infty } \langle P(t_n)x,x \rangle _H =+\infty \qquad \forall x\in H, \; x \ne 0, \end{aligned}$$

(while the one for R(t) is given in (27)). $\square $

We present now our existence result.

Theorem 4.12

Suppose that Assumptions 2.1 and 2.4 hold. Then, the operator $P_V(t)$ given by (33) is a solution of (38) on $[T_0,+\infty [\,$, with the set $D_{P_V}(t)$ given by $D(A)\cap R(Q_t)$ for all $t\ge T_0$.

Moreover, the operator $R_V(t)=Q_t^{-1}$ is a solution of (26) on $[T_0,+\infty [\,$, with the set $D_{R_V}(t)$ given by $D(A)\cap R(Q_t)$ for all $t\ge T_0$.

Proof

Fix $t\ge T_0$ and $x\in R(Q_t)$; then, $P_V(t)x\in R(Q_\infty )$ since, using (35), $P_V(t)x = Q_\infty Q_t^{-1}x$ for all $x \in R(Q_t)$. Moreover, from the definition of $Q_t$ and Assumption 2.4 it follows that $[\ker Q_t]^\perp $ is constant in t for $t\ge T_0$, so that $Q_s^{-1}Q_s$ reduces to the identity on $[\ker Q_t]^\perp $ for s and t larger than $T_0$. Hence, for $h \ne 0$ sufficiently small we can write for $x,y\in R(Q_t)$

$$\begin{aligned} \left\langle \frac{P_V(t+h)-P_V(t)}{h}\,x,y \right\rangle _H= & {} \left\langle \frac{P_V(t+h)- Q_\infty Q_{t+h}^{-1}Q_{t+h}Q_t^{-1}}{h}\,x,y \right\rangle _H \\= & {} \left\langle \frac{P_V(t+h)[I-Q_{t+h}Q_t^{-1}]}{h}\,x,y \right\rangle _H \\= & {} \left\langle P_V(t+h)\frac{[Q_t-Q_{t+h}]}{h}\,Q_t^{-1}x,y \right\rangle _H \\= & {} \left\langle Q_\infty ^{-1/2}\frac{[Q_t-Q_{t+h}]}{h}\,Q_t^{-1}x,Q_\infty ^{-1/2}P_V(t+h)y \right\rangle _X\,. \end{aligned}$$

Now we easily deduce, since $Q_\infty ^{-1/2}e^{tA} \in \mathcal{L}(X)$ by Assumption 2.4,

$$\begin{aligned} Q_\infty ^{-1/2}\frac{[Q_t-Q_{t+h}]}{h}\,Q_t^{-1}x= & {} -\frac{1}{h} \int _t^{t+h}Q_\infty ^{-1/2}e^{sA}BB^*e^{sA^*}Q_t^{-1}x \,\mathrm {d}s \\\rightarrow & {} - Q_\infty ^{-1/2}e^{tA}BB^*e^{tA^*}Q_t^{-1}x\quad \text {in } X \text { as } h\rightarrow 0^+, \end{aligned}$$

so that, since $t\mapsto Q_\infty ^{-1/2}P_V(t)y$ is continuous by Proposition 4.8 (iii)(b), we readily obtain

$$\begin{aligned} \lim _{h\rightarrow 0} \left\langle \frac{P_V(t+h)-P_V(t)}{h}\,x,y \right\rangle _H= & {} - \langle Q_\infty ^{-1/2}e^{tA}BB^*e^{tA^*} Q_t^{-1}x,Q_\infty ^{-1/2}P_V(t)y\rangle _X \\= & {} - \langle P_V(t)e^{tA}BB^*e^{tA^*} Q_t^{-1}x,y\rangle _H\,. \end{aligned}$$

This shows that

$$\begin{aligned}&\exists \frac{\mathrm {d}}{\mathrm {d}t}\langle P_V(t)x,y\rangle _H \nonumber \\&\quad = - \langle e^{tA}BB^*e^{tA^*} Q_t^{-1}x,P_V(t)y\rangle _H \qquad \forall x,y\in R(Q_t), \qquad \forall t \in [T_0,+\infty [\,.\nonumber \\ \end{aligned}$$

(39)

Finally, using Proposition 3.3, for all $x,y\in R(Q_t)\cap D(A)$ we can compute for every $t\in [T_0, +\infty [\,$:

$$\begin{aligned} \frac{\mathrm {d}}{\mathrm {d}t}\langle P_V(t)x,y\rangle _H= & {} - \langle Q_\infty ^{-1/2}e^{tA}BB^*e^{tA^*} Q_t^{-1}x,Q_\infty ^{-1/2}P_V(t)y\rangle _X \\= & {} - \langle Q_\infty ^{-1/2}e^{tA}BB^*e^{tA^*} Q_t^{-1}x,Q_\infty ^{-1/2}[Q_\infty Q_t^{-1}]y\rangle _X \\= & {} - \langle [AQ_t + (AQ_t)^* + BB^*] Q_t^{-1}x,Q_t^{-1}y\rangle _X \\= & {} - \langle Ax,Q_t^{-1}y\rangle _X - \langle Q_t^{-1}x,Ay\rangle _X - \langle B^*Q_t^{-1}x,B^*Q_t^{-1}y\rangle _U \\= & {} - \langle Ax,Q_\infty ^{-1}P_V(t)y\rangle _X- \langle Q_\infty ^{-1}P_V(t)x,Ay\rangle _X \\&- \langle B^*Q_\infty ^{-1}P_V(t)x,B^*Q_\infty ^{-1} P_V(t)y\rangle _U \,. \end{aligned}$$

This completes the proof of the first statement. The proof of the second one is completely similar and we omit it. $\square $

4.4 A partial uniqueness result

We are not able to prove a satisfactory uniqueness result; here is our statement which establishes uniqueness in a restricted class of solutions.

Theorem 4.13

Suppose that Assumptions 2.1 and 2.4 hold. Let $P_V(t)$ be defined by (33). Let S(t) be an operator defined in $[T_0,+\infty [\,$, with the following properties:

(i)
$S(t)\in \mathcal{L}(H)$, $S(t)=S(t)^{*H}$, $\exists S(t)^{-1}\in \mathcal{L}(H)$ and the maps $t\mapsto S(t)$, $t\mapsto S(t)^{-1}$ are strongly continuous;
(ii)
$S(t)^{-1}(Q_\infty (D(A^*))) \subseteq D((AQ_\infty )^*)$ for every $t\in [T_0,+\infty [\,$;
(iii)
for every $x \in S(t)^{-1}(Q_\infty (D(A^*)))$ and $t \ge T_0$ the map
$$\begin{aligned} h \mapsto \frac{1}{h}(S(t+h)S(t)^{-1}-I)x \end{aligned}$$
is bounded in a neighborhood of 0;
(iv)
for every $x,y\in S(t)^{-1}(R(Q_\infty ))\cap D(A)$ the following equation holds:
$$\begin{aligned} \frac{\mathrm {d}}{\mathrm {d}t}\langle S(t)x,y \rangle _H= & {} -\langle Ax, Q_\infty ^{-1}S(t)y\rangle _X - \langle Q_\infty ^{-1}S(t)x,Ay\rangle _X\\&- \langle B^*Q_\infty ^{-1}S(t)x,B^*Q_\infty ^{-1}S(t)y \rangle _U\,; \end{aligned}$$
(v)
there exists $t_0\ge T_0$ such that $S(t_0)= P_V(t_0)$.

Then, $S(t) \equiv P_V(t)$ in $[T_0,+\infty [\,$.

Proof

For fixed $t\ge T_0$, the above equation holds in particular for every $x,y\in S(t)^{-1}(Q_\infty (D(A^*)))$. Set now $\xi =S(t)x$, $\eta =S(t)y$: then we have $\xi ,\eta \in Q_\infty (D(A^*))$ and, replacing x and y into (iv) above, we get

$$\begin{aligned}&\left[ \frac{\mathrm {d}}{\mathrm {d}\tau }\langle S(\tau )S(t)^{-1}\xi ,S(t)^{-1}\eta \rangle _H\right] _{\tau =t} \\&\quad = -\langle AS(t)^{-1}\xi , Q_\infty ^{-1}\eta \rangle _X - \langle Q_\infty ^{-1}\xi ,AS(t)^{-1}\eta \rangle _X - \langle B^*Q_\infty ^{-1}\xi ,B^*Q_\infty ^{-1}\eta \rangle _U\,. \end{aligned}$$

Now we want to compute, whenever possible, $\frac{\mathrm {d}}{\mathrm {d}t}\left\langle S(t)^{-1}\xi ,\eta \right\rangle _H\,$. We have

$$\begin{aligned}&\left\langle \frac{S(t+h)^{-1}- S(t)^{-1}}{h}\xi ,\eta \right\rangle _H = -\left\langle S(t+h)^{-1}\left[ \frac{S(t+h)- S(t)}{h}\right] S(t)^{-1}\xi ,\eta \right\rangle _H \\&\quad = - \left\langle \left[ \frac{S(t+h)- S(t)}{h}\right] S(t)^{-1}\xi ,[S(t+h)^{-1}-S(t)^{-1}]\eta \right\rangle _H \\&\qquad - \left\langle \left[ \frac{S(t+h)- S(t)}{h}\right] S(t)^{-1}\xi ,S(t)^{-1}\eta \right\rangle _H. \end{aligned}$$

The second term clearly converges to

$$\begin{aligned} \left[ \frac{\mathrm {d}}{\mathrm {d}\tau }\langle S(\tau )S(t)^{-1}\xi ,S(t)^{-1}\eta \rangle _H\right] _{\tau =t}\,, \end{aligned}$$

whereas the first term goes to 0: indeed its first factor is bounded by assumption (iii), while the second one goes to 0 in view of the strong continuity of assumption (i). Hence, we have, by (iv),

$$\begin{aligned} \frac{\mathrm {d}}{\mathrm {d}t}\left\langle S(t)^{-1}\xi ,\eta \right\rangle _H= & {} \langle AS(t)^{-1}\xi , Q_\infty ^{-1}\eta \rangle _X + \langle Q_\infty ^{-1}\xi ,AS(t)^{-1}\eta \rangle _X \\&+\langle B^*Q_\infty ^{-1}\xi ,B^*Q_\infty ^{-1}\eta \rangle _U\,. \end{aligned}$$

Now set $u_1=Q_\infty ^{-1}\xi $, $v_1=Q_\infty ^{-1}\eta $: by definition of pseudoinverses, we have $u_1,v_1\in [\ker Q_\infty ]^\perp $ and, since $\xi ,\eta \in Q_\infty (D(A^*))$ there exist $u_0,v_0\in \ker Q_\infty $ such that $u:=u_1+u_0$, $v:=v_1+v_0$ belong to $D(A^*)$ and, of course, $Q_\infty u = \xi $ and $Q_\infty v = \eta $. The above equation then becomes

$$\begin{aligned} \frac{\mathrm {d}}{\mathrm {d}t}\langle S(t)^{-1}Q_\infty u,Q_\infty v\rangle _H= & {} \langle AS(t)^{-1}Q_\infty u, v_1\rangle _X +\langle u_1,AS(t)^{-1}Q_\infty v\rangle _X \\&+\langle B^*u_1,B^*v_1 \rangle _U \,. \end{aligned}$$

Observe now that, by Proposition C.1-(ii), we have $B^*u_0=B^*v_0=0$. In addition, using assumption (ii) and the fact that $H\subseteq [\ker Q_\infty )]^\perp $, we get $S(t)^{-1}Q_\infty u =S(t)^{-1}\xi \in S(t)^{-1}(Q_\infty (D(A^*))) \subseteq D((AQ_\infty )^*)$ so that, by Lemma 3.1-(iii), $AS(t)^{-1}Q_\infty u \in [\ker Q_\infty )]^\perp $. Hence, we may write, for all $u,v\in D(A^*)$,

$$\begin{aligned} \frac{\mathrm {d}}{\mathrm {d}t}\langle S(t)^{-1}Q_\infty u,Q_\infty v\rangle _H= & {} \langle AS(t)^{-1}Q_\infty u, v\rangle _X +\langle u,AS(t)^{-1}Q_\infty v\rangle _X \\&+\langle B^*u,B^*v \rangle _U\,, \end{aligned}$$

i.e.,

$$\begin{aligned} \frac{\mathrm {d}}{\mathrm {d}t}\langle S(t)^{-1}Q_\infty u,v\rangle _X= & {} \langle S(t)^{-1}Q_\infty u, A^*v\rangle _X +\langle A^*u,S(t)^{-1}Q_\infty v\rangle _X \\&+\langle B^*u,B^*v \rangle _U, \ \ \forall u,v\in D(A^*). \end{aligned}$$

This proves that $S(t)^{-1}Q_\infty $ solves the Lyapunov differential equation (24) in weak sense. Note that $S(t)^{-1}Q_\infty \in \mathcal{L}(X)$, since, using also Lemma 4.2-(iv),

$$\begin{aligned} \Vert S(t)^{-1}Q_\infty x\Vert _X= & {} \Vert Q^{1/2}_\infty S(t)^{-1}Q_\infty x\Vert _H\le \Vert Q^{1/2}_\infty \Vert _{\mathcal{L}(X)}\Vert S(t)^{-1}\Vert _{\mathcal{L}(H)}\Vert Q_\infty x\Vert _H \\\le & {} \Vert Q^{1/2}_\infty \Vert _{\mathcal{L}(X)}^2 \Vert S(t)^{-1}\Vert _{\mathcal{L}(H)} \Vert x\Vert _X\,, \end{aligned}$$

and it is selfadjoint, too, in view of

$$\begin{aligned} \langle S(t)^{-1}Q_\infty x,y\rangle _X= & {} \langle S(t)^{-1}Q_\infty x,Q_\infty y\rangle _H = \langle Q_\infty x,S(t)^{-1}Q_\infty y\rangle _H \\= & {} \langle x,S(t)^{-1}Q_\infty y\rangle _X \ \ \forall x,y\in X. \end{aligned}$$

Now we recall that by (35) it follows that $P_V(t_0)^{-1} x = Q_{t_0}Q_\infty ^{-1}x$ for every $x\in R(Q_\infty )$; then from the assumption $S(t_0)= P_V(t_0)$ we deduce

$$\begin{aligned} S(t_0)^{-1}Q_\infty z = P_V(t_0)^{-1}Q_\infty z= Q_{t_0}z \quad \forall z\in X. \end{aligned}$$

Hence, the operators $S(t)^{-1}Q_\infty $ and $Q_t$ solve the Lyapunov equation and coincide for $t=t_0$; thus, they must coincide in $[T_0, +\infty [$:

$$\begin{aligned} S(t)^{-1}Q_\infty = Q_t \qquad \forall t\ge T_0. \end{aligned}$$

Thus, for $x \in R(Q_\infty )$, i.e., $x= Q_\infty z$ with $z \in [\ker Q_\infty ]^\perp $, we may write

$$\begin{aligned} S(t)^{-1}x = S(t)^{-1} Q_\infty z = Q_t z=Q_t Q_\infty ^{-1}x = P_V(t)^{-1}x. \end{aligned}$$

By density, we get $S(t)^{-1}x = P_V(t)^{-1}x$ for every $x\in H$, and finally $S(t)z\equiv P_V(t)z$ for every $z\in H$. $\square $

4.5 The selfadjoint commuting case

We consider now the case where A is selfadjoint and commutes with $BB^*$. As a consequence, A commutes with $Q_\infty $ and is selfadjoint in H, too. More specifically, from Proposition C.1-(v) we know that $BB^*Q_\infty ^{-1}=-2A$; hence, in (38) the term

$$\begin{aligned} -\left\langle B^*Q_\infty ^{-1}P(t)x,B^*Q_\infty ^{-1}P(t)y \right\rangle _U= -\left\langle BB^*Q_\infty ^{-1}P(t)x,Q_\infty ^{-1}P(t)y \right\rangle _X \end{aligned}$$

can be simply rewritten as $2 \left\langle AP(t)x,Q_\infty ^{-1}P(t)y \right\rangle _X\,$; if in addition $AP(t)x\in H$, it just becomes $ \left\langle AP(t)x,P(t)y \right\rangle _H\,$. Similarly, if $Ax, Ay \in H$, in (38) the terms $\left\langle Ax, Q_\infty ^{-1} P(t)y\right\rangle _X$ and $\left\langle Q_\infty ^{-1}P(t)x,Ay\right\rangle _X$ can be rewritten as $\left\langle Ax, P(t)y\right\rangle _H$ and $\left\langle P(t)x,Ay\right\rangle _H\,$. Hence, in this case, we can rewrite (38) as

$$\begin{aligned} \frac{\mathrm {d}}{\mathrm {d}t}\left\langle P(t)x,y \right\rangle _H = -\left\langle Ax, P(t)y\right\rangle _H - \left\langle P(t)x,Ay\right\rangle _H + 2 \left\langle AP(t)x,P(t)y \right\rangle _H. \end{aligned}$$

(40)

which makes sense for $x,y \in \overline{D}_P(t)$, where

$$\begin{aligned} \overline{D}_P(t):=\left\{ z \in D(A): \ Az \in H,\ P(t)z \in D(A),\ AP(t)z \in H \right\} . \end{aligned}$$

We give now three results about the solutions to this equation. Such result are proved without using the null controllability Assumption 2.4, mainly thanks to the fact, proved in Proposition C.2-(iii) that, in this case, we always have $R(Q_t) = R(Q_\infty )$ and $R(Q_t^{1/2}) = R(Q_\infty ^{1/2})$ for every $t>0$.

The first result (Theorem 4.14) is an existence result. The subsequent ones (Theorems 4.15 and 4.16) are uniqueness-type results.

Theorem 4.14

Assume that A is selfadjoint in X and commutes with $BB^*$; let $K\in \mathcal{L}(H)$ be selfadjoint in H, nonnegative, such that $AKA^{-1}\in \mathcal{L}(H)$. Let, moreover, $T_1\ge T_0$ be such that $(I-e^{tA}Ke^{tA})$ is invertible for each $t>T_1$. Then, $(I-e^{tA}Ke^{tA})^{-1}$ solves (40) in $\,]T_1,+\infty [\,$.

Proof

It is clear that $T_1$ exists, since $e^{tA}$ is of negative type. Consider the set

$$\begin{aligned} D=\{z\in D(A)\cap H: \ Az\in H\}; \end{aligned}$$

it is dense in H, since it contains $Q_\infty (D(A))=Q_\infty (D(A^*))$, which is dense in H by Lemma 4.3: indeed, if $x\in Q_\infty (D(A))$, then $x=Q_\infty z$ with $z\in D(A)$, so that $Ax=AQ_\infty z= Q_\infty Az\in H$. Then, setting $\overline{P}(t)=(I-e^{tA}Ke^{tA})^{-1}$, we can write for $x,y\in D$ and $t>T_1$

$$\begin{aligned} \frac{\mathrm {d}}{\mathrm {d}t} \langle \overline{P}(t)x,y\rangle _H= & {} \frac{\mathrm {d}}{\mathrm {d}t} \langle (I-e^{tA}Ke^{tA})^{-1}x,y\rangle _H \\&= -\langle (I-e^{tA}Ke^{tA})^{-1} (-Ae^{tA}Ke^{tA}\\&\quad -e^{tA}KAe^{tA})(I-e^{tA}Ke^{tA})^{-1} x,y\rangle _H \\&= - \langle (I-e^{tA}Ke^{tA})^{-1} (-Ae^{tA}Ke^{tA}\\&\quad +A-e^{tA}Ke^{tA}A+A-2A)(I-e^{tA}Ke^{tA})^{-1} x,y\rangle _H \\&= - \langle (I-e^{tA}Ke^{tA})^{-1} (-A(e^{tA}Ke^{tA}-I)\\&\quad -(e^{tA}Ke^{tA}-I)A-2A)(I-e^{tA}Ke^{tA})^{-1} x,y\rangle _H \\&= - \langle (I-e^{tA}Ke^{tA})^{-1} Ax,y\rangle _H - \langle A(I-e^{tA}Ke^{tA})^{-1} x,y\rangle _H\\&\quad + \langle 2A(I-e^{tA}Ke^{tA})^{-1} x, (I-e^{tA}Ke^{tA})^{-1}y\rangle _H \\&= -\langle Ax, \overline{P}(t)y\rangle _H -\langle \overline{P}(t)x,Ay\rangle _H+\langle 2A\overline{P}(t)x,\overline{P}(t)y\rangle _H. \end{aligned}$$

This shows that $\overline{P}$ solves (40) with $\overline{D}_{\overline{P}}(t)=D$ for every $t>T_1$. Note that

$$\begin{aligned} \overline{P}(t) = (I-e^{(t-T_1)A}Le^{(t-T_1)A})^{-1}, \qquad L=e^{T_1A}Ke^{T_1A}=I-\overline{P}(T_1)^{-1}. \end{aligned}$$

$\square $

The second statement is a uniqueness result.

Theorem 4.15

Assume that A is selfadjoint in X and commutes with $BB^*$. Let, moreover, $S:[T^*,+\infty [\,\rightarrow \mathcal{L}(H)$ be a strongly continuous, selfadjoint, nonnegative operator, such that:

(i):: S(t) is invertible for $t\ge T^*$ and $S(T^*)=S^*$;
(ii):: S(t) solves (40) in $\,]T^*,+\infty [\,$. Then, $S(t)=(I-e^{(t-T^*)A}Le^{(t-T^*)A})^{-1}$ for every $t\ge T^*$, where $L=I-(S^*)^{-1}$.

Proof

Set again $D=\{z\in D(A)\cap H: \ Az\in H\}$ and define

$$\begin{aligned} U(t):= S(t)^{-1}, \qquad t\ge T^*. \end{aligned}$$

Obviously, $U(T^*)=(S^*)^{-1}$. Moreover, for every $t>T^*$ and x, y in the set $S(t)(\overline{D}_S(t))$, which is dense in H, we have by (ii)

$$\begin{aligned} \frac{\mathrm {d}}{\mathrm {d}t} \langle U(t)x,y\rangle _H= & {} -\left[ \frac{\mathrm {d}}{\mathrm {d}t} \langle S(t)\xi ,\eta \rangle _H\right] _{\xi =U(t)x,\ \eta =U(t)y} \\= & {} \langle A\xi ,S(t)\eta \rangle _H +\langle S(t)\xi ,A\eta \rangle _H- 2\langle AS(t)\xi ,S(t)\eta \rangle _H \\= & {} \langle AU(t)x,y\rangle _H +\langle x,AU(t)y\rangle _H- 2\langle Ax,y\rangle _H \\= & {} \langle U(t)x,Ay\rangle _H +\langle Ax,U(t)y\rangle _H- 2\langle Ax,y\rangle _H \,. \end{aligned}$$

This is a linear equation, governed by the semigroup $P \mapsto e^{tA}Pe^{tA}$: by the variation of constants formula we have for each $x,y\in S(t)(\overline{D}_S(t))$

$$\begin{aligned} U(t)x= & {} e^{(t-T^*)A}U(T^*)e^{(t-T^*)A}x - 2\int _{T^*}^t e^{(t-s)A}Ae^{(t-s)A}x\,\mathrm {d}s \\= & {} e^{(t-T^*)A}(1-L)e^{(t-T^*)A}x +\int _{T^*}^t \frac{\mathrm {d}}{\mathrm {d}s} e^{2(t-s)A}x \,\mathrm {d}s \\= & {} e^{(t-T^*)A}(1-L)e^{(t-T^*)A}x -e^{2(t-T^*)A}x+x = (I-e^{(t-T^*)A}Le^{(t-T^*)A})x. \end{aligned}$$

By density this shows that

$$\begin{aligned} S(t)^{-1} = U(t)=I-e^{(t-T^*)A}Le^{(t-T^*)A}, \end{aligned}$$

which is our claim. $\square $

In the next result we look at non-invertible solutions obtained through projections.

Theorem 4.16

Assume that A is selfadjoint in X and commutes with $BB^*$; let $S:[T^*,+\infty [\,\rightarrow \mathcal{L}(H)$ be a strongly continuous, selfadjoint, nonnegative operator, which solves (40) in $\,]T^*,+\infty [\,$. Let, moreover, $P\in \mathcal{L}(H)$ be an orthogonal projection, such that $AP=PA$ and $S(t)P(D(A)\cap H) \subseteq D(A)$ for every $t>T^*$.

Then, PS(t)P solves (40) in $\,]T^*,+\infty [\,$ if and only if $S(t)P(D(A)\cap H)\subseteq R(P)$ for every $t>T^*$.

Proof

We start by observing that the existence of a projection P in H such that $AP=PA$ implies that A maps $D(A)\cap H$ into H: indeed if $z\in D(A)\cap H$ we have $Az = APz+A(I-P)z = PAz + (I-P)Az$ and both terms of the last member belong to H.

Suppose that

$$\begin{aligned} AP=PA, \qquad S(t)P(D(A)) \subseteq D(A)\cap R(P). \end{aligned}$$

(41)

As S(t) solves (40), we have for $x,y\in \overline{D}_S(t)$

$$\begin{aligned} \frac{\mathrm {d}}{\mathrm {d}t}\langle S(t)x,y \rangle _H = -\langle Ax, S(t)y\rangle _H - \langle S(t)x,Ay\rangle _H + 2 \langle AS(t)x,S(t)y \rangle _H\,, \end{aligned}$$

(42)

where, as we know,

$$\begin{aligned} \overline{D}_S(t):=\{ z \in D(A)\cap H: \ Az \in H,\ S(t)z \in D(A),\ AS(t)z \in H \}. \end{aligned}$$

Now, if $z\in D$ (i.e., $z\in D(A)\cap H$ and $Az\in H$), then by (41), $Pz\in D(A)$ with $APz=PAz\in H$, and in addition $S(t)Pz\in R(P)\cap D(A)$, so that $AS(t)Pz=APS(t)Pz=PAS(t)Pz\in H$. Thus, $Pz\in D_S(t)$ for each $t>T^*$ and $z\in D$. Hence, setting

$$\begin{aligned} \overline{D}_{PSP}(t) = D \qquad \forall t>T^*, \end{aligned}$$

and replacing in (42) x, y by Px, Py, we have for every $x,y\in \overline{D}_{PSP}(t)$ and $t>T^*$

$$\begin{aligned} \frac{\mathrm {d}}{\mathrm {d}t}\langle S(t)Px,Py \rangle _H= & {} -\langle APx, S(t)Py\rangle _H - \langle S(t)Px,APy\rangle _H \\&+ 2 \langle AS(t)Px,S(t)Py \rangle _H\,, \end{aligned}$$

i.e.,

$$\begin{aligned} \frac{\mathrm {d}}{\mathrm {d}t}\langle PS(t)Px,y \rangle _H= & {} -\langle Ax, PS(t)Py\rangle _H - \langle PS(t)Px,Ay\rangle _H \\&+ 2 \langle AS(t)Px,S(t)Py \rangle _H\,. \end{aligned}$$

Now we remark that $S(t)Px=PS(t)Px$ and $S(t)Py=PS(t)Py$; hence, we obtain, for every $x,y\in \overline{D}_{PSP}(t)$ and $t>T^*$,

$$\begin{aligned} \frac{\mathrm {d}}{\mathrm {d}t}\langle PS(t)Px,y \rangle _H= & {} -\langle Ax, PS(t)Py\rangle _H - \langle PS(t)Px,Ay\rangle _H \\&+\, 2 \langle APS(t)Px,PS(t)Py \rangle _H\,. \end{aligned}$$

This shows that PS(t)P solves (40) in $\,]T^*,+\infty [\,$.

Suppose conversely that $P\in \mathcal{L}(H)$ is an orthogonal projection, such that $AP=PA$, $S(t)P(D(A)\cap H)\subseteq D(A)$ for every $t>T^*$ and PS(t)P solves (40) in $\,]T^*,+\infty [\,$. Assume by contradiction that for some $t>T^*$ there exists $v\in S(t)P(D(A)\cap H)\setminus R(P)$: we can write $v=S(t)Pz$ with $z\in D(A)\cap H$. Then, $w=(I-P)S(t)Pz$ belongs to $D(A)\cap R(P)^\perp $, $w\ne 0$ and $Aw=(I-P)AS(t)Pz \in R(P)^\perp $. As $\overline{D}_{PSP}(t)$ is dense in H, there exists $\{z_n\}\subset \overline{D}_{PSP}(t)$ such that $z_n \rightarrow z$ in H; then $w_n=(I-P)S(t)Pz_n \rightarrow w$ in H and consequently $w_n\ne 0$ for sufficiently large n.

Now by assumption we have for every $x,y\in \overline{D}_{PSP}(t)$

$$\begin{aligned}&\frac{\mathrm {d}}{\mathrm {d}t}\langle PS(t)Px,y \rangle _H +\langle Ax, S(t)y\rangle _H + \langle PS(t)Px,Ay\rangle _H \\&\quad - 2 \langle APS(t)Px,PS(t)Py \rangle _H = 0, \end{aligned}$$

whereas for every $x,y\in \overline{D}_S(t)$ it holds

$$\begin{aligned}&\frac{\mathrm {d}}{\mathrm {d}t}\langle S(t)x,y \rangle _H +\langle Ax, S(t)y\rangle _H + \langle S(t)x,Ay\rangle _H \\&\quad - 2 \langle APS(t)Px,PS(t)Py \rangle _H = 0. \end{aligned}$$

We may choose $x=y=z_n$ in the first equation and $x=y=Pz_n$ in the second one: indeed, as $z_n\in \overline{D}_{PSP}(t)$, we have $Pz_n\in D(A)\cap H$ and $APz_n=PAz_n\in H$; hence, $S(t)Pz_n\in D(A)$ and consequently, as remarked at the beginning of the proof, $AS(t)Pz_n\in H$: this shows that $Pz_n\in \overline{D}_S(t)$. Thus, we get

$$\begin{aligned}&\frac{\mathrm {d}}{\mathrm {d}t}\langle PS(t)Pz_n,z_n \rangle _H +\langle Az_n, PS(t)Pz_n\rangle _H + \langle PS(t)Pz_n,Az_n\rangle _H \\&\quad - 2 \langle APS(t)Pz_n,PS(t)Pz_n \rangle _H = 0 \end{aligned}$$

and

$$\begin{aligned}&\frac{\mathrm {d}}{\mathrm {d}t}\langle S(t)Pz_n,Pz_n \rangle _H +\langle APz_n, S(t)Pz_n\rangle _H + \langle S(t)Pz_n,APz_n\rangle _H \\&\quad - 2 \langle AS(t)P^2z_n,S(t)P^2z_n \rangle _H = 0. \end{aligned}$$

The second equation can be rewritten as

$$\begin{aligned}&\frac{\mathrm {d}}{\mathrm {d}t}\langle PS(t)Pz_n,z_n \rangle _H+\langle Az_n, PS(t)Pz_n\rangle _H + \langle PS(t)Pz_n,Az_n\rangle _H \\&\quad - 2 \langle AS(t)Pz_n,S(t)Pz_n \rangle _H = 0. \end{aligned}$$

Subtracting the second equation from the first one, we get

$$\begin{aligned} \langle AS(t)Pz_n,S(t)Pz_n \rangle _H - \langle APS(t)Pz_n,PS(t)Pz_n \rangle _H = 0. \end{aligned}$$

On the other hand

$$\begin{aligned} 0= & {} \langle AS(t)Pz_n,S(t)Pz_n \rangle _H - \langle APS(t)Pz_n,PS(t)Pz_n \rangle _H\\= & {} \langle A(I-P)S(t)Pz_n,S(t)Pz_n \rangle _H + \langle APS(t)Pz_n,(I-P)S(t)Pz_n \rangle _H \\= & {} \langle A(I-P)S(t)Pz_n,(I-P)S(t)Pz_n \rangle _H +\langle PAS(t)Pz_n,(I-P)S(t)Pz_n \rangle _H \\= & {} \langle Aw_n,w_n \rangle _H + 0 = \langle Aw_n,w_n \rangle _H. \end{aligned}$$

Now we recall that A is of negative type and selfadjoint in H: thus, since $w_n\ne 0$,

$$\begin{aligned} \langle Aw_n,w_n \rangle _H = - \langle (-A)w_n,w_n \rangle _H = - \Vert (-A)^{1/2}w_n\Vert _H^2 <0: \end{aligned}$$

this is a contradiction. $\square $

5 Examples

5.1 Delay state equation

Consider the following linear controlled delay equation

$$\begin{aligned} \left\{ \begin{array} [c]{l} x'(t) =a_0x(t)+ a_1x(t-d)+ b_0 u(t) ,\quad t\in [ 0,T] \\ x(0)=x_0, \quad x(s)=x_1(s), \; s \in [-d,0[\,, \end{array} \right. \end{aligned}$$

(43)

where the initial datum $(x_0,x_1)$ is in ${\mathbb R}\times L^2(-d,0;{\mathbb R})$, the control u belongs to $L^2(0,T;{\mathbb R})$, and the coefficients $a_0$, $a_1$, $b_0$ are real numbers with $a_1\ne 0$ and $b_0\ne 0$ to avoid degeneracy. We call $x(\cdot \,;(x_0,x_1),u)$ the unique solution which always exists (see, for example, [2, Chapter 4]). Using a standard approach (see, for example, again [2, Chapter 4]), we reformulate equation (43) as an abstract differential equation in the Hilbert space $\mathcal{H}={\mathbb R}\times L^2(-d,0;{\mathbb R})$. To this end we introduce the operator $A : \mathcal{D}(A) \subset \mathcal{H}\rightarrow \mathcal{H}$ as follows:

$$\begin{aligned} \left\{ \begin{array}{l} \mathcal{D}(A)=\left\{ (x_0,x_1)\in \mathcal{H}:\ y_1\in W^{1,2}([-d,0],{\mathbb R}),\ x_1(0)=x_0 \right\} , \\ A(x_0 ,x_1 )= ( a_0 x_0+ a_1x_1(-d), x_1'). \end{array} \right. \end{aligned}$$

(44)

We denote by $e^{tA}$ the $C_0$-semigroup generated by A: for $x=(x_0,x_1)\in \mathcal{H}$,

$$\begin{aligned} e^{tA} \left( x_0, x_1\right) = \left( x(t;(x_0,x_1),0),x(t+\cdot ;(x_0,x_1),0)\right) \in \mathcal{H}. \end{aligned}$$

(45)

The control operator B is bounded and defined as

$$\begin{aligned} B:{\mathbb R}\rightarrow \mathcal{H},\qquad Bu=(b_0 u,0), \quad u\in {\mathbb R}. \end{aligned}$$

(46)

In this setup, equation (43) is equivalent (in the sense that the first component of y is the solution of (43)) to the equation in $\mathcal{H}$:

$$\begin{aligned} y'(t)=Ay(t)+Bu(t),\quad y(0)=(x_0,x_1) \in \mathcal{H}. \end{aligned}$$

For this system the null controllability Assumption 2.4 holds for any $T_0>r$, see, for example, [13, Theorem 10.2.3] or [22]. Hence, Theorems 4.12 and 4.13 hold in this case.

Now we compute the adjoints and the controllability operator. We denote by $A^*$ the adjoint operator of A:

$$\begin{aligned} \left\{ \begin{array}{l} \mathcal{D}(A^*)=\left\{ (x_0,x_1)\in \mathcal{H}:\ y_1\in W^{1,2}([-d,0],{\mathbb R}),\ x_1(-d)=a_1x_0 \right\} , \\ A^*(x_0 ,x_1 )=( a_0 x_0 +x_1(0), -x_1'). \end{array} \right. \end{aligned}$$

(47)

Similarly, denoting by $e^{tA^*}=(e^{tA})^*$ the $C_0$-semigroup generated by $A^*$, we have for $(x_0,x_1)\in \mathcal{H}$

$$\begin{aligned} e^{tA^*} (x_0,x_1)= \left( x(t;(y_0,y_1),0),x(t+\cdot ;(y_0,y_1),0)\right) \in \mathcal{H}\end{aligned}$$

(48)

where

$$\begin{aligned} y_0=x_0, \quad and \quad y_1(r)=a_1^{-1}x_1(-d-r), \quad r \in \,]-d,0]. \end{aligned}$$

(49)

The adjoint of the control operator is

$$\begin{aligned} B^*:\mathcal{H}\rightarrow {\mathbb R},\qquad B^*(x_0,x_1)=b_0 x_0, \quad \forall (x_0,x_1)\in \mathcal{H}. \end{aligned}$$

(50)

It follows that

$$\begin{aligned} BB^*e^{tA^*}(x_0,x_1)= b_0^2 \left( x(t;(y_0,y_1),0),0 \right) \end{aligned}$$

where $(y_0,y_1)$ is as in (49). Hence, by linearity of (43) we can write

$$\begin{aligned} e^{tA}BB^*e^{tA^*}(x_0,x_1)= b_0^2 x(t;(y_0,y_1),0) \left( g(t),g(t+\cdot ) \right) \end{aligned}$$

where again $(y_0,y_1)$ is as in (49) and $g(t)=x(t;(1,0),0)$ (which is a given piecewise polynomial function that may be computed recursively). We can then finally write, for $(x_0,x_1)\in \mathcal{H}$,

$$\begin{aligned} Q_t (x_0,x_1)= b_0^2 \left( \int _0^t x(s;(y_0,y_1),0) g(s)\mathrm {d}s, \int _0^t x(s;(y_0,y_1),0) g(s+\cdot )\mathrm {d}s\right) \in \mathcal{H}\end{aligned}$$

(51)

where $(y_0,y_1)$ is as in (49). It is not obvious to compute $R(Q_t)$ and $R(Q_t^{1/2})$. However, we can at least say that $R(Q_t) \subseteq D(A)$: indeed the boundary condition $x_0=x_1(0)$ is obviously satisfied for all elements of $R(Q_t)$ by continuity of translations in $L^2$; on the other hand the second element of $Q_t(x_0,x_1)$ belongs to $W^{1,2}([-d,0],{\mathbb R})$ by direct verification simply using the continuity of $x(s;(y_0,y_1),0)$.

Hence, the sets $D_P(t)$ and $D_R(t)$ in Theorem 4.12 are equal to $R(Q_t)$ in this case.

5.2 Diagonal cases

Let $\{e_n\}_{n\in {\mathbb N}}$ be a complete orthonormal system in the Hilbert space X, and let $\{\lambda _n\}_{n\in {\mathbb N}}$ be a strictly increasing sequence of strictly positive numbers such that $\lambda _n \rightarrow +\infty $ as $n\rightarrow +\infty $. We define on the space X the semigroup

$$\begin{aligned} S(t)=\sum _{n\in {\mathbb N}} e^{-\lambda _nt}\langle x,e_n\rangle _X\, e_n\, ,\quad t\ge 0. \end{aligned}$$

It is easily verified that S is an analytic semigroup of negative type $-\omega $, where $\omega =\min _{n\in {\mathbb N}} \lambda _n=\lambda _0>0$, with norm $\Vert S_(t)\Vert _{\mathcal{L}(X)} = e^{-\omega t}$. Its generator is the self-adjoint, dissipative, densely defined operator $A:D(A)\subset X\rightarrow X$, given by

$$\begin{aligned} \left\{ \begin{array}{l} D(A) = \left\{ x\in X: \sum _{n\in {\mathbb N}} \lambda _n^2 \langle x,e_n \rangle _X^2 <+\infty \right\} \\ Ax= -\sum _{n\in {\mathbb N}} \lambda _ n\langle x,e_n \rangle _X \, e_n \end{array} \right. \end{aligned}$$

(52)

(see [29, pp. 178 and 198]). Note that $0\in \rho (A)$ and that $A^{-1}$ is selfadjoint and compact.

As A is dissipative, the fractional powers $(-A)^\alpha $ of $-A$ are well defined (see [2, Proposition 6.1, p. 113]).

Concerning the operator B, we assume that $B:U\rightarrow X$ is such that $BB^*$ is diagonal in X:

$$\begin{aligned} BB^*e_n = b_ne_n \quad \forall n\in {\mathbb N}, \end{aligned}$$

with $b_n\ge 0$ for all $n\in {\mathbb N}$. By Assumption 2.1 B is bounded; hence, the sequence $\{b_n\}$ must be bounded, too. However, here we generalize a bit the setting, allowing $BB^*$ to be unbounded. Since $S(t)=e^{tA}$ commutes with $BB^*$ we have, see (59),

$$\begin{aligned} Q_t x= & {} \int _0^t e^{2sA}BB^*x \,\mathrm {d}s = \frac{1}{2} A^{-1}(e^{2tA}-I)BB^*x, \quad \forall t>0, \qquad \\ Q_\infty x= & {} -\frac{1}{2} A^{-1}BB^*x; \end{aligned}$$

in particular, for $t>0$,

$$\begin{aligned} Q_t e_n = \frac{1}{2\lambda _n}(1-e^{-2\lambda _nt}) b_n e_n, \quad Q_\infty e_n = \frac{1}{2\lambda _n} b_n e_n \qquad \forall n\in {\mathbb N}. \end{aligned}$$

(53)

Thus, if $BB^*$ is possibly unbounded, we need to assume

$$\begin{aligned} \sup _{n\in {\mathbb N}} \frac{b_n}{\lambda _n} <+\infty \end{aligned}$$

(54)

in order that $Q_t,Q_\infty \in \mathcal{L}(X)$ for all $t>0$. The null controllability holds for a given $t>0$ if and only if there exists $c_t>0$ such that

$$\begin{aligned} \Vert S(t)x\Vert _X^2\le c_t \langle Q_t x,x\rangle _X \qquad \forall x\in X. \end{aligned}$$

This is equivalent to

$$\begin{aligned} e^{-2\lambda _n t} \le c_t \frac{b_n}{2\lambda _n}(1-e^{-2\lambda _nt}) \qquad \forall n\in {\mathbb N}. \end{aligned}$$

Hence, Assumption 2.4 holds for every $T_0>0$ if and only if $b_n>0$ for every $n\in {\mathbb N}$ and

$$\begin{aligned} \sup _{n \in {\mathbb N}} \frac{2\lambda _n}{b_n(e^{2\lambda _n t}-1)}< + \infty \qquad \forall t>0. \end{aligned}$$

Now, we look at $R(Q_\infty )$ and $R(Q_\infty ^{1/2})$ (observe that, by Proposition C.2-(iii), these are equal to $R(Q_t)$ and $R(Q_t^{1/2})$ for all $t>0$). By (53) it is clear that $R(Q_\infty )\subseteq R(BB^*)$ and $R(Q_\infty ^{1/2})\subseteq \overline{R(BB^*)}$.

If $b_n\ne 0$ only for a finite number of $n\in {\mathbb N}$ then, clearly, $R(Q_\infty )=R(Q_\infty ^{1/2})=R(BB^*)\subseteq D(A)$. In this case the RE is substantially finite dimensional: the function $t\rightarrow Q_t^{-1}$ is a solution on $D_P(t)=D_R(t)=R(BB^*)$ and, by Theorem 4.16, $P Q_t^{-1}P$ is a solution for every projection generated by some elements of the basis $\{e_n\}$.
If $b_n\ne 0$ for every $n\in {\mathbb N}_1$, where ${\mathbb N}_1$ is an infinite subset of ${\mathbb N}$, then, clearly,
$$\begin{aligned} R(Q_\infty ) = \left\{ z\in R(BB^*): \ \left\{ \frac{\lambda _n}{b_n}\langle z,e_n\rangle _X\right\} _{n\in {\mathbb N}_1} \in \ell ^2\right\} . \end{aligned}$$
In this case the RE is infinite dimensional. Again the function $t\rightarrow Q_t^{-1}$ is a solution on $D_P(t)=D_R(t)=D(A)\cap R(Q_\infty )$ and, by Theorem 4.16, $P Q_t^{-1}P$ is a solution for every projection generated by some elements of the basis $\{e_n\}$.

We now look closely at the second case above, when ${\mathbb N}_1={\mathbb N}$. First, if $BB^*$ is bounded, i.e., $b=\{b_n\}_{n\in {\mathbb N}}\in \ell ^\infty $, we have $R(Q_\infty )\subseteq D(A)$ and, similarly $R(Q_\infty ^{1/2})\subseteq D(A^{1/2})$. On the other hand, if, for some $\delta >0$, we have $b_n \ge \delta $ for all $n\in {\mathbb N}$, then $R(Q_\infty )\supseteq D(A)$ and $R(Q_\infty ^{1/2})\supseteq D(A^{1/2})$.

Thus, if both $BB^*$ and $(BB^*)^{-1}$ are bounded we have $R(Q_\infty )= D(A)$ and $R(Q_\infty ^{1/2})=D(A^{1/2})$.

Finally, if $b_n=\lambda _n^\alpha $ for every $n\in {\mathbb N}$, with $\alpha \in {\mathbb R}$, then $R(Q_\infty )=D(A^{1-\alpha })$ and $R(Q_\infty ^{1/2})=D(A^{\frac{1-\alpha }{2}})$.

Now we consider a special case which fits into the application studied, for example, in [6] in the case of the Landau–Ginzburg model. We take $X=H^{-1}(0,\pi ;{\mathbb R})$ and A the Laplacian in X with Dirichlet boundary conditions. We also take $U=X$ and $B=I$. Using what said just above we see that $R(Q_\infty )=D(A)=H^{1}_0(0,\pi ;{\mathbb R})$ and $H=R(Q_\infty ^{1/2})=D(A^{1/2})=L^{2}(0,\pi ;{\mathbb R})$.

Notes

More precisely in such case the linear part generates a semigroup (namely $P\mapsto e^{tA}Pe^{tA^*}$) which is not a group: such semigroup property is then lost when the sign changes.
We thank prof. R. Vinter for providing us these references.
From now on we will denote by R(F) the image of the operator F.
See Definition 4.10 for the formal definition of solution.
In this context the operator AK may be defined on a set Y strictly larger than D(A) and in this case, in addition, the operator KA can be extended to all of Y. An obvious example of this situation occurs when K is a resolvent of A.
We are indebted to Giorgio Fabbri for this example.

References

Barucci E, Gozzi F (2001) Technology adoption and accumulation in a vintage capital model. J Econ 74:1–38
Article MATH Google Scholar
Bensoussan A, Da Prato G, Delfour MC, Mitter SK (2007) Representation and control of infinite dimensional systems, 2nd edn. Birkhäuser, Boston
Book MATH Google Scholar
Bertini L, De Sole A, Gabrielli D, Jona-Lasinio G, Landim C (2001) Fluctuations in stationary nonequilibrium states of irreversible processes. Phys Rev Lett 87:040601
Article MathSciNet Google Scholar
Bertini L, De Sole A, Gabrielli D, Jona-Lasinio G, Landim C (2002) Macroscopic fluctuation theory for stationary non equilibrium state. J Stat Phys 107:635–675
Article MathSciNet MATH Google Scholar
Bertini L, De Sole A, Gabrielli D, Jona-Lasinio G, Landim C (2003) Large deviations for the boundary driven simple exclusion process. Math Phys Anal Geom 6:231–267
Article MathSciNet MATH Google Scholar
Bertini L, De Sole A, Gabrielli D, Jona-Lasinio G, Landim C (2004) Minimum dissipation principle in stationary non-equilibrium states. J Stat Phys 116:831–841
Article MathSciNet MATH Google Scholar
Bertini L, De Sole A, Gabrielli D, Jona-Lasinio G, Landim C (2011) Action functional and quasi-potential for the Burgers equation in a bounded interval. Commun Pure Appl Math 64:649–696
Article MathSciNet MATH Google Scholar
Curtain RF (1984) Linear-quadratic control problem with fixed endpoints in infinite dimensions. J Optim Theor Appl 44:55–74
Article MathSciNet MATH Google Scholar
Curtain RF, Pritchard AJ (1976) The infinite dimensional Riccati equation for systems defined by evolution operators. SIAM J Control Optim 14:951–983
Article MathSciNet MATH Google Scholar
Curtain RF, Pritchard AJ (1978) Infinite dimensional linear systems theory. Springer, Berlin
Book MATH Google Scholar
Curtain RF, Zwart HJ (1995) An introduction to infinite-dimensional linear systems theory. Springer, New York
Book MATH Google Scholar
Da Prato G, Pritchard AJ, Zabczyk J (1991) On minimum energy problems. SIAM J Control Optim 29(1):209–221
Article MathSciNet MATH Google Scholar
Da Prato G, Zabczyk J (1996) Ergodicity for infinite dimensional systems. Cambridge University Press, Cambridge
Book MATH Google Scholar
Da Prato G, Zabczyk J (2014) Stochastic equations in infinite dimensions, 2nd edn. Cambridge University Press, Cambridge
Book MATH Google Scholar
Emirsajlow Z (1989) A feedback for an infinite-dimensional linear-quadratic control problem with a fixed terminal state. IMA J Math Control Inf 6:97–117
Article MathSciNet MATH Google Scholar
Emirsajlow Z (1993) Feedback approximation of the minimum energy control for linear infinite-dimensional systems with zero terminal state. J Optim Theory Appl 78:337–363
Article MathSciNet MATH Google Scholar
Engel K-J, Nagel R (1999) One parameter semigroups for linear evolution equations. Springer, New York
MATH Google Scholar
Feng J, Kurtz T (2006) Large deviations for stochastic processes. Mathematical Surveys and Monographs, AMS, Providence
Book MATH Google Scholar
Gozzi F, Loreti P (1999) Regularity of the minimum time function and minimum energy problems: the linear case. SIAM J Control Optim 37:1195–1221
Article MathSciNet MATH Google Scholar
Lunardi A (1995) Analytic semigroups and optimal regularity in parabolic problems. Birkhäuser Verlag, Basel
Book MATH Google Scholar
Moore BC (1981) Principal component analysis in linear systems: controllability, observability, and model reduction. IEEE Trans Autom Control 26(1):17–32
Article MathSciNet MATH Google Scholar
Olbrot AW, Pandolfi L (1988) Null controllability of a class of functional differential systems. Int J Control 47:193–208
Article MathSciNet MATH Google Scholar
Pazy A (1983) Semigroups of linear operators and applications to partial differential equations. Springer, New York
Book MATH Google Scholar
Priola E, Zabczyk J (2003) Null controllability with vanishing energy. SIAM J Control Optim 42:1013–1032
Article MathSciNet MATH Google Scholar
Scherpen JMA (1993) Balancing for nonlinear systems. Syst Control Lett 2:143–153
Article MathSciNet MATH Google Scholar
Staffans O (2005) Well-posed linear systems. Cambridge University Press, Cambridge
Book MATH Google Scholar
Weiss G, Staffans O, Tucsnak M (2001) Well-posed linear systems—a survey with emphasis on conservative systems. Int J Appl Math Comput Sci 11:7–33
MathSciNet MATH Google Scholar
Willems JC (Dec. 1971) Least squares stationary optimal control and the algebraic Riccati equation. IEEE Trans Autom Control AC-16, No. 6
Zabczyk J (1995) Mathematical control theory: an introduction. Birkhäuser, Boston
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Dipartimento di Matematica, Università di Pisa, Pisa, Italy
P. Acquistapace
Dipartimento di Economia e Finanza, Università LUISS - Guido Carli, Roma, Italy
F. Gozzi

Authors

P. Acquistapace
View author publications
You can also search for this author in PubMed Google Scholar
F. Gozzi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to P. Acquistapace.

Appendices

Appendix

A: Pseudoinverses

We recall here two well-known results of functional analysis that will be very useful in the sequel.

Given a linear operator $F: X \rightarrow Y$, where X and Y are Hilbert spaces, we define, as in [29, p. 209] (see also [14, p. 429]), the pseudoinverse $F^{-1}$ of F as the linear operator

$$\begin{aligned} F^{-1}: D(F^{-1})\subset Y \rightarrow X, \end{aligned}$$

with domain $D(F^{-1})=R(F)$, where $F^{-1}y$ is the element of $F^{-1}(\{y\})$ with minimal norm. Note that $R(F^{-1})= (\ker F)^\perp $.

We have the following result, taken from [14, Proposition B.1, p.429].

Proposition A.1

Let $E,E_1,E_2$ be three Hilbert spaces, let $A_1: E_1 \rightarrow E$, $A_2:E_2 \rightarrow E$ be linear operators, let $A_1^*:E \rightarrow E_1$ and $A_2^*:E \rightarrow E_2$ be their adjoints and finally let $A_1^{-1}:D(A_1^{-1})\subseteq E_1 \rightarrow E$, $A_2^{-1}:D(A_2^{-1})\subseteq E_2 \rightarrow E$ be the respective pseudoinverses. Then, we have:

(i)
$R(A_{1})\subseteq R(A_{2})$ if and only if there exists a constant $k>0$ such that
$$\begin{aligned} \Vert A_{1}^{ \star }x\Vert _{E_1}\le k\Vert A_{2}^{ \star }x\Vert _{E_2} \qquad \forall x \in E. \end{aligned}$$
(ii)
If
$$\begin{aligned} \Vert A_{1}^{ \star }x\Vert _{E_1}=\Vert A_{2}^{\star }x\Vert _{E_2} \qquad \forall x \in E, \end{aligned}$$
then $R(A_{1})$=$R(A_{2})$, $R(A_{1}^{-1})=R(A_{2}^{-1})$ and
$$\begin{aligned} \Vert A_{1}^{-1}x\Vert _{E_1}=\Vert A_{2}^{-2}x\Vert _{E_2}\qquad \forall x \in R(A_{1}). \end{aligned}$$

B: Some properties of commuting operators

Given a real separable Hilbert space X, let $A:D(A)\subseteq X \rightarrow X$ be a generator of a strongly continuous semigroup $e^{tA}$ and, for any $\lambda \in \rho (A)$, denote by $R(\lambda ,A)$ the resolvent operator $(\lambda - A)^{-1}$.

Definition B.1

Consider an operator $K \in \mathcal{L}(X)$. We say that K commutes with A if, for all $x \in D(A)$ we have $Kx \in D(A)$ and $AKx=KAx$. In particular, this means that K maps D(A) into itself.^{Footnote 5}

The following result is known, but for the reader’s convenience, we provide the complete proof as we could not find it in the literature.

Lemma B.2

Let X be a Hilbert space, $K\in \mathcal{L}(X)$, and $A:D(A)\subseteq X \rightarrow X$ be the generator of a strongly continuous semigroup $e^{tA}$. The following statements are equivalent:

(i)
There exists $\lambda _0 \in \rho (A)$ such that $R(\lambda _0,A)K=KR(\lambda _0,A)$.
(ii)
For every $\lambda \in \rho (A)$ it holds $R(\lambda ,A)K=KR(\lambda ,A)$.
(iii)
K commutes with A.
(iv)
$K^*$ commutes with $A^*$.
(v)
For all $t>0$ we have $e^{tA}K=Ke^{tA}$.

Proof

(i) $\Longleftrightarrow $ (ii).

We only prove (i) $\Longrightarrow $ (ii), as the other direction is obvious. Let $\lambda \in \rho (A)$. Then,
$$\begin{aligned} KR(\lambda ,A) - R(\lambda ,A)K= & {} K[R(\lambda ,A)-R(\lambda _0,A)] + [KR(\lambda _0,A)\\&-R(\lambda _0,A)K]+[R(\lambda _0,A)-R(\lambda ,A)]K, \end{aligned}$$
and so, using the so-called resolvent identity, and the fact that K commutes with $R(\lambda _0,A)$,
$$\begin{aligned} KR(\lambda ,A) - R(\lambda ,A)K= & {} K(\lambda _0- \lambda )R(\lambda ,A)R(\lambda _0,A)\\&-(\lambda _0- \lambda )R(\lambda ,A)R(\lambda _0,A)K \\= & {} (\lambda _0- \lambda )R(\lambda _0,A)[KR(\lambda ,A) - R(\lambda ,A)K]. \end{aligned}$$
Then, it follows that
$$\begin{aligned}{}[I - (\lambda _0- \lambda )R(\lambda _0,A)][KR(\lambda ,A) - R(\lambda ,A)K]=0. \end{aligned}$$
Since $I - (\lambda _0- \lambda )R(\lambda _0,A)=(\lambda - A)R(\lambda _0,A)$ we can apply $R(\lambda ,A)$ to both sides of the above equality which, thanks to the injectivity of $R(\lambda ,A)$, is equivalent to
$$\begin{aligned} R(\lambda _0,A)[KR(\lambda ,A) - R(\lambda ,A)K]=0; \end{aligned}$$
this, using the injectivity of $R(\lambda _0,A)$, gives the claim.
(ii) $\Longrightarrow $ (iii).

For sufficiently large $n \in {\mathbb N}$, consider $A_n:= nAR(n,A)=-n + n^2 R(n,A)$, the Yosida approximants of A. By (iii) we immediately have $KA_n x=A_nKx$ for all $x \in X$. Let now $x \in D(A)$. By the properties of Yosida approximants [23] we have $A_nx \rightarrow Ax$ and $x_n:=nR(n,A)Kx \rightarrow Kx$ as $n \rightarrow + \infty $.

Now $Ax_n=A_nKx=KA_nx\rightarrow KAx$ as $n \rightarrow + \infty $. Since A is closed, we have $Kx \in D(A)$ and $AKx=KAx$, which is the claim.
(iii) $\Longrightarrow $ (ii).

Let $\lambda \in \rho (A)$. We have, for $x \in X$,
$$\begin{aligned} (\lambda -A) [R(\lambda ,A)K-KR(\lambda ,A)]x =Kx - \lambda K R(\lambda ,A)x + AK R(\lambda ,A)x \end{aligned}$$
and, since A commutes with K,
$$\begin{aligned} =Kx - K\lambda R(\lambda ,A)x + KA R(\lambda ,A)x =K[I-\lambda R(\lambda ,A)+A R(\lambda ,A)]x=0. \end{aligned}$$
By the injectivity of $\lambda -A$ this implies that $R(\lambda ,A)Kx-KR(\lambda ,A)x=0$.
(iii) $\Longleftrightarrow $ (iv).

We only prove (iii) $\Longrightarrow $ (iv), as the other direction follows simply by taking the adjoints and using the relations $A^{**}=A$ and $K^{**}=K$.

Let $x\in D(A)$ and $y \in D(A^*)$. Then,
$$\begin{aligned} \left\langle KAx,y\right\rangle =\left\langle Ax,K^*y\right\rangle \end{aligned}$$
and also, since K commutes with A,
$$\begin{aligned} \left\langle KAx,y\right\rangle =\left\langle AKx,y\right\rangle =\left\langle Kx,A^*y\right\rangle =\left\langle x,K^*A^*y\right\rangle . \end{aligned}$$
From the two above it follows that
$$\begin{aligned} |\left\langle Ax,K^*y\right\rangle |=|\left\langle x,K^*A^*y\right\rangle | \le C |x| \end{aligned}$$
for some $C>0$. This means that $K^*y\in D(A^*)$ and that
$$\begin{aligned} A^*K^*y=(KA)^*y=(AK)^*y =K^*A^*y \end{aligned}$$
which is the claim.
(ii) $\Longrightarrow $ (v).

We know that, for all $x \in X$, $A_nK x=KA_n x$ and $e^{tA_n}x=\sum _{p=0}^{+\infty }\frac{t^pA_n^p}{p!}x$; hence, we get
$$\begin{aligned} Ke^{tA_n}x=K\sum _{p=0}^{+\infty }\frac{t^pA_n^p }{p!}x =\sum _{p=0}^{+\infty }K\frac{t^pA_n^p }{p!}x =\sum _{p=0}^{+\infty }\frac{t^pA_n^p }{p!}Kx =e^{tA_n}Kx. \end{aligned}$$
We now let $n \rightarrow +\infty $ and use the fact that, by the properties of Yosida approximants [23], $e^{tA_n}x \rightarrow e^{tA}x$ for all $x \in X$, as $n \rightarrow + \infty $. This implies the claim.
(v) $\Longrightarrow $ (ii).

We know [23] that, for all sufficiently large $\lambda \in \rho (A)$ and for all $x \in X$,
$$\begin{aligned} R(\lambda , A)x=\int _0^{+\infty }e^{-\lambda t} e^{tA}x\, \mathrm {d}t. \end{aligned}$$
Then,
$$\begin{aligned} KR(\lambda , A)x= & {} K\int _0^{+\infty }e^{-\lambda t} e^{tA}x\, \mathrm {d}t = \int _0^{+\infty }K e^{-\lambda t} e^{tA}x\, \mathrm {d}t \\= & {} \int _0^{+\infty } e^{-\lambda t} e^{tA}Kx\, \mathrm {d}t = R(\lambda , A)Kx \end{aligned}$$
and the claim follows.

$\square $

We will need also the following result on pseudoinverses.

Lemma B.3

Let E be a Hilbert space and let $A_1, A_2 \in \mathcal{L}(E)$ be such that $A_1A_2=A_2A_1$, $A_2$ is selfadjoint and $R(A_1)\subseteq {R(A_2)}$. Then, denoting by $A_2^{-1}$ the pseudoinverse of $A_2\,$, the two operators

$$\begin{aligned} A_1 A_2^{-1}:R(A_2) \rightarrow E \qquad \text {and} \qquad A_2^{-1}A_1 :E \rightarrow E \end{aligned}$$

coincide on $R(A_2)$; hence, in particular, $A_1 A_2^{-1}$ can be extended to all of E.

Proof

Take $z \in R(A_2)$ and set

$$\begin{aligned} v:=A_1 A_2^{-1}z, \qquad u:=A_2^{-1}A_1z. \end{aligned}$$

Applying $A_2$ we get

$$\begin{aligned} A_2v=A_2A_1 A_2^{-1}z=A_1 A_2A_2^{-1}z=A_1 z, \qquad A_2u=A_2 A_2^{-1}A_1z=A_1 z \end{aligned}$$

where in the first equality we have used the commuting assumption. This means that $A_2(u-v)=0$, i.e., $u-v\in \ker A_2\,$. Now by the definition of pseudoinverse we have $u \in (\ker A_2)^\perp $, while $v\in R(A_1)\subseteq R(A_2) \subseteq (\ker A_2)^\perp $, since $A_2$ is selfadjoint. Hence, it must be $u-v=0$ and the result follows. $\square $

C: Controllability operators and minimum energy

Following [29, p. 209], we collect some basic properties of the controllability operators $Q_t$ defined in (8):

Proposition C.1

Let $Q_t$ be defined by (8).

(i)
The operator $Q_t$ is linear, bounded, selfadjoint and nonnegative.
(ii)
For $0\le s \le t \le +\infty $ it holds
$$\begin{aligned} \ker Q_t \subseteq \ker Q_s \subseteq \ker B^*=\ker BB^* \end{aligned}$$
(55)
and each inclusion becomes an equality when $BB^*$ and A commute.
(iii)
For $0\le s \le t \le +\infty $,
$$\begin{aligned}{}[\ker Q_t]^\perp \supseteq [\ker Q_s]^\perp \supseteq [\ker B^*]^\perp = [\ker BB^*]^\perp , \end{aligned}$$
(56)
so that
$$\begin{aligned} \overline{R(Q_t)} \supseteq \overline{R(Q_s)} \supseteq \overline{R(B)}=\overline{R(BB^*)}, \end{aligned}$$
(57)
and each inclusion becomes an equality when $BB^*$ and A commute.
(iv)
For $0\le t \le \tau \le +\infty $ we have
$$\begin{aligned} Q_\tau = Q_t + e^{tA}Q_{\tau -t}e^{tA^*}. \end{aligned}$$
(58)
(v)
Finally, if A is selfadjoint, and A and $BB^*$ commute, we have for all $x \in X$
$$\begin{aligned} Q_t x= \frac{1}{2} A^{-1}\left( e^{2tA}- I\right) BB^* x, \quad t>0; \qquad \qquad Q_\infty x= -\frac{1}{2} A^{-1}BB^* x. \end{aligned}$$
(59)
This, in particular, implies that for every $t\in [0,+\infty ]$ the operator $Q_t$ commutes with A and that
$$\begin{aligned} 2Ay =-BB^*Q_\infty ^{-1}y \qquad \forall y\in R(Q_\infty ) \subseteq D(A). \end{aligned}$$
(60)

Proof

The statement (i) is immediate by definition of $Q_t$.

We prove now (ii). Indeed, for every $t \in [0,+\infty ]$, since $Q_t$ is selfadjoint we have

$$\begin{aligned} \begin{array}{rrcll} Q_t x=0 &{} \Longleftrightarrow &{} \langle Q_t x,x \rangle _X =0 &{} \Longleftrightarrow &{} \displaystyle \int _0^t \Vert B^*e^{rA^*}x\Vert _U^2\,\mathrm {d}r=0 \\ &{} &{} &{} \Longleftrightarrow &{} \Vert B^*e^{rA^*}x\Vert _U=0 \quad \text {for a.e. } r \in [0,t]. \end{array} \end{aligned}$$

The above immediately gives $\ker Q_t \subseteq \ker Q_s $ when $s \le t$. Moreover, since $r \rightarrow \Vert B^*e^{rA^*}x\Vert _U$ is continuous, this function is identically 0, so the last implies $\Vert B^*x\Vert _U=0$. Finally, since $BB^*$ is selfadjoint then $BB^*x=0$ is equivalent to $B^*x=0$.

If A and $BB^*$ commute then, by Lemma B.2 also $A^*$ and $BB^*$ commute and so also $e^{tA^*}$ and $BB^*$. It follows that, if $BB^*x=0$ then, for all $t>0$,

$$\begin{aligned} \left\langle Q_t x,x\right\rangle _X=\int _0^t\left\langle BB^*e^{rA^*}x,e^{rA^*}x\right\rangle _X\,\mathrm {d}r =\int _0^t\left\langle e^{rA^*}BB^*x,e^{rA^*}x\right\rangle _X\,\mathrm {d}r=0, \end{aligned}$$

which gives the claim.

Concerning (iii), (56), as well as (57), easily follow from (55).

The statement (iv) follows by writing

$$\begin{aligned} Q_\tau x=\int _0^\tau e^{rA}BB^*e^{rA^*}\, \mathrm {d}r= \int _0^t e^{rA}BB^*e^{rA^*}\mathrm {d}r + \int _t^\tau e^{rA}BB^*e^{rA^*}\, \mathrm {d}r \end{aligned}$$

and then changing variable in the second integral.

The statement (v) follows since in this case, by Lemma B.2, $e^{tA}$ commutes with $BB^*$. Hence,

$$\begin{aligned} Q_t x=\int _0^t e^{rA}BB^*e^{rA}x\, \mathrm {d}r= \int _0^t e^{2rA}BB^*x \,\mathrm {d}r \end{aligned}$$

and (59) follows by standard integration of semigroups. Concerning the commutativity of $Q_t$ and A we first observe that, by (59) we have $R(Q_t) \subseteq D(A)$ for $t\in \, ]0,+\infty ]$. Moreover, still by (59), we have, by direct computations

$$\begin{aligned} 2AQ_tx=2Q_tA x =(e^{2tA}-I)BB^*x, \qquad 2AQ_\infty x=2Q_\infty A x =-BB^*x \end{aligned}$$

for all $x \in D(A)$. Finally, for any given $y\in R(Q_\infty )$ we set $x:=Q_\infty ^{-1}y\in [\ker Q_\infty ]^\perp \subseteq X$ and we write, using the last formula

$$\begin{aligned} 2AQ_\infty Q_\infty ^{-1}y =-BB^*Q_\infty ^{-1}y, \end{aligned}$$

which, by the properties of the pseudoinverses, gives (60). $\square $

Finally, we provide the following partly well-known result, concerning the images of the controllability operators.

Proposition C.2

Assume that Assumption 2.1 holds.

(i)
If $0<t<\tau <+ \infty $ then $R(Q_t^{1/2}) \subseteq R(Q_\tau ^{1/2})\subseteq R(Q_\infty ^{1/2})$.
(ii)
If, in addition, the system (5) is null controllable at time $T_0$, i.e., Assumption 2.4 holds, then $R(Q_t^{1/2}) = R(Q_\infty ^{1/2})$ for all $t\in [T_0,+\infty [$.
(iii)
If A is selfadjoint and commutes with $BB^*$ then, without assuming null controllability, for all $t\in \, ]0,+\infty [\,$, the equalities $R(Q_t) = R(Q_\infty )$ and $R(Q_t^{1/2}) = R(Q_\infty ^{1/2})$ hold.

Proof

The results (i) and (ii) concerning the images of the operators $Q_t^{1/2}$, $t\in [0,+\infty ]$ are well known: see, for example, for point (i) the proof of Theorem 2.2 in Part IV, Chapter 2 of [29]; for point (ii) the proof of Theorem 2.2 in [12].

We now prove (iii). For all $0<t<\tau \le +\infty $ we have, using (58), the selfadjointness of A and the commutativity,

$$\begin{aligned} Q_\tau = Q_t + e^{2tA}Q_{\tau -t}=Q_t +Q_{\tau -t} e^{2tA}; \end{aligned}$$

(61)

hence, if $\tau =+\infty $ we get, for all $x \in H$ and $t\ge 0$,

$$\begin{aligned} Q_t x=Q_\infty x -Q_{\infty } e^{2tA}x=Q_\infty (x- e^{2tA}x). \end{aligned}$$

(62)

Thus, we immediately get $R(Q_t)\subseteq R(Q_\infty )$ for all $t\ge 0$. On the other hand we have, for $x \in H$ and $t\ge 0$,

$$\begin{aligned} Q_\infty x=Q_t x - e^{2tA}Q_{\infty } x, \end{aligned}$$

which implies, for all $x \in H$ and $t\ge 0$,

$$\begin{aligned} \Vert Q_\infty x \Vert _X\le \Vert Q_t x\Vert _X + Me^{-2\omega t}\Vert Q_{\infty }x\Vert _X\,. \end{aligned}$$

Let $T_1\ge 0$ be such that $Me^{-2\omega T_1}=1$. Then, for all $t>T_1$ the above implies

$$\begin{aligned} \Vert Q_\infty x \Vert _X\le \frac{1}{1-Me^{-2\omega t}}\Vert Q_t x\Vert _X\,. \end{aligned}$$

Using Proposition A.1-(i) this implies that $R(Q_t)=R(Q_\infty )$ for all $t>T_1$. If $T_1=0$ the claim is proved. If $T_1>0$ take $t\le T_1$. We have, taking $\tau =2t$ in (61),

$$\begin{aligned} Q_{2t}x= Q_t x+ Q_{t} e^{2tA}x=Q_t (x+ e^{2tA}x), \quad x \in H. \end{aligned}$$

(63)

This implies that $R(Q_{2t})\subseteq R(Q_t)$. Iterating this argument we see that it must be $R(Q_{2^k t})\subseteq R(Q_t)$ for all $k \in {\mathbb N}$. Taking $\bar{k}$ such that $2^{\bar{k}} t>T_1$ we then get $R(Q_\infty )=R(Q_{2^{\bar{k}} t})\subseteq R(Q_t)$. This proves the claim.

Concerning the last statement we observe that, by (62) and since $e^{tA}$ commutes with $Q^{1/2}_{\infty }$, too, we may write

$$\begin{aligned} \Vert Q^{1/2}_\infty x \Vert _X^2= & {} \left\langle Q_\infty x, x\right\rangle _X= \left\langle Q_t x,x\right\rangle _X - \left\langle e^{2tA}Q_{\infty } x,x\right\rangle _X \\= & {} \left\langle Q_t x,x\right\rangle _X - \left\langle e^{tA}Q^{1/2}_{\infty } x,e^{tA}Q^{1/2}_{\infty } x\right\rangle _X \\= & {} \Vert Q^{1/2}_t x \Vert _X^2 +\Vert e^{tA}Q^{1/2}_{\infty } x\Vert _X^2 \le \Vert Q^{1/2}_t x \Vert _X^2 +Me^{-\omega t}\Vert Q^{1/2}_{\infty } x\Vert _X^2. \end{aligned}$$

Hence, taking $T_2$ such that $Me^{-\omega T_2}=1$ (i.e., $T_2=2T_1$), for $t>T_2$ we get

$$\begin{aligned} \Vert Q^{1/2}_\infty x \Vert _X^2 \le \frac{1}{1-Me^{-\omega t}}\Vert Q^{1/2}_t x \Vert _X^2 \end{aligned}$$

which gives $R(Q^{1/2}_\infty )\subseteq R(Q^{1/2}_t )$, and hence $R(Q^{1/2}_\infty )= R(Q^{1/2}_t )$, for $t>T_2$. If $T_2=0$ the claim follows. Otherwise, using (63), we have

$$\begin{aligned} \Vert Q^{1/2}_{2t} x \Vert _X^2= & {} \left\langle Q_{2t} x, x\right\rangle _X= \left\langle Q_t x,x\right\rangle _X + \left\langle e^{2tA}Q_{t} x,x\right\rangle _X \\= & {} \left\langle Q_t x,x\right\rangle _X + \left\langle e^{tA}Q^{1/2}_{t} x,e^{tA}Q^{1/2}_{t} x\right\rangle _X \\= & {} \Vert Q^{1/2}_t x \Vert _X^2 +\Vert e^{tA}Q^{1/2}_{t} x\Vert _X^2 \le \Vert Q^{1/2}_t x \Vert _X^2 +Me^{-\omega t}\Vert Q^{1/2}_{t} x\Vert _X^2. \end{aligned}$$

Hence, arguing as above we get $R(Q^{1/2}_\infty )=R(Q^{1/2}_t )$ for $t>0$. $\square $

We have the following result about the optimal pairs when $x \in R(Q_t)$.

Proposition C.3

Let $x \in R(Q_t)$. Let $(\hat{y}_{t,x},\hat{u}_{t,x})$ be the optimal pair in $[-t,0]$. Then, we have

$$\begin{aligned} \hat{u}_{t,x}(r) = B^* e^{-rA^*} { Q}^{-1}_{t} x \quad \forall r\in [-t,0], \end{aligned}$$

(64)

with ${ Q}^{-1}_t$ defined as in Theorem 2.7 (iii). Moreover, the corresponding optimal state $\hat{y}$ satisfies

$$\begin{aligned} \hat{y}_{t,x}(r)=Q_{t+r}e^{-rA^*}Q_t^{-1}x, \qquad r \in [-t,0]; \end{aligned}$$

(65)

hence, the optimal pair satisfies the feedback formula

$$\begin{aligned} \hat{u}_{t,x}(r)=B^* Q_{t+r}^{-1}\hat{y}_{t,x}(r), \qquad r \in \,]-t,0], \end{aligned}$$

(66)

and, formally, $\hat{y}_{t,x}$ is a solution of the backward closed-loop equation (BCLE)

$$\begin{aligned} y'(r)=(A+BB^* Q_{t+r}^{-1}) y(r) , \qquad r \in \, ]-t,0] \end{aligned}$$

(67)

with final condition $y(0)=x$.

Proof

Formula (64) follows from [29, Theorem 2.3-(iii), page 210].

Formula (65) follows by inserting (64) into the state equation:

$$\begin{aligned} \hat{y}_{t,x}(r)= & {} \int _{-t}^r e^{(r-s)A}B \hat{u}_{t,x}(s)\,\mathrm {d}s = \int _{-t}^r e^{(r-s)A}B B^* e^{-sA^*} { Q}^{-1}_{t} x \,\mathrm {d}s \\= & {} \left( \int _{-t}^r e^{(r-s)A}B B^* e^{(r-s)A^*}\, \mathrm {d}s \right) e^{-rA^*} { Q}^{-1}_{t} x =Q_{t+r}e^{-rA^*} Q^{-1}_{t} x; \end{aligned}$$

moreover, formula (66) follows by simply observing that $\hat{y}_{t,x}(r) \in R(Q_{t+r})$ for every $r \in [-t,0]$, and using (64)–(65).

Inserting (66) into the state equation (5) we see that, formally, $\hat{y}_{t,x}$ is a solution of the BCLE (67). $\square $

Remark C.4

It is not hard to show that the above result holds true also in the case when $t=+\infty $. So we have for the optimal pair the representations

$$\begin{aligned} \hat{u}_{\infty ,x}(r)= & {} B^* e^{-rA^*} { Q}^{-1}_{\infty } x \qquad r\in \, ]-\infty ,0], \end{aligned}$$

(68)

$$\begin{aligned} \hat{y}_{\infty ,x}(r)= & {} Q_{\infty }e^{-rA^*}Q_\infty ^{-1}x, \qquad r \in \,]-\infty ,0]; \end{aligned}$$

(69)

and the feedback formula

$$\begin{aligned} \hat{u}_{\infty ,x}(r)=B^* Q_{\infty }^{-1}\hat{y}_{\infty ,x}(r), \qquad r \in \,]-\infty ,0]. \end{aligned}$$

(70)

Thus, formally, $\hat{y}_{\infty ,x}$ is a solution of the backward closed-loop equation (BCLE)

$$\begin{aligned} y'(r)=(A+BB^* Q_{\infty }^{-1}) y(r) , \qquad r \in \,]-\infty ,0] \end{aligned}$$

(71)

with final condition $y(0)=x$. Using the Lyapunov equation (25) proved in Proposition 3.3, the above (71) can be simplified as

$$\begin{aligned} y'(r)=-Q_{\infty }A^*Q_{\infty }^{-1} y(r) , \qquad r \in \,]-\infty ,0]. \end{aligned}$$

(72)

Hence, if $A^*$ commutes with $Q_\infty $ (e.g., when A is selfadjoint, and A and $BB^*$ commute), then the BCLE (71) becomes

$$\begin{aligned} y'(r)=-A^* y(r) , \qquad r \in \,]-\infty ,0]. \end{aligned}$$

which is well posed and is solved by the optimal trajectory. The same argument can be used in the finite horizon case of Proposition C.3 to rewrite (67), but due to the presence of $Q_t^\prime $ in the Lyapunov equation (24), the result is not so useful. $\square $

We now give a counterexample^{Footnote 6} in the case where the null controllability Assumption 2.4 does not hold.

Example C.5

Let us consider the Hilbert spaces $X=L^2(0,1)$ and $U= \mathbb {R}$. The operator

$$\begin{aligned} \left\{ \begin{array}{l} D(A)= \{ f\in H^1(0,1): f(0)=0 \} \\ Af=-f' \end{array} \right. \end{aligned}$$

is the infinitesimal generator in $L^2(0,1)$ of the $C_0$-semigroup (see, for example, [17, Chapter I, Section 4.c], or [1]):

$$\begin{aligned} (e^{tA}f)(s) := \left\{ \begin{array}{l@{\quad }ll} f(s-t) &{} \text {if }&{} s > t \\ 0 &{} \text {if }&{} s\le t,\end{array}\right. \qquad s\in \, ]0,1[\,, \quad t\ge 0, \end{aligned}$$

or, in other words,

$$\begin{aligned} (e^{tA}f)(\cdot )= f(\cdot -t)\, \chi _{[t,1]}(\cdot ). \end{aligned}$$

Next, let $B:\mathbb {R} \rightarrow L^2(0,1)$ be defined by

$$\begin{aligned} B(a) := a \, \chi _{[0,1/4]}(\cdot ). \end{aligned}$$

Consider the state equation

$$\begin{aligned} \left\{ \begin{array}{l} y'(s) = A y(s) + B u(s)\\ y(0)=0. \end{array}\right. \end{aligned}$$

For any fixed $t\in \,]0,1]$, we have $f\in Q_t^{1/2}(L^2(0,1))$ if and only if there exists $u\in L^2(0,t)$ such that

$$\begin{aligned} f = \int _0^t e^{(t-r)A}Bu(r)\, \mathrm {d}r. \end{aligned}$$

By the definition of B and the explicit form of $e^{tA}$ we easily get

$$\begin{aligned} f(\cdot ) = \int _0^t u(r) \, \chi _{[0,1/4]}(\cdot - t+r) \chi _{[t-r,1]}(\cdot )\,\mathrm {d}r = \int _0^t u(r) \, \chi _{[t-r,1\wedge (t-r+1/4)]}(\cdot )\, \mathrm {d}r. \end{aligned}$$

Now fix $t=1/4$: then if $f\in Q_{1/4}^{1/2}(L^2(0,1))$ it holds

$$\begin{aligned} f(s) = \int _0^{1/4} u(r) \, \chi _{[1/4-r,1/2-r]}(s)\, \mathrm {d}r, \end{aligned}$$

so that necessarily $f(s)=0$ for all $s\in \,]1/2,1]$. On the other hand, take $t=1$ and $u\equiv 1\in L^2(0,1)$; then if $f = \int _0^1 e^{(1-r)A}Bu(r)\, \mathrm {d}r$ we have in particular $f\in Q_1^{1/2}(L^2(0,1))$ and

$$\begin{aligned} f(s) = \int _0^1 \chi _{[1-r,1\wedge (5/4-r)]}(s)\,\mathrm {d}r = \int _0^1 \chi _{[1-s,1\wedge (5/4-s)]}(r)\,\mathrm {d}r = s\wedge \frac{1}{4} \quad \forall s\in [0,1]. \end{aligned}$$

This shows that f cannot belong to $Q_{1/4}^{1/2}(L^2(0,1))$, i.e.,

$$\begin{aligned} Q_1^{1/2}(L^2(0,1)) \nsubseteq Q_{1/4}^{1/2}(L^2(0,1)), \end{aligned}$$

and in particular, the system cannot be null controllable at any $T\in \,]0,1/4]$. $\square $

D: Proofs

Proof of Lemma 4.2

We start proving (i). Let $\{x_n\}$ be a Cauchy sequence in H: then for each n we have $x_n = Q_\infty ^{1/2}z_n$, where $z_n\in [\ker Q_\infty ^{1/2}]^\perp $ is uniquely determined, and by (29) $\{Q_\infty ^{-1/2}x_n\}=\{z_n\}$ is a Cauchy sequence in X, so that it converges to some $z\in [\ker Q_\infty ^{1/2}]^\perp $. As $Q_\infty ^{1/2}\in \mathcal{L}(X)$, $\{x_n\}$ is a Cauchy sequence in X, too, and it converges to some $x\in X$. It follows that $Q_\infty ^{1/2}z = x$, so that $x\in H$ and $x_n \rightarrow x$ in H. This shows that H is complete. To prove that H is continuously embedded into X, take $x\in H$: then $x=Q_\infty ^{1/2}y$ for a unique $y= Q_\infty ^{-1/2}x \in [\ker Q_\infty ^{1/2}]^\perp $. Thus,

$$\begin{aligned} \Vert x\Vert _X= & {} \Vert Q_\infty ^{1/2}y\Vert _X \le \Vert Q_\infty ^{1/2}\Vert _{\mathcal{L}(X)}\Vert y\Vert _X = \Vert Q_\infty ^{1/2}\Vert _{\mathcal{L}(X)}\Vert Q_\infty ^{-1/2}x\Vert _X \\= & {} \Vert Q_\infty ^{1/2}\Vert _{\mathcal{L}(X)}\Vert x\Vert _H\,. \end{aligned}$$

Concerning (ii), let $x \in H$. Then, there exists a unique $z \in [\ker Q_\infty ^{1/2}]^\perp =\overline{R(Q_\infty ^{1/2})} $ such that $x=Q_\infty ^{1/2}z$. So there exists a sequence $\{z_n\}\subset R(Q_\infty ^{1/2})$ such that $z_n \rightarrow z$ in X as $n \rightarrow + \infty $. Setting $x_n=Q_\infty ^{1/2}z_n$ we have as $n\rightarrow \infty $

$$\begin{aligned} \Vert x_n-x\Vert _H =\Vert Q_\infty ^{1/2}(z_n-z)\Vert _H =\Vert z_n-z\Vert _X \rightarrow 0, \end{aligned}$$

and the claim follows.

The statement (iii) follows from (29) by just taking $x=y$.

To prove the statement (iv) we observe first that, for all $x \in H$ with $x=Q^{1/2}_\infty z$, $z \in X$,

$$\begin{aligned} \frac{\Vert Q^{1/2}_\infty x\Vert _H}{\Vert x\Vert _H}= \frac{\Vert x\Vert _X}{\Vert z\Vert _X}\frac{\Vert Q^{1/2}_\infty z\Vert _X}{\Vert z\Vert _X}, \end{aligned}$$

which implies $\Vert Q^{1/2}_\infty \Vert _{\mathcal{L}(H)}\le \Vert Q^{1/2}_\infty \Vert _{\mathcal{L}(X)}$. On the other hand, if $z_n\in X$ is such that

$$\begin{aligned} \frac{\Vert Q^{1/2}_\infty z_n\Vert _X}{\Vert z_n\Vert _X}>\Vert Q^{1/2}_\infty \Vert _{\mathcal{L}(X)}-\frac{1}{n} \end{aligned}$$

then, setting $x_n=Q^{1/2}_\infty z_n \in H$, we have

$$\begin{aligned} \Vert Q^{1/2}_\infty \Vert _{\mathcal{L}(H)}\ge \frac{\Vert Q^{1/2}_\infty x_n\Vert _H}{\Vert x_n\Vert _H} = \frac{\Vert Q^{1/2}_\infty z_n\Vert _X}{\Vert z_n\Vert _X}> \Vert Q^{1/2}_\infty \Vert _{\mathcal{L}(X)}-\frac{1}{n}. \end{aligned}$$

which gives the claim.

Finally, (v) follows by observing that $Q_\infty ^{-1/2}F$ is a well-defined closed linear operator from X to X and applying the closed graph theorem. $\square $

Proof of Lemma 4.3

We just consider the case $t=+\infty $, since the case $T_0\le t <+\infty $ is quite similar. Fix $x\in H$. Then, there is a unique $z\in [\ker Q_\infty ]^\perp $ such that $Q_\infty ^{1/2}z=x$. As $[\ker Q_\infty ]^\perp = \overline{R(Q_\infty )}=\overline{R(Q_\infty ^{1/2})}$, there exists $\{z_n\}\subset X$ such that $Q_\infty ^{1/2}z_n \rightarrow z$ in X. Since $D(A^*)$ is dense in X, for each $n\in {\mathbb N}^+$ we can find $y_n \in D(A^*)$ such that $\Vert y_n-z_n\Vert _X<1/n$, so that $Q_\infty ^{1/2}y_n \rightarrow z$ in X, too. Hence,

$$\begin{aligned} \Vert Q_\infty y_n - x\Vert _H = \Vert Q_\infty ^{1/2} y_n - Q_\infty ^{-1/2}x\Vert _X = \Vert Q_\infty ^{1/2} y_n - z\Vert _X\rightarrow 0, \end{aligned}$$

i.e., x belongs to the closure of $Q_\infty (D(A^*))$ in H. The density of $D(A) \cap H$ follows since, by Lemma 3.1-(i), we have $Q_\infty (D(A^*))\subset D(A)\cap H$. $\square $

Proof of Lemma 4.4

We recall that for $0<t\le +\infty $ we have

$$\begin{aligned} Q_t^{1/2}Q_t^{-1/2}z = z \quad \forall z\in R(Q_t^{1/2}), \qquad Q_t^{-1/2}Q_t^{1/2}x = P_{[\ker Q_t]^\perp }\,x \quad \forall x\in X. \end{aligned}$$

(73)

By (73), using also the fact that $Q_t^{1/2}$ is selfadjoint in X, we get (i):

$$\begin{aligned} \langle z, Q_t^{-1/2}w\rangle _X= & {} \langle Q_t^{1/2}Q_t^{-1/2}z, Q_t^{-1/2}w\rangle _X \\= & {} \langle Q_t^{-1/2}z, Q_t^{1/2}Q_t^{-1/2}w\rangle _X = \langle Q_t^{-1/2}z, w\rangle _X\qquad \forall z,w\in R(Q_t^{1/2}). \end{aligned}$$

About (ii), we have by (73), (29) and (i):

$$\begin{aligned} \langle Q_\infty ^{1/2}x,y\rangle _H= & {} \langle Q_\infty ^{1/2}x,Q_\infty ^{1/2}Q_\infty ^{-1/2}y\rangle _H = \langle x, Q_\infty ^{-1/2}y\rangle _X = \langle Q_\infty ^{-1/2}x, y\rangle _X \\= & {} \langle Q_\infty ^{-1/2}x, P_{[\ker Q_\infty ]^\perp }\,y \rangle _X = \langle Q_\infty ^{-1/2}x, Q_\infty ^{-1/2}Q_\infty ^{1/2}y\rangle _X \\= & {} \langle x, Q_\infty ^{1/2}y\rangle _H \qquad \forall x,y\in H. \end{aligned}$$

Finally, (iii) follows immediately by applying (ii) twice. $\square $

Proof of Lemma 4.5

For fixed $t\ge T_0$, let $x\in H$ be such that $Q_t^{1/2}Q_\infty ^{-1/2}x=0$: then, by (73),

$$\begin{aligned} Q_\infty ^{-1/2}x = Q_t^{-1/2}(Q_t^{1/2}Q_\infty ^{-1/2}x) = P_{[\ker Q_t]^\perp }Q_\infty ^{-1/2}x =0, \end{aligned}$$

which implies $Q_\infty ^{-1/2}x \in \ker Q_t= \ker Q_\infty $. On the other hand, by definition we also have $Q_\infty ^{-1/2}x \in [\ker Q_\infty ]^\perp $, so that $Q_\infty ^{-1/2}x=0$ and consequently $x=Q_\infty ^{1/2}(Q_\infty ^{-1/2}x)=0$. This proves that $Q_t^{1/2}Q_\infty ^{-1/2}$ is one-to-one. Moreover, for each $y\in H$ the equation $Q_t^{1/2}Q_\infty ^{-1/2}x=y$ is equivalent to $P_{[\ker Q_t]^\perp }Q_\infty ^{-1/2}x = Q_t^{-1/2}y$, but since $Q_\infty ^{-1/2}x\in [\ker Q_\infty ]^\perp =[\ker Q_t]^\perp $, we deduce $Q_\infty ^{-1/2}x = Q_t^{-1/2}y$ and hence $x=Q_\infty ^{1/2}Q_t^{-1/2}y$. This shows that $Q_t^{1/2}Q_\infty ^{-1/2}$ is surjective.

We now claim that $Q_t^{1/2}Q_\infty ^{-1/2}$ has closed graph in $H\times H$. Indeed, let $\{x_n\}$ be a sequence in H such that $(x_n,Q_t^{1/2}Q_\infty ^{-1/2}x_n)\rightarrow (x,y)$ in $H\times H$. This means, by definition,

$$\begin{aligned} Q_\infty ^{-1/2}x_n \rightarrow Q_\infty ^{-1/2}x \quad \text {in } X, \end{aligned}$$

(74)

and

$$\begin{aligned} Q_\infty ^{-1/2}Q_t^{1/2}Q_\infty ^{-1/2}x_n \rightarrow Q_\infty ^{-1/2}y \quad \text {in } X; \end{aligned}$$

(75)

if we apply $Q_t^{1/2}$ to (74) we obtain

$$\begin{aligned} Q_t^{1/2}Q_\infty ^{-1/2}x_n \rightarrow Q_t^{1/2}Q_\infty ^{-1/2}x \quad \text {in } X, \end{aligned}$$

whereas if we apply $Q_\infty ^{1/2}$ to (75) we get

$$\begin{aligned} Q_t^{1/2}Q_\infty ^{-1/2}x_n \rightarrow y \quad \text {in } X, \end{aligned}$$

so that $y=Q_t^{1/2}Q_\infty ^{-1/2}x$ and our claim is proved. By Lemma 4.2 it follows that $Q_t^{1/2}Q_\infty ^{-1/2}\in \mathcal{L}(H)$. Finally, the inverse $Q_\infty ^{1/2}Q_t^{-1/2}$ is also in $\mathcal{L}(H)$ by the same argument, or by the open mapping theorem.

The proof of the second statement is quite analogous. $\square $

Proof of Lemma 4.6

It is clear that $Q_t^{-1/2}Q_\infty ^{1/2}$ maps X into X and vanishes on $\ker Q_\infty =\ker Q_\infty ^{1/2}$. Moreover, if $y\in [\ker Q_\infty ]^\perp $ then $x = Q_\infty ^{-1/2}Q_t^{1/2} y$ is in $[\ker Q_\infty ]^\perp $ and satisfies $Q_t^{-1/2}Q_\infty ^{1/2}x=y$, so that $Q_t^{-1/2}Q_\infty ^{1/2}$ is one-to-one from $[\ker Q_\infty ]^\perp $ onto itself.

Now we prove that $Q_t^{-1/2}Q_\infty ^{1/2}$ has closed graph. Let $(x_n,Q_t^{-1/2}Q_\infty ^{1/2}x_n) \rightarrow (x,y)$ in $X\times X$: then $x_n \rightarrow x$ in X and $Q_t^{-1/2}Q_\infty ^{1/2}x_n \rightarrow y$ in X, so that $y\in [\ker Q_\infty ]^\perp $. It follows that

$$\begin{aligned} Q_\infty ^{1/2}x_n \rightarrow Q_\infty ^{1/2}x \end{aligned}$$

and also

$$\begin{aligned} Q_\infty ^{1/2}x_n = Q_t^{1/2}[Q_t^{-1/2}Q_\infty ^{1/2}x_n]\rightarrow Q_t^{1/2}y, \end{aligned}$$

so that $Q_\infty ^{1/2}x=Q_t^{1/2}y$, but since $y\in [\ker Q_\infty ]^\perp $, we deduce $y=Q_t^{-1/2}Q_\infty ^{1/2}x$. Thus, $Q_t^{-1/2}Q_\infty ^{1/2}\in \mathcal{L}(X)$ and the result follows. $\square $

Proof of Lemma 4.7

Indeed, by (29) and Lemma 4.4 (ii)-(iii),

$$\begin{aligned} \langle Lx,y\rangle _H= & {} \langle Q_\infty ^{-1/2}Lx,Q_\infty ^{-1/2}y\rangle _X = \langle Lx,Q_\infty ^{-1}y\rangle _X \\= & {} \langle x,L^*Q_\infty ^{-1}y\rangle _X = \langle Q_\infty ^{1/2}x,Q_\infty ^{1/2}L^*Q_\infty ^{-1}y\rangle _H = \langle x,Q_\infty L^*Q_\infty ^{-1}y\rangle _H\,. \end{aligned}$$

$\square $

Proof of Proposition 4.8

(i)
Fix $x\in H$ and $t, \tau >0$ with $\tau <t$. For any $u\in \mathcal{U}_{[0,\tau ]}(x)$, define
$$\begin{aligned} \tilde{u}(s)=\left\{ \begin{array}{l@{\quad }l}0 &{} \text {if } \ s\in [0,t-\tau ] \\ u(s-t+\tau ) &{} \text {if } \ s\in \, ]t-\tau ,t]. \end{array}\right. \end{aligned}$$
We have $\tilde{u} \in \mathcal{U}_{[0,t]}(x)$, since obviously
$$\begin{aligned} \int _0^t e^{(t-s)A}B\tilde{u}(s)\,\mathrm {d}s= & {} \int _{t-\tau }^t e^{(t-s)A}Bu(s-t+\tau )\,\mathrm {d}s \\= & {} \int _0^\tau e^{(\tau -\sigma )A}Bu(\sigma )\,\mathrm {d}\sigma =x, \end{aligned}$$
and, moreover,
$$\begin{aligned} \int _0^t \Vert \tilde{u}(s)\Vert _U^2 \, \mathrm {d}s = \int _{t-\tau }^t \Vert u(s-t+\tau )\Vert _U^2 \, \mathrm {d}s = \int _0^\tau \Vert u(\sigma )\Vert _U^2 \, \mathrm {d}\sigma . \end{aligned}$$
Now, for a fixed $\varepsilon >0$ we may select $u\in \mathcal{U}_{[0,\tau ]}(x)$ such that
$$\begin{aligned} V(\tau , x) \le \frac{1}{2} \int _0^\tau \Vert u(\sigma )\Vert _U^2 \, \mathrm {d}\sigma < V(\tau ,x)+\varepsilon , \end{aligned}$$
so that for the corresponding $\tilde{u}$ we get
$$\begin{aligned} V(t,x) \le \frac{1}{2} \int _0^t \Vert \tilde{u}(s)\Vert _U^2 \, \mathrm {d}s = \frac{1}{2} \int _0^\tau \Vert u(\sigma )\Vert _U^2 \, \mathrm {d}\sigma < V(\tau ,x)+\varepsilon \end{aligned}$$
and finally $V(t,x)\le V(\tau ,x)$.
(ii)
Formula (19) shows that V is quadratic with respect to x. Moreover, (19), rewritten in H, becomes
$$\begin{aligned} V(t,x)= & {} \frac{1}{2} \langle Q_{\infty }^{1/2} Q_{t}^{-1/2}x,Q_{\infty }^{1/2} Q_{t}^{-1/2}x \rangle _H \\= & {} \frac{1}{2} \langle [Q_{\infty }^{1/2} Q_{t}^{-1/2}]^{*H} Q_{\infty }^{1/2} Q_{t}^{-1/2}x,x \rangle _{[R(Q_{t}^{1/2})]^{*H},R(Q_{t}^{1/2})} \quad \\&\times \, \forall t>0, \ \ \forall x\in R(Q_{t}^{1/2}), \end{aligned}$$
and the claim is proved.
(iii)
(a) Under Assumption 2.4 formula (34) immediately follows from the fact that $R(Q_t^{1/2})=H$ for every $t\ge T_0$. To prove (35) we take $x\in R(Q_t)$. Then, by (19),
$$\begin{aligned} V(t,x)= \frac{1}{2} \Vert Q_{t}^{-1/2}x \Vert _X^2 =\frac{1}{2} \langle Q_{t}^{-1/2}x, Q_{t}^{-1/2}x \rangle _X = \frac{1}{2} \langle Q_{t}^{-1}x, x \rangle _X \end{aligned}$$
where in the last step we used Lemma 4.4 (i). Now passing to the inner product in H and using Lemma 4.4 (ii) we get
$$\begin{aligned} V(t,x)= \frac{1}{2} \langle [Q_{\infty }^{1/2} Q_{t}^{-1}]x, Q_{\infty }^{1/2}x\rangle _H =\frac{1}{2} \langle [Q_{\infty } Q_{t}^{-1}]x, x\rangle _H \,, \end{aligned}$$
which is the claim. Finally, the estimate (36) is an immediate consequence of the monotonicity of $V(\cdot , x)$.
(iii)
(b) First we show that if $t\in [T_0,+\infty [\,$, then for each $\varepsilon >0$ and $R>0$ there is $\delta >0$ such that
$$\begin{aligned} V(\tau ,x) < V(t,x) + \varepsilon \qquad \forall x\in H, \; \Vert x\Vert _H\le R, \quad \forall \tau \in [T_0\vee (t-\delta ), t[\,.\qquad \end{aligned}$$
(76)
To this purpose, fix $\tau \in [T_0,t[\,$ and $x \in B_H(0,R)$, take $u\in \mathcal{U}_{[0,t]}(x)$ such that
$$\begin{aligned} \frac{1}{2} \int _0^t \Vert u(s)\Vert _U^2\, \mathrm {d}s < V(t,x)+ \frac{\varepsilon }{2}\,, \end{aligned}$$
and define
$$\begin{aligned} \overline{u}(s) = u(s+t-\tau ), \qquad s\in [0,\tau ]. \end{aligned}$$
Since
$$\begin{aligned} \int _0^\tau e^{(\tau -s)A}B\overline{u}(s)\,\mathrm {d}s= & {} \int _0^\tau e^{(\tau -s)A}Bu(s+t-\tau )\,\mathrm {d}s \\= & {} \int _{t-\tau }^t e^{(t-\sigma )A}Bu(\sigma ) \,\mathrm {d}\sigma = x - \int _0^{t-\tau }e^{(t-\sigma )A}Bu(\sigma )\,\mathrm {d}\sigma , \end{aligned}$$
we have $\overline{u}\in \mathcal{U}_{[0,\tau ]}\left( x-\int _0^{t-\tau } e^{(t-\sigma )A}Bu(\sigma )\,\mathrm {d}\sigma \right) $. Hence,
$$\begin{aligned} V\left( \tau ,x-\int _0^{t-\tau } e^{(t-\sigma )A}Bu(\sigma )\,\mathrm {d}\sigma \right)\le & {} \frac{1}{2} \int _0^\tau \Vert \overline{u}(s)\Vert _U^2 \,\mathrm {d}s = \frac{1}{2} \int _{t-\tau }^t \Vert u(\sigma )\Vert _U^2 \mathrm {d}\sigma \\\le & {} \frac{1}{2} \int _0^t \Vert u(\sigma )\Vert _U^2\,\mathrm {d}\sigma < V(t,x)+ \frac{\varepsilon }{2}\,. \end{aligned}$$
On the other hand, by (34) we have, for $x,y \in H$
$$\begin{aligned} V(\tau ,x-y)-V(\tau ,x)= & {} \frac{1}{2} \langle P_V(\tau )(x-y),x-y\rangle _H - \frac{1}{2} \langle P_V(\tau )x,x \rangle _H \\= & {} - \frac{1}{2} \langle P_V(\tau )y,x-y\rangle _H - \frac{1}{2} \langle P_V(\tau )x,y \rangle _H; \end{aligned}$$
then, by (36), for every $\varepsilon >0$ and $R>0$ there is $\eta >0$ such that
$$\begin{aligned}&\Vert y\Vert _H< \eta \qquad \Longrightarrow \qquad |V(\tau ,x-y)-V(\tau ,x)|< \frac{\varepsilon }{2} \\&\qquad \forall \tau \ge T_0, \quad \forall x \in B_H(0,R). \end{aligned}$$
Hence, we get
$$\begin{aligned} V(\tau ,x)<V\left( \tau ,x-\int _0^{t-\tau } e^{(t-\sigma )A}Bu(\sigma )\,\mathrm {d}\sigma \right) + \frac{\varepsilon }{2}<V(t,x)+ \varepsilon \end{aligned}$$
provided we are able to find $\delta >0$ such that
$$\begin{aligned} \left\| \int _0^{t-\tau } e^{(t-\sigma )A}Bu(\sigma )\,\mathrm {d}\sigma \right\| _H < \eta \qquad \forall \tau \in [T_0\vee (t-\delta ), t[\,. \end{aligned}$$
(77)
In order to check (77), we fix $z\in R(Q_\infty )$ with $\Vert z\Vert _H \le 1$. We can write, by Assumption 2.4, Proposition A.1 and Lemma 4.5,
$$\begin{aligned}&\quad \left| \left\langle \int _0^{t-\tau } e^{(t-\sigma )A}Bu(\sigma )\,\mathrm {d}\sigma , z\right\rangle _H\right| = \left| \left\langle \int _0^{t-\tau } e^{(t-\sigma )A}Bu(\sigma )\,\mathrm {d}\sigma , Q_\infty ^{-1}z\right\rangle _X \right| \\&\quad = \left| \left\langle \int _0^{t-\tau } e^{(t-\sigma -T_0)A}Bu(\sigma )\,\mathrm {d}\sigma , e^{T_0 A^*}Q_\infty ^{-1}z \right\rangle _X\right| \\&\quad \le \int _0^{t-\tau } e^{(t-\sigma -T_0)A}Bu(\sigma )\Vert _X \,\mathrm {d}\sigma \, \cdot \,\Vert e^{T_0 A^*}Q_\infty ^{-1}z \Vert _X \\&\quad \le \int _0^{t-\tau } e^{(t-\sigma -T_0)A}Bu(\sigma )\Vert _X\,\mathrm {d}\sigma \, \cdot \,\Vert Q_{T_0}^{1/2}Q_\infty ^{-1}z \Vert _X \\&\quad \le c \int _0^{t-\tau } e^{-\omega (t-\sigma -T_0)}\Vert u(\sigma )\Vert _U\, \mathrm {d}\sigma \, \cdot \,\Vert Q_\infty ^{-1/2}z\Vert _X \\&\quad \le c\,\sqrt{t-\tau }\, e^{-\omega \,\delta }\Vert u\Vert _{L^2(0,t;U)} \Vert z\Vert _H. \end{aligned}$$
Hence, using the density of $R(Q_\infty )$ in H (see Lemma 4.2),
$$\begin{aligned} \sup _{\Vert z\Vert _H \le 1} \left| \left\langle \int _{-t}^{-\tau } e^{-sA}Bu(s)\,\mathrm {d}s, z\right\rangle _H \right| \le c\,\sqrt{t-\tau }\, \Vert u\Vert _{L^2(-t,0;U)}, \end{aligned}$$
so that
$$\begin{aligned} \left\| \int _{-t}^{-\tau } e^{-sA}Bu(s)\,\mathrm {d}s \right\| _H \le c\,\sqrt{t-\tau }\, \Vert u\Vert _{L^2(-t,0;U)}. \end{aligned}$$
Thus, to achieve (77) it suffices to take $\delta >0$ such that $c\sqrt{\delta }\, \Vert u\Vert _{L^2(-t,0;U)}< \eta $. Hence, we have proved (76), too. Now fix $\varepsilon >0$, $R>0$ and take $\delta $ such that (76) holds. For $(t,x), (\tau ,x')\in [T_0,+\infty ]\times B_H(0,R)$ with $|t -\tau |<\delta $ we have
$$\begin{aligned} |V(t,x)-V(\tau ,x')|\le & {} |V(t,x)-V(\tau ,x)|+ |V(\tau ,x)-V(\tau ,x')| \\\le & {} \varepsilon + \frac{1}{2} |\langle P_V(\tau )x,x \rangle _H - \frac{1}{2} \langle P_V(\tau )x',x' \rangle _H| \\\le & {} \varepsilon + \frac{1}{2} |\langle P_V(\tau )(x-x'),x \rangle _H|+\frac{1}{2} |\langle P_V(\tau )x',x-x'\rangle _H| \\\le & {} \varepsilon + \Vert P_V(T_0\Vert _{\mathcal{L}(H)}R \Vert x-x'\Vert _H\,, \end{aligned}$$
and the first part of the claim easily follows. To prove the continuity of the map $t \rightarrow P_V(t)$ we observe that, for $t,\tau \in [T_0,+\infty [$
$$\begin{aligned} \Vert P_V(t)-P_V(\tau )\Vert _{\mathcal{L}(H)}= & {} \sup _{\Vert x\Vert = 1} \langle P_V(t)-P_V(\tau )x,x \rangle _H \\= & {} 2\sup _{\Vert x\Vert = 1} |V(t,x)-V(\tau ,x)|; \end{aligned}$$
so the claim follows by (76).
(iii)
(c) The limit in (37) clearly exists and is finite by monotonicity and positivity of V. To find this limit we consider first the case when $x\in R(Q_\infty )$. Then, we have, by (35),
$$\begin{aligned} 2V(t,x)-\Vert x\Vert _H^2= & {} \langle P_V(t)x-x,x \rangle _H = \langle Q_\infty Q_t^{-1}x-x,x\rangle _H \\= & {} \langle (Q_\infty -Q_t)Q_t^{-1}x,x \rangle _H =\langle (Q_\infty -Q_t)Q_t^{-1}x,Q_\infty ^{-1}x \rangle _X\,. \end{aligned}$$
Since, for suitable $c>0$,
$$\begin{aligned} \Vert (Q_\infty -Q_t)z\Vert _X=\left\| \int _t^{+\infty } e^{sA}BB^*e^{sA^*}z\,\mathrm {d}s\right\| _X \le c \, e^{- 2\omega t}\Vert z\Vert _X \quad \forall z\in X, \end{aligned}$$
we obtain
$$\begin{aligned} 2V(t,x)-\Vert x\Vert _H^2 \le c \, e^{- 2\omega t}\Vert Q_t^{-1}x\Vert _X \Vert Q_\infty ^{-1}x\Vert _X. \end{aligned}$$
But
$$\begin{aligned} \Vert Q_t^{-1}x\Vert _X= & {} \Vert Q_\infty ^{1/2} Q_t^{-1}x\Vert _H =\Vert Q_\infty ^{1/2} Q_t^{-1/2} Q_t^{-1/2}Q_\infty ^{1/2}Q_\infty ^{-1/2} x\Vert _H \\\le & {} \Vert Q_\infty ^{1/2} Q_t^{-1/2}\Vert _{\mathcal {L}(H)} \Vert Q_t^{-1/2}Q_\infty ^{1/2}\Vert _{\mathcal {L}(H)}\Vert Q_\infty ^{-1/2}x\Vert _H, \end{aligned}$$
so that by Lemma 4.5 we get
$$\begin{aligned} \lim _{t\rightarrow +\infty } V(t,x) =\frac{1}{2}\Vert x\Vert _H^2 \qquad \forall x\in R(Q_\infty ). \end{aligned}$$
By selfadjointness of $P_V(t)$ and polarization, we also have
$$\begin{aligned} \lim _{t\rightarrow +\infty } \langle P_V(t)x,y \rangle _H = \langle x,y \rangle _H \quad \forall x,y \in R(Q_\infty ). \end{aligned}$$
Since, by (36), $P_V(t) - I$ is uniformly bounded, by density (Lemma 4.2) we deduce that
$$\begin{aligned} \lim _{t\rightarrow +\infty } \langle P_V(t)x,y \rangle _H = \langle x,y \rangle _H \quad \forall x\in R(Q_\infty ), \quad \forall y \in H, \end{aligned}$$
and using again that $P_V(t)$ is selfadjoint we get
$$\begin{aligned} \lim _{t\rightarrow +\infty } \langle x,P_V(t)y \rangle _H = \langle x,y \rangle _H \quad \forall x\in R(Q_\infty ), \quad \forall y \in H. \end{aligned}$$
With the same argument we then obtain
$$\begin{aligned} \lim _{t\rightarrow +\infty } \langle x,P_V(t)y \rangle _H = \langle x,y \rangle _H \quad \forall x,y\in H, \end{aligned}$$
and the result follows. $\square $

Rights and permissions

Reprints and permissions

About this article

Cite this article

Acquistapace, P., Gozzi, F. Minimum energy for linear systems with finite horizon: a non-standard Riccati equation. Math. Control Signals Syst. 29, 19 (2017). https://doi.org/10.1007/s00498-017-0204-y

Download citation

Received: 15 September 2016
Accepted: 02 October 2017
Published: 20 October 2017
DOI: https://doi.org/10.1007/s00498-017-0204-y

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Minimum energy for linear systems with finite horizon: a non-standard Riccati equation

Abstract

Similar content being viewed by others

The turnpike property and the longtime behavior of the Hamilton–Jacobi–Bellman equation for finite-dimensional LQ control problems

On the solvability of the Yakubovich linear-quadratic infinite horizon minimization problem

On the Asymptotic Stability for Nonlinear Oscillators with Time-Dependent Damping

1 Introduction

1.1 Description of the problem and of the main results

1.2 Plan of the paper

2 Minimum energy problems

2.1 Assumptions

Assumption 2.1

Proposition 2.2

Definition 2.3

Assumption 2.4

Remark 2.5

2.2 General formulation

Proposition 2.6

Theorem 2.7

Remark 2.8

3 The Lyapunov equation

Lemma 3.1

Proof

Definition 3.2

Proposition 3.3

Proof

Remark 3.4

4 The Riccati equation

Remark 4.1

4.1 The space H and its properties

Lemma 4.2

Lemma 4.3

Lemma 4.4

Lemma 4.5

Lemma 4.6

Lemma 4.7

4.2 Properties of the value function

Proposition 4.8

Remark 4.9

4.3 The value function solves the Riccati equation

Definition 4.10

Remark 4.11

Theorem 4.12

Proof

4.4 A partial uniqueness result

Theorem 4.13

Proof

4.5 The selfadjoint commuting case

Theorem 4.14

Proof

Theorem 4.15

Proof

Theorem 4.16

Proof

5 Examples

5.1 Delay state equation

5.2 Diagonal cases

Notes

References

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix

A: Pseudoinverses

Proposition A.1

B: Some properties of commuting operators

Definition B.1

Lemma B.2

Proof

Lemma B.3

Proof

C: Controllability operators and minimum energy

Proposition C.1

Proof

Proposition C.2

Proof

Proposition C.3

Proof

Remark C.4