Abstract
We consider the existence and first order conditions of optimality for a stochastic optimal control problem inspired by the celebrated FitzHugh–Nagumo model, with nonlinear diffusion term, perturbed by a linear multiplicative Brownian-type noise. The main novelty of the present paper relies on the application of the rescaling method which allows us to reduce the original problem to a random optimal one.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Consider the following problem
\(\gamma : {\mathbb {R}} \rightarrow {\mathbb {R}}\) being a monotone, increasing continuous function, \(v=v(t, \xi )\) represents the transmembrane electrical potential, \({\mathcal {O}}\subset {\mathbb {R}}^d\), \(d=2,3,\) is a bounded and open set with smooth boundary \(\partial {\mathcal {O}}\). We indicate with \(\varDelta _\xi \) the Laplacian operator with respect to the spatial variable \(\xi \), while \(\varepsilon \) and \(\delta \) are positive constants representing phenomenological coefficients, \(f(\xi )\) is a given external forcing term, while \(I^{ion}\) is the Ionic current and, according with the FitzHugh–Nagumo model, it equals \(I^{ion}(v)= v(v-a)(v-1)\), \(v_0\), \(w_0 \, \in L^2({\mathcal {O}})\), namely it represents a cubic non-linearity. Also F is a bounded term needed to treat the general controlled equation in next section.
Equation (1) with linear diffusion, i.e. \(\gamma (x)=x\), is the well-known FitzHugh–Nagumo (FHN) equation. FHN equation is a reaction–diffusion equation, first introduced by Hodgkin and Huxley in [32] and then simplified in [31, 35]. The model has been proposed to provide a rigorous, yet simplified, analysis of electrical impulses dynamics along a nerve axon, see, e.g., [38], where the propagation of the transmembrane potential on the nerve axon is represented by a cubic nonlinear reaction term, possibly perturbed by a noisy one, see, e.g., [12, 22, 38, 41].
The random perturbation represents the effect of noisy input currents within neurons, their source being the random opening/closing actions of ion channels, see, e.g., [41]. Moreover, in two-dimensional and three-dimensional settings, Eq. (1) plays also a relevant role in statistical mechanics, under the name of Ginzburg–Landau equation, as well as concerning phase transition models of Ginzburg–Landau type, see, e.g., [27].
The general case where \(\gamma \) is a monotone function corresponds to an anomalous–diffusive FitzHugh–Nagumo (FHN) equation, see [33], also describing phase transitions in porous media, see, e.g., [36, 40].
Remark 1.1
In what follows we shall focus on the mathematical setting behind the Stochastic FitzHugh–Nagumo (FHN) model, without entering into details about the neuro-biological justification of parameters characterizing it. Appropriate details, as well as in depth analysis of the existing literature on the subject, will be provided later.
We assume that W is a \(H^{-1}:=H^{-1}({\mathcal {O}})\)-cylindrical Wiener processes, such that
where \(\{\beta _n\}_{n \ge 1}\) is a sequence of mutually independent standard Brownian motion defined on a filtered probability space \(\left( \varOmega ,{\mathcal {F}},\left\{ {\mathcal {F}}_t\right\} _{t \ge 0},{\mathbb {P}}\right) \), while \(\{e_n\}_{\ge 1}\) is an orthonormal basis in \(H^{-1}\) and \(\mu _n \in {\mathbb {R}}\).
Since the Laplacian operator \(\varDelta _\xi \) is a linear operator in \(L^2\left( {\mathcal {O}}\right) \), and \(-\varDelta _\xi \) is self-adjoint, then there exists a complete orthonormal system \(\{{\bar{e}}_k\}_{k\ge 1}\) in \(L^2\left( {\mathcal {O}}\right) \) of eigenfunctions of \(-\varDelta _\xi \), and we shall indicate the corresponding sequence of eigenvalues denoted by \(\{{\bar{\lambda }}_k\}_{k\ge 1}\). Therefore, we have
Also, we set
and note that G is monotonically nondecreasing.
The present paper addresses the problem of existence and uniqueness of a strong solution, in a sense to be better specified in a while, to Eq. (1). We stress that this is not a trivial problem as the nonlinear operator \(\varDelta \gamma \) is naturally defined on the space \(H^{-1}\) whereas the nonlinear polynomial perturbation \(I^{ion}\) is not m-accreative on the same space. In order to solve above problem we will transform the original equation, via a rescaling transformation, to a random PDE. It turns out that the existence and uniqueness of transformed random PDE can be treated by the theory of nonlinear semigroup in \(L^1\).
We will further consider the problem of existence of an optimal control for the nonlinear FHN equation. Again, in order to solve the problem we will apply a rescaling transformation to obtain a corresponding random PDE. As already emerged in [12, 22], the nonlinear polynomial term implies that standard minimization argument does not apply. Therefore, existence of an optimal control is achieved using Ekeland’s variational principle. First order conditions of optimality are given in terms of dual stochastic backward equation, see, e.g, [12, 17], whereas, due to the applied rescaling transformation are expressed in terms of a random backward dual equation which allows to simplify the setting also giving more insights on the derived optimal controller.
It is worth stressing that the present work continues the investigation of optimal control problem for a stochastic FHN system, generalizing further the result presented in [12, 22]. It must be stressed that the techniques used in the present work, although presenting some similarity with [12, 22], as for instance the usage of the Ekeland principle to treat the cubic nonlinearity typical of the FHN equation, are in general different. In fact, the nonlinear term \(\gamma \) poses several difficulties since its natural state space is \(H^{-1}\). As already mentioned, the cubic nonlinear term arising in the FHN equation is not sufficiently regular in such a space. Therefore, differently to the previous works [12, 22], existence and uniqueness of a solution for the main equation is non trivial. The nonlinear diffusion \(\gamma \) also affects the main technique used in proving the existence of an optimal control since a suitable trasnformation to reduce the problem to a random equation must be applied.
The present paper is structured as follows: Sect. 1.1 introduces main notation used thorough the paper. Section 2 addresses the problem of proving existence and uniqueness for the state equation whereas in Sect. 3 the problem of the existence for an optimal control is considered.
1.1 Main Notations
In what follows we will denote by \(| \cdot |\), resp. \(\langle \cdot ,\cdot \rangle \), the norm, resp. scalar product, on \({\mathbb {R}}^d\). Also, \(L^p\left( {\mathcal {O}}\right) =: L^p\), for \(1 \le p \le \infty \), is the standard space of p-Lebesgue measurable function over the domain \({\mathcal {O}}\subset {\mathbb {R}}^d\), with corresponding norm defined as \(|\cdot |_p\). For the case \(p=2\), we will further denote by \(\langle \cdot ,\cdot \rangle _2\) the scalar product in \(L^2\). The space \(H^1({\mathcal {O}})=: H^1\) is the Sobolev space \(\left\{ u \in L^2\,:\,\partial _\xi u(\xi ) \in L^2\right\} \), endowed with the standard norm \(\Vert u\Vert _{H^1}^2:= \int _{{\mathcal {O}}} \left( |u|^2 + |\nabla u|^2 \right) d\xi \). The dual of the space \(H^1\) will be denoted as \(H^{-1}\) equipped with corresponding norm \(|\cdot |_{-1}\).
Similarly, we will denote by \(W^{n,p}({\mathcal {O}})=:W^{n,p}\), \(n \in {\mathbb {N}}\), \(1\le p \le \infty \), the standard Sobolev space of p-integrable functions with p-integrable n-order derivatives. Coherently, \(W^{1,p}([0,T];H^{-1})\) will be the space of absolutely continuous function \(u:[0,T]\rightarrow H^{-1}\) such that both u and \(\frac{d}{dt}u \, \in L^p([0,T];H^{-1})\). Further, given a Banach space X, \(L^p([0,T];X)\) is the space of X-valued Bochner p-integrable functions on the interval [0, T]. Also, C([0, T]; X), resp. \(C^1([0,T];X)\), denotes the space of continuous, resp. continuously differentiable, functions \(u:[0,T]\rightarrow X\).
We shall also introduce \(C_W([0,T];H)\) the space of all \(H^{-1}\)-valued \(\left( {\mathcal {F}}_t\right) \)-adapted processes such that \(X \in C\left( [0,T];L^2\left( \varOmega ;H^{-1}\right) \right) \), that is X satisfies
In an analogous manner \(L^2_W([0,T];H^{-1})\) is the space of all \(H^{-1}\)-valued \(\left( {\mathcal {F}}_t\right) \)-adapted processes such that \(X \in L^2\left( [0,T];L^2\left( \varOmega ;H^{-1}\right) \right) \), that is X satisfies
At last \(L^2_W(\varOmega ;C\left( [0,T];H^{-1}\right) )\) denotes the space of all \(H^{-1}\)-valued \(\left( {\mathcal {F}}_t\right) \)-adapted and continuous processes such that
Above definition are still in place if instead of \(H^{-1}\) we consider a general Hilbert space H. It is also known that there is a natural embedding of \(L^2_W(\varOmega ;C\left( [0,T];H^{-1}\right) )\) into the space \(C_W([0,T];H^{-1})\), see, e.g. [13, Chapter 1].
We therefore can rewrite Eq. (1) as
We will assume the following to hold.
Hypothesis 1
-
(i)
\(\gamma :{\mathbb {R}}\rightarrow {\mathbb {R}}\) with \(\gamma (0)=0\) is a continuous and differentiable function such that
$$\begin{aligned} 0< C_1 \le \gamma '(x) \le C_2 < \infty \,,\quad \forall \, x \in {\mathbb {R}}\,; \end{aligned}$$ -
(ii)
\(G: {\mathbb {R}}\rightarrow {\mathbb {R}}\) is continuous monotonically non–decreasing and \(G(0) = 0\); moreover G is locally Lipschitzian;
-
(iii)
\(F \in L^\infty ((0,T)\times {\mathcal {O}})\), \({\mathbb {P}}\)-a.s. and it is progressively measurable w.r.t. \((0,T) \times \varOmega \times {\mathcal {B}}({\mathcal {O}})\); \(f \in L^\infty ({\mathcal {O}})\), and \(f \ge 0\) a.e. in \({\mathcal {O}}\);
-
(iv)
W is a \(H^{-1}:=H^{-1}({\mathcal {O}})\)-cylindrical Wiener processes, that is,
$$\begin{aligned} W = \sum _{j=1}^\infty \mu _j e_j \beta _j\,, \end{aligned}$$with
$$\begin{aligned} \sum _{j=1}^\infty \mu _j^2 |e_j|^2_{L^\infty ({\mathcal {O}})} < \infty \,, \end{aligned}$$see, [13, p. 22].
Then, we can state the notion of solution to Eq. (3) that we will consider in subsequent analysis, see [13, p. 50].
Definition 1.1
Let \(x \in H^{-1}\), we say that the process
is a solution to (3) if \(X(t): [0,T] \rightarrow H^{-1}\) is \({\mathbb {P}}\)-a.s. continuous and \(\forall \) \(t \in [0,T]\)
2 Existence for the State Equation
The main problem in proving existence and uniqueness for a solution to Eq. (3) is that the operator G in not m-accretive on the space \(H^{-1}\) and so basic existence results in [11, 13] are not applicable in the present case. It turns out that the proper space one has to consider to successfully treat Eq. (3) is the space \(L^1\), which, in turn, is not the proper one if one has to deal with SPDEs such as (3).
To overcome such a stalemate, we follow [9, 10]. In particular, we apply the transformation \(X=e^Wy\), which allows to reduce the stochastic equation (3) to a random PDE that can be treated with analytical techniques. In fact, the random equation can be successfully solved by exploiting the theory of nonlinear semigroup in \(L^1\). As noted in [10], we have still to face the problem that, because of the non regularity of the term W, the general theory cannot be applied straightforward to the resulting random PDE. Therefore, for \(\epsilon >0\), we shall consider a suitable sequence of regular approximations \(W_\epsilon \) of W, to first establish a priori estimates for solutions \(y_\epsilon \) of the associated \(W_\epsilon \)-approximating problem, and then to show that, in the limit \(\epsilon \rightarrow 0\), we obtain both existence and uniqueness of the solution for the original equation.
The following theorem constitutes the main result existence of this section.
Theorem 2.1
Let \(x \in H^{-1}\cap L^1\) with \(\gamma (x) \in H^1_0\), then there is a unique strong solution to Eq. (3) \(X=e^Wy\) which satisfies
In order to prove Theorem 2.1 we need some auxiliary lemmas. In particular, let us then introduce the transformation
so that by an application of the Itô formula we obtain the random equation
with
Following [10], we prove the existence of a unique strong solution to Eq. (5) by first considering an approximating problem. In particular, let us denote by \(\beta ^\epsilon (t) := (\beta * \rho _\epsilon )(t)\), where \(\rho _\epsilon (t) = \frac{1}{\epsilon }\rho \left( \frac{t}{\epsilon }\right) \) is a standard mollifier and \(\rho \in C^\infty _0\), then we have that \(\beta ^\epsilon \in C^1([0,T];{\mathbb {R}})\). Setting
we thus have that \(W_\epsilon \in C^1([0,T] \times {\mathcal {O}})\). Moreover
as \(\epsilon \rightarrow 0\).
For each \(\epsilon >0\), let us thus consider the approximating equation associated to (5)
where \(G_\epsilon \) is the Yosida approximation of G, that is
Note that, \(G_\epsilon \) is monotonically non–decreasing, Lipschitzian and
uniformly on compacts.
Defining \(z_\epsilon := e^{W_\epsilon }y_\epsilon \), Eq. (6) becomes
where \(F_\epsilon := e^{-W_\epsilon } F\).
Lemma 2.2
Let \(x \in H^{-1}\cap L^1\) with \(\gamma (x) \in H^1_0\), then for each \(\epsilon >0\) Eq. (6) has a unique solution such that
Proof
Let us first prove existence and uniqueness of a solution to Eq. (8) in the space \(H^{-1}\). For a fixed \(\epsilon >0\), let us define the operator \(A:D(A)\subset H^{-1}\rightarrow H^{-1}\) as
We equip the space \(H^{-1}\) with the scalar product
where \(\left( -\varDelta \right) ^{-1} y = x\) indicates the solution to the Dirichlet problem \(-\varDelta x = y\) in \({\mathcal {O}}\), \(x \in H^1_0\).
Taking into account that \(G_\epsilon \) is Lipschitz continuous in \(L^2\) and since
is m-accreative in the space \(H^{-1}\), see, e.g., [4, p. 68], we have that, for a suitable \(\alpha = \alpha _\epsilon \), it holds
which implies \((A+\alpha I)\) to be accretive in \(H^{-1}\).
Moreover, for \(\lambda >0\) sufficiently large, we also have \({\mathcal {R}}\left( (\lambda + \alpha )I + A\right) =H^{-1}\), so that A is quasi-m-accretive. In other words, for \(f \in H^{-1}\) the equation
has a unique solution in \(z \in L^2\). Indeed, introducing the operators
and
we see that Eq. (10) can be rewritten as
Since B is m-accretive and \(\varGamma \) is m-accretive and continuous in \(L^2\), it follows, see, e.g., [4, p. 104], that \({\mathcal {R}}(\epsilon I + B + \varGamma )=L^2\), so that Eq. (11) admits a unique solution z in \(L^2\). Moreover, since \(\gamma (z) + \epsilon z \in H^1\) and the inverse map of \(z \mapsto \gamma (z) + \epsilon z\) is Lipschitz, then \(z \in D(A)\). It follows that, applying [10, Lemma A.1, Corollary A.2], see also [4, Sect. 4], \(z_\epsilon \) is a strong solution to Eq. (8) in \(W^{1;\infty }([0,T];H^{-1})\). In addition, by [10, Corollary A.2], we also have
and
so that, since \(z_\epsilon \in W^{1;\infty }([0,T];H^{-1})\), we obtain
and consequently \(z_\epsilon \in L^\infty (0,T;L^2)\). Moreover we have that
which implies that \(\gamma (z_\epsilon ) + \epsilon z_\epsilon \in L^\infty (0,T;H^1_0)\) and consequently \(z_\epsilon \in L^\infty (0,T;H^1_0)\). \(\square \)
Lemma 2.3
Taking \(x \in D(A)\), then \(y_\epsilon \in L^\infty ((0,T)\times {\mathcal {O}})\), and it holds
Proof
Let \(\alpha \in C^1([0,T])\), such that \(\alpha (0) = 0\) and \(\alpha '\ge 0\). Then, defining \(M:= (1+|x|_\infty )\), we have
with
with \(\alpha \) such that \(F_\epsilon - \alpha ' \le 0\).
Following [10, Lemma 3.3], we first assume that
then, denoting by
we have
Moreover, by Hypothesis 1, it follows that \(G_\epsilon \) is monotone, so that
and since
by [10, Lemma 3.3], we conclude that
if \({\tilde{F}}_\epsilon \le \alpha '\) a.e. in \((0,T)\times {\mathcal {O}}\). Moreover, for a suitable \(\alpha \), it also holds
and
and inequalities (12) follows.
Using the approximating scheme described in [10, Lemma 3.3], we have (12) without requiring the condition (13), and the claim follows. \(\square \)
Lemma 2.4
Let \(x \in D(A)\), then there exists an increasing function \(C:[0,\infty ) \rightarrow (0,\infty )\) such that
Proof
In what follows we will use the following
with \(j(r) = \int _0^r \gamma (s)ds\), \(r \in {\mathbb {R}}^+\).
Thus, multiplying Eq. (6) by \(\gamma (y_\epsilon )\) and integrating over \((0,t) \times {\mathcal {O}}\) we obtain
Concerning the last integral in the right hand side of Eq. (14), using Assumption 1 (i) on \(\gamma \) we obtain
Using estimate (15) it follows, recalling that \(G_\epsilon \) is the Yosida approximant of G and using the monotonicity of \(\gamma \) and \(G_\epsilon \) that
From the boundedness on F and assumption on \(\gamma \) in hypothesis 1\((i)-(ii)\), we obtain
for a positive constant C independent of \(\epsilon \).
The other terms in Eq. (14) can be studied as done in [10, Lemma 3.3], so that the claim follows by Lemma 2.3. \(\square \)
Lemma 2.5
There is a unique solution to Eq. (5) with
Moreover, the process y is \(\left( {\mathcal {F}}_t\right) _{t \ge 0}\)-adapted.
Proof
Let us first prove uniqueness. Let \(y_1\) and \(y_2\) be two solutions to Eq. (5), and let \({\bar{y}} := y_1 - y_2\). Then it holds
We can rewrite Eq. (19) as
where we have denoted for short
Multiplying Eq. (20) by \((-\varDelta )^{-1}{\bar{y}}\), we obtain
Concerning the first integral in the right hand side of Eq. (21), notice that, for \(\alpha \in [0,1]\) it holds
Moreover, since G is locally Lipschitz, we have
whereas other terms can be treated as in [10, Theorem 2.2]. So that, we have
from which it follows that \({\bar{y}} = 0\), and, by Lemma 2.4, it holds
so that, see [10, Theorem 2.2], we further have
As regard existence, by Lemmas 2.3 and 2.4, we have that \((\gamma (e^{W_\epsilon }y_\epsilon ))\) is bounded in \(L^2(0,T;H^1_0)\), \((y_\epsilon )\) is bounded in \(L^\infty (0,T;L^2) \cap L^\infty ((0,T) \times {\mathcal {O}}) \cap L^2(0,T;H^1_0)\), and \(\left( \frac{d y_\epsilon }{dt}\right) \) is bounded in \(L^2(0,T;H^{-1})\). Thus, by Aubin compactness theorem, \((y_\epsilon )\) is compact in each \(L^2(0,T;L^2({\mathcal {O}}))\). It follows that, for fixed \(\omega \in \varOmega \), along a subsequence, which we still denote by \(\{\epsilon \} \rightarrow 0\) for the sake of clarity, we have
Since the map \(z \mapsto \gamma (z)\) is maximal monotone, by (22) we have that \(\eta = \gamma (e^Wy)\).
Then, since it holds
and
then, for \(\epsilon \rightarrow 0\), we get
Thus, again from the fact that \(G:{\mathbb {R}}\rightarrow {\mathbb {R}}\) is maximal monotone it follows that it is also closed and therefore we have that \(\zeta = G(e^W y)\).
Therefore, by letting \(\epsilon \rightarrow 0\), from Eq. (6) we obtain
Then, by the uniqueness result already proved, we also have that the sequence \((y_\epsilon )\) is independent of \(\omega \in \varOmega \), implying that y is \(\left( {\mathcal {F}}_t\right) \)-adapted, ending the proof. \(\square \)
We can finally prove that it exists a unique strong solution X to Eq. (3) which satisfies
Proof of Theorem 2.1
Using [9, Lemma 8.1] we have the equivalence between the stochastic PDE 3 and the random PDE 5 via the rescaling transformation 4, so that existence and uniqueness of a solution X in the sense of Definition 1.1 follows by Lemma 2.5. \(\square \)
3 The Optimal Control Problem
In this section we will focus the attention to a controlled version of Eq. (1). We denote by \({\mathcal {X}}=L^2_{ad}\left( (0,T) \times {\mathcal {O}}\right) \) the space of all \({\mathscr {F}}_t- \)adapted processes \(u : [0,T ] \rightarrow {\mathbb {R}}^d\), and we consider the following optimal control problem
subject to \(u \in {\mathcal {U}}\) and
Here
\(M>0\) being a suitable constant, while \(v_1\), \(v_2 \in L^2(\varOmega )\) \({\mathcal {F}}_0\)-adapted and \(\alpha >0\) are given.
In what follows we are going to treat the problem (P) by a rescaling procedure which allows us to reduce it to a random optimal control problem. In the current section the following Hypothesis will be assumed to hold.
Hypothesis 2
-
(i)
\(G \in C^1({\mathbb {R}})\), \(G'\) is locally Lipschitz.
Theorem 3.1
Let Hypotheses 1 and 2 hold, then, for T sufficiently small, there exists at least one optimal pair \( \left( u^*,v^*\right) \) solution to problem (P).
Proof
As in Sect. 2, we will apply the rescaling transformation \(y := e^{-W}v\) so that the optimal control problem (P) reads,
subject to
Due to the cubic nonlinear term G, existence and uniqueness of an optimal control cannot be established by standard minimization arguments. In order to overcome this problem Ekeland’s variational principle can be exploited in order to obtain a nearly optimal solution to the above control problem.
Applying Ekeland’s variational principle, see, e.g. [28] or also [12, 22], there exists a sequence \(\{u_\epsilon \}\subset {\mathcal {U}}\) such that, defining for short,
it follows
or equivalently it holds
In particular, the optimal pair \((y_\epsilon ,u_\epsilon )\) solves
A variational argument alike the one carried out in [12, 22], allow us to associate the the optimal control problem (26) the dual backward problem
where \(|\theta _\epsilon |_{L^2(\varOmega \times {\mathcal {O}}\times (0,T))}\le \sqrt{\epsilon }\). Indeed, by (26), it follows \(\varPsi '(u_\epsilon ) = \theta _\epsilon \), yielding (29). Existence and uniqueness for a solution to Eq. (25) follows from Lemma 2.5 whereas standard arguments yield, using assumption 2, that it exists a unique solution \(p_\epsilon \in L^\infty ((0,T) \times {\mathcal {O}}\times \varOmega )\), see [5].
By Eq. (29) we have \({\mathbb {P}}-a.s.\),
Therefore multiplying Eq. (30) by \((y_\epsilon - y_\lambda )\) and integrating over \({\mathcal {O}}\), we therefore obtain in virtue of Lemma 2.4,
The last two terms in Eq. (31) can be treated exploiting the Young inequality, yielding
as to obtain
Applying the Gronwall lemma and taking the mean value, we obtain using assumption 1 on \(\gamma \)
Regarding the second equation in (29), we obtain
Then, multiplying Eq. (35) by \((p_\epsilon - p_\lambda )\) and integrating over \({\mathcal {O}}\), we obtain
Rearranging terms above, we further have
Taking into account that \(u \in {\mathcal {U}}\), it follows exploiting Lemma 2.3 and using the Young inequality,
where we have used C to denote possibly different constants as to simplify notation.
Taking the expectation in Eq. (37) and combining it with Eq. (34), we thus have
so that, if T is small enough, we can infer that
implying that \((y_\epsilon ,p_\epsilon )\) is a Cauchy sequence, therefore ,along a subsequence still denoted by \(\{\epsilon \} \rightarrow 0\) for the sake of clarity, we have
Further, arguments analogous to the ones used in the proof of Lemma 2.5, it follows from the fact that G is maximal monotone, that for \(\epsilon \rightarrow 0\), we get
Letting then \(\epsilon \rightarrow 0\) in the first equation in (29), we have, denoting by \(y^*\) the solution corresponding to the optimal control \(u^*\),
hence, since \(\varPsi \) is lower–semicontinuous, previous computations give us:
and the claimed existence result follows. \(\square \)
Theorem 3.2
(Necessary condition of optimality) Let be \((v^*,u^*)\) an optimal pair for problem (P), then if \(\alpha > 0\) it holds
where p is the solution to the dual backward Eq. (45) and
Remark 3.1
We would like to underline that in literature about stochastic control problem, the first order conditions of optimality (the Pontryagin maximum principle) are expressed in terms of dual stochastic backward equation, see, e.g., [12, 17]. Here, instead, optimality conditions are given in terms of a random backward dual equation which allows to simplify the setting also giving more insights on the derived optimal controller.
Proof
We provide the result exploiting the rescaling transformation \(y := e^{-W}X\), hence proving necessary condition for the problem (P2).
Let \((y^*,u^*)\) be an optimal pair for problem (P2), therefore we have that for any \(u \in {\mathcal {U}}\), defining \(u^\lambda := u^* + \lambda {\bar{u}} = u^* + \lambda (u - u^*)\), \(\lambda \ge 0\), by the optimality of \(u^*\) it must hold,
By the Gâteaux differentiability of \(\varPsi \) it follows, taking the limit as \(\lambda \rightarrow 0\),
being z the solution to the system in variation defined as
Therefore, introducing the backward dual system
and exploiting Eqs. (44) and (45) together with Eq. (43), we have
which gives
where \(P_U\) is the projection operator defined in (42). \(\square \)
Theorem 3.3
(The bang–bang principle) Let be \((v^*,u^*)\) an optimal pair for problem (P) and let \(\alpha = 0\), then it holds
where p is the solution to the dual backward Eq. (45).
Proof
Proceeding as in Theorem 3.2 with obtain the equivalent of Eq. (46) to be
which yields Eq. (47), and the claim follows. \(\square \)
Remark 3.2
By (47) it follows that if \(\mid v^* -v_1 \mid >0 \) a.e. on \((0,T) \times {\mathcal {O}}\times \varOmega \) then the optimal controller \(u^*\) is a bang-bang controller, namely \(\mid u^* \mid =M\) a.e. on \((0,T) \times {\mathcal {O}}\times \varOmega \).
References
Albeverio, S., Di Persio, L.: Some stochastic dynamical models in neurobiology: recent developments. Eur. Commun. Math. Theor. Biol. 14, 44–53 (2011)
Albeverio, S., Di Persio, L., Mastrogiacomo, E.: Small noise asymptotic expansions for stochastic PDE’s. I. The case of a dissipative polynomially bounded non linearity. Tohoku Math. J. 63(4), 877–898 (2011)
Barbu, V.: Analysis and Control of Nonlinear Infinite-Dimensional Systems. Mathematics in Science and Engineering, 190. Academic Press Inc., Boston (1993)
Barbu, V.: Nonlinear Differential Equations of Monotone Types in Banach Spaces. Springer Monographs in Mathematics. Springer, New York (2010)
Barbu, V.: Mathematical Methods in Optimization of Differential Systems, vol. 310. Springer Science & Business Media, New York (2012)
Barbu, V., Iannelli, M.: Optimal control of population dynamics. J. Optim. Theory Appl. 102(1), 1–14 (1999)
Barbu, V., Röckner, M.: On a random scaled porous media equation. J. Differ. Equ. 251(9), 2494–2514 (2011)
Barbu, V., Precupanu, T.: Convexity and Optimization in Banach Spaces. Springer Science & Business Media, New York (2012)
Barbu, V., Röckner, M.: An operatorial approach to stochastic partial differential equations driven by linear multiplicative noise. J. Eur. Math. Soc. 17–7, 1789–1815 (2015)
Barbu, V., Röckner, M.: Nonlinear Fokker–Planck equations driven by Gaussian linear multiplicative noise. J. Differ. Equ. 265(10), 4993–5030 (2018)
Barbu, V., Da Prato, G., Röckner, M.: Existence of strong solutions for stochastic porous media equation under general monotonicity conditions. Ann. Probab. 37(2), 428–452 (2009)
Barbu, V., Cordoni, F., Di Persio, L.: Optimal control of stochastic FitzHugh–Nagumo equation. Int. J. Control 89, 1–25 (2015)
Barbu, V., Da Prato, G., Röckner, M.: Stochastic Porous Media Equations, vol. 2163. Springer, New York (2016)
Benilan, P., Brezis, H., Crandall, M.G.: A semilinear equation in \(L^1 ({\mathbb{R}}^N)\). Annali della Scuola Normale Superiore di Pisa - Classe di Scienze 2(4), 523–555 (1975)
Bonaccorsi, S., Marinelli, C., Ziglio, G.: Stochastic FitzHugh–Nagumo equations on networks with impulsive noise. Electron. J. Probab. 13, 1362–1379 (2008)
Bonaccorsi, S., Mastrogiacomo, E.: Analysis of the Stochastic FitzHugh–Nagumo system. Inf. Dim. Anal. Quantum Probab. Relat. Top. 11(3), 427–446 (2008)
Breiten, T., Kunisch, K.: Riccati-based feedback control of the monodomain equations with the Fitzhugh–Nagumo model. SIAM J. Control Optim. 52(6), 4057–4081 (2014)
Casas, E., Ryll, C., Tröltzsch, F.: Sparse optimal control of the Schlögl and FitzHugh–Nagumo systems. Comput. Methods Appl. Math. 13(4), 415–442 (2013)
Cordoni, F., Di Persio, L.: Small noise asymptotic expansion for the infinite dimensional Van der Pol oscillator. Int. J. Math. Models Method Appl. Sci. 9, 202–210 (2015)
Cordoni, F., Di Persio, L.: Stochastic reaction–diffusion equations on networks with dynamic time-delayed boundary conditions. J. Math. Anal. Appl. 451(1), 583–603 (2017)
Cordoni, F., Di Persio, L.: Gaussian estimates on networks with dynamic stochastic boundary conditions. Infinite Dimens. Anal. Quant. Probab. Relat. Top. 20(01), 1750001 (2017)
Cordoni, F., Di Persio, L.: Optimal control for the stochastic FitzHugh–Nagumo model with recovery variable. Evol. Equ. Control Theory 7(4), 571–585 (2018)
Da Prato, G.: Kolmogorov Equations for Stochastic PDEs. Advanced Courses in Mathematics. CRM Barcelona. Birkhäuser Verlag, Basel (2004)
Da Prato, G., Zabczyk, J.: Ergodicity for Infinite-Dimensional Systems. London Mathematical Society Lecture Note Series, 229. Cambridge University Press, Cambridge (1996)
Da Prato, G., Zabczyk, J.: Second Order Partial Differential Equations in Hilbert Spaces. London Mathematical Society Lecture Note Series, 293. Cambridge University Press, Cambridge (2012)
Da Prato, G., Zabczyk, J.: Stochastic Equations in Infinite Dimensions, vol. 152. Cambridge University Press, Cambridge (2014)
Diehl, H.W.: The theory of boundary critical phenomena. Int. J. Mod. Phys. B 11(30), 3503–3523 (1997)
Ekeland, I.: On the variational principle. J. Math. Anal. Appl. 47(2), 324–353 (1974)
Fuhrman, M., Orrieri, C.: Stochastic maximum principle for optimal control of a class of nonlinear SPDEs with dissipative drift. SIAM J. Control Optim. 54(1), 341–371 (2016)
Fuhrman, M., Tessitore, G.: Nonlinear Kolmogorov equations in infinite dimensional spaces: the backward stochastic differential equations approach and applications to optimal control. Ann. Probab. 30(3), 1397–1465 (2002)
FitzHugh, R.: Impulses and physiological states in theoretical models of nerve membrane. Biophys. J. 1, 445–466 (1961)
Hodgkin, A.L., Huxley, A.F.: A quantitative description of membrane current and its application to conduction and excitation in nerve. J. Physiol. 117(4), 500–544 (1952)
Iqbal, N., Ranchao, W., Liu, B.: Pattern formation by super-diffusion in FitzHugh–Nagumo model. Appl. Math. Comput. 313, 245–258 (2017)
Marinelli, C., Di Persio, L., Ziglio, G.: Approximation and convergence of solutions to semilinear stochastic evolution equations with jumps. J. Funct. Anal. 264(12), 2784–2816 (2013)
Nagumo, J., Arimoto, S., Yoshizawa, S.: An active pulse transmission line simulating nerve axon. Proc. Inst. Radio Eng. 50(10), 2061–2070 (1962)
Pankratov, L.: Homogenization of the ginzburg-landau heat flow equation in a porous medium. Appl. Anal. 69(1–2), 31–45 (1998)
Pitchaiah, S., Armaou, A.: Output feedback control of the FitzHugh-Nagumo equation using adaptive model reduction. In: Proceedings of the 49th IEEE Conference on Decision and Control. Atlanta, GA (2010)
Ringkvist, M., Zhou, Y.: On the dynamical behaviour of FitzHugh–Nagumo systems: revisited. Nonlinear Anal. Theory Methods Appl. 71(7–8), 2667–2687 (2009)
Scarpa, L.: Optimal distributed control of a stochastic Cahn–Hilliard Equation. SIAM J. Control Opti 57(5), 3571–3602 (2019)
Siddheshwar, P.G., Vanishree, R.K.: Lorenz and Ginzburg–Landau equations for thermal convection in a high-porosity medium with heat source. Ain Shams Eng. J. (2016). https://doi.org/10.1016/j.asej.2016.11.007
Tuckwell, H.C.: Random perturbations of the reduced Fitzhugh–Nagumo equation. Phys. Scr. 46(6), 481 (1992)
Funding
Open Access funding provided by Universitá degli Studi di Verona
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Cordoni, F., Persio, L.D. Optimal Control of the FitzHugh–Nagumo Stochastic Model with Nonlinear Diffusion. Appl Math Optim 84, 2947–2968 (2021). https://doi.org/10.1007/s00245-020-09735-7
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00245-020-09735-7