Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Problem Formulation

Consider a nonlinear time-varying system

$$\displaystyle{ \dot{x} = f(x,u,t),\quad x \in {R}^{n},\quad u \in {R}^{m},\quad t \in R,\quad x(0) = x_{ 0}, }$$
(1)

with control goal

$$\displaystyle{ J\left (u(\cdot )\right ) =\limsup _{t\rightarrow +\infty }Q\left (x(t,u(\cdot ),x_{0}),t\right ),\quad J\left (u(\cdot )\right ) \rightarrow \min, }$$
(2)

and constraints

$$\displaystyle{ \forall t \geq 0: \quad B_{k}\left (x(t,u(\cdot ),x_{0}),t\right ) > 0,\quad k = 1,\ldots,\mu, }$$
(3)

here x—state, u—control, t—time, x 0—initial condition, Q(⋅ ), B k (⋅ ): R n × R → R—some functions, x(t, u(⋅ ), x 0)—solutions of the system (1) with control u(⋅ ) and initial condition x 0.

2 Speed-Gradient Method

In order to design control algorithm the scalar function w(x, u, t) is calculated that is the speed of changing Q(x, t) along trajectories x(t) of (1)

$$\displaystyle{ w(x,u,t) = \frac{\partial Q(x,t)} {\partial t} + \frac{\partial Q} {\partial x} f(x,u,t). }$$
(4)

Then it is needed to evaluate the gradient of w(x, u, t) with respect to input variables

$$\displaystyle{ \nabla _{u}w(x,u,t) = \nabla _{u}\frac{\partial Q} {\partial x} f(x,u,t). }$$
(5)

Finally the algorithm of changing u(t) is determinated according to the differential equation (differential form)

$$\displaystyle{ \dot{u} = -\varGamma \nabla _{u}w(x,u,t),\quad u(0) = u_{0} }$$
(6)

or to the algebraic equation (finite form)

$$\displaystyle{ u = u_{0} -\varGamma \nabla _{u}w(x,u,t), }$$
(7)

where Γ = Γ T > 0 is the positive definite gain matrix, u 0 is some initial value of control algorithm. It can be also introduced a speed-pseudogradient algorithm

$$\displaystyle{ u = u_{0} -\varGamma \psi (x,u,t), }$$
(8)

where ψ(x, u, t) satisfies the pseudogradient condition

$$\displaystyle{ \psi {(x,u,t)}^{T}\nabla _{ u}w(x,u,t) \geq 0. }$$
(9)

The algorithm (6) is called speed-gradient algorithm [1], since it suggests to change u(t) proportionally to the gradient of the speed of changing \(Q\left (x(t),t\right )\).

2.1 Universal Speed-Gradient Method

Consider the Taylor approximation for \(w(t) = w\left (x(t),u(t),t\right )\), where \(\left (x(t),u(t)\right )\) is a trajectory of the system (1)

$$\displaystyle\begin{array}{rcl} w(t+\tau )& =& w\left (x(t),u(t),t\right ) + \\ & \quad +& \left (\frac{\partial w} {\partial t} \left (x(t),u(t),t\right ) + \frac{\partial w} {\partial x} f\left (x(t),u(t),t\right ) + \frac{\partial w} {\partial u} \dot{u}\right )\tau + o(\tau ).\quad {}\end{array}$$
(10)

If w(x, u, t) is non-positive, then (with some additional assumptions) according to a La-Salle principle the control goal (2) is fulfilled.

Consider

$$\displaystyle{ \dot{u} = -\gamma (x,u,t)\psi (x,u,t), }$$
(11)

where \(\psi: {R}^{n} \times {R}^{m} \times R \rightarrow {R}^{m}\), γ: R n × R m × R → R and

$$\displaystyle{ \gamma (x,u,t)\frac{\partial w} {\partial u} \psi (x,u,t) > \frac{\partial w(x,u,t)} {\partial t} + \frac{\partial w} {\partial x} f(x,u,t). }$$
(12)

For example it can be used ψ(⋅ ) = ∇ u w(⋅ ) and

$$\displaystyle{ \gamma (x,u,t) = \frac{\;\eta (x,u,t) +\lambda \sqrt{\eta {(x, u, t)}^{2 } +\zeta {(x, u, t)}^{2}}\;} {\zeta (x,u,t)} \;,\quad \lambda > 0, }$$
(13)

with

$$\displaystyle{ \eta (x,u,t) = \frac{\partial w(x,u,t)} {\partial t} + \frac{\partial w} {\partial x} f(x,u,t)\;,\quad \zeta (x,u,t) = \frac{\partial w} {\partial u} \psi (x,u,t). }$$
(14)

For affine systems the same algorithm was proposed by Sontag in 1989 [2].

The control algorithm (11) with the inequality (12) we named the “Universal speed-gradient method”.

Theorem 1.

Let the following assumptions be valid:

  1. 1.

    \(w\left (x_{{\ast}}(t),u_{{\ast}}(t),t\right )\) is a twice continuously differentiable function along the trajectories \(\left (x_{{\ast}}(t),u_{{\ast}}(t)\right )\) of system (1), (11);

  2. 2.

    The function Q(x,t) is nonnegative, uniformly continuous in any set of the form {(x,t): ||x|| < β,t ≥ 0} and radially unbounded;

  3. 3.

    For initial condition the inequality \(w\left (x(0),u(0),0\right ) \leq 0\) is true;

  4. 4.

    Inequality (12) is true for all (x,u,t): w(x,u,t) = 0, Q(x,t) ≠ 0;

  5. 5.

    Control (11) is a continuous in (x,u) function;

then any solution \(\left (x(t),u(t)\right )\) of (1), (11) is bounded and \(\lim \limits _{t\rightarrow +\infty }\frac{d} {\mathit{dt}}Q\left (x_{{\ast}}(t),t\right ) = 0.\)

Proof.

The Taylor approximation (10) is true according to assumption 1. According to assumptions 3, 4 inequality \(\dot{Q}\left (x(t),t\right ) < 0\) is true for all (x, u, t): Q(x, t) ≠ 0. Consequently, from assumption 2 follows that any solution \(\left (x(t),u(t)\right )\) of (1) and (11) is bounded and \(\lim \limits _{t\rightarrow +\infty }\frac{d} {\mathit{dt}}Q\left (x_{{\ast}}(t),t\right ) = 0.\) □ 

2.2 Universal Speed-Gradient Method with Constraints

Consider the derivative of constraints (3) along the trajectories of the system (1)

$$\displaystyle{ \frac{\partial B_{k}} {\partial t} (x,u,t) + \frac{\partial B_{k}} {\partial x} f(x,u,t) + \frac{\partial B_{k}} {\partial u} \dot{u} > 0\;,\quad k = 1,\ldots,\mu. }$$
(15)

According to (11) consider the following inequalities

$$\displaystyle{ \gamma (x,u,t)\frac{\partial B_{k}} {\partial u} \psi (x,u,t) > -\frac{\partial B_{k}} {\partial t} (x,u,t) -\frac{\partial B_{k}} {\partial x} f(x,u,t),\quad k = 1,\ldots,\mu.\quad }$$
(16)

The control algorithm (11) with the inequalities (12) and (16) we named the “Universal speed-gradient method with constraints”.

Theorem 2.

Let the following assumptions be valid:

  1. 1.

    \(w\left (x_{{\ast}}(t),u_{{\ast}}(t),t\right )\) is a twice continuously differentiable function along the trajectories \(\left (x_{{\ast}}(t),u_{{\ast}}(t)\right )\) of system (1), (11);

  2. 2.

    The function Q(x,t) is nonnegative, uniformly continuous in any set of the form {(x,t): ||x|| < β,t ≥ 0} and radially unbounded;

  3. 3.

    For initial condition inequalities \(w\left (x(0),u(0),0\right ) \leq 0\) and \(g\left (x(0),u(0),0\right ) > 0\) are true;

  4. 4.

    Inequalities (12), (16) are true for all (x,u,t): w(x,u,t) = 0, Q(x,t) ≠ 0;

  5. 5.

    Control (11)is a continuous in (x,u) function;

then any solution \(\left (x(t),u(t)\right )\) of (1), (11) is bounded, the constraint (3) fulfilled and \(\lim \limits _{t\rightarrow +\infty } \frac{d} {\mathit{dt}}Q\left (x_{{\ast}}(t),t\right ) = 0.\)

Proof.

The Taylor approximation (10) is true according to assumption 1. According to assumptions 3, 4 inequality \(\dot{Q}\left (x(t),t\right ) < 0\) is true for all (x, u, t): Q(x, t) ≠ 0. Consequently, from assumption 2 follows that any solution \(\left (x(t),u(t)\right )\) of (1) and (11) is bounded and \(\lim \limits _{t\rightarrow +\infty }\frac{d} {\mathit{dt}}Q\left (x_{{\ast}}(t),t\right ) = 0.\) From the assumptions 3, 4 follows that constraints are fulfilled for any solution \(\left (x(t),u(t)\right )\) of (1) and (11). □ 

3 Two Pendulums Example

Fig. 1
figure 1

Two pendulums with a single control input u

Consider the system of two pendulums (Fig. 1) with a single control input

$$\displaystyle{ \left \{\begin{array}{@{}l@{}} \dot{q}_{k} = \frac{1} {{\mathit{ml}}^{2}}p_{k}\;, \\ \dot{p}_{k} = -\mathit{mgl}\sin q_{k} + ul\cos q_{k}\;,\quad k = 1,2,\end{array} \right. }$$
(17)

the Hamiltonians of pendulums are the following

$$\displaystyle{ H_{0}^{k}(p_{ k},q_{k}) = \frac{1} {{\mathit{ml}}^{2}}p_{k}^{2} + \mathit{mgl}(1 -\cos q_{ k}),\qquad k = 1,2. }$$
(18)

Consider the control goal

$$\displaystyle{ \lim _{t\rightarrow +\infty }H_{0}^{1}\left (p_{ 1}(t),q_{1}(t)\right ) = E_{1}. }$$
(19)

with phase constraints

$$\displaystyle{ H_{0}^{2}\left (p_{ 2}(t),q_{2}(t)\right ) < E_{2}\;,\quad t \geq 0. }$$
(20)

According to the Speed-gradient approach we obtained the following control function

$$\displaystyle{ u(p,q) = -\varGamma \left (\frac{p_{1}\cos q_{1}} {\mathit{ml}} \left (H_{0}^{1}(p_{ 1},q_{1}) - E_{1}\right ) +\alpha \frac{p_{2}\cos q_{2}} {\;\mathit{ml}{\left (H_{0}^{2}(p_{2},q_{2}) - E_{2}\right )}^{2}\;}\right ). }$$
(21)

To demonstrate the ability of the controller to achieve the control goal and to fulfill the phase constraints we carried out computer simulation. The following value of system parameters and initial conditions were chosen: m = 1, l = 1, g = 10, q 1(0) = 0, q 2(0) = 0. 05, p 1(0) = 0, p 2(0) = 0. Energy goal value for the first pendulum was taken E 1 = 8, energy constraint for the second one was taken E 2 = 5. Algorithm parameters were: Γ = 0. 015, α = 10. Time for simulating was 80 s. Simulations shows that proposed algorithm solve the control problem: energy of the first pendulum converged to the goal value E 1 and the energy of the second was constrained by E 2. The simulating results are presented in (Figs. 2 and 3). The complete analysis of this control system was presented in [3].

Fig. 2
figure 2

Energy of pendulums. Solid line corresponds to the energy of the first pendulum \(H_{0}^{1}\left (q_{1}(t),p_{1}(t)\right )\), dash line—to the energy of the second one \(H_{0}^{2}\left (q_{2}(t),p_{2}(t)\right )\)

Fig. 3
figure 3

Control function: \(u(t)\,=\,u\left (q_{1}(t),q_{2}(t),p_{1}(t),p_{2}(t)\right )\)

4 Molecular Example

Consider a quantum model for diatomic molecule, described by the Schrodinger equation with control [46]

$$\displaystyle{ i\hslash \frac{\partial \varPsi (r,t)} {\partial t} = H_{0}\varPsi (r,t) + f(u)H_{1}\varPsi (r,t), }$$
(22)

where

$$\displaystyle{ H_{0} = - \frac{{\hslash }^{2}} {2M} \frac{{\partial }^{2}} {\partial {r}^{2}} + V (r)\;,\quad H_{1} = A\mu (r), }$$
(23)

and Morse potential

$$\displaystyle{ V (r) = D{\left (\mathit{exp}\left (-\alpha \frac{\;r - r_{0}\;} {r_{0}} \right ) - 1\right )}^{2} - D, }$$
(24)

here \(i = \sqrt{-1}\), \(\hslash = 1\)—Planck constant, Ψ(t, r)—wave function, r—distance between nuclei of the molecule, M—reduce mass of the molecule, α—parameter of a nonlinearity, μ(r)—molecular dipole momentum, D—dissociation energy, r 0—distance of equilibrium, u—control function of electromagnetic field, f(⋅ )—some function. All parameters are in the atomic Hartree unit system.

The problem is to design the control function u(t) to stabilize the average energy on the goal value:

$$\displaystyle{ \lim _{t\rightarrow +\infty }\phi {(t)}^{{\ast}}H_{ 0}\phi (t) = E_{{\ast}}\;. }$$
(25)

All the following calculations are made for a finite-level approximation obtained by a Bubnov-Galerkin method.

According to a speed-gradient method the following goal function is introduced

$$\displaystyle{ Q(\phi ) = {{(\phi }^{{\ast}}H_{ 0}\phi - E_{{\ast}})}^{2} }$$
(26)

and w(ϕ, u, t) is calculated

$$\displaystyle{ w(\phi,u,t) =\dot{ Q}(\phi ) = 2 \frac{i} {\hslash }{{(\phi }^{{\ast}}H_{ 0}\phi - E_{{\ast}})\phi }^{{\ast}}[H_{ 1},H_{0}]\phi f(u)\;. }$$
(27)

A Tailor approximation for \(w(t) = w\left (\phi (t),u(t),t\right )\) is the following

$$\displaystyle{ w(t+\tau ) = g_{0}(\phi,u) + \left (g_{1}(\phi,u) + g_{2}(\phi,u)\dot{u}\right )\tau + o(\tau )\;, }$$
(28)

where

$$\displaystyle\begin{array}{rcl} g_{0}(\phi,u)& =& \frac{i} {\hslash }{{(\phi }^{{\ast}}H_{ 0}\phi - E_{{\ast}})\phi }^{{\ast}}[H_{ 1},H_{0}]\phi f(u)\;,{}\end{array}$$
(29)
$$\displaystyle\begin{array}{rcl} g_{1}(\phi,u)& =& \frac{i} {\hslash }f(u) \frac{d} {\mathit{dt}}\left ({{(\phi }^{{\ast}}H_{ 0}\phi - E_{{\ast}})\phi }^{{\ast}}[H_{ 1},H_{0}]\phi \right )\;,{}\end{array}$$
(30)
$$\displaystyle\begin{array}{rcl} g_{2}(\phi,u)& =& \frac{i} {\hslash }{{(\phi }^{{\ast}}H_{ 0}\phi - E_{{\ast}})\phi }^{{\ast}}[H_{ 1},H_{0}]\phi \frac{d} {du}f(u)\;.{}\end{array}$$
(31)

According to a universal speed-gradient method the following algorithm was obtained

$$\displaystyle{ \dot{u} = \mathit{sat}\left (\frac{\;-g_{2}{(\phi,u)}^{2} - g_{1}(\phi,u)\;} {g_{2}(\phi,u)} \right ),\qquad u(0) = 0. }$$
(32)

For computer simulation we used the parameters of iodine molecule J 127 J 127: M = 114, 842, α = 4. 954, D = 0. 0572, r 0 = 5. 0366, with control function in the following form f(u) = 0. 02sin(u). Computer simulations shows that the energy converged to the goal value. The simulation results presented in Figs. 4 and 5.

Fig. 4
figure 4

Average energy: ϕ(t) H 0 ϕ(t), duration: 50 fs (10−15 s), goal energy value is E  = 0. 043

Fig. 5
figure 5

Control function \(f\left (u(t)\right ) = A\sin \left (u(t)\right )\), duration: 50 fs (10−15 s)

5 Conclusion

A new version of speed-gradient method is proposed that generates “universal” control algorithms both for differential and for finite form. Efficiency of this “universal” method is illustrated by computer simulation for energy control of quantum diatomic molecule. A speed-gradient method for control problems with phase constraints is also proposed and its efficiency is illustrated by computer simulation for selective energy control of two pendulums.