Abstract
We address the generalized variational problem of Herglotz from an optimal control point of view. Using the theory of optimal control, we derive a generalized Euler–Lagrange equation, a transversality condition, a DuBois–Reymond necessary optimality condition and Noether’s theorem for Herglotz’s fundamental problem, valid for piecewise smooth functions.
Part of first author’s Ph.D. project, which is carried out under the Doctoral Programme in Mathematics (PDMat) of University of Aveiro.
Access provided by Autonomous University of Puebla. Download conference paper PDF
Similar content being viewed by others
Keywords
- Herglotz’s variational problems
- Optimal control
- Euler–Lagrange equations
- Invariance
- Dubois–Reymond condition
- Noether’s theorem
1 Introduction
The generalized variational problem proposed by Herglotz in 1930 [3, 4] can be formulated as follows:
It consists in the determination of trajectories \(x(\cdot )\) and corresponding trajectories \(z(\cdot )\) that extremize (maximize or minimize) the value z(b), where \(L \in C^1([a,b]\times \mathbb {R}^{2n}\times \mathbb {R};\mathbb {R})\). While in [3, 4, 6] the admissible functions are \(x(\cdot ) \in C^2([a,b];\mathbb {R}^n)\) and \(z(\cdot ) \in C^1([a,b];\mathbb {R})\), here we consider (\(P_{H}\)) in the wider class of functions \(x(\cdot ) \in PC^1([a,b];\mathbb {R}^n)\) and \(z(\cdot ) \in PC^1([a,b];\mathbb {R})\).
It is obvious that Herglotz’s problem (\(P_{H}\)) reduces to the classical fundamental problem of the calculus of variations (see, e.g., [13]) if the Lagrangian L does not depend on the z variable: if \(\dot{z}(t)=L(t,x(t),\dot{x}(t))\), \(t \in [a,b]\), then (\(P_{H}\)) is equivalent to the classical variational problem
Herglotz proved that an Euler–Lagrange optimality condition for a pair \(\left( x(\cdot ),z(\cdot )\right) \) to be an extremizer of the generalized variational problem (\(P_{H}\)) is given by
\(t \in [a,b]\). The Eq. (2) is known as the generalized Euler–Lagrange equation. Observe that for the fundamental problem of the calculus of variations (1) one has \(\frac{\partial L}{\partial z}=0\) and the differential Eq. (2) reduces to the classical Euler–Lagrange equation
Since the celebrated work [5] by Pontryagin et al., the calculus of variations is seen as part of optimal control. One of the simplest problems of optimal control, in Bolza form, is the following one:
where \(f \in C^1([a,b]\times \mathbb {R}^{n}\times {\varOmega };\mathbb {R})\), \(\phi \in C^1(\mathbb {R}^{n};\mathbb {R})\), \(g \in C^1([a,b]\times \mathbb {R}^{n}\times {\varOmega };\mathbb {R}^n)\), \(x \in PC^1([a,b]; \mathbb {R}^n)\) and \(u\in PC([a,b];{\varOmega })\), with \({\varOmega }\subseteq \mathbb {R}^r\) an open set. In the literature of optimal control, x and u are called the state and control variables, respectively, while \(\phi \) is known as the payoff or salvage term. Note that the classical problem of the calculus of variations (1) is a particular case of problem (P) with \(\phi (x) \equiv 0\), \(g(t,x,u)=u\) and \({\varOmega }=\mathbb {R}^n\). In this work we show how the results on Herglotz’s problem of the calculus of variations (\(P_{H}\)) obtained in [2, 6] can be generalized by using the theory of optimal control. The main idea is simple and consists in rewriting the generalized variational problem of Herglotz (\(P_{H}\)) as a standard optimal control problem (P), and then to apply available results of optimal control theory.
The paper is organized as follows. In Sect. 2 we briefly review the necessary concepts and results from optimal control theory. In particular, we make use of Pontryagin’s maximum principle (Theorem 1); the DuBois–Reymond condition of optimal control (Theorem 2); and the Noether theorem of optimal control proved in [8] (cf. Theorem 3). Our contributions are then given in Sect. 3: we generalize the Euler–Lagrange equation and the transversality condition for problem (\(P_{H}\)) found in [6] to admissible functions \(x(\cdot ) \in PC^1([a,b];\mathbb {R}^n)\) and \(z(\cdot ) \in PC^1([a,b];\mathbb {R})\) (Theorem 4); we obtain a DuBois–Reymond necessary optimality condition for problem (\(P_{H}\)) (Theorem 5); and a generalization of the Noether theorem [2] (Theorem 6) as a corollary of the optimal control results of Torres [7–9]. We end with Sect. 4 of conclusions and future work.
2 Preliminaries
The central result in optimal control theory is given by Pontryagin’s maximum principle, which is a first-order necessary optimality condition.
Theorem 1
(Pontryagin’s Maximum Principle for Problem (P) [5]). If a pair \((x(\cdot ),u(\cdot ))\) with \(x \in PC^1([a,b]; \mathbb {R}^n)\) and \(u\in PC([a,b];{\varOmega })\) is a solution to problem (P), then there exists \(\psi \in PC^1([a,b];\mathbb {R}^n)\) such that the following conditions hold:
-
the optimality condition
$$\begin{aligned} \frac{\partial H}{\partial u}(t, x(t),u(t), \psi (t))=0; \end{aligned}$$(3) -
the adjoint system
$$\begin{aligned} {\left\{ \begin{array}{ll} \dot{x}(t)=\frac{\partial H}{\partial \psi }(t, x(t),u(t), \psi (t))\\ \dot{\psi }(t)=-\frac{\partial H}{\partial x}(t, x(t),u(t), \psi (t)); \end{array}\right. } \end{aligned}$$(4) -
and the transversality condition
$$\begin{aligned} \psi (b)=\nabla \phi (x(b)); \end{aligned}$$(5)
where the Hamiltonian H is defined by
Definition 1
(Pontryagin Extremal to (P)). A triplet \((x(\cdot ),u(\cdot ), \psi (\cdot ))\) with \(x \in PC^1([a,b]; \mathbb {R}^n)\), \(u\in PC([a,b];{\varOmega })\) and \(\psi \in PC^1([a,b];\mathbb {R}^n)\) is called a Pontryagin extremal to problem (P) if it satisfies the optimality condition (3), the adjoint system (4) and the transversality condition (5).
Theorem 2
(DuBois–Reymond Condition of Optimal Control [5]). If \((x(\cdot ),u(\cdot ), \psi (\cdot ))\) is a Pontryagin extremal to problem (P), then the Hamiltonian (6) satisfies the equality
\(t \in [a,b]\).
Noether’s theorem has become a fundamental tool of modern theoretical physics [1], the calculus of variations [10, 11], and optimal control [7–9]. It states that when an optimal control problem is invariant under a one parameter family of transformations, then there exists a corresponding conservation law: an expression that is conserved along all the Pontryagin extremals of the problem [7–9, 12]. Here we use Noether’s theorem as found in [8], which is formulated for problems of optimal control in Lagrange form, that is, for problem (P) with \(\phi \equiv 0\). In order to apply the results of [8] to the Bolza problem (P), we rewrite it in the following equivalent Lagrange form:
The notion of invariance for problem (P) is obtained by applying the notion of invariance found in [8] to the equivalent optimal control problem (7). In Definition 2 we use the little-o notation.
Definition 2
(Invariance of Problem (P)). Let \(h^s\) be a one-parameter family of \(C^1\) invertible maps
Problem (P) is said to be invariant under transformations \(h^s\) if for all \((x(\cdot ),u(\cdot ))\) the following two conditions hold:
-
(i)
$$\begin{aligned} \Big [f \circ h^s(t,x(t),u(t))+\frac{\phi (x(b))}{b-a} + \xi s + o(s&)\Big ]\frac{d\mathcal {T}^s}{dt}(t,x(t),u(t))\nonumber \\&= f(t,x(t),u(t)) + \frac{\phi (x(b))}{b-a} \end{aligned}$$(8)
for some constant \(\xi \);
-
(ii)
$$\begin{aligned} \frac{d\mathcal {X}^s}{dt}\left( t,x(t),u(t)\right) =g\circ h^s(t,x(t),u(t))\frac{d\mathcal {T}^s}{dt}(t,x(t),u(t)). \end{aligned}$$(9)
Theorem 3
(Noether’s Theorem for the Optimal Control Problem (P)). If problem (P) is invariant in the sense of Definition 2, then the quantity
is constant in t along every Pontryagin extremal \((x(\cdot ),u(\cdot ), \psi (\cdot ))\) of problem (P), where
and H is defined by (6).
Proof
The result is a simple exercise obtained by applying the Noether theorem of [8] and the Pontryagin maximum principle (Theorem 1) to the equivalent optimal control problem (7) (in particular using the adjoint equation corresponding to the multiplier associated with the state variable \(x_0\) and the respective transversality condition).
3 Main Results
We begin by introducing some basic definitions for the generalized variational problem of Herglotz (\(P_{H}\)).
Definition 3
(Admissible Pair to Problem (\(P_{H}\))). We say that \((x(\cdot ),z(\cdot ))\) with \(x(\cdot ) \in PC^1([a,b];\mathbb {R}^n)\) and \(z(\cdot ) \in PC^1([a,b];\mathbb {R})\) is an admissible pair to problem (\(P_{H}\)) if it satisfies the equation
and the initial conditions \(x(a)=\alpha \) and \(z(a)=\gamma \), \(\alpha , \gamma \in \mathbb {R}\).
Definition 4
(Extremizer to Problem (\(P_{H}\))). We say that an admissible pair \((x^*(\cdot ),z^*(\cdot ))\) is an extremizer to problem (\(P_{H}\)) if \(z(b)-z^*(b)\) has the same signal for all admissible pairs \((x(\cdot ),z(\cdot ))\) that satisfy \(\Vert z-z^* \Vert _0< \epsilon \) and \(\Vert x-x^* \Vert _0< \epsilon \) for some positive real \(\epsilon \), where \(\Vert y\Vert _0=\displaystyle \mathop {\max _{a\le t \le b}}|y(t)|\).
We now present a necessary condition for a pair \((x(\cdot ),z(\cdot ))\) to be a solution (extremizer) to problem (\(P_{H}\)). The following result generalizes [3, 4, 6] by considering a more general class of functions. To simplify notation, we use the operator \(\langle \cdot ,\cdot \rangle \) defined by
When there is no possibility of ambiguity, we sometimes suppress arguments.
Theorem 4
(Euler–Lagrange Equation and Transversality Condition for Problem (\(P_{H}\))). If \((x(\cdot ),z(\cdot ))\) is an extremizer to problem (\(P_{H}\)), then the Euler–Lagrange equation
holds, \(t \in [a,b]\). Moreover, the following transversality condition holds:
Proof
Observe that Herglotz’s problem (\(P_{H}\)) is a particular case of problem (P) obtained by considering x and z as state variables (two components of one vectorial state variable), \(\dot{x}\) as the control variable u, and by choosing \(f\equiv 0\) and \(\phi (x,z)=z\). Note that since \(x(t)\in \mathbb {R}^n\), we have \(u(t)\in \mathbb {R}^n\) (i.e., for Herglotz’s problem (\(P_{H}\)) one has \(r=n\)). In this way, the problem of Herglotz, described as an optimal control problem, takes the form
It follows from Pontryagin’s maximum principle (Theorem 1) that there exists \(\psi _x \in PC^1([a,b];\mathbb {R}^n)\) and \(\psi _z \in PC^1([a,b];\mathbb {R})\) such that the following conditions hold:
-
the optimality condition
$$\begin{aligned} \frac{\partial H}{\partial u}(t,x(t),u(t),z(t),\psi _x(t),\psi _z(t))=0; \end{aligned}$$(13) -
the adjoint system
$$\begin{aligned} {\left\{ \begin{array}{ll} \dot{x}(t)=\frac{\partial H}{\partial \psi _x}(t,x(t),u(t),z(t),\psi _x(t),\psi _z(t))\\ \dot{z}(t)=\frac{\partial H}{\partial \psi _z}(t,x(t),u(t),z(t),\psi _x(t),\psi _z(t))\\ \dot{\psi }_x(t)=-\frac{\partial H}{\partial x}(t,x(t),u(t),z(t),\psi _x(t),\psi _z(t))\\ \dot{\psi }_z(t)=-\frac{\partial H}{\partial z}(t,x(t),u(t),z(t),\psi _x(t),\psi _z(t)); \end{array}\right. } \end{aligned}$$(14) -
and the transversality conditions
$$\begin{aligned} {\left\{ \begin{array}{ll} \psi _x(b)=0,\\ \psi _z(b)=1, \end{array}\right. } \end{aligned}$$(15)
where the Hamiltonian H is defined by
Observe that the adjoint system (14) implies that
This means that \(\psi _z\) is solution of a first-order linear differential equation, which is solved using an integrand factor to find that \(\psi _z=ke^{-\int _a^t\frac{\partial L}{\partial z}d\theta }\) with k a constant. From the second transversality condition in (15), we obtain that \(k=e^{\int _a^b\frac{\partial L}{\partial z}d\theta }\) and, consequently,
The optimality condition (13) is equivalent to \(\psi _x+\psi _z\frac{\partial L}{\partial u}=0\) and, after derivation, we obtain that
Now, comparing with (16), we have
Since \(\psi _z(t)\ne 0\) for all \(t \in [a,b]\) and \(\dot{x}=u\), we obtain the Euler–Lagrange Eq. (10):
Note that from the optimality condition (13) we obtain that \(\psi _x=-\psi _z\frac{\partial L}{\partial u}=-\psi _z\frac{\partial L}{\partial \dot{x}}\), which together with transversality condition (15) for \(\psi _x\) leads to the transversality condition (11):
This concludes the proof.
Definition 5
(Extremal to Problem (\(P_{H}\))). We say that an admissible pair \((x(\cdot ), z(\cdot ))\) is an extremal to problem (\(P_{H}\)) if it satisfies the Euler–Lagrange Eq. (10) and the transversality condition (11).
Theorem 5
(DuBois–Reymond Condition for Problem (\(P_{H}\))). If \((x(\cdot ), z(\cdot ))\) is an extremal to problem (\(P_{H}\)), then
\(t \in [a,b]\), where \(\psi _z(t)=e^{\int _t^b\frac{\partial L}{\partial z}\langle x,z \rangle (\theta )d\theta }\).
Proof
The result follows from Theorem 2, rewriting problem (\(P_{H}\)) as the optimal control problem (12).
We define invariance for (\(P_{H}\)) using Definition 2 for the equivalent optimal control problem (12).
Definition 6
(Invariance of Problem (\(P_{H}\))). Let \(h^s\) be a one-parameter family of \(C^1\) invertible maps
Problem (\(P_{H}\)) is said to be invariant under the transformations \(h^s\) if for all admissible pairs \((x(\cdot ),z(\cdot ))\) the following two conditions hold:
-
(i)
$$\begin{aligned} \left( \frac{z(b)}{b-a}+\xi s + o(s)\right) \frac{d\mathcal {T}^s}{dt}\langle x,z \rangle (t) =\frac{z(b)}{b-a} \end{aligned}$$(17)
for some constant \(\xi \);
-
(ii)
$$\begin{aligned}&\frac{d \mathcal {Z}^s}{dt}\langle x,z \rangle (t)\nonumber \\&= L\left( \mathcal {T}^s\langle x,z \rangle (t),\mathcal {X}^s\langle x,z \rangle (t), \frac{d\mathcal {X}^s}{d\mathcal {T}^s}\langle x,z \rangle (t), \mathcal {Z}^s\langle x,z \rangle (t)\right) \frac{d\mathcal {T}^s}{dt}\langle x,z \rangle (t), \end{aligned}$$(18)
where
Follows the main result of the paper.
Theorem 6
(Noether’s Theorem for Problem (\(P_{H}\))). If problem (\(P_{H}\)) is invariant in the sense of Definition 6, then the quantity
is constant in t along every extremal of problem (\(P_{H}\)), where
and \(\psi _z(t)=e^{\int _t^b\frac{\partial L}{\partial z}\langle x,z \rangle (\theta )d\theta }\).
Proof
As before, we rewrite problem (\(P_{H}\)) in the equivalent optimal control form (12), where x and z are the state variables and u the control. We prove that if problem (\(P_{H}\)) is invariant in the sense of Definition 6, then (12) is invariant in the sense of Definition 2. First, observe that if Eq. (17) holds, then (8) holds for (12): here \(f \equiv 0\), \(\phi (x,z) = z\) and (8) simplifies to \(\left[ \frac{z(b)}{b-a} + \xi s + o(s)\right] \frac{d\mathcal {T}^s}{dt}\langle x,z \rangle (t) = \frac{z(b)}{b-a}\). Note that the first equation of the control system of problem (12) (\(u(t) = \dot{x}(t)\)) defines \(\mathcal {U}^s:=\frac{d\mathcal {X}^s}{d\mathcal {T}^s}\), that is,
Hence, if Eqs. (18) and (20) holds, then there is also invariance of the control system of (12) in the sense of (9) and consequently problem (12) is invariant in the sense of Definition 2. We are now in conditions to apply Theorem 3 to problem (12), which guarantees that the quantity
is constant in t along every Pontryagin extremal of problem (12), where
This means that the quantity
is constant in t along all extremals of problem (\(P_{H}\)), where
Equivalently,
is a constant along the extremals. To conclude the proof, we just need to prove that the quantity
is a constant. From the invariance condition (17) we know that
Integrating from a to t, we conclude that
Differentiating (22) with respect to s, and then putting \(s=0\), we obtain
We conclude from (23) that expression (21) is the constant \((b-a)\xi - \frac{z(b)}{b-a}T\langle x,z\rangle (a)\).
4 Conclusion
We introduced a different approach to the generalized variational principle of Herglotz, by looking to Herglotz’s problem as an optimal control problem. A Noether type theorem for Herglotz’s problem was first proved by Georgieva and Guenther in [2]: under the condition of invariance
they obtained
where \(\lambda (t)=e^{-\int _a^t\frac{\partial L}{\partial z}\langle x,z \rangle (\theta )d\theta }\), as a conserved quantity along the extremals of problem (\(P_{H}\)). Our results improve those of [2] in three ways: (i) we consider a wider class of piecewise admissible functions; (ii) we consider a more general notion of invariance whose transformations \(\mathcal {T}^s\), \(\mathcal {X}^s\) and \(\mathcal {Z}^s\) may also depend on velocities, i.e., on \(\dot{x}(t)\) (note that if (18) holds with \(\mathcal {Z}^s\langle x,z \rangle = z\), then (24) also holds); (iii) the conserved quantity (25), up to multiplication by a constant, is a particular case of (19) when there is no transformation in z (\(Z=\left. \frac{\partial \mathcal {Z}^s}{\partial s}\right| _{s=0}=0\)). The results here obtained can be generalized to higher-order variational problems of Herglotz type. This is under investigation and will be addressed elsewhere.
References
Frederico, G.S.F., Torres, D.F.M.: Fractional isoperimetric Noether’s theorem in the Riemann-Liouville sense. Rep. Math. Phys. 71(3), 291–304 (2013)
Georgieva, B., Guenther, R.: First Noether-type theorem for the generalized variational principle of Herglotz. Topol. Methods Nonlinear Anal. 20(2), 261–273 (2002)
Guenther, R.B., Guenther, C.M., Gottsch, J.A.: The Herglotz Lectures on Contact Transformations and Hamiltonian Systems. Lecture Notes in Nonlinear Analysis. Juliusz Schauder Center for Nonlinear Studies, Nicholas Copernicus University, Torún (1996)
Herglotz, G.: Berührungstransformationen. Lectures at the University of Göttingen, Göttingen (1930)
Pontryagin, L.S., Boltyanskii, V.G., Gamkrelidze, R.V., Mishchenko, E.F.: The Mathematical Theory of Optimal Processes. Interscience Publishers, London (1962)
Santos, S.P.S., Martins, N., Torres, D.F.M.: Higher-order variational problems of Herglotz type. Vietnam J. Math. 42(4), 409–419 (2014)
Torres, D.F.M.: On the Noether theorem for optimal control. Eur. J. Control 8(1), 56–63 (2002)
Torres, D.F.M.: Conservation laws in optimal control. In: Colonius, F., Grüne, L. (eds.) Dynamics, Bifurcations, and Control. Lecture Notes in Control and Information Science, vol. 273, pp. 287–296. Springer, Berlin (2002)
Torres, D.F.M.: Quasi-invariant optimal control problems. Port. Math. 61(1), 97–114 (2004). (N.S.)
Torres, D.F.M.: Carathéodory equivalence Noether theorems, and tonelli full-regularity in the calculus of variations and optimal control. J. Math. Sci. 120(1), 1032–1050 (2004). (N. Y.)
Torres, D.F.M.: Proper extensions of Noether’s symmetry theorem for nonsmooth extremals of the calculus of variations. Commun. Pure Appl. Anal. 3(3), 491–500 (2004)
Torres, D.F.M.: A Noether theorem on unimprovable conservation laws for vector-valued optimization problems in control theory. Georgian Math. J. 13(1), 173–182 (2006)
van Brunt, B.: The Calculus of Variations. Universitext, New York (2004)
Acknowledgments
This work was supported by Portuguese funds through the Center for Research and Development in Mathematics and Applications (CIDMA), within project UID/MAT/04106/2013, and the Portuguese Foundation for Science and Technology (FCT). The authors would like to thank an anonymous Reviewer for valuable comments.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Santos, S.P.S., Martins, N., Torres, D.F.M. (2015). An Optimal Control Approach to Herglotz Variational Problems. In: Plakhov, A., Tchemisova, T., Freitas, A. (eds) Optimization in the Natural Sciences. EmC-ONS 2014. Communications in Computer and Information Science, vol 499. Springer, Cham. https://doi.org/10.1007/978-3-319-20352-2_7
Download citation
DOI: https://doi.org/10.1007/978-3-319-20352-2_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-20351-5
Online ISBN: 978-3-319-20352-2
eBook Packages: Computer ScienceComputer Science (R0)