Abstract
In this paper, we introduce Hamel’s formalism for infinite-dimensional mechanical systems and in particular consider its applications to the dynamics of nonholonomically constrained systems. This development is a nontrivial extension of its finite-dimensional counterpart. The analysis is applied to several continuum mechanical systems of interest, including coupled systems and systems with infinitely many constraints.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Hamel’s formalism is a natural extension of Euler’s ideas of using nonmaterial velocity in mechanics. Nonmaterial velocity carries information about system’s velocity, but is not the rate of change of system’s configuration with respect to time. For a system with finite number of degrees of freedom, nonmaterial velocity is a collection of velocity components relative to a set of vector fields that span the fibers of the tangent bundle of the configuration space. In the finite-dimensional setting, this development was carried out by Hamel himself in Hamel (1904). Here we introduce Hamel’s formalism for infinite-dimensional mechanical systems.
One of the reasons for using nonmaterial velocity is that the Euler–Lagrange equations are not always effective for analyzing the dynamics of a mechanical system of interest. For example, it is difficult to study the motion of the Euler top if the Euler–Lagrange equations are used to represent the dynamics. On the other hand, the use of the angular velocity components relative to a body frame as pioneered by Euler (1752) results in a much simpler representation of dynamics. In a similar fashion, Euler (1757a, (1757b, (1761) uses convective velocity to represent the dynamics of ideal incompressible fluid. Euler’s approach was further developed by Lagrange (1788) for reasonably general Lagrangians on the rotation group and by Poincaré (1901) for arbitrary Lie groups (see Marsden and Ratiu 1999 for details and history).
The nonmaterial velocity used in Lagrange (1788) and Poincaré (1901) is associated with a group action. Hamel (1904) obtained the equations of motion in terms of nonmaterial velocity that is unrelated to a group action on the configuration space. Hamel’s equations include both the Euler–Lagrange and Euler–Poincaré equations (for the rigid body for example) as special cases.
As clearly seen from his paper, Hamel was particularly motivated by nonholonomic mechanics. His formalism features the simplicity of an analytic representation of constraints and the intrinsic absence of Lagrange multipliers in equations of motion. It is exceptionally effective for studying (finite-dimensional) constrained systems and understanding their dynamics, both analytically and numerically; see e.g., Bloch et al. (2009), Ball and Zenkov (2015), Zenkov et al. (2012) and references therein.
As mentioned above, in finite dimensions nonmaterial velocity refers to velocity components relative to a set of vector fields that span the fibers of the tangent bundle of the configuration space. Alternatively, one interprets nonmaterial velocity as a result of configuration-dependent velocity substitution. In the infinite-dimensional setting, the former interpretation fails to work as generically bases fail to exist.
General properties of infinite-dimensional constrained systems are studied from the field-theoretic viewpoint in e.g., Binz et al. (2002) and Vankerschaver (2005, (2007a, (2007b). Infinite-dimensional nonholonomic mechanics, in relation to electromechanical systems, diffeomorphism groups, and optimal transport, has been utilized in Neimark and Fufaev (1972) and Khesin and Lee (2009), respectively. At the moment, the general theory of infinite-dimensional nonholonomic systems in the form of ordinary differential equations on an infinite-dimensional configuration space does not exist. As demonstrated by Ebin and Marsden (1970) (see also Ebin 2015), the use of the infinite-dimensional formalism in combination with nonmaterial (spatial) velocity is crucial for proving existence of solutions in fluid mechanics. Motivated by this and as a part of the development of Hamel’s formalism, we introduce the general theory of infinite-dimensional nonholonomic systems.
Our paper develops the general formalism of Hamel’s equations in infinite-dimensions with careful attention to the analytic setting in which the equations are well defined. We also discuss mechanical examples, including a string, a constrained string, and strings attached to rigid bodies. The differential equations describing these systems are derived and analyzed.
Having in mind possible future applications to the dynamics of systems with nonBanach configuration manifolds (such as infinite-dimensional Lie groups), we develop the formalism on convenient manifolds. These are (infinite-dimensional) manifolds modeled on convenient spaces. A convenient space is a locally convex space with ‘tweaked’ topology. By definition, the \(c^\infty \)-topology on a locally convex space E is the final topology with respect to all smooth curves \(\gamma :{\mathbb {R}} \rightarrow E\), i.e., it is the finest topology on E with respect to which the aforementioned curves are continuous. Recall that smoothness is a bornological concept and is independent of the choice of topology of a locally convex space. In general, the \(c^\infty \)-topology is finer than any locally convex topology with the same collection of bounded sets. A locally convex vector space is said to be \(c^\infty \)-complete or convenient if it is \(c^\infty \)-closed in any locally convex space. For a Fréchet space, the \(c^\infty \)-topology coincides with the given locally convex topology. In general, a convenient space is not a topological vector space.
A mapping between two convenient spaces is called smooth if it maps smooth curves to smooth curves. The important property of the smooth mappings,
is known as the Cartesian closedness. We refer the reader to Kriegl and Michor (1997) for details and proofs.
The use of such general spaces is natural in variational calculus and helps to clarify various technical assumptions one needs to impose in the infinite-dimensional setting with velocity constraints. Meanwhile, the complexity of proofs is unaffected by the use of convenient analysis. Moreover, the results remain correct after a rollback to ‘simpler’ (e.g., Fréchet or Banach) manifolds.
Based on the earlier observations of the authors in the finite-dimensional setting and recent publications on geometric continuum mechanics, we expect that the proposed formalism will be useful in:
-
Systematic derivation of simple representations of equations of motion in continuum mechanics, which includes elimination of unnecessary Lagrange multipliers from the equations of motion and singling out control directions when they are inconsistent with configuration coordinates and/or do not commute.
-
Multibody dynamics of systems with continuum components.
-
Dynamics of contacting elastic rods (Gay-Balmaz and Putkaradze 2012, 2015), molecular strands (Ellis et al. 2010), and certain non-Newtonian fluids (Gay-Balmaz and Yoshimura 2015).
-
Stability and qualitative analysis, including existence of conservation laws in constrained systems and Lyapunov function construction strategies for partial stability analysis, with applications to stability of rigid bodies with fins moving in a fluid, fluid-filled bodies, and rolling elastic bodies.
-
Development of structure-preserving integrators for constrained infinite-dimensional systems.
The paper is organized as follows: In Sect. 2 we review the finite-dimensional Euler–Lagrange, Euler–Poincaré, and Hamel equations. In Sect. 3, the infinite-dimensional Hamel formalism is introduced. In Sects. 4 and 5, systems with constraints and symmetry are treated. In particular, systems with infinitely many constraints are studied, which, in the presence of symmetry, requires a somewhat different approach than the standard formalism for systems with symmetry. Numerous illustrative examples are given.
2 Preliminaries
Lagrangian mechanics provides a systematic approach to deriving the equations of motion as well as establishes the equivalence of force balance and variational principles.
2.1 The Euler–Lagrange Equations
A Lagrangian mechanical system is specified by a smooth manifold Q called the configuration space and a function \(L:TQ\rightarrow {\mathbb {R}}\) called the Lagrangian. In many cases, the Lagrangian is the kinetic minus potential energy of the system, with the kinetic energy defined by a Riemannian metric and the potential energy being a smooth function on the configuration space Q. If necessary, nonconservative forces can be introduced (e.g., gyroscopic forces that are represented by terms in L that are linear in the velocity), but this is not discussed in detail in this paper.
In local coordinates \(q=(q^1,\ldots ,q^n)\) on the configuration space Q, we write \(L=L(q,\dot{q})\). The dynamics is given by the Euler–Lagrange equations
These equations were originally derived by Lagrange (1788) by requiring that simple force balance be covariant, i.e., expressible in arbitrary generalized coordinates. A variational derivation of the Euler–Lagrange equations, namely Hamilton’s principle (see Theorem 2.1 below), came later in the work of Hamilton (1834, 1835).
Let q(t), \(a \le t \le b\), be a smooth curve in Q. A variation of the curve q(t) is a smooth map \(\beta : [a,b] \times [- \varepsilon , \varepsilon ] \rightarrow Q\) that satisfies the condition \(\beta (t, 0) = q(t)\). This variation defines the vector field
along the curve q(t).
Theorem 2.1
The following statements are equivalent:
-
(i)
The curve q(t), where \(a \le t \le b\), is a critical point of the action functional
$$\begin{aligned} \int _a^b L (q,{\dot{q}})\,\mathrm{d}t \end{aligned}$$on the space of curves in Q connecting \(q_a\) to \(q_b\) on the interval [a, b], where we choose variations of the curve q(t) that satisfy \(\delta q(a) = \delta q(b) = 0\).
-
(ii)
The curve q(t) satisfies the Euler–Lagrange equations.
We point out here that this principle assumes that a variation of the curve q(t) induces the variation \(\delta \dot{q}(t)\) of its velocity vector according to the formula
For more details and a proof, see e.g., Bloch (2015) and Marsden and Ratiu (1999).
2.2 The Euler–Poincaré Equations
The classical Euler equations for freely rotating rigid body read
where \(\Omega \) is the body angular velocity and J is the inertia tensor. First derived by Euler (1752), these equations, as well as the Euler equation for an incompressible fluid flow,
were generalized by Poincaré (1901, 1910) to any Lie algebra. These Euler–Poincaré equations for a Lagrangian \(l(\xi )\) defined on a Lie algebra \(\mathfrak g\) are
These equations are variational, with variations satisfying certain constraints, as the following theorem clarifies.
Theorem 2.2
Let \(\mathfrak g\) be a Lie algebra and \(l:\mathfrak g \rightarrow \mathbb R\) be a Lagrangian. The following statements are equivalent:
-
(i)
The variational principle
$$\begin{aligned} \delta \int _a^b l (\xi (t))\,\mathrm{d}t = 0 \end{aligned}$$holds on \(\mathfrak {g}\), using variations of the form
$$\begin{aligned} \delta \xi = \dot{\eta }\pm {\text {ad}}_\xi \eta , \end{aligned}$$where \(\eta \) vanishes at the endpoints.
-
(ii)
The Euler–Poincaré Eq. (2.1) hold.
See Bloch et al. (1996b), Cendra et al. (1988), Marsden (1992), Holm et al. (1998), and Marsden and Ratiu (1999) for details, history, and proofs.
2.3 Lagrangian Mechanics in Non-Coordinate Frames
In many cases, the Lagrangian and the equations of motion have a simpler structure when the velocity components are measured against a frame that is unrelated to the system’s local configuration coordinates. An example of such a system is the rigid body.
Let \(q=(q^1,\ldots , q^n)\) be local coordinates on the configuration space Q and \(u_i \in TQ\), \(i = 1,\ldots ,n\), be smooth independent local vector fields on Q defined in the same coordinate neighborhood. In certain cases, some or all of the fields \(u_i\) can be chosen to be global vector fields on Q.
Let \(\xi = (\xi ^1,\ldots ,\xi ^n) \in {\mathbb {R}}^n\) be the components of the velocity vector \(\dot{q} \in TQ\) relative to the frame \(u(q) = (u_1(q),\ldots , u_n(q))\), i.e.,
The Lagrangian of the system written in the local coordinates \((q, \xi )\) on the velocity phase space TQ reads
The coordinates \((q, \xi )\) are Lagrangian analogue of noncanonical variables in Hamiltonian dynamics.
Define the structure functions \(c_{ij}^k (q)\) by the equations
\(i, j, k = 1,\ldots , n\). These quantities vanish if and only if the vector fields \(u_i (q)\), \(i = 1,\ldots , n\), commute.
Viewing \(u _i\) as vector fields on TQ whose fiber components equal 0 (that is, taking the vertical lift of the frame vector fields), one defines the directional derivatives \(u_i [l]\) for a function \(l:TQ\rightarrow {\mathbb {R}}\) in a usual way.
The evolution of the variables \((q,\xi )\) is governed by the Hamel equations
\(i, j, k = 1,\ldots ,n\), coupled with equation \(\dot{q} = \xi ^i u_i(q)\). These equations were introduced in Hamel (1904) (see also Neimark and Fufaev 1972 and Bloch et al. 2009 for details, history, and contemporary geometric exposition). If \(u_i = \partial / \partial q^i\), the Hamel equations become the Euler–Lagrange equations.
2.4 Ideal Constraints
Assume now that there are velocity constraints imposed on the system. We confine our attention to constraints that are linear and homogeneous in the velocity. Accordingly, we consider a configuration space Q and a distribution \({\mathcal {D}}\) on Q that describes these constraints. Recall that a distribution \({\mathcal {D}}\) is a collection of linear subspaces of the tangent spaces of Q; we denote these subspaces by \({\mathcal {D}}_q\subset T_q Q\), one for each \(q \in Q\).
A curve \(q(t) \in Q\) is said to satisfy the constraints if \(\dot{q}(t)\in {\mathcal {D}}_{q(t)}\) for all t. This distribution will, in general, be nonintegrable, i.e., the constraints will be, in general, nonholonomic.Footnote 1
As discussed in e.g., Suslov (1946) and Chetaev (1989), it is assumed in classical mechanics that the constraints imposed on the system can be replaced with the reaction force. This means that after the force is imposed on the unconstrained system, the constraint distribution \({\mathcal {D}} \subset TQ\) becomes a conditional invariant manifold of the forced unconstrained Lagrangian system whose dynamics on this invariant manifold is identical to that of the constrained system.
Definition 2.3
Constraints (either holonomic or nonholonomic) are called ideal if their reaction force at each \(q\in Q\) belong to the null space \({\mathcal {D}}^\circ _q \subset T_q^{*}Q\) of \({\mathcal {D}}_q\subset T_q Q\).
As shown in Suslov (1946) and Chetaev (1989), the reaction force of ideal constraints is defined uniquely at each state \((q, \dot{q}) \in TQ\).
In summary, for a system subject to ideal constraints, the forced dynamics is equivalent to the Lagrange–d’Alembert principle. We refer the reader to books Suslov (1946) and Chetaev (1989) for a more detailed exposition and history of the concept of ideal constraints.
Utilizing Hamel’s formalism and assuming the ideal velocity constraints read \(\xi ^{m+1}=\cdots =\xi ^n=0\), the dynamics of the constrained system is given by (2.2) for \(j=1,\ldots ,m\). The remaining \(n-m\) equations serve for computing the reaction force and do not affect the dynamics of the system.
For the early development of these equations see Poincaré (1901) and Hamel (1904). We refer the readers to Marsden and Ratiu (1999) and Bloch et al. (2009) for the history and development of variational principles for the Euler–Lagrange, Euler–Poincaré, and Hamel equations, and to Ball et al. (2012) for the Hamilton–Pontryagin principle for the Hamel equations.
2.5 The Chaplygin Sleigh
Here we describe the Chaplygin sleigh which is one of the simplest but nonetheless instructive example of a nonholonomic mechanical system. The sleigh is essentially a vertical blade moving on a horizontal plane, with no motion perpendicular to the blade allowed. There is a single contact point of the blade and the plane, and the center of mass of the blade coincides with this contact point. The sleigh is often thought of as a balanced platform on the top of the blade. See Fig. 1 where the platform, the blade, and the contact point are depicted as an oval, a bold segment, and a bold dot, respectively.
Let \(\theta \) be the angular orientation of the sleigh and (x, y) be the coordinates of the contact point as shown in Fig. 1. The configuration space for the sleigh is the Euclidean group \({\text {SE}}(2) = {\text {SO}}(2) {\circledS } {\mathbb {R}}^2\). We parametrize the elements of SE(2) as \((\theta , x, y)\). The body frame is
Using this frame,
where \(\omega \) is the angular velocity of the sleigh relative to the vertical line through the contact point and where \((v^1, v^2)\) are the components of the linear velocity of the contact point in the directions along and orthogonal to the blade, respectively. Thus, the constraint reads \(v^2 = 0\).
Having in mind infinite-dimensional generalizations of the Chaplygin sleigh, we start using the complex configuration variable \(z = x + iy\) on the plane. Similarly, the linear velocity relative to the body frame is written as \(v = v^1 + iv^2\). Formulae (2.3) become
whereas the constraint in this complex representation reads
Denote the mass and the moment of inertia of the sleigh by m and J. The Lagrangian is just the kinetic energy of the sleigh, which is the sum of the kinetic energies of the linear and rotational modes of the sleigh. Therefore, the reduced Lagrangian of the Chaplygin sleigh is
The elements of the algebra \(\mathfrak {se}(2) = \mathfrak {so}(2) {\circledS } {\mathbb {C}}\) in the complex representation used here are written as \((i \omega , v)\), where \(i \omega \in \mathfrak {so}(2)\), \(\omega \in {\mathbb {R}}\), and \(v \in {\mathbb {C}}\). The bracket operation on the algebra \(\mathfrak {se}(2)\) reads
Using (2.5), the constrained Hamel equations for the Chaplygin sleigh are computed to be
Recall that in (2.6) the quantity v is real-valued. The reduced dynamics of the Chaplygin sleigh is given by the constrained Hamel Eq. (2.6) along with the kinematic Eq. (2.4).
Solving these equations gives
and so generically the sleigh moves along a circle at a uniform rate.
3 Infinite-Dimensional Systems
Starting from this section, we no longer assume that systems have a finite number of degrees of freedom. As the use of frames and bases in the infinite-dimensional setting is unnatural, cumbersome, and not even always possible, we introduce a coordinate-free approach to Hamel’s formalism. Thus, instead of frames, we use linear velocity substitutions. These substitutions, however, are not induced by a (local) configuration coordinate change.
3.1 Lagrangian Mechanics
Let M be an infinite-dimensional smooth manifold modeled on a convenient vector space W and let TM be its kinematic tangent bundle with the projection \(\pi _M:TM\rightarrow M\). Consider the initial inclusion map \(i: Q\rightarrow M\) and the pullback vector bundle \(P=i^{*}TM\).Footnote 2 The importance of the manifold Q will be demonstrated shortly.
A Lagrangian is a smooth function \(L:P\rightarrow \mathbb R\). The dynamics for this Lagrangian is defined in a usual way by Hamilton’s principle: The curve \(\gamma :[a,b] \rightarrow Q\) is a trajectory if
along \(\gamma \).
To demonstrate the necessity of the initial inclusion map in infinite-dimensional mechanics, consider the wave equation
on \({\mathbb {R}}^n\). This is the Euler–Lagrange equation for the Lagrangian
where \(\langle \cdot , \cdot \rangle \) is the standard Riemann metric on \(L^2 ({\mathbb {R}}^n)\). This Lagrangian is defined on the space \(P=H^1 \times L^2 \subset L^2 \times L^2 = TL^2\) and not on the entire \(TL^2\). Using our notations, \(Q = H^1 \subset L^2 = M\). See Marsden and Hughes (1983) for details.
3.2 Hamel’s Formalism and Hamilton’s Principle
Turning to Hamel’s formalism, let U be an open subset of M containing \(q\in Q\) and let
be a fiber-preserving diffeomorphism that is linear in the second input. Hence, for each \(q \in U\), both \(\Psi _q:W\rightarrow T_q M\) and \(\Psi _q^{-1}:T_q M\rightarrow W\) are invertible bounded linear operators smoothly dependent on q in an open subset \(i^{-1}(U)\subset Q\). The latter is verified by selecting the map (3.1) as a bundle chart on TM and using the Cartesian closedness \(C^\infty (U\times W, W)\cong C^\infty (U,C^\infty (W,W))\) along with the fact that the space of all bounded linear operators from a convenient vector space E to a convenient vector space F is a closed linear subspace of \(C^\infty (E,F)\).Footnote 3
Remark
As pointed out in Marsden and Ratiu (1999), the space W and its dual are chosen to be suitable, in the functional-analytic sense, to the problem in question. As the Lagrangian fails to be defined, in general, on TM, it is necessary to consider various forms of equations of motion, such as weak and strong forms. In particular, the weak form is important in understanding the existence of solutions as well as in numerical methods. Further, the objects involved may not even be defined on the entire TM but only on a dense subset already in the Banach case. See Marsden and Ratiu (1999) for details and references.
For each \(q \in U\) and \(\xi \in W\), the operator \(\Psi _q: W \rightarrow T_q M\) introduced in (3.1) outputs the vector \({\Psi }_q \xi \in T_q M\).Footnote 4 Thus, each \(\xi \in W\) defines the vector field
on U, which usually will be written as
Given two vectors \(\xi ,\eta \in W\), define an antisymmetric bilinear operation \([\cdot , \cdot ]_q:W \times W \rightarrow W\) by
where \([\cdot , \cdot ]\) is the Jacobi–Lie bracket of two vector fields on the manifold M. Next, for arbitrary \(\xi \), \(\eta \), \(\zeta \in W\), we have
implying, in view of invertibility of \(\Psi _q\), the Jacobi identity for the bracket \([\cdot , \cdot ]_q\). Therefore, for each \(q \in U\), the space W with the operation \([\cdot , \cdot ]_q\) is a Lie algebra, denoted hereafter \(W_q\).
The dual of \([\cdot ,\cdot ]_q\) is, by definition, the operation \( [\cdot ,\cdot ]_q^{*}: W_q \times W_q^{*} \rightarrow W_q^{*}\) given by
As in the finite-dimensional setting, let \(\dot{q}\) and \(\delta q\) denote the velocity and the virtual displacement at \(q \in Q\). From now on, the inverse images of \(\dot{q}\) and \(\delta q\) are written as \(\xi , \eta \in W\), that is, \(\dot{q} = {\Psi }_q \xi \) and \(\delta q = {\Psi }_q \eta \).
Interpreting \(\xi \) as an independent variable that replaces \(\dot{q}\) (locally) defines the Lagrangian as a smooth function of \((q, \xi )\) on \(V \times W\):
where we used the smoothness of the mapping \(V\times W \ni (q, \xi ) \mapsto {\Psi }_q \xi \in \pi _M^{-1}(U)\). The equations of motion written when \((q, \xi )\) are selected as (local) coordinates on the velocity phase space are called the Hamel equations.
Recall that, given a smooth curve \(q(t) \in Q\), \(t \in [a, b]\), its variation is a smooth one-parameter family of curves
An infinitesimal variation, also known as variation field \(\delta q\), is defined by
When this field is evaluated along the curve q(t), we write \(\delta q(t)\), i.e.,
Thus, a variation of a smooth curve \(q(t) \in Q\) defines a curve \(\eta (t) \in W\)
Theorem 3.1
(Hamilton’s Principe for Hamel’s Equations). Let \(L:P\rightarrow {\mathbb {R}}\) be a Lagrangian and l be its representation in local coordinates \((q,\xi )\). Then, the following statements are equivalent:
-
(i)
The curve q(t), where \(a \le t \le b\), is a critical point of the action functional
$$\begin{aligned} \int _a ^b L (q, \dot{q})\,\mathrm{d}t \end{aligned}$$on the space of curves in Q connecting \(q_a\) to \(q_b\) on the interval [a, b], where we choose variations of the curve q(t) that satisfy \(\delta q(a) = \delta q(b) = 0\).
-
(ii)
The curve q (t) satisfies the weak form of the Euler–Lagrange equations
$$\begin{aligned} \int _a^b \bigg \langle \frac{\delta L}{\delta q} - \frac{d}{\mathrm{d}t} \frac{\delta L}{\delta \dot{q}}, \delta q\bigg \rangle \,\mathrm{d}t = 0. \end{aligned}$$(3.4)Furthermore, if \(\delta L / \delta q \in T_q^{*} M\) and \(i_{*} T_q Q\) is dense in \(T_q M\) for every \(q\in Q\), the curve q (t) satisfies the strong form of the Euler–Lagrange equations
$$\begin{aligned} \frac{d}{\mathrm{d}t} \frac{\delta L}{\delta \dot{q}} - \frac{\delta L}{\delta q}=0. \end{aligned}$$(3.5) -
(iii)
The curve \((q(t), \xi (t))\) is a critical point of the functional
$$\begin{aligned} \int _a^b l(q, \xi )\,\mathrm{d}t \end{aligned}$$(3.6)with respect to variations \(\delta \xi \), induced by the variations
$$\begin{aligned} \delta q = {\Psi }_q \eta , \end{aligned}$$and given by
$$\begin{aligned} \delta \xi = \dot{\eta }+ [\xi , \eta ]_q. \end{aligned}$$(3.7) -
(iv)
The curve \((q(t), \xi (t))\) satisfies the weak form of the Hamel equations
$$\begin{aligned} \int _a^b \bigg \langle {\Psi }^{*}_q \frac{\delta l}{\delta q} + \bigg [\xi , \frac{\delta l}{\delta \xi } \bigg ]_q^{*} - \frac{d}{\mathrm{d}t} \frac{\delta l}{\delta \xi }, \eta \bigg \rangle \,\mathrm{d}t = 0, \quad \eta \in {\Psi }^{-1}_q (T_q Q) \end{aligned}$$(3.8)coupled with the equations \( \dot{q} = {\Psi }_q \xi . \) If \(\delta l /\delta q \in T_q^{*} M\) and \( i_{*}T_q Q\) is dense in \(T_q M\) for every \(q\in Q\), the curve \((q(t), \xi (t))\) satisfies the strong form of the Hamel equations
$$\begin{aligned} \frac{d}{\mathrm{d}t} \frac{\delta l}{\delta \xi } = \bigg [\xi , \frac{\delta l}{\delta \xi }\bigg ]_q^{*} + {\Psi }^{*}_q \frac{\delta l}{\delta q} \end{aligned}$$(3.9)coupled with the equation \(\dot{q} = {\Psi }_q \xi \).
For the early development of these equations in the finite-dimensional setting see Poincaré (1901) and Hamel (1904).
Proof
The equivalence of (i) and the weak form of the Euler–Lagrange Eq. (3.4) is proved by integration by parts:
where \(\langle \cdot ,\cdot \rangle \) is the paring of \(T_q^{*}Q\) with \(T_q Q\). The strong form of the Euler–Lagrange Eq. (3.5) follows easily from a standard contradiction argument,Footnote 5 which finishes the proof of the equivalence of (i) and (ii).
To prove the equivalence of (i) and (iii), we first compute the quantities \(\delta \dot{q}\) and \(d(\delta q)/\mathrm{d}t\). Recall that
Using the definition (3.3) of the field \(\delta q\), one concludes that
Hereafter, v[f] denotes the derivative of the function f along the vector filed v; in particular, in (3.10) an operator-valued function is differentiated.
Similarly,
and therefore
From \(\delta \dot{q} = \frac{d}{\mathrm{d}t} \delta q\), we obtain
which implies formula (3.7).
To prove the equivalence of (iii) and the weak form of Hamel’s Eq. (3.8), we use the above formula and compute the variation of the action (3.6):
If \( \delta l/\delta q \in T_q^{*} M\) and \(i_{*}T_q Q\) is dense in \(T_q M\) for every \(q\in Q\), then for each t the subspace \({\Psi }^{-1}_{q(t)}(i_{*}T_{q(t)}Q)\) is dense in W, and the variational derivative vanishes if and only if the strong form of the Hamel Eq. (3.9) is satisfied. \(\square \)
Example 3.2
For an incompressible fluid flow in a compact domain \(D \subset {\mathbb {R}}^3\) with a smooth boundary the configuration space is the group \({\text {Diff}} (D)\) of (volume-preserving) diffeomorphism of D, which is a regular Lie group in convenient setting. Let q(t) be a curve in this group, one may think of q(t) as a particular fluid flow. Following Euler (1757a, (1757b, (1761), one typically uses the spatial velocity \(\xi := \dot{q} \circ q^{-1} \in T_e {\text {Diff}} (D)\). Selecting \(W = T_e {\text {Diff}} (D) = {\mathcal {X}} (D)\), the space of smooth vector fields on D tangent to the boundary, and \(\Psi _q = TR_q\) gives \(\dot{q} = {\Psi }_q \xi \). Therefore, the use of spatial velocity in fluid dynamics is an instance of infinite-dimensional Hamel’s formalism. The variation formula (3.7) becomes
where \([\cdot , \cdot ]\) is the Jacobi–Lie bracket on D. The dynamics, in the form of Hamel’s equations, reads
The latter are the Euler–Poincaré equations, as established by Poincaré (1901) and Arnold (1966).
Remark
The convective representation of fluid dynamics is straightforward to obtain by setting \(\Psi _q = TL_q\). See Gay-Balmaz et al. (2012) for details on the convective representation in continuum mechanics.
Example 3.3
For an inextensible string moving in the plane, the configuration manifold is the space of smooth embeddings \({\text {Emb}}([0,1],{\mathbb {R}}^2)\). We will view \({\mathbb {R}}^2\) as a complex plane.
Given \(z \in {\text {Emb}}([0,1], {\mathbb {C}})\), the inextensibility constraint reads \(\Vert z_s\Vert = 1\), \(0\le s\le 1\). For simplicity, we assume no resistance to bending. Therefore, the Lagrangian reads
where \(\lambda :[0,1]\rightarrow {\mathbb {R}}\) is the Lagrange multiplier (tension). The boundary conditions for the Lagrange multiplier are a part of the requirement \(\delta L = 0\). For a free motion of a string, these conditions read
Let
so the velocity components to be used to construct Hamel’s equations are represented by a complex-valued function \(\xi = \xi (s)\). The real and imaginary parts of \(\xi \) are the tangent and normal velocity components of the points of the string, as is illustrated in Fig. 2.
The Lagrangian becomes
in which the density should be understood as a function of \((z_s, \bar{z}_s, \xi , \bar{\xi })\) and the Lagrange multiplier \(\lambda \).
Next, the bracket formula (3.2) for the string becomes
That is,
Instead of establishing the formulae for the dual bracket and dual operator \(\Psi ^{*}\), it is more efficient in this example to directly work with the variational principle. We have:
and since l is real-valued, the two last terms are obtained from the first two by conjugation. Thus, it is sufficient to evaluate the last two terms in (3.14):
which, after imposing the constraint
implies Hamel’s string equation
as well as the tension conditions (3.11). Here \(\varkappa = i z_s \bar{z}_{ss}\) is the signed curvature of the curve \([0,1] \ni s \mapsto z(s) \in {\mathbb {C}}\). One of the implications of (3.16) is
it will be used to identify the Lagrange multiplier \(\lambda \) in terms of the state of the string.
Using the identity
that follows from (3.12) and (3.15), one obtains an alternative representation of Eq. (3.16),
Either of the Eqs. (3.16) and (3.19) is equivalent to the Euler–Lagrange equations for the string.
The Lagrange multiplier as a function of string’s state is obtained by solving the differential equation
subject to the boundary conditions (3.11). To derive Eq. (3.20), one differentiates (3.17) with respect to s to obtain
Formula (3.21) becomes (3.20) after a transformation using the identity
that follows from (3.12) and (3.15).
Remark
Alternatively, one defines the operator \(\Psi \) by
Because of the constraint \(|z_s| = 1\), the resulting Hamel equation is (3.16). However, the bracket \([\xi , \eta ]_z\) is now given not by (3.13) but by a slightly different formula. This latter bracket is in fact induced by the (standard) bracket of the Lie algebra of the infinite-dimensional group
To see that, the string dynamics should be interpreted as a motion on G specified by the degenerate Lagrangian
subject to the constraint
where \((\theta , z)\) are the (standard) coordinates on \({\text {SE}}(2) = {\text {SO}}(2) \mathop {\circledS } {\mathbb {C}}\) and \(\xi \) is the \({\mathbb {C}}\)-component of the body velocity \(g^{-1} \dot{g}\). One then composes the \({\mathbb {C}}\)-component of the Euler–Poincaré equations on G and imposes constraint (3.22). This results in the string Eq. (3.16). It is worth noticing that the derivation of these equations using formula (3.12) is simper and more efficient.
3.3 The Hamilton–Pontryagin Principle
Here we utilize the Hamilton–Pontryagin principle of Yoshimura and Marsden (2006a, (2006b, (2007) for the derivation of Hamel’s equations. This approach provides an alternative interpretation of Hamel’s formalism in general and of the bracket term in particular. The Hamilton–Pontryagin principle for finite-dimensional systems of Hamel type was introduced in Ball et al. (2012).
Recall that the Hamilton–Pontryagin principle identifies the trajectories for the Lagrangian \(L:TQ\rightarrow \mathbb R\) with the critical points of the functional
and so the curves the functional is calculated along belong to the Whitney sum \(TQ \oplus T^{*}Q\). See Yoshimura and Marsden (2006a, (2006b, (2007) for details, motivation, and history.
Theorem 3.4
(The Hamilton–Pontryagin Principe for Hamel’s Equations). Let \(L:P\rightarrow \mathbb R\) be a Lagrangian and l be its representation in local coordinates \((q,\xi )\) on P. Then, the following statements are equivalent:
-
(i)
The curve \((q(t),\xi (t),\mu (t)) \in V\times (W\oplus W^{*})\) is a critical point of the action functional
$$\begin{aligned}\int _a^b \big ( l(q, \xi ) + \big \langle \mu , \Psi ^{-1}_q \dot{q} - \xi \big \rangle \big ) \,\mathrm{d}t \end{aligned}$$with respect to independent variations \(\delta q = {\Psi }_q \eta \), \(\delta \xi \), and \(\delta \mu \), with \(\eta (a)=\eta (b)=0\).
-
(ii)
The curve \((q(t), \xi (t),\mu (t))\) satisfies the weak form of the implicit Hamel equations
$$\begin{aligned} \int _a^b \bigg \langle \Psi _q^{*} \frac{\delta l}{\delta q} - \dot{\mu }+[\xi ,\mu ]_q^{*}, \eta \bigg \rangle = 0, \quad \frac{\delta l}{\delta \xi } = \mu , \quad \mathrm{where}\quad \eta \in {\Psi }^{-1}_q (T_q Q), \end{aligned}$$together with the constraint \( \xi =\Psi ^{-1}_q \dot{q}. \) If \(\delta l / \delta q \in T_q^{*} M\) and \(i_{*}T_q Q\) is dense in \(T_q M\) for every \(q\in Q\), the curve \((q(t), \xi (t),\mu (t))\) satisfies the strong form of the implicit Hamel equations
$$\begin{aligned} \Psi _{q}^{*}\frac{\delta l}{\delta q} - \dot{\mu }+ [\xi ,\mu ]^{*}_q = 0,\quad \frac{\delta l}{\delta \xi } = \mu , \end{aligned}$$together with the constraint \( \xi =\Psi ^{-1}_{q}\dot{q}. \)
Proof
From \(\Psi _q^{-1}\Psi _q = {\text {id}}\) we have
which implies
Therefore, using earlier calculation,
where \(\eta = \Psi _q^{-1} \delta q \in {\Psi }^{-1}_q (T_q Q)\).
Finally, taking the variation and using the above formula along with integration by parts gives
Setting the latter equal to zero and noting that \(\delta \xi \) is an arbitrary element of W yields the desired results. \(\square \)
4 Systems with Constraints
In this section, we consider systems with ideal velocity constraints, both holonomic and nonholonomic. The dynamics of such systems is equivalent to the Lagrange–d’Alembert principle. To simplify the exposition, in the rest of the paper, we assume that \(\delta L / \delta q \in T_q^{*} M\) and \(i_{*} T_q Q\) is dense in \(T_q M\) for every \(q\in Q\), and thus all results will be stated for strong equations of motion. Similar statements for weak equations are straightforward to obtain.
4.1 The Lagrange–d’Alembert Principle
Recall that in this paper the constraints imposed on on the system are assumed linear and homogeneous in the velocity. Such constraints are specified by a vector subbundle \({\mathcal {D}}\) of the bundle \(P=i^{*}TM\). The base of this subbundle is the manifold Q. This subbundle will, in general, be nonintegrable.
One of the ways to construct a subbundle of P is to take a pullback (induced by the initial inclusion map \(i:Q\rightarrow M\)) of a distribution on the manifold M. Verification of integrability of a distribution in the infinite-dimensional setting may be nontrivial as the Frobenius theorem has not been established in the general infinite-dimensional setting. See Kriegl and Michor (1997), Hiltunen (2000), Teichmann (2001), Filipović and Teichmann (2003) for some of the infinite-dimensional versions of Frobenius theorem.
The condition for a curve to satisfy the constraints is of course insufficient for the development of constrained mechanics. One needs a mechanism for constructing the vector field that captures the dynamics of the constrained system. For the ideal constraints in the finite-dimensional setting, this is accomplished by a projection. Such constraints define a submanifold of the velocity phase space and a projection onto this submanifold.
For a projection to be meaningful in the infinite-dimensional case, a submanifold has to be splitting. Thus, in the infinite-dimensional setting, we require that \({\mathcal {D}}\) is a locally splitting subbundle of P. That is, for each \(q \in Q\) there exists a chart (U, h) of M with \( i(q) \in U\) such that
where the closed subspace \( W^{{\mathcal {D}}} \) of the model space W is splitting, or complemented, i.e., there is a closed subspace \(W^{{\mathcal {U}}}\) of W such that \(W^{{\mathcal {D}}} \oplus W^{\mathcal {U}} = W\) and the projection \(\pi ^{{\mathcal {D}}}\) uniquely determined by setting \(({\text {Ker}} \pi ^{{\mathcal {D}}}, {\text {Im}} \pi ^{{\mathcal {D}}}) = (W^{\mathcal {U}}, W^{{\mathcal {D}}})\) is continuous. Note that the continuous projection \(\pi ^{{\mathcal {D}}}\) in the convenient space W is automatically smooth as a bounded linear mapping.Footnote 6
The following Lagrange–d’Alembert principle is known to be equivalent to the dynamics of systems with ideal holonomic and nonholonomic constraints:
Definition 4.1
The Lagrange–d’Alembert equations of motion for the system are those determined by
where we choose variations \(\delta q(t)\) of the curve q(t) that satisfy \(\delta q(a) = \delta q(b) = 0\) and \(\delta q(t) \in {\mathcal {D}}_{q(t)}\) for each t where \( a\le t\le b\).
This principle is supplemented by the condition that the curve q(t) itself satisfies the constraints. Note that we take the variation before imposing the constraints; that is, we do not impose the constraints on the family of curves defining the variation. This is well known to be important to obtain the correct mechanical equations (see Arnold et al. 2006 and Bloch 2015 for a discussion of other types of constraints and references).
The Lagrange–d’Alembert principle is equivalent to the equations
Here,
One way to give a more explicit representation of dynamics (4.1), under certain technical conditions, is to make use of the Euler–Lagrange equations with multipliers. Let the fibers of the subbundle \({\mathcal {D}}\) be locally written as
where A(q) for each \(q \in Q\) is a continuous linear operator on \(T_q M\) with values in the vector space \(W^{\mathcal {U}}\). For instance, if there is a single constraint imposed on the system, A(q) is a linear functional. We assume smooth dependence of A(q) on q.
The constrained variations \(\delta q(t) \in T_{q(t)}Q\) satisfy the condition
Using Definition 4.1 and formula (4.2), one writes the equations of motion as
where \(A^{*}(q):{W^{\mathcal {U}}}^{*} \rightarrow T_q^{*}M\) is the adjoint of A(q) and where the closure should be understood as weak* closure. Thus, if \(A^{*}(q)\) has a weak* closed range,Footnote 7 there exist Lagrange multipliers \(\lambda \in {W^{\mathcal {U}}}^{*}\) such that
4.2 The Constrained Hamel Equations
Given a nonholonomic system, that is, a Lagrangian \(L:P\rightarrow \mathbb R\) and constraint distribution \({\mathcal {D}}\), select the operators \(\Psi _q:W \rightarrow T_q M\), \(q\in U \subset Q\), such that there exist closed subspaces \(W^{{\mathcal {D}}}\), \(W^{\mathcal {U}} \subset W\) with the properties \(W = W^{{\mathcal {D}}} \oplus \, W^{\mathcal {U}}\) and \(\Psi _q = \Psi _q^{{\mathcal {D}}} \oplus \, \Psi _q^{\mathcal {U}}\), where \(\Psi _q^{{\mathcal {D}}}: W^{{\mathcal {D}}} \rightarrow {\mathcal {D}}_q\) and \(\Psi _q^{\mathcal {U}}: W^{\mathcal {U}} \rightarrow \mathcal U_q\) and their inverses are bounded linear operators smoothly dependent on \(q \in U\). One way to choose the operators \(\Psi _q\) is to use the above subbundle chart. In general, \( U \ne Q\), as numerous finite-dimensional examples demonstrate.
Each \(\dot{q}\in TM\) is then uniquely written as
i.e., \({\Psi }_q \xi ^{{\mathcal {D}}}\) is the component of \(\dot{q}\) along \({\mathcal {D}}_q\). Similarly, each \(\alpha \in W^{*}\) can be uniquely decomposed as
where \(\alpha _{{\mathcal {D}}}\) and \(\alpha _{\mathcal {U}}\) denote the component of \(\alpha \) along the duals of \(W^{{\mathcal {D}}}\) and \(W^{\mathcal {U}}\), respectively. Actually, we have
where \(\big (\pi ^{{\mathcal {D}}}\big )^{*}\) is the adjoint of \(\pi ^{{\mathcal {D}}}\). Using (4.3), the constraints read
This implies
The Lagrange–d’Alembert principle then implies the following theorem:
Theorem 4.2
The dynamics of a nonholonomic system is represented by the strong form of constrained Hamel equations
Example 4.3
Consider an inextensible string moving in the plane subject to the vanishing normal velocity constraint. See Fig. 2. One may think of a motion of a ‘sharp’ string on the horizontal ice. Using the notations introduced in Example 3.3, the constraint reads
i.e., \(\xi \in \mathbb R\). Equations (4.4) for the constrained string thus become
supplemented by the inextensibility condition.
Identity (3.18) in the presence of constraint (4.5) implies
That is, all points of the string have the same speed, and (4.6) becomes
Using the boundary conditions (3.11), we conclude that
Summarizing, \(\xi ={\mathrm {const}}\) throughout the motion. This is in agreement with the motion of the Chaplygin sleigh for which the velocity of the contact point relative to the body frame is constant. Unlike the sleigh, the constrained string motion is not completely determined by the initial state: Any solution of (4.7) is of the form
where \(\phi \) is an arbitrary twice-differentiable complex-valued function. The initial conditions define \(\phi \) on the segment [0, 1]. Outside this segment, the function \(\phi \) is unknown, unless, for example, the motion of the front end of the string is prescribed. The motion of the constrained string is therefore purely kinematic: The string follows its front end at a constant speed.
This behavior is similar to that of the degenerate Chaplygin sleigh specified by the Lagrangian \(l = \tfrac{1}{2} \xi \bar{\xi }\) and the constraint \(\xi = \bar{\xi }\), where \(\xi = e ^{-i \theta } \dot{z}\). The Lagrangian is degenerate as the term quadratic in \(\dot{\theta }\) is absent, i.e., the moment of inertia of the sleigh equals zero.
For the degenerate sleigh, the dynamics reads
where \(\theta (t)\) is an arbitrary function. Thus, the motions are not identified by the initial conditions.Footnote 8
Another remark about the string on ice is that, similar to \({\text {SE}}(2)\)-snakes of Krishnaprasad and Tsakiris (1994), it is almost holonomic: Adding a single (holonomic) constraint \(z(0,t) = z(1,t)\) renders the system holonomic. The string becomes closed and will slide, at a constant speed, along its initial shape.
5 Systems with Symmetry
Here we present infinite-dimensional analogues of some of the results of Bloch et al. (1996a, (2009) on systems with symmetry. The mechanical and nonholonomic connections, the momentum equation, and mechanics of systems with infinitely many velocity constraints are discussed. Recall that \( \delta L / \delta q \in T_q^{*} M \) and \(i_{*} T_q Q\) is dense in \(T_q M\) for every \(q\in Q\), and thus all results are stated for strong equations of motion.
5.1 The Lagrange–Poincaré Equations
Recall that in finite-dimensional mechanics with symmetry one starts with an action of a Lie group G on the configuration space Q and Lagrangian and constraints (if any) that are invariant with respect to the lifted action on the velocity phase space. The quotient space Q / G, whose points are the group orbits, is called the shape space. It is known that if the group action is free and proper, the shape space is a smooth manifold and the projection \(\pi :Q\rightarrow Q/G\) is a smooth surjective map with a surjective derivative at each point. The configuration space thus has the structure of a principal fiber bundle, with the group acting on the fibers by (left) multiplication.
We thus assume that in the general infinite-dimensional setting in the presence of symmetry the manifold M is a principal fiber bundle (see Kriegl and Michor 1997 for details in the infinite-dimensional case). The base of this bundle is a smooth manifold, and the group G involved in the construction of this bundle may be finite- or infinite-dimensional. In the latter case, the group is assumed regular (see Kriegl and Michor 1997 and Omori 1997).Footnote 9
We denote the bundle coordinates (r, g) where r is a local coordinate in the base, or shape space M / G, and g is a group coordinate. Such a local trivialization is characterized by the fact that in such coordinates the group does not act on the factor r but acts on the group coordinate by group multiplication. Thus, locally in the base, the space M is isomorphic to the product \(M/G \times G\), and in this local trivialization the map \(\pi \) becomes the projection onto the first factor. The model space of the manifold M / G is denoted \(W_B\).
Recall that in general in the infinite-dimensional setting Lagrangians are defined on the bundle \(P = i^{*} TM\), where \(i:Q\rightarrow M\) is the initial inclusion map and Q is a manifold. We will assume in the rest of the paper that Q is a principal fiber bundle with the same group G and that in the aforementioned local trivialization the group component of the initial inclusion map is the identity map.
Definition 5.1
We say that the Lagrangian is G-invariant if L is invariant under the induced action of G on P.
Both left and right actions may be of interest. Below we develop the formalism assuming that the action is left; the case of the right action is similar.
We start with the construction of the operator \(\Psi \). Let \(\mathfrak g\) be the Lie algebra, and thus the model space, of the group G. When the configuration space is a principal fiber bundle, the group component of the operator \(\Psi \) can be defined globally, as shown below. For the fibers of T(M / G) we write the corresponding operator as \(\psi _r: W_B \rightarrow T_r (M/G)\), i.e., \(\dot{r}=\mathop {\psi _r} \xi \), whereas for the fibers of TG we set \(\dot{g}:=\mathop {L_{g^{*}} \circ \varphi _r} \zeta \), where \(q = (r, g)\) in a local trivialization and \(\varphi _r:\mathfrak g_r \rightarrow \mathfrak g\) is a linear operator. The operators \(\psi _r\) and \(\varphi _r\) and their inverses are, for each \(r \in Q/G\), bounded, continuous, and depend smoothly on r. This implies that the algebras \(\mathfrak g\) and \(\mathfrak g_r\) are isomorphic. The bracket on the space \(W_B\) is
whereas the bracket on \(\mathfrak g_r\) is defined by
The use of the nonmaterial shape velocity is of importance already in the finite-dimensional setting in some of the problem involving the rolling rigid body, such as the rattleback. In a number of interesting finite-dimensional instances, it suffices to use the shape velocity \(\dot{r}\). The version of the infinite-dimensional formalism developed here is motivated by the utility of nonmaterial velocity in accounting for fluid- and elastic-type shapes.
The operator \(\varphi _r\) is important for accounting for various subbundle structures associated with subspaces of the Lie algebra \(\mathfrak g\). Such structures originate in the presence of control inputs and/or velocity constraints, particularly in the motion generation problems (see Ostrowski et al. 1994 and Bloch et al. 1996a for the details in the finite-dimensional setting). In general, the position of these subspaces in the Lie algebra \(\mathfrak g\) is shape-dependent. While this operator may not appear to be important in the current part of the paper, it is more straightforward to introduce it here in order to make the exposition of the rest of the paper more straightforward.
Let \({\mathcal {A}}_\text {s}\) be a principal connection on the bundle \(Q \rightarrow Q/G\) (see Kriegl and Michor 1997 for details on principal connections in the infinite-dimensional setting). Below we will discuss how the structure of the Lagrangian facilitates the selection of a connection. Tangent vectors in a local trivialization \(Q = Q/G \times G\) at the point (r, g) are denoted (x, y). We write the action of \({\mathcal {A}}_\text {s}\) on this vector as \({\mathcal {A}}_\text {s} (x, y)\).Footnote 10 Using this notation, we write the connection form in the local trivialization as
where \(y_\text {c}\) is the left translation of y to the identity and, since we are working locally in shape space, we regard \({\mathcal {A}}_\text {c}\) as a \(\mathfrak g\)-valued one-form on Q / G.
The curvature of the principal connection \({\mathcal {A}}_\text {s}\) is the \(\mathfrak g\)-valued two-form
where \(X_1\) and \(X_2\) are two vector fields and \({\text {hor}} X_1\) and \({\text {hor}} X_2\) are their horizontal components. In a local trivialization one writes
where \(x_1, x_2 \in T(Q/G)\), \(y_1, y_2 \in \mathfrak g\), and where
The quantity \(y_\text {c} + {\mathcal {A}}_\text {c} x\) regarded as an element of \({\mathfrak {g}}_r\) will be denoted \(\zeta + {\mathcal {A}} x\), i.e.,
Thus, the velocity vector \(\dot{q}\) in such a representation is characterized by the components
where \(\mathop {\varphi _r} \zeta = g^{-1} \dot{g} \in \mathfrak g\). The curvature of the connection \({\mathcal {A}}\) on Q / G in this representation is hereafter denoted \({\mathcal {B}}\).
With nonmaterial shape velocity being used, the velocity components become
Summarizing,
Theorem 5.2
The strong Lagrange–Poincaré equations Footnote 11 for a system with a G-invariant Lagrangian \(L:P \rightarrow {\mathbb {R}}\) are
where \({\mathcal {B}}\) is the curvature of \({\mathcal {A}}\), \(\gamma := \varphi _r^{-1} \circ \mathrm{d}\varphi _r\), and \({\mathcal {E}}:= \gamma -{\text {ad}}_{{\mathcal {A}}}\), so that \(\gamma \) and \({\mathcal {E}}\) are \({\mathfrak {g}}_r\otimes {\mathfrak {g}}_{r}^{*}\)-valued one-forms on Q / G. The shape equation (5.4) and the momentum equation (5.5) govern the reduced dynamics for the Lagrangian L. The full dynamics is given by equations (5.4) and (5.5) along with the kinematic shape equation
and reconstruction equation
Proof
The principal step is to uncover the structure of the bracket terms in the Hamel equations. Recall that these terms are induced by the map \(\Psi \) defined in (5.3). Evaluating the Jacobi–Lie brackets of two vector fields \(X_1\) and \(X_2\) whose components in local trivialization are \((\xi _i, \zeta _i)\), that is,
gives:
Next,
Summarizing the components of \([X_1, X_2]\) are
and
Applying the inverse of \(\Psi _q\) outputs the components
and
Utilizing the formula
for the exterior derivative of one-forms, the definition of the curvature, and the Cartan structure equation, the bracket \(\big [(\xi _1, \zeta _1), (\xi _2, \zeta _2)\big ]_r\) on \(W_B \oplus \mathfrak g_r\) reads
From (5.7), the components of the dual bracket \([(\xi , \zeta ), (\alpha , \beta )]^{*}_q \) are given by the formulae
Equations (5.4) and (5.5) then follow from Theorem 3.1 and formula (5.8). \(\square \)
In certain situations, it may be useful to know the formulae for variations. For systems considered in this part of the paper, the general formula (3.7) transforms into
where the curves \(\eta (t) \in W_B\) and \(\mathop {\varphi _{r(t)}} \Sigma (t) \in \mathfrak g\) satisfy the boundary conditions \(\eta (a) = \eta (b)= 0\) and \(\Sigma (a) = \Sigma (b) = 0\), respectively. The reconstruction Eq. (5.6) is just the group component of the equation \(\dot{q} = {\Psi }_q \xi \).
In the case that the material shape velocity is utilized, Eqs. (5.4) and (5.5) read
whereas the reconstruction equation becomes
Now assume that the G-invariant Lagrangian equals the kinetic minus potential energy of the system and that the kinetic energy is given by a (weak) Riemannian metric \(\left\langle \left\langle \,\cdot \,, \, \cdot \, \right\rangle \right\rangle \) on the configuration space Q. In such a setting, the isomorphism \(\varphi _r\) is usually selected to be the identity map.
Definition 5.3
The mechanical connection \({\mathcal {A}}^{\mathrm {mech}}\) is, by definition, the connection on Q regarded as a bundle over shape space Q / G that is defined by declaring its horizontal space at a point \(q\in Q\) to be the subspace that is the orthogonal complement to the tangent space to the group orbit through \(q\in Q\) using the kinetic energy metric. The locked inertia tensor \({\mathcal {I}} (q): {\mathfrak {g}} \rightarrow {\mathfrak {g}}^{*}\) is defined by \(\langle {\mathcal {I}} (q)\,\xi ,\eta \rangle =\langle \!\langle \xi _Q (q), \eta _Q (q) \rangle \!\rangle \), where \(\xi _Q\) is the infinitesimal generator of \(\xi \in {\mathfrak {g}}\) and where \(\langle \!\langle \cdot \,, \cdot \rangle \!\rangle \) is the kinetic energy inner product.
Of course, in the infinite-dimensional setting the mechanical connection may fail to exist when the group metric is not strong.Footnote 12 It is certain to exist in the important case of a finite-dimensional symmetry group.
Given a system with symmetry, one may use the mechanical connection to set up Eq. (5.4) and (5.5). This choice of a connection changes the Lie algebra variables from \(\zeta \) to the local version of the locked group velocity \(\Omega \). With this connection choice the kinetic energy metric becomes block-diagonal, that is,
For a rotating rigid body, the locked group velocity has the physical interpretation of the body angular velocity.
Example 5.4
Here we revisit the planar string motion, but now derive the dynamics with an emphasis on the \({\text {SE(2)}}\) symmetry. We also assume that one of the ends of the string is attached to a platform (a flat rigid body). Both the platform and string move in the same horizontal plane.
The elements of the Euclidean group \({\text {SE(2)}}\) are written using complex notation, \(g = (\hbox {e}^{i \theta }, w)\), i.e., the points of the \(\mathbb R^2\) are identified with complex numbers, as before. The group \({\text {SE(2)}}\) is the configuration space of the platform and simultaneously the symmetry group of the entire system.
The angular and linear velocities of the platform (measured relative to the platform) are denoted \(\omega \in \mathbb R\) and \(v \in {\mathbb {C}}\), so that
and the operator \(\varphi \) in (5.3) is the identity operator. The position of the string relative to the platform is given by the embedding map \(z \in {\text {Emb}}([0,1], {\mathbb {C}})\) subject to the conditions \(z(0) = 0\) and \(z_s (0) = 1\).
The absolute position of the points of the string is
and thus the velocity of the points of the string is given by
Written relative to the platform’s frame, the absolute velocity of the point z of the string reads
That is, the operator \(\psi \) is defined by \(\dot{z} = \psi _z \xi = z_s \xi \).
The Lagrangian reads
where m and J are the mass and inertia of the platform. Recall that the quantities \(\omega \) and v are s-independent. The components of the mechanical connection denoted here \({\mathcal {A}}_\theta \), \({\mathcal {A}}_w\), and \({\mathcal {A}}_{\bar{w}}\), so that
are computed to be
The reduced dynamics is given by Eq. (5.4) and (5.5), where \([\cdot , \cdot ]_r^{*}\) is the dual of the bracket (3.13) and \([\cdot , \cdot ]_{\mathfrak g_r}^{*}\) is the operator \({\text {ad}}^{*}\) on the algebra \(\mathfrak {se}(2)\).
5.2 Nonholonomic Systems with Symmetry
Recall that the manifolds M and Q are principal fiber bundles. The Lagrangian L and constraint distribution \({\mathcal {D}}\) are now invariant with respect to the induced action of G on P. Recall also that \({\mathcal {D}}\) is a (locally) splitting subbundle of P.
Assumption 5.5
The constraints and the orbit directions span the entire tangent space to the configuration space:
for each \(q \in Q\).Footnote 13 If this condition is satisfied, we say that the principal case holds.
Let \( \mathcal {S} \) be the subbundle of \({\mathcal {D}}\) whose fiber at q is \(\mathcal S_q = {\mathcal {D}}_q \cap T_q {\text {Orb}}(q)\). We assume here that \(\mathcal S_q \ne \{0\}\).Footnote 14 The subbundle \(\mathcal S\) is invariant with respect to the action of G on P induced by the left action of G on M. We assume that \({\mathcal {S}}\) is a (locally) splitting subbundle.Footnote 15 Therefore, for each \(q\in Q\) there exist a subspace \(\mathcal U_q \subset {\text {Orb}}(q)\) such that \({\text {Orb}}(q) = \mathcal S_q \oplus \, \mathcal U_q\) and the subbundle \(\mathcal U\) is (locally) splitting and G-invariant. Since the distributions \(\mathcal S\) and \(\mathcal U\) are left-invariant, for each \(r \in Q/G\) there exist subspaces \(\mathfrak b^{\mathcal S}_r\) and \(\mathfrak b^{\mathcal {U}}_r\) of the Lie algebra \(\mathfrak g_r = \mathfrak b^{\mathcal S}_r\! \oplus \mathfrak b^{\mathcal {U}}_r\) and an operator \(\varphi _r:\mathfrak g_r \rightarrow \mathfrak g\) such that in a local trivialization
The operators \(\varphi _r\) are constructed in such a way that \(\mathfrak b_r^{\mathcal S}\) and \(\mathfrak b_r^{\mathcal {U}}\) are fixed subspaces of \(\mathfrak g_r\). Such an arrangement is necessary for accounting for constraint subspaces that change their location in the Lie algebra \(\mathfrak g\) of the symmetry group depending on system’s shape configuration.
Let \(\mathfrak b^{\mathcal S}\) and \(\mathfrak b^{\mathcal {U}}\) be the bundles over Q / G whose fibers are the subspaces \(\mathfrak b^{\mathcal S}_r\) and \(\mathfrak b^{\mathcal {U}}_r\) of \(\mathfrak g_r\). Given \(\zeta \in \mathfrak g_r\), we write its components along these subspaces as \(\zeta ^{\mathcal S}\) and \(\zeta ^{\mathcal {U}}\) so that for each \(r\in Q/G\) we have
Let \({\mathcal {A}}_\text {s}\) be a connection defined in a local trivialization by the formula
where \(\zeta \in \mathfrak g_r\) and where \({\mathcal {A}}\) is a \(\mathfrak g_r\)-valued one-form on Q / G. This form is such that the constraints in a local trivialization read
That is, the \( \mathcal {U} \)-component of the connection \(\mathcal {A}\) is defined by the constraints. The \(\mathcal {S}\)-component of \(\mathcal {A}\) is arbitrary at the moment; later on we will see how the structure of the Lagrangian affects the choice of this component.
As in Sect. 5.1, introduce the operator \(\psi _r:W_B \rightarrow T_r (M/G)\) (locally), so that \(\dot{r} = \mathop {\psi _r} \xi \) and \(\xi \) is now the shape velocity component used instead of \(\dot{r}\). Utilizing the constrained Hamel Eq. (4.4) and the G-invariance of the constraint distribution \(\mathcal {D}\), one obtains the reduced nonholonomic equations of motion from (5.4) and (5.5) by projecting equation (5.5) onto the fibers of the bundle \({\mathfrak {b}}^*_{\mathcal {S}}\) and imposing constraints, i.e., setting \(\Omega = \Omega ^{\mathcal {S}}\). Summarizing, we have:
Theorem 5.6
The strong form of the reduced nonholonomic dynamics is given by the equations
coupled with the kinematic shape equation
In the above, the Lagrangian l is written as a function of \((r, \xi , \Omega )\), \(\mathcal {B}\) is the curvature of \(\mathcal {A}\) on Q / G (see (5.1)), and the quantities \(\gamma \) and \(\mathcal {E}\) are defined as in Theorem 5.2. Note that the partial derivatives of l in (5.12) and (5.13) are evaluated before setting \(\Omega = \Omega ^{\mathcal S}\), i.e., before imposing the constraints.
The shape and momentum Eqs. (5.12) and (5.13) can be rewritten as
where \(l_\text {c} (r, \xi , \Omega ^{\mathcal S}):= l(r,\xi , \Omega ^{\mathcal S})\) is the constrained reduced Lagrangian. These equations follow directly from (5.12) and (5.13) as
Note that in general
5.3 The Nonholonomic Connection
In the rest of the section, we assume that the Lagrangian equals the kinetic minus potential energy, that the kinetic energy is given by a (weak) Riemannian metric on the manifold M, and that the principal case holds (see Assumption 5.5).
Definition 5.7
The nonholonomic connection \({\mathcal {A}}^{\mathrm {nhc}}\) is, by definition, the connection on the principal bundle \(Q \rightarrow Q/G\) whose horizontal space at \(q \in Q\) is given by the orthogonal complement to the space \(\mathcal S_q = {\mathcal {D}}_q \cap T_q {\text {Orb}}(q)\) within the space \({\mathcal {D}}_q\).
Similar to the remark on the existence of the mechanical connection, in the infinite-dimensional setting, the nonholonomic connection may fail to exist when the group metric is not strong, but is certain to exist when the symmetry group is finite-dimensional.
Under the assumption that the distribution \(\mathcal {D}\) is invariant and from the fact that the group action preserves orthogonality (since it is assumed to preserve the Lagrangian and hence the kinetic energy metric), it follows that the distribution and the horizontal spaces transform to themselves under the group action. Therefore, the nonholonomic connection in a local trivialization is defined by the formula
where \(y_\text {c}\in \mathfrak g\) and where \({\mathcal {A}}_\text {c}\) is a \(\mathfrak g\)-valued one-form on Q / G. Given \(\dot{q} = (\dot{r}, \dot{g}) \in T_q Q\), the horizontal and vertical components of \(\dot{q}\) in a local trivialization are
In order to reflect the bundle structure associated with velocity constraints, we use Definition 5.7 and the map \(\varphi _r\) and rewrite \(y_\text {c} + {\mathcal {A}}_\text {c}\dot{r}\) in (5.15) as \(\varphi _r (\zeta + {\mathcal {A}} \dot{r})\), where
Using the nonholonomic connection, define the group velocity \(\Omega \in \mathfrak g\) by the formula
The constraints then are represented by formula (5.11).
Let the kinetic energy metric in a local trivialization be written as
The constrained locked inertia tensor \(\mathcal I_{\mathcal S}: \mathfrak b^{\mathcal S} \rightarrow \big (\mathfrak b^{\mathcal S}\big )^{*}\) is given in a local trivialization by
Similarly, define \(\mathcal I_{\mathcal {U}} (r): \mathfrak b^{\mathcal {U}} \rightarrow \big (\mathfrak b^{\mathcal {U}}\big )^{*}\), \(\mathcal I_{\mathcal S \mathcal U} (r): \mathfrak b^{\mathcal {U}} \rightarrow \big (\mathfrak b^{\mathcal S}\big )^{*}\), and \(\mathcal I_{\mathcal U \mathcal S} (r):\mathfrak b^{\mathcal S} \rightarrow \big (\mathfrak b^{\mathcal {U}}\big )^{*}\) by
respectively.
Definition 5.7 implies that the constrained kinetic energy metric written as a function of \((\dot{r}, \Omega ^{\mathcal S})\) is block-diagonal, that is, it reads
Substituting \(\zeta = \Omega ^{\mathcal S} - {\mathcal {A}} \dot{r}\) in (5.16), one concludes that the \(\mathcal S\)-component of the nonholonomic connection satisfies the equation
This equation may or may not have a solution in the infinite dimensional setting. If the restriction of the metric on the subspace \(\mathfrak b^{\mathcal S}\) is strong, the \(\mathcal S\)-component of the nonholonomic connection is well defined and reads
The \(\mathcal U\)-component of the nonholonomic connection is defined by the constraints. As in Sect. 5.2, the reduced dynamics is given by Eqs. (5.12)–(5.14).
Example 5.8
Consider the planar motion of an inextensible string coupled with the two identical Chaplygin sleighs. Assume that the ends of the string are attached at the contact points of the sleighs and the plane. See Fig. 3 for the illustration of the basic configuration.
The manifold \(M (= Q) \) for this system is \({\text {SE}}(2)\times {\text {SE}}(2)\times {\text {Emb}}([0,1], {\mathbb {C}}) \), and the system is invariant with respect to the diagonal action of the group \({\text {SE}}(2)\) on M. We identify the symmetry group with configurations of the sleigh at \(s = 0\). In such a representation, the position of the sleigh at the front end of the string (i.e., at \(s = 0\)) is characterized by the element \((\hbox {e}^{i\theta _0}, w_0) \in {\text {SE(2)}}\), while the relative position of the sleigh at the opposite end of the string is \((\hbox {e}^{i\phi _1}, z_1) \in {\text {SE(2)}}\), so that the absolute position of that sleigh is
The angular and linear velocity (relative to the body frame) of the sleigh at the front end of the string are
while the angular and linear velocity of the other sleigh are
Thus, the constraints are
Formulae (5.17) define the operator \(\varphi \) and the constraint component of the nonholonomic connection. Both are nontrivial for this coupled system.
Recall that the nonholonomic connection modifies the elements of the Lie algebra of the symmetry group. The equations of motion that use such ‘shifted’ Lie algebra elements are useful in numerous situations, including stability analysis of relative equilibria. However, one may verify that this form of equations is rather complicated for the system considered here. Below we give a simpler representation of system’s dynamics using the operators introduced earlier for the sleigh and string.
Using the notations introduced in Sect. 2.5 and Example 3.3, parametrize the configuration spaces of the sleighs as \((\theta _j, w_j)\), \(j =0,1\), where \(w_j=x_j+i y_j\), and denote the absolute position of the string by \(w\in {\text {Emb}}([0,1], {\mathbb {C}})\). The Lagrangian reads
Coupling is accomplished by matching the position of the sleighs and the string’s ends:
That is, the string is attached at and can rotate freely around the contact point of each sleigh.
The constrained Hamel equations for this system read
where, by definition, \(\lambda _0:=\lambda |_{s=0}\), \(\lambda _1:= \lambda |_{s=1}\), \(w_s^0:= w_s |_{s=0}\), and \(w_s^1:= w_s |_{s=1}\). Equations (5.20), (5.22), (5.24) and the left-hand sides of (5.21), (5.23) have been obtained earlier. The right-hand sides of (5.21) and (5.23) account for the coupling constraints (5.19) and are obtained as projections of the external terms (i.e., those due to integration by parts) associated with the Lagrange multiplier \(\lambda \) in (5.18) along the sleigh directions.
5.4 Systems with Shape Constraints
Here we study systems with a ‘slim’ constraint distribution. That is, at each \(q \in Q\), the complimentary subspace of \({\mathcal {D}}_q \subset T_q M\) is ‘larger’ than the tangent space to the group orbit at q. While this may happen when the symmetry group is infinite-dimensional as well as when the system is finite-dimensional, our main objects of interest here are systems with finite-dimensional symmetry group and constraint distributions of infinite codimension. Assumption 5.5 does not hold for such systems, and the formalism of Bloch et al. (1996a) and its infinite-dimensional analogue presented in Sect. 5.2 should be modified.
Example 5.9
Consider the Chaplygin sleigh with an inextensible string attached at and allowed to rotate around the contact point of the sleigh and the plane, as shown in Fig. 4. Assume that the string is constrained as in Example 4.3. This system is \({\text {SE}}(2)\)-invariant. Using formulae (5.9) and (5.10) there, the condition of vanishing normal velocity component is equivalent to
Here the string position z is measured relative to the sleigh, and \(\omega \) and v are the sleigh’s angular and linear velocity measured relative to its frame. Formula (5.25) imposes infinitely many constraints on the system, one for each \(s \in [0,1]\).
For a generic string placement, constraints (5.25) exhaust the Lie algebra \(\mathfrak {se}(2)\). Indeed, the vectors that span the constraint subspaces for \(s_0 = 0\) and \(0< s_1< s_2 <1\) are verified to be \(\mathbb R\)-independent if and only if
which generically holds. The rest of the constraints are effectively imposed on the shape degrees of freedom of the system, and thus Assumption 5.5 does not hold in this example and is void in the rest of the paper.
We begin by developing a special version of the formalism for unconstrained G-invariant systems. Using notations introduced in Sect. 5.1, this system is characterized by the Lagrangian \(l(r, \xi , \zeta )\). Recall that \(\dot{r} = \psi _r \xi \). In order to simplify the exposition, we assume here that \(\varphi _r\) is the identity operator so that \(\dot{g} = L_{g^{*}} \zeta \). Instead of the velocity components (5.2), we will use
where \({\mathcal {C}}:\mathfrak g \rightarrow W_B\) is a linear map. It depends parametrically and smoothly on \(r \in U\), where U is an open subset of Q / G. This is motivated by the structure of the constraints (5.25). Thus, for the operator \(\Psi _q \) we have
The inverse of \(\Psi _q\) is
One can think of \({\mathcal {C}}\) as a connection on the local fiber bundle whose base and fiber are open subsets of G and Q / G, respectively.
Theorem 5.10
The strong Lagrange–Poincaré equations are
Proof
The key step is to establish the bracket operation associated with the operator \(\Psi _q\). One begins with the computation of the Jacobi–Lie bracket
where \(\eta _i\) and \(\zeta _i\) are constant vectors. For the M / G-component of the Jacobi–Lie bracket, we have, using the chain rule:
For the G-component, we obtain
Applying the inverse operator \(\Psi _q\) outputs the bracket operation on \(W_B \oplus \mathfrak g\):
Pairing with an element \((\alpha , \beta ) \in W_B^{*} \oplus \mathfrak g^{*}\), we obtain:
which yields the components of the dual bracket,
Utilizing the dual bracket (5.26) and (5.27) and composing Eq. (3.9) completes the proof. \(\square \)
Next, assume that there exists a splitting distribution \({\mathcal {D}}\) along with its complimentary distribution \(\mathcal U\) such that the constraint \(\eta \in {\mathcal {D}}_q\) reads
Thus, the constraint is defined by the operator \({\mathcal {C}}^{\mathcal {U}}:\mathfrak g \rightarrow \mathcal U _q\). This representation is, in general, local, i.e., (5.28) represents the constraint on an open subset U of the shape space Q / G. The operator \({\mathcal {C}}^{\mathcal {U}}\) is then extended to \({\mathcal {C}}: \mathfrak g \rightarrow W_B\) at each \(r \in U\).
There are, of course, infinitely many such extensions, and one needs to utilize additional requirements to single out the desired one. Utilizing the general Hamel’s equations established in Sect. 4.2, the reduced dynamics becomes
If desired, one can give the representation of these equations using the constrained Lagrangian and constrained versions of the velocity components \(\eta \) and \(\zeta \), as in the approach in Sect. 5.2.
Example 5.9 (continued). Consider again the motion of the Chaplygin sleigh with the flexible string attached. The normal velocity component of the string at each point is constrained to be zero, as in (5.25). The system is \({\text {SE(2)}}\)-invariant and the symmetry group is also the configuration space of the sleigh. The Lagrangian of the system is the sum of the kinetic energy of the sleigh and the string Lagrangian and reads
where \(\xi = \bar{z}_s \dot{z}\) is string’s shape velocity. The coupling is accomplished by setting
and the constraint is given by formula (5.25). We assume from now on that \(v>0\), i.e., the string trails the sleigh.
Define the quantity \(\eta \) and the operator \({\mathcal {C}}\) by the formula
so that \(\eta \) is string’s absolute velocity. Doing so results in the Lagrangian
The constraint in this representation reads \(\eta = \bar{\eta }\), which is formula (5.28) written for the string. Equations (5.29) and (5.30) for the sleigh-string system become
where v and \(\eta \) are real-valued. These equations should be amended with the coupling conditions
Note also that the position of the front end of the string coincides with the position of the contact point of the sleigh and the plane.
Arguing as in Example 4.3, one concludes that \(\eta \) is independent of s. Thus, equation (5.33) becomes
Equation (5.31) implies \(\omega = \mathrm {const}\).
The tension \(\lambda \) is obtained by integrating (5.35) with respect to s and, since \(\lambda |_{s=1} = 0\), we conclude that
Therefore, \(\lambda |_{s=0} = -\dot{\eta }\), which in combination with (5.32) and (5.34) yields \(v = \mathrm {const}\). The velocity coupling condition then implies that the blade moves at the constant speed \(\eta = v\). Using (5.36), we conclude that \(\lambda = 0\).
Summarizing, the Chaplygin sleigh with the constrained string attached generically undergoes uniform circular motion. Nongeneric trajectories are straight lines. The string (possibly after some period of time) follows the trajectory of the contact point of the sleigh.
It is interesting to point out that in this example the shape dynamics (string’s motion) is modulated by the group dynamics (skate’s motion). This is the opposite of typical reconstruction in finite-dimensional constrained systems discussed in Bloch et al. (1996a).
We note also that the qualitative dynamics of this system—uniform circular or straight line motion—is consistent with the behavior of integrable Hamiltonian systems. One may raise the question of whether it is integrable in a more precise sense—with infinitely many conserved quantities. We intend to return to this issue in a forthcoming publication.
6 Conclusions
This paper introduced Hamel’s formalism for infinite-dimensional mechanical systems, proved some general results, and illustrated them with examples. A number of topics remain outside the scope of the paper. For instance, the nature of nonholonomic constraints in the infinite-dimensional setting should be studied in more detail and will be addressed in a forthcoming publication. We also intend to develop Hamel’s formalism for field theories and apply it for constructing structure-preserving integrators for constrained continuum mechanical systems.
Examples in this paper were intentionally kept relatively simple for pedagogical reasons, and solutions there are classical, i.e., not generalized. More realistic examples, including systems with generalized solutions, will be treated in the forthcoming paper on the field-theoretic Hamel formalism.
Notes
Constraints are nonholonomic if and only if they cannot be rewritten as position constraints.
The map i satisfies the following property: A map \(f:N\rightarrow Q\) is smooth if and only if \(i\circ f:N\rightarrow M\) is smooth. Note that Q is usually not a submanifold of M. See Kriegl and Michor (1997) for details.
Here, U can be thought of as an open subset of a convenient space.
Each vector from \(T_q M\) can be represented this way.
If \(\frac{\delta L}{\delta q}-\frac{d}{\mathrm{d}t}\frac{\delta L}{\delta \dot{q}}\ne 0\) at some \(t_0\), and \(i_{*}T_{q(t_0)} Q\) is dense in \(T_{q(t_0)} M\), there exists \(X\in T_{q(t_0)} Q\) such that \(\big \langle \frac{\delta L}{\delta q}-\frac{d}{\mathrm{d}t}\frac{\delta L}{\delta \dot{q}},X\big \rangle >0\). Using continuity, it is straightforward to construct a variation of the curve q(t) for which \(\int _a^b \big \langle \frac{\delta L}{\delta q}-\frac{d}{\mathrm{d}t}\frac{\delta L}{\delta \dot{q}},\delta q(t)\big \rangle \,\mathrm{d}t>0\), which is a contradiction.
Nonsplitting closed subspaces already exist in Banach spaces; for more information on splitting subspaces, see Domański and Mastyło (2007) and references therein. The continuity of \(\pi ^{{\mathcal {D}}}\) in a Banach space is a consequence of the closed graph theorem. For more spaces with this property see Jarchow (1981).
This holds if \(T_q M\) is a Fréchet space and A(q) has closed range, see Jarchow (1981) for details.
Unlike the regular sleigh, for which \(\dot{\theta }= \mathrm {const}\).
According to Kriegl and Michor (1997), all finite-dimensional Lie groups and all known infinite-dimensional Lie groups are regular.
Here and below, the subscripts ‘c’ and ‘s’ stand for ‘convective’ and ‘spatial’, respectively.
The orthogonal complement may not exist in nonHilbert spaces.
This is called the dimension assumption in the finite-dimensional setting, see Bloch et al. (1996a) for details.
If \(\mathcal S_q = \{0\}\), a set of nonholonomic constraints is said to be purely kinematic.
Note that the intersection of two splitting subspaces may fail to be splitting already in a Banach space. For the intersection of two subspaces to be splitting, additional assumptions are necessary. According to Bill Johnson, asking that two subspaces are norm one complemented and the space itself is uniformly convex is sufficient. See http://mathoverflow.net/questions/85492/intersection-of-complemented-subspaces-of-a-banach-space for details.
References
Arnold, V.I.: Sur la géometrie differentialle des groupes de Lie de dimension infinie et ses applications à l’hydrodynamique des fluids parfaits. Ann. Inst. Fourier 16, 319–361 (1966)
Arnold, V.I., Kozlov, V.V., Neishtadt, A.I.: Mathematical Aspects of Classical and Celestial Mechanics, 3rd edn. Springer, Berlin (2006)
Ball, K., Zenkov, D.V., Bloch, A.M.: Variational structures for Hamel’s equations and stabilization. IFAC Proc. 45, 178–183 (2012)
Ball, K.R., Zenkov, D.V.: Hamel’s formalism and variational integrators. In: Chang, D.E., Holm, D.D., Patrick, G., Ratiu, T. (eds.) Geometry, Mechanics, and Dynamics: the Legacy of Jerry Marsden, Fields Institute Communications, vol. 73, pp. 477–506. Springer, Berlin (2015)
Binz, E., de León, M., Martín de Diego, D., Socolescu, D.: Nonholonomic constraints in classical field theories. Rep. Math. Phys. 49, 151–166 (2002)
Bloch, A.M.: Nonholonomic Mechanics and Control, Interdisciplinary Applied Mathematics, vol. 24, 2nd edn. Springer, New York (2015)
Bloch, A.M., Krishnaprasad, P.S., Marsden, J.E., Murray, R.: Nonholonomic mechanical systems with symmetry. Arch. Ration. Mech. Anal. 136, 21–99 (1996a)
Bloch, A.M., Krishnaprasad, P.S., Marsden, J.E., Ratiu, T.S.: The Euler–Poincaré equations and double bracket dissipation. Commun. Math. Phys. 175, 1–42 (1996b)
Bloch, A.M., Marsden, J.E., Zenkov, D.V.: Quasivelocities and symmetries in nonholonomic systems. Dyn. Syst. Int. J. 24, 187–222 (2009)
Cendra, H., Holm, D.D., Marsden, J.E., Ratiu, T.S.: Lagrangian reduction, the Euler–Poincare equations, and semidirect products. Am. Math. Soc. Transl. 186, 1–25 (1988)
Cendra, H., Marsden, J.E., Ratiu, T.S.: Geometric mechanics, lagrangian reduction, and nonholonomic systems. In: Enguist, B., Schmid, W. (eds.) Mathematics Unlimited-2001 and Beyond, pp. 221–273. Springer, New York (2001)
Cendra, H., Marsden, J.E., Ratiu, T.S.: Lagrangian Reduction by Stages, Memoirs of the American Mathematical Socity, vol. 152. AMS, Providence (2001b)
Chetaev, N.G.: Theoretical Mechanics. Springer, New York (1989)
Domański, P., Mastyło, M.: Characterization of splitting for Fréchet–Hilbert spaces via interpolation. Mathematische Annalen 338, 317–340 (2007)
Ebin, D.G.: Groups of diffeomorphisms and fluid motion: reprise. In: Chang, D.E., Holm, D.D., Patrick, G., Ratiu, T. (eds.) Geometry, Mechanics, and Dynamics: the Legacy of Jerry Marsden, Fields Institute Communications, vol. 73, pp. 477–506. Springer, Berlin (2015)
Ebin, D.G., Marsden, J.E.: Groups of diffeomorphisms and the motion of an incompressible fluid. Ann. Math. 92, 102–163 (1970)
Ellis, D.C.P., Gay-Balmaz, F., Holm, D.D., Putkaradze, V., Ratiu, T.S.: Symmetry reduced dynamics of charged molecular strands. Arch Ration Mech Anal 197, 811–902 (2010)
Euler, L.: Decouverte d’un nouveau principe de mecanique. Mémoires de l’académie des sciences de Berlin 6, 185–217 (1752)
Euler, L.: Principes généraux de l’état d’équilibre des fluides. Mémoires de l’académie des sciences de Berlin 11, 217–273 (1757a)
Euler, L.: Principes généraux du mouvement des fluides. Mémoires de l’académie des sciences de Berlin 11, 274–315 (1757b)
Euler, L.: Principia motus fluidorum. Novi Commentarii Acad. Sci. Petropolitanae 6, 271–311 (1761)
Filipović, D., Teichmann, J.: Existence of invariant manifolds for stochastic equations in infinite dimension. J. Funct. Anal. 197, 398–432 (2003)
Gay-Balmaz, F., Marsden, J.E., Ratiu, T.S.: Reduced variational formulations in free boundary continuum mechanics. J. Nonlinear Sci. 22, 463–497 (2012)
Gay-Balmaz, F., Putkaradze, V.: Dynamics of elastic rods in perfect friction contact. Phys. Rev. Lett. 109, 244303 (2012)
Gay-Balmaz, F., Putkaradze, V.: Dynamics of elastic strands with rolling contact. Phys. D 294, 6–23 (2015)
Gay-Balmaz, F., Yoshimura, H.: Dirac reduction for nonholonomic mechanical systems and semidirect products. Adv. Appl. Mech. 63, 131–213 (2015)
Hamel, G.: Die Lagrange–Eulersche gleichungen der mechanik. Z. Math. Phys. 50, 1–57 (1904)
Hamilton, W.R.: On a general method in dynamics, part I. Philos. Trans. R. Soc. Lond. 124, 247–308 (1834)
Hamilton, W.R.: On a general method in dynamics, part II. Philos. Trans. R. Soc. Lond. 125, 95–144 (1835)
Hiltunen, S.: A Frobenius theorem for locally convex global analysis. Monatshefte für Mathematik 129, 109–117 (2000)
Holm, D.D., Marsden, J.E., Ratiu, T.S.: The Euler–Poincaré equations and semidirect products with applications to continuum theories. Adv. Math. 137, 1–81 (1998)
Jarchow, H.: Locally Convex Spaces. Teubner, Stuttgart (1981)
Khesin, B., Lee, P.: A nonholonomic Moser theorem and optimal transport. J. Symplectic Geom. 7, 381–414 (2009)
Kriegl, A., Michor, P.W.: The Convenient Setting of Global Analysis, Mathematical Surveys and Monographs, vol. 53. AMS, Providence (1997)
Krishnaprasad, P.S., Tsakiris, D.P.: \(G\)-Snakes: Nonholonomic Kinematic Chains on Lie Groups. ISR Technical Report (1994)
Lagrange, J.L.: Mécanique Analytique. Chez la Veuve Desaint, Paris (1788)
Marsden, J.E.: Lectures on Mechanics, London Mathematical Society Lecture Note Series 174. Cambridge University Press, Cambridge (1992)
Marsden, J.E., Hughes, T.J.R.: Mathematical Foundations of Elasticity. Prentice Hall, Upper Saddle River (1983)
Marsden, J.E., Ratiu, T.S.: Introduction to Mechanics and Symmetry, Texts in Applied Mathematics, vol. 17, 2nd edn. Springer, New York (1999)
Neimark, JuI, Fufaev, N.A.: Dynamics of Nonholonomic Systems, Translations of Mathematical Monographs, vol. 33. AMS, Providence (1972)
Omori, H.: Infinite-Dimensional Lie Groups, Translations of Mathematical Monographs, vol. 158. AMS, Providence (1997)
Ostrowski, J., Lewis, A., Murray, R., Burdick, J.: Nonholonomic mechanics and locomotion: the snakeboard example. In: Proceedings IEEE International Conference on Robotics and Automation, pp. 2391–2397. (1994)
Poincaré, H.: Sur une forme nouvelle des équations de la mécanique. CR Acad. Sci. 132, 369–371 (1901)
Poincaré, H.: Sur la precession des corps deformables. Bull. Astron. 27, 321–356 (1910)
Suslov, G.K.: Theoretical Mechanics, 3rd edn. GITTL, Moscow-Leningrad (1946)
Teichmann, J.: A Frobenius Theorem on convenient manifolds. Monatshefte für Mathematik 134, 159–167 (2001)
Vankerschaver, J.: The momentum map for nonholonomic field theories with symmetry. Int. J. Geom. Meth. Mod. Phys. 2, 1029–1041 (2005)
Vankerschaver, J.: A class of nonholonomic kinematic constraints in elasticity. J. Phys. A: Math. Theor. 40, 3889–3913 (2007a)
Vankerschaver, J.: (2007b), Continuous and Discrete Aspects of Lagrangian Field Theories with Nonholonomic Constraints. Ph.D. Thesis, Ghent University
Yoshimura, H., Marsden, J.E.: Dirac structures in lagrangian mechanics. Implicit lagrangian systems. J. Geom. Phys. 57, 133–156 (2006a)
Yoshimura, H., Marsden, J.E.: Dirac structures in lagrangian mechanics. Variational structures. J. Geom. Phys. 57, 209–250 (2006b)
Yoshimura, H., Marsden, J.E.: Reduction of dirac structures and the Hamilton–Pontryagin principle. Rep. Math. Phys. 60, 381–426 (2007)
Zenkov, D.V., Leok, M., Bloch, A.M.: Hamel’s Formalism and Variational Integrators on a Sphere. Proc. CDC 51, 7504–7510 (2012)
Acknowledgments
We would like to thank Professors Yongxin Guo, Francois Gay-Balmaz, Vakhtang Putkaradze, and Tudor Ratiu for valuable discussions, and the reviewers for helpful remarks.
The research of AMB was partially supported by NSF Grants DMS-1207893, DMS-1613819, INSPIRE-1363720, and the Simons Foundations. The research of DS was partially supported by the China Scholarship Council. DS wishes to thank support and hospitality of North Carolina State University during his visit. The research of YBK was partially supported by NDSEG Fellowship. The research of DVZ was partially supported by NSF grants DMS-0908995 and DMS-1211454. DVZ would like to acknowledge support and hospitality of the Beijing Institute of Technology, where a part of this work was carried out.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Paul Newton.
Rights and permissions
About this article
Cite this article
Shi, D., Berchenko-Kogan, Y., Zenkov, D.V. et al. Hamel’s Formalism for Infinite-Dimensional Mechanical Systems. J Nonlinear Sci 27, 241–283 (2017). https://doi.org/10.1007/s00332-016-9332-7
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00332-016-9332-7