1 Variational approaches for the Kepler problem

The equation of motions for the Kepler problem:

$$\begin{aligned} \mu \ddot{x}= & {} -\alpha \frac{x}{|x|^3}\, \end{aligned}$$
(1.1)

is the Euler–Lagrange equation for the Lagrangian action functional:

$$\begin{aligned} I_{\mu ,\alpha ,T}(x)= & {} \int _0^T \frac{\mu }{2}|\dot{x}|^2 + \frac{\alpha }{|x|}\mathrm{d}t. \end{aligned}$$
(1.2)

Here, \(\mu \) and \(\alpha \) are positive constants involving masses. We call solutions of (1.1) Keplerian orbits and \(I_{\mu ,\alpha ,T}\) the Keplerian action functional. It is natural to regard Keplerian orbits as extrema of this functional in suitable function spaces. To include generalized solutions and allow more functional analytic tools, a natural choice of the function space would be \(H^1_\text {loc}(\mathbb R,\mathbb C)\) or \(H^1(\mathbb R/T\mathbb Z,\mathbb C)\).

Speaking of variational approaches for the Kepler problem, one should begin with the pioneering work of Gordon [17], in which elliptic Keplerian orbits, together with periodic collision–ejection orbits, were characterized as minimizers of the Keplerian action functional among non-contractible \(H^1\)-loops in the punctured complex plane. This observation played an important role in later discoveries of periodic solutions for several equivariant n-body problems, especially those which based on global estimates for the action functional. See [5, 6, 8, 1214, 23] for examples and further references. Another setting by the author characterizes some of these orbits as minimizing paths between two rays emanating from the attractive focus. This viewpoint is useful in showing existence of retrograde orbits of the three-body problem without the equal mass constraint [8, 10].

Parabolic orbits for the n-body problem, as well as for some other singular Hamiltonian systems, arise naturally in the study of generalized solutions near collisions. They are interesting from variational perspective, because they can be characterized as free-time action-minimizers, and can, therefore, be linked with the Mather and weak KAM theories. Detailed discussions can be found in [2, 3, 18, 19].

Given two different points with the same positive distance from the attractive focus, the parabolic collision–ejection path which connects them has larger action value than some direct and indirect Keplerian arcs which connects them with the same transfer time. This result was attributed to C. Marchal in [4] and proved in [16, 21]. Similar proofs have also appeared in [20, 22]. Here, we will give another proof, a proof which we think might be the simplest, more direct and informative. The proof is more informative, because it not only shows the existence of Keplerian arc will less action value, but also specifies eccentricities, semi-latus rectum, and action values of such hyperbolic Keplerian arcs. This result allows us to characterize hyperbolic Keplerian orbits as action minimizers among paths from a point to a ray emanating from the attractive focus. Details will be shown in Sect. 3.

2 Some preliminaries

For convenience, here, we briefly review some facts about hyperbolic Keplerian orbits. We refer readers to two introductory books [1, Sect. 4.4], [15, Chap 3] for details.

In terms of polar coordinates \((x,y)=(r\cos \theta ,r\sin \theta )\), solutions of (1.1) with pericenters on \(\mathbb R_+=\{0\}\times [0,\infty )\) are of the form:

$$\begin{aligned} r = \frac{\mathfrak {p}}{1+\mathfrak {e}\cos \theta }, \end{aligned}$$
(2.1)

where \(\mathfrak {p}>0\) is the semi-latus rectum and \(\mathfrak {e}\ge 0\) is the eccentricity. This is Kepler’s first law. Kepler’s second law can be written:

$$\begin{aligned} r^2\dot{\theta } \;=\; \sqrt{\frac{\mathfrak {p}\alpha }{\mu }}. \end{aligned}$$
(2.2)

The energy H of orbit is given by:

$$\begin{aligned} H \;=\; \frac{\alpha (\mathfrak {e}^2-1)}{2\mathfrak {p}}. \end{aligned}$$
(2.3)

Conics described by (2.1) form a two-parameters’ family. For each \(\xi \) in the upper half space \(\mathbb R\times (0,\infty )\), there is a one-parameter family of such conics in the upper half space which emanate from their pericenters on \(\mathbb R_+{\setminus }\{0\}\) and end at \(\xi \). The corresponding Keplerian orbit is uniquely determined if we fix the transfer time.

A natural parametrization for hyperbolic orbits (\(1<\mathfrak {e}\)) is by the hyperbolic anomaly \(\psi _\text {H}\in \mathbb R\):

$$\begin{aligned}&r \;=\; \frac{\mathfrak {p}}{\mathfrak {e}^2-1} (\mathfrak {e}\cosh \psi _\text {H}-1). \end{aligned}$$
(2.4)

The true anomaly \(\theta \) and the hyperbolic anomaly \(\psi _\text {H}\) are related by:

$$\begin{aligned} \displaystyle {\tan \frac{\theta }{2}}= & {} \displaystyle {\sqrt{\frac{\mathfrak {e}+1}{\mathfrak {e}-1}} \tanh \frac{\psi _\text {H}}{2},} \nonumber \\ \displaystyle {\frac{\mathrm{d}\psi _\text {H}}{\mathrm{d}\theta }}= & {} \displaystyle {\frac{\mathfrak {e}\cosh \psi _\text {H}-1}{\sqrt{\mathfrak {e}^2-1}} \;=\; \frac{r}{\mathfrak {p}}\sqrt{\mathfrak {e}^2-1}.} \end{aligned}$$
(2.5)

The hyperbolic Kepler equation relates the hyperbolic anomaly with the transfer time \(\tau \), since the last pericenter passage:

$$\begin{aligned} \sqrt{\frac{\alpha (\mathfrak {e}^2-1)^3}{\mu \mathfrak {p}^3}}\, \tau= & {} \mathfrak {e}\sinh \psi _\text {H}- \psi _\text {H}. \end{aligned}$$
(2.6)

In particular, in the upper half space (\(\psi _\text {H}>0\)), we have \(\mathfrak {e}\sinh \psi _\text {H}- \psi _\text {H}>0\).

Using (2.2) and (2.5), \(\psi _\text {H}\) as a function of \(\tau \) is proportional to the integration of the potential term on \([0,\tau ]\):

$$\begin{aligned} \int _0^\tau \frac{\alpha }{r}\mathrm{d}t \;=\; \alpha \int _0^{\psi _\text {H}(\tau )} \frac{1}{r} \frac{\mathrm{d}t}{\mathrm{d}\theta } \frac{\mathrm{d}\theta }{\mathrm{d}\psi _\text {H}}\, \mathrm{d}\psi _\text {H}\;=\; \sqrt{\frac{\mu \alpha \mathfrak {p}}{\mathfrak {e}^2-1}}\,\psi _\text {H}(\tau ). \end{aligned}$$
(2.7)

3 The main theorem

A simple rectilinear solution of (1.1) is the parabolic ejection orbit \(x_\text {pe}^{(\phi )}\) given by:

$$\begin{aligned} x_\text {pe}^{(\phi )}(t)= & {} c_\text {pe}\, t^\frac{2}{3} e^{i\phi }, \quad c_\text {pe}= \left( \frac{9\alpha }{2\mu }\right) ^\frac{1}{3}. \end{aligned}$$
(3.1)

It is easy to check that this is the only solution for (1.1) of the form \(c\, t^\lambda e^{i\phi }\), it is indeed parabolic, and its action value is:

$$\begin{aligned} I_{\mu ,\alpha ,\tau }(x_\text {pe}^{(\phi )}) \;=\; 2(6\mu \alpha ^2)^\frac{1}{3}\tau ^\frac{1}{3}. \end{aligned}$$
(3.2)

Our main theorem compares these parabolic ejection orbits with some hyperbolic orbits:

Main theorem.  Fix \(\phi \in (0,\pi )\) and \(\tau >0\). Let \(x_\text {pe}^{(\phi )}\) be the parabolic ejection orbit in (3.1). Let \(x^{(\phi )}=r e^{i\theta }\) be the Keplerian orbit such that \(x^{(\phi )}(0)\in \mathbb R_+\) is its pericenter, and \(x^{(\phi )}(\tau )=x_\text {pe}^{(\phi )}(\tau )\). Then, \(x^{(\phi )}\) is hyperbolic and

$$\begin{aligned} I_{\mu ,\alpha ,\tau }(x^{(\phi )})< & {} I_{\mu ,\alpha ,\tau }(x_\text {pe}^{(\phi )}). \end{aligned}$$

Proof

The polar equation for \(x^{(\phi )}\) is of the form (2.1), since the pericenter is on \(\mathbb R_+\). The area of the sector swept by the vector from 0 to \(x^{(\phi )}\) for \(t\in [0,\tau ]\) is

$$\begin{aligned} \frac{\mathfrak {p}^2}{2}\int _0^\phi \frac{1}{(1+\mathfrak {e}\cos \theta )^2} \mathrm{d}\theta \;=\; \frac{1}{2}\int _0^\phi r^2\,\mathrm{d}\theta \;=\; \frac{1}{2}\int _0^\tau r^2\dot{\theta }\,\mathrm{d}t \;=\; \frac{1}{2}\sqrt{\frac{\mathfrak {p}\alpha }{\mu }}\, \tau . \end{aligned}$$

The last identity uses (2.2). Observe that

$$\begin{aligned} \frac{\mathfrak {p}}{1+\mathfrak {e}\cos \phi } \;=\; |x^{(\phi )}(\tau )| \;=\; |x_\text {pe}^{(\phi )}(\tau )| \;=\; \left( \frac{9\alpha }{2\mu }\right) ^\frac{1}{3}\tau ^\frac{2}{3}, \end{aligned}$$

and so

$$\begin{aligned} \tau ^2= & {} \left( \frac{2\mu }{9\alpha }\right) \left( \frac{\mathfrak {p}}{1+\mathfrak {e}\cos \phi }\right) ^3. \end{aligned}$$
(3.3)

Therefore,

$$\begin{aligned} \int _0^\phi \frac{1}{(1+\mathfrak {e}\cos \theta )^2} \mathrm{d}\theta= & {} \frac{\sqrt{2}}{3} \frac{1}{(1+\mathfrak {e}\cos \phi )^\frac{3}{2}}. \end{aligned}$$
(3.4)

By differentiating this identity with respect to \(\phi \), we find

$$\begin{aligned} \mathfrak {e}\sin \phi \;=\; \sqrt{2} \sqrt{1+\mathfrak {e}\cos \phi }, \end{aligned}$$

from which one can square both sides to deduce

$$\begin{aligned} (1+\mathfrak {e}\cos \phi )^2= & {} \mathfrak {e}^2 - 1. \end{aligned}$$
(3.5)

This implies \(\mathfrak {e}>1\); i.e. the orbit \(x^{(\phi )}\) is hyperbolic.

In addition, from (3.3) and (3.5), we have:

$$\begin{aligned} \tau ^2= & {} \left( \frac{2\mu }{9\alpha }\right) \left( \frac{\mathfrak {p}}{\sqrt{\mathfrak {e}^2-1}}\right) ^3. \end{aligned}$$
(3.6)

In terms of hyperbolic anomaly \(\psi _\text {H}=\psi _\text {H}(\tau )\), by (2.4) and (3.5), we find:

$$\begin{aligned} \mathfrak {e}\cosh \psi _\text {H}- 1 \;=\; \frac{\mathfrak {e}^2-1}{1+\mathfrak {e}\cos \phi } \;=\; \sqrt{\mathfrak {e}^2-1}. \end{aligned}$$
(3.7)

The action value for \(x^{(\phi )}\) can be easily evaluated using the hyperbolic eccentric anomaly:

$$\begin{aligned} I_{\mu ,\alpha ,\tau }(x^{(\phi )})= & {} \frac{1}{2}\sqrt{\frac{\mathfrak {p}\mu \alpha }{\mathfrak {e}^2-1}} (\mathfrak {e}\sinh \psi _\text {H}+3\psi _\text {H}). \end{aligned}$$
(3.8)

This follows easily by writing the Lagrangian as \(2\alpha /r+H\) and using (2.3), (2.6), (2.7).

By (3.6), we can rewrite (3.2) in terms of \(\mathfrak {e}\), \(\mathfrak {p}\):

$$\begin{aligned} I_{\mu ,\alpha ,\tau }\left( x_\text {pe}^{(\phi )}\right)= & {} 2\sqrt{2} \sqrt{\frac{\mathfrak {p}\mu \alpha }{\mathfrak {e}^2-1}} (\mathfrak {e}^2-1)^\frac{1}{4}. \end{aligned}$$

Comparing with the above formula for \(I_{\mu ,\alpha ,\tau }(x^{(\phi )})\), all we have to show is

$$\begin{aligned} \mathfrak {e}\sinh \psi _\text {H}+ 3\psi _\text {H}< & {} 4\sqrt{2} (\mathfrak {e}^2-1)^\frac{1}{4}. \end{aligned}$$

By (3.7), we have

$$\begin{aligned} (\mathfrak {e}\cosh \psi _\text {H}-1)^2= & {} \mathfrak {e}^2 -1. \end{aligned}$$

Expanding the left-hand side yields

$$\begin{aligned} \mathfrak {e}\sinh \psi _\text {H}\;=\; \sqrt{2} \sqrt{\mathfrak {e}\cosh \psi _\text {H}-1} \;=\; \sqrt{2}\,(\mathfrak {e}^2-1)^\frac{1}{4}. \end{aligned}$$

From the hyperbolic Kepler equation, we know \(\mathfrak {e}\sinh \psi _\text {H}-\psi _\text {H}>0\). Therefore,

$$\begin{aligned} \mathfrak {e}\sinh \psi _\text {H}+ 3\psi _\text {H}\;<\; 4\mathfrak {e}\sinh \psi _\text {H}\;=\; 4\sqrt{2}\, (\mathfrak {e}^2-1)^\frac{1}{4} \end{aligned}$$

as desired. \(\square \)

Identity (3.4) was also obtained in [16]. The main difference is that our argument bases on properties of the hyperbolic anomaly. As mentioned in the first section, this result in slightly variant forms can be found in [16, 2022]. Apart from these references, an application to the n-center problem can be found in [11]. The proof above is informative, because it identifies eccentricity, semi-latus rectum, hyperbolic anomaly, and action value of the Keplerian arc with smaller action value through equations (3.5)–(3.8).

The following corollary characterizes hyperbolic Keplerian orbits as minimizing paths of the Keplerian action functional in the space of curves from a ray emanating from the attractive focus to a point in space. An application to the four-body problem can be found in [9].

Corollary

Given \(\tau >0\) and \(\xi \in \mathbb R\times (0,\infty )\). Minimizers of the Keplerian action functional on the space

$$\begin{aligned} \Gamma _\tau (\mathbb R_+,\xi ) \;:=\; \{x\in H^1([0,\tau ],\mathbb C): \; x(0)\in \mathbb R_+,\; x(\tau )=\xi \} \end{aligned}$$

are collision-free.

Proof

Existence of action minimizer follows from weak compactness of the space \( \Gamma _\tau (\mathbb R_+,\xi )\), weak lower semi-continuity, and coercivity of the Keplerian action functional (see [7, 17] for instances).

If an action minimizer has collision, then following a standard blow-up argument (see [14, 21] for instances), a limiting minimizing path along the ray \(\mathbb R_+\xi \) is a parabolic ejection orbit, but the main theorem tells us that such an orbit do not have least action value in spaces of the form \(\Gamma _{\tau '}(\mathbb R_+,\xi )\). \(\square \)