Homoclinic dynamics in a restricted four-body problem: transverse connections for the saddle-focus equilibrium solution set

Kepley, Shane; Mireles James, J. D.

doi:10.1007/s10569-019-9890-8

Homoclinic dynamics in a restricted four-body problem: transverse connections for the saddle-focus equilibrium solution set

Original Article
Published: 04 March 2019

Volume 131, article number 13, (2019)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Celestial Mechanics and Dynamical Astronomy Aims and scope Submit manuscript

Homoclinic dynamics in a restricted four-body problem: transverse connections for the saddle-focus equilibrium solution set

Download PDF

341 Accesses
9 Citations
Explore all metrics

Abstract

We describe a method for computing an atlas for the stable or unstable manifold attached to an equilibrium point and implement the method for the saddle-focus libration points of the planar equilateral restricted four-body problem. We employ the method at the maximally symmetric case of equal masses, where we compute atlases for both the stable and unstable manifolds. The resulting atlases are comprised of thousands of individual chart maps, with each chart represented by a two-variable Taylor polynomial. Post-processing the atlas data yields approximate intersections of the invariant manifolds, which we refine via a shooting method for an appropriate two-point boundary value problem. Finally, we apply numerical continuation to some of the BVP problems. This breaks the symmetries and leads to connecting orbits for some nonequal values of the primary masses.

Critical homoclinics in a restricted four-body problem: numerical continuation and center manifold computations

Article 15 February 2021

Revealing the basins of convergence in the planar equilateral restricted four-body problem

Article 01 December 2016

Heteroclinic Transition Motions in Periodic Perturbations of Conservative Systems with an Application to Forced Rigid Body Dynamics

Article 01 July 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Illuminating studies by Darwin, Strömgren, and Moulton in the first decades of the Twentieth Century established the importance of numerical calculations in the qualitative theory of Hamiltonian systems (Darwin 1897; Strömgren 1934; Moulton et al. 1920). In particular, their work gave new insights into the orbit structure of the circular restricted three-body problem (CRTBP), a problem already immortalized by Poincaré. Interest in the CRTBP was reinvigorated in the 1960s with the inauguration of the space race and a number of authors including Szebehely and Nacozy (1967), Szebehely and Flandern (1967) harnessed the newly available power of digital computing to settle some questions raised by Strömgren. The interested reader will find a delightful retelling of this story with many additional references in the book of Szebehely (1967).

Motivated by the works just mentioned, in 1973 Henrard proved a theorem settling a conjecture of Strömgren about the role of asymptotic orbits. More precisely, Henrard showed that the existence of a transverse homoclinic for a saddle-focus equilibrium in a two-degree-of-freedom Hamiltonian system implies the existence of a tube of periodic orbits parameterized by energy and accumulating to the homoclinic (Henrard 1973). In the same paper he showed that the period of the orbits in the family goes to infinity and their stability changes infinitely often as they accumulate to the homoclinic. This phenomenon was called the blue sky catastrophe by Abraham (1985) and has been studied by a number of authors including Shilnikov et al. (2014), Devaney (1977).

In 1976 it was further shown by Devaney that such a transverse homoclinic—again for a saddle-focus in a two-degree-of-freedom Hamiltonian system—implies the existence of chaotic dynamics in the energy level of the equilibrium (Devaney 1976). See also the works of Lerman (1991, 2000). Such theorems should be thought of as Hamiltonian versions of the homoclinic bifurcations studied by Shilńikov (1967, 1970a, b). Taken together the results cited so far paint a vivid picture of the rich dynamics near a transverse homoclinic connection in a two-degree-of-freedom Hamiltonian system.

The present study concerns asymptotic orbits in the planar equilateral restricted four-body problem, henceforth referred to as the circular restricted four-body problem (CRFBP). The problem has a rich literature dating at least back to the work of Pedersen (1944, 1952). Detailed numerical studies of the equilibrium set, as well as the planar and spatial Hill’s regions, are found in Simó (1978), in Baltagiannis and Papadakis (2011a), and in Álvarez-Ramírez and Vidal (2009). Mathematically rigorous theorems about the equilibrium set and its bifurcations are proven by Leandro (2006), Barros and Leandro (2011, 2014) (with computer assistance). They show that for any value of the masses there are either 8, 9, or 10 equilibrium solutions with 6 outside the equilateral triangle formed by the primary bodies (see Fig. 1).

Fundamental families of periodic orbits are considered by in Papadakis (2016a, b), and by Burgos-García and Delgado (2013a), Burgos-García and Bengochea (2017). A study by Burgos-García, Lessard, and Mireles James proves the existence of some spatial periodic orbits for the CRFBP (Burgos-García et al. 2019) (again with computer assistance). An associated Hill’s problem is derived, and its periodic orbits are studied by Burgos-García (2016), Burgos-García and Gidea (2015).

Regularization of collisions is studied by Alvarez-Ramírez et al. (2014). Chaotic motions were studied numerically by Gidea and Burgos (2003) and by Alvarez-Ramírez and Barrabés (2015). Perturbative proofs of the existence of chaotic motions are found in the work of Cheng and She (2017), She and Cheng (2014), She et al. (2013) and also in the work of Alvarez-Ramírez et al. (2018). Blue sky catastrophes in the CRFBP were previously studied by Burgos-García and Delgado (2013b) and by Kepley and Mireles James (2018). This last reference develops (computer-assisted) methods of proof for verifying the hypotheses of the theorems of Hernard and Devaney.

The main goal of the present work is to study orbits which are homoclinic to a saddle-focus equilibrium solution in the equilateral restricted four-body problem. We apply the parameterization method of Cabré, Fontich, and de la Llave to compute a chart for the stable or unstable manifold in a neighborhood of the equilibrium (Cabré et al. 2003a, b, 2005). Then, we implement the analytic continuation scheme for local invariant manifolds developed by Kalies et al. (2018), where it was applied to some two-dimensional manifolds in the Lorenz system. We adapt this scheme for the CRFBP and compute atlases for the local stable/unstable manifolds attached to a saddle-focus equilibrium. By an atlas, we mean a collection of analytic maps or charts of the form, $P :[-1, 1]^2 \rightarrow \mathbb {R}^4$, where the image of P lies in the stable or unstable manifold. The union of these charts is a piecewise approximation for a large portion of the manifold away from the equilibrium. For a more formal definition, see any standard text on differential geometry. The charts are computed using high-order polynomial approximations with algorithms that exploit automatic manipulations of formal series.

After computing the stable/unstable manifold atlases, we post-process to find approximate intersections. Once a potential intersection is located, we refine the approximation using a Newton scheme for a two-point boundary value problem as in the classical work of Doedel and Friedman (1989), Doedel et al. (1997). In the case of the CRFBP, our algorithm identifies a large collection of connecting orbits which are naturally ordered by connection time. We focus on the maximally symmetric case of equal masses, which we refer to as the triple Copenhagen problem. We prove that a rotational symmetry in this case reduces the complexity of the atlas computations by a factor of 3.

The algorithm for producing the atlases utilizes an adaptive subdivision routine to carefully control errors. This results in a large number of charts, on the order of tens of thousands, in only a few minutes of computation time. These computations are expensive in terms of memory usage, and it is impractical to recompute the atlases for a large number of parameter values, at least given the resources of the present study, namely laptop/desktop computers running single threads. Instead, after computing an ensemble of connecting orbits for the triple Copenhagen problem, we apply numerical continuation to the boundary value problem describing the homoclinics. That is, we use the connections found for the equal mass case as a jumping off point for exploring nearby—but nonsymmetric—mass parameters. Continuation of the connecting orbits is much more efficient than continuing the entire invariant manifold atlas.

As is well known, the bifurcation structure of the homoclinic continuation problem in the Hamiltonian setting is rich. We do not attempt automatic tracking of new branches, nor do we follow folds. A more systematic study of the branching would make an excellent topic for future study, perhaps by combining our invariant manifold atlas data with powerful continuation software such as AUTO (Champneys et al. 1996).

We emphasize that our restriction to the equal masses case is due to convenience and is not a technical restriction on the method itself. Our atlas algorithm applies to any choice of parameters or even to other Hamiltonian systems. Thus, even though we abandon the branch whenever the homoclinic continuation algorithm fails, we always have the ability to dig deeper into the cause of failure by running the full atlas computation from scratch.

We remark that our method is deployed in the full phase space and does not require choosing a fixed surface of section in which to study intersections of the invariant manifolds. This is advantageous as many problems do not admit a single section for which the return map is topologically conjugate to the true dynamics. Considering the intersections of the stable/unstable manifolds in a particular section may not reveal all the connecting orbits. Moreover, the first intersections to appear in phase space may not be the first to appear in a given section. Indeed, projecting to a section can introduce discontinuities which make it impossible to precisely formulate notions like “first intersection.” The great virtue of a surface of section (restricted to an energy level) is that it leads—at least in the case of a two-degree-of-freedom Hamiltonian—to a two-dimensional representation of the dynamics. We remark that the methods of the present work generalize to systems with three or more degrees of freedom, where considering surfaces of section is less fruitful.

2 Saddle-focus equilibrium solutions of the equilateral CRFBP

In this section, we review well known results about the set of equilibrium solutions in the CRFBP, focusing on material which informs the calculations carried out in the remainder of the work. We are especially interested in the number and location of saddle-foci and in how these depend on the mass ratios. First, we recall the mathematical formulation of the problem and some of its elementary properties.

2.1 The planar equilateral circular restricted four-body problem

Consider three particles with masses $0< m_3 \le m_2 \le m_1 < 1$, normalized so that

$$\begin{aligned} m_1 + m_2 + m_3 = 1. \end{aligned}$$

These massive particles are referred to as the “primaries.” Suppose that the primaries are located at the vertices of a planar equilateral triangle, rotating with constant angular velocity. That is, we assume that the three massive bodies are in the triangular configuration of Lagrange. We choose a co-rotating coordinate frame which puts the triangle in the xy-plane and fixes the center of mass at the origin. We orient the triangle so that the first primary is on the negative x-axis, the second body is in the lower right quadrant, and the smallest body is in the upper right quadrant. Once in co-rotating coordinates, we are interested in the dynamics of a fourth, massless particle with coordinates (x, y), moving in the gravitational field of the primaries. The situation is illustrated in Fig. 1.

We write $(x_1, y_1)$, $(x_2, y_2)$ and $(x_3, y_3)$ to denote the locations of the primary masses. Let

$$\begin{aligned} K = m_2(m_3 - m_2) + m_1(m_2 + 2 m_3). \end{aligned}$$

Taking into account the normalizations discussed above, the precise positions of the primary bodies are given by the formulas

$$\begin{aligned} x_1= & {} \frac{-|K| \sqrt{m_2^2 + m_2 m_3 + m_3^2}}{K}, \quad \quad \quad \quad y_1 = 0,\\ x_2= & {} \frac{|K|\left[ (m_2 - m_3) m_3 + m_1 (2 m_2 + m_3) \right] }{ 2 K \sqrt{m_2^2 + m_2 m_3 + m_3^2} } \quad \quad y_2 = \frac{-\sqrt{3} m_3}{2 m_2^{3/2}} \sqrt{\frac{m_2^3}{m_2^2 + m_2 m_3 + m_3^2}}\\ x_3= & {} \frac{|K|}{2 \sqrt{m_2^2 + m_2 m_3 + m_3^2}}, \quad \quad \quad y_3 = \frac{\sqrt{3}}{2 \sqrt{m_2}} \sqrt{\frac{m_2^3}{m_2^2 + m_2 m_3 + m_3^2}}. \end{aligned}$$

Define the potential function

$$\begin{aligned} \varOmega (x,y) := \frac{1}{2} (x^2 + y^2) + \frac{m_1}{r_1(x,y)} + \frac{m_2}{r_2(x,y)} + \frac{m_3}{r_3(x,y)}, \end{aligned}$$

(1)

where

$$\begin{aligned} r_j(x,y) := \sqrt{(x-x_j)^2 + (y-y_j)^2}, \quad \quad \quad j = 1,2,3, \end{aligned}$$

(2)

and let $\mathbf {x} = (x, \dot{x}, y, \dot{y}) \in \mathbb {R}^4$ denote the state of the system. The equations of motion in the rotating frame are

$$\begin{aligned} \mathbf {x}' = f(\mathbf {x}), \end{aligned}$$

where

$$\begin{aligned} f(x, \dot{x}, y, \dot{y}) := \left( \begin{array}{c} \dot{x} \\ 2 \dot{y} + \varOmega _x(x, y) \\ \dot{y} \\ -2 \dot{x} + \varOmega _y(x, y) \\ \end{array} \right) . \end{aligned}$$

(3)

The system conserves the quantity

$$\begin{aligned} E(x, \dot{x}, y, \dot{y})&= -\left( {\dot{x}}^2 + {\dot{y}}^2 \right) + 2\varOmega (x,y), \end{aligned}$$

(4)

which is called the Jacobi integral. Note that E is smooth—in fact real analytic—away from the primaries. The zero velocity curves are defined by fixing a value of the energy and setting $\dot{x}, \dot{y}$ to zero. These curves are useful for understanding the structure of the phase space and are illustrated in Fig. 2.

As mentioned in the introduction, the CRFBP has exactly 8, 9 or 10 equilibrium solutions, depending on the values of the mass parameters $m_1, m_2,$ and $m_3$. The equilibria are referred to as libration points in the dynamical astronomy literature, and we denote them by $\mathcal {L}_j$ for $0 \le j \le 9$. A typical configuration of these libration points is illustrated in Fig. 1, which also illustrates out naming convention. In the present work we are interested in the linear stability of the libration points. We are especially interested in determining the mass ratios where $\mathcal {L}_j$ with $j = 0, 4, 5, 6$ are saddle-focus—as opposed to real saddle or center $\times $ center—equilibria. This question is considered from a numerical point of view in Sect. .

We note that for all values of the masses, $\mathcal {L}_j$ with $j = 1, 2, 3, 7, 8, 9$ have either saddle $\times $ center, or center $\times $ center stability depending on the values of the masses. The local two-dimensional invariant manifolds attached to all ten libration points are illustrated in Fig. 3, for the case of equal masses.

2.2 Saddle-foci in parameter space

The CRFBP admits as many as four and as few as zero saddle-focus equilibrium points, depending on the mass ratios. We now consider briefly what happens in between these extremes as the masses are varied. The problem is normalized so that $m_1 + m_2 + m_3 = 1$, with $m_3 \le m_2 \le m_1$, so we have that $m_1 \in [1/3, 1]$, $m_2 \in [0, 1/2]$ and $m_3 \in [0, 1/3]$. Considering the 2-simplex in $\mathbb {R}^3$ satisfying these constraints, we see that when $m_1 \in [1/3, 1/2]$ we have

$$\begin{aligned} m_3 \in \left[ -2m_1 + 1, \frac{-1}{2}m_1 +\frac{1}{2} \right] , \end{aligned}$$

while for $m_1 \in [1/2, 1]$ we have

$$\begin{aligned} m_3 \in \left[ 0, \frac{-1}{2}m_1 +\frac{1}{2} \right] . \end{aligned}$$

In either case, once we choose $m_1$ and $m_3$, the value of $m_2$ is determined by

$$\begin{aligned} m_2 = 1 - m_1 - m_3. \end{aligned}$$

The question is, how does the stability of the libration points depend on the mass ratios? We address the question for each of the points, $\mathcal {L}_{0,4,5,6}$, as follows. Beginning with the case of equal masses, $m_1 = m_2 = m_3 = 1/3$, we numerically continue each equilibrium to the opposite boundary of the parameter simplex at $m_3 = 0$. Throughout the computation, we track the stability of each libration point and label a parameter point with a black dot whenever the stability is of saddle-focus type. The results are summarized in Fig. 4. We refer to the curve in the parameter simplex where the stability changes as the Routh–Gascheau curve.

Roughly speaking, we see that when ${1}/{3} \le m_1 \le 0.42$ the libration point $\mathcal {L}_0$ is a saddle-focus for all allowable values of $m_2$, $m_3$. When $m_1 > 0.43$, the libration point $\mathcal {L}_0$ is no longer a saddle, no matter the values of $m_2$, $m_3$. The points $\mathcal {L}_{4, 6}$ on the other hand have saddle-focus stability for most parameter values, and only bifurcate after $m_1 > 0.95$ (with $\mathcal {L}_6$ a little more robust than $\mathcal {L}_4$ except when $m_2 = m_3$). The libration point $\mathcal {L}_5$ is the most robust. It maintains saddle-focus stability until $m_1 \approx 0.99$. For $m_1 > 0.995$ there are no more saddle-foci at all. By reading parameter values off of the frames in Fig. 4, we can arrange that the CRFBP has 1, 2, 3 or 4 saddle-focus equilibria. In the sequel we are interested in homoclinic connections for such parameters.

2.3 Two ways to formulate a connecting orbit: phase space geometry and boundary value problems

There are two standard ways to think about connecting orbits and—while they are completely equivalent from a mathematical point of view—in practice they have different advantages and disadvantages. In the following let $f :\mathbb {R}^n \rightarrow \mathbb {R}^n$ denote a smooth vector field and let $\mathbf {x}_0 \in \mathbb {R}^n$ be an equilibrium solution for f. We write $W^s(\mathbf {x}_0)$ and $W^u(\mathbf {x}_0)$ to denote, respectively, the stable and unstable manifolds attached to $\mathbf {x}_0$.

Analytic definition If $\mathbf {x} :\mathbb {R}\rightarrow \mathbb {R}^n$ satisfies
$$\begin{aligned} \frac{\mathrm{d}}{\mathrm{d}t} \mathbf {x}(t) = f(\mathbf {x}(t)), \end{aligned}$$
for all $t \in \mathbb {R}$, and satisfies the asymptotic boundary conditions
$$\begin{aligned} \lim _{t \rightarrow \pm \infty } \mathbf {x}(t) = \mathbf {x}_0, \end{aligned}$$
then we say that $\mathbf {x}$ is a homoclinic connecting orbit for $\mathbf {x}_0$.
Geometric definition If
$$\begin{aligned} \hat{x} \in W^s(\mathbf {x}_0) \cap W^u(\mathbf {x}_0), \end{aligned}$$
and $\mathbf {x} = \text{ orbit }(\hat{x})$ denotes the orbit which passes through $\hat{x}$, then $\mathbf x$ is a homoclinic connecting orbit for $\mathbf {x}_0$. If the intersection of the manifolds is transverse, then we say that $\mathbf {x}$ is a transverse homoclinic connection.

The analytic definition is recast as a finite time boundary value problem by projecting the boundary conditions onto local stable/unstable manifolds. If P, Q are parameterizations of the local unstable and stable manifolds, respectively, then we look for $T > 0$ and $\mathbf {x} :[0, T] \rightarrow \mathbb {R}^n$, so that $\mathbf {x}$ solves the differential equation subject to the boundary conditions

$$\begin{aligned} \mathbf {x}(0) \in \text{ image }(P), \qquad \text{ and } \qquad \mathbf {x}(T) \in \text{ image }(Q). \end{aligned}$$

In applications one frequently replaces P and Q by their linear approximations. In Sect. 3 we review an approach called the parameterization method for computing high-order polynomial approximations of the local charts P, Q.

Remark 1

(Relative strengths and weaknesses) One great advantage of the analytic formulation is that, since it is equivalent to a two-point boundary value problem, we can utilize the Newton method to find very accurate solutions—often on the order of machine precision. The formulation as a boundary value problem also lends itself to numerical continuation schemes, which are very useful for exploring the parameter space. The disadvantages are twofold. First, in this formulation it is necessary to begin the Newton iteration with a fairly good approximate solution and this raises the question: Where do the approximate solutions come from? Second, it is difficult to rule out solutions using the BVP approach.

In the geometric approach, there is no need to make a guess. Instead, one moves along the stable and unstable manifolds and identifies connections by locating intersections in phase space. At the same time, the geometric approach allows one to rule out connecting orbits by showing that a particular region of phase space does not contain any intersections. The difficulty with the geometric perspective is that it provides information only as good as our knowledge of the embeddings of the stable/unstable manifolds. Computing embeddings of invariant manifolds is challenging, and methods tend to decrease in accuracy the farther from the equilibrium they are applied.

The important point, from the perspective of the present work, is that these two approaches complement one another. The geometric formulation is good for locating and ruling out connections, while the analytic formulation is good for refining approximations and for continuation with respect to parameters. This suggests the approach of the present work: namely that we use the two formulations in concert, playing the strengths of one against the weaknesses of the other as appropriate.

We remark that in many applications it is convenient to examine the intersections of the invariant manifolds in an intermediate surface of section. This is especially true for two-degree-of-freedom systems as the section intersected with the energy level leads to a two-dimensional image which is easy to visualize. Often an appropriate section is suggested by the geometry of the problem, or by the goals of a particular space mission. We refer the interested reader to the works (Koon et al. 2000; Canalias and Masdemont 2006; Barrabés et al. 2009) for examples and fuller discussion.

3 Numerical computation of the stable/unstable manifolds

The results of Sect. show that for most parameter values, the CRFBP has either three or four saddle-focus equilibria—though for some parameters it may have only two, or one, or none. For a given saddle-focus equilibrium with fixed values of the mass parameters, we compute the invariant manifolds in two steps. First, we find a high-order expansion of an initial local chart containing the equilibrium solution. Then we use a high-order Taylor integration scheme to advect the boundary of the initial chart one subarc at a time. The second step is repeated until a certain integration time has been reached, or until some error tolerance has been exceeded. Along the way, it is sometimes necessary to subdivide boundary arcs in order to manage the truncation errors.

Our computation of the initial chart employs the parameterization method, which is reviewed in Sect. 3.1. Advection of the boundary uses a Taylor integration scheme similar to the one developed in Kalies et al. (2018), but adapted to the problem at hand. Both procedures exploit differential-algebraic manipulations of formal power series, and these manipulations are delicate due to the presence of the minus two-thirds of power in the nonlinearity of the CRFBP vector field.

One technique for manipulating power series of several complex variables involves automatic differentiation combined with the radial gradient. This procedure is developed in Haro et al. (2016) and is reviewed in “Appendix B.” Another technique involves appending additional variables and equations to the problem, so that the enlarged field is polynomial and equivalent to the original CRFBP on a certain submanifold. This option is discussed at length for the CRFBP in Kepley and Mireles James (2018) which also includes a more precise definition of what “equivalent” means here. See also Lessard et al. (2016) and Rabe (1961).

3.1 Parameterization method for the local invariant manifold

We now review the parameterization method adapted to the needs of the present work, namely for a stable/unstable manifold attached to a saddle-focus equilibrium in $\mathbb {R}^4$. Much more general treatment of the parameterization method is found in Cabré et al. (2003a, b, 2005). See also the book on this topic (Haro et al. 2016).

Let $\mathbf {x}_0 \in \mathbb {R}^4$ denote a saddle-focus equilibrium point. Specifically, we suppose $f(\mathbf {x}_0) = 0$,

$$\begin{aligned} \lambda _{1,2} = -\,\alpha \pm i \beta , \end{aligned}$$

with $\alpha , \beta > 0$ denotes the stable eigenvalues for $Df(\mathbf {x}_0)$, and $\xi _{1,2} \in \mathbb {C}^4$ denotes a choice of associated complex conjugate eigenvectors.

Since the eigenvalues are complex, it is convenient to look for a complex parameterization of a local stable manifold. Let

$$\begin{aligned} D^2 = \left\{ (z_1, z_2) \in \mathbb {C}^2 \, : \, |z_j| < 1, \ j = 1,2 \right\} \end{aligned}$$

denote the unit complex polydisc. We look for a parameterization $P :D^2 \rightarrow \mathbb {C}^4$ satisfying the infinitesimal conjugacy given by

$$\begin{aligned} D P(\mathbf {z}) \varLambda \mathbf {z} = f(P(\mathbf {z})), \end{aligned}$$

(5)

where $\mathbf {z} = (z_1, z_2)^{\text{ T }}$, and

$$\begin{aligned} \varLambda = \left( \begin{array}{cc} \lambda _1 &{}\quad 0 \\ 0 &{}\quad \lambda _2 \end{array} \right) . \end{aligned}$$

Equation (5) is subject to the first-order constraints

$$\begin{aligned} P(0, 0) = \mathbf {x}_0, \quad \hbox {and} \quad \frac{\partial }{\partial z_{1,2}} P(0, 0) = \xi _{1,2}. \end{aligned}$$

(6)

Note that

$$\begin{aligned} DP(\mathbf {z}) \varLambda \mathbf {z} = \lambda _1 z_1 \frac{\partial }{\partial z_1} P(z_1, z_2) + \lambda _2 z_2 \frac{\partial }{\partial z_2} P(z_1, z_2), \end{aligned}$$

is the push forward of the linear vector field by P. The geometric meaning of Eq. (5) is illustrated in Fig. 5.

Let $\varPhi $ denote the flow generated by f. Any P satisfying Eq. (5) on $D^2$ also satisfies the flow conjugacy

$$\begin{aligned} \varPhi (P(z_1, z_2), t) = P(e^{\lambda _1 t} z_1, e^{\lambda _2 t} z_2), \quad (z_1, z_2) \in D^2. \end{aligned}$$

(7)

In particular, if P satisfies both Eq. (5) and the constraints of Eq. (6), then for any $(z_1, z_2) \in D^2$ it follows that

$$\begin{aligned} \lim _{t \rightarrow \infty } \varPhi (P(z_1, z_2), t)&= \lim _{t \rightarrow \infty } P(e^{\lambda _1t} z_1, e^{\lambda _2 t} z_2) \\&= P(0, 0) \\&= \mathbf {x}_0, \end{aligned}$$

so that $P(D^2) \subset W^s(\mathbf {x}_0)$. Combining this with the fact that the image of P contains $\mathbf {x}_0$ and is tangent to the stable eigenspace at $\mathbf {x}_0$ we see that P parameterizes a local stable manifold for $\mathbf {x}_0$. Moreover, we recover the dynamics on the manifold through the conjugacy.

When the vector field f is analytic near $\mathbf {x}_0$, then $W^{s}(\mathbf {x}_0)$ is an analytic manifold, and it makes sense to look for an analytic chart of the form

$$\begin{aligned} P(z_1, z_2) = \sum _{m=0}^\infty \sum _{n=0}^\infty p_{m,n} z_1^m z_2^n, \end{aligned}$$

with $p_{m,n} \in \mathbb {C}^4$ for all $m,n \in \mathbb {N}$. Since we are interested in the real image of the chart, we look for a solution of Eq. (5) with

$$\begin{aligned} P(z, \bar{z}) \in \mathbb {R}^4, \end{aligned}$$

for all $|z| < 1$. This is achieved whenever the power series coefficients of the solution satisfy

$$\begin{aligned} p_{n,m} = \overline{p_{m,n}}, \end{aligned}$$

(8)

for all $(m,n) \in \mathbb {N}^2$. The real parameterization $\tilde{P} :B \rightarrow \mathbb {R}^4$ is recovered using complex conjugate variables

$$\begin{aligned} \tilde{P}(\sigma _1, \sigma _2) = P(\sigma _1 + i \sigma _2, \sigma _1 - i \sigma _2). \end{aligned}$$

Elementary proofs of the facts discussed in this section are found, for example, in Kepley and Mireles James (2018).

3.2 Power series solution of Eq. (5)

We describe three methods for computing the power series coefficients of an analytic solution of the invariance equation given in Sect. 3.1. Combining these methods leads to very efficient numerical methods.

3.2.1 Solution by power matching

Plugging the unknown power series expansion for P into Eq. (5) leads to

$$\begin{aligned} \sum _{m=0}^\infty \sum _{n=0}^\infty (m \lambda _1 + n \lambda _2) p_{m,n} z_1^m z_2^n = \sum _{m=0}^\infty \sum _{n=0}^\infty [f \circ P]_{m,n} z_1^m z_2^n. \end{aligned}$$

It is shown in Cabré et al. (2003a) (see also the discussion in Haro et al. 2016) that when we match like powers and isolate $p_{m,n}$ we are led to an expression of the form

$$\begin{aligned} (m \lambda _1 + n \lambda _2) p_{m,n}&= [f \circ P]_{m,n} \\&= Df(p_{0,0}) p_{m,n} + R(P)_{m,n}, \end{aligned}$$

where $R(P)_{m,n}$ depends in a nonlinear way on coefficients $p_{j,k}$ with $ 0 \le j+k < m+n$. Isolating the variable $p_{m,n}$ on the left leads to the homological equations

$$\begin{aligned} \left[ Df(\mathbf {x}_0) - (m \lambda _1 + n \lambda _2) \text{ Id } \right] p_{m,n} = -R(P)_{m,n}. \end{aligned}$$

(9)

Remark 2

(The formal solution is well defined) Observe that Eq. (9) is linear in $p_{m,n}$ and has a unique solution as long as $m \lambda _1 + n \lambda _2$ is not an eigenvalue of $Df(\mathbf {x}_0)$. But $\lambda _2 = \overline{\lambda _1}$, and since any remaining eigenvalues are assumed to be unstable, we have that $m \lambda _1 + n \lambda _2$ is never an eigenvalue of $Df(\mathbf {x}_0)$. Hence the matrix on the left-hand side of the homological equation (9) is invertible for all $m + n \ge 2$.

Given any first-order data as in the constraint Eq. (6), the homological equations are uniquely solvable to all orders and the corresponding formal series solution of Eq. (5) is well defined. Since each Taylor coefficient is uniquely determined by the homological equations (9), it follows that the formal series solution is unique up to the choice of the scalings of the eigenvectors in Eq. (6). Solving the homological equations recursively to order $N \ge 2$ provides a polynomial chart $P^N$ which approximately parameterizes the local stable manifold.

Remark 3

(Reality of the parameterization) Taking complex conjugates in the homological equations (9) shows that the coefficients $p_{m,n}$ have the symmetry of Eq. (8).

3.2.2 A Newton scheme

A quadratic convergence scheme for Eq. (5) is obtained as follows. Define the nonlinear operator

$$\begin{aligned} \varPsi [P](\sigma ) = DP(\sigma ) \varLambda \sigma - f(P(\sigma )), \end{aligned}$$

where f is the CRFBP vector field, and note that a zero of $\varPsi $ is a solution of Eq. (5). Moreover, we note that, at least formally, the Fréchet derivative is given by

$$\begin{aligned} D\varPsi [P] H (\sigma ) = DH(\sigma ) \varLambda \sigma - Df(P(\sigma )) H(\sigma ). \end{aligned}$$

In fact this is the correct Fréchet derivative of $\varPsi $ when, for example, we consider $\varPsi $ defined on a Banach space of analytic functions, see Cabré et al. (2003a, 2005), de la Llave and Mireles James (2012).

Choose $P_0$ an approximate zero of $\varPsi $, and define the sequence

$$\begin{aligned} P_{n+1} = P_n + \Delta _n, \end{aligned}$$

where $\Delta _n$ is the formal series solution of the linear equation

$$\begin{aligned} D\varPsi [P] \Delta = - \varPsi [P]. \end{aligned}$$

(10)

If $P_0$ is a good enough approximate solution of Eq. (5) we expect $P_n$ to converge quadratically to a zero of $\varPsi $. The linear operator $D \varPsi [P]$ nonconstant coefficient, and Eq. (10) may be solved recursively via the following power matching scheme. Define

$$\begin{aligned} \Delta (\sigma _1, \sigma _2)= & {} \sum _{m=0}^\infty \sum _{n=0}^\infty \Delta _{m,n} \sigma _1^m \sigma _2^n,\\ Df(P(\sigma ))= & {} \sum _{m=0}^\infty \sum _{n=0}^\infty A_{m,n} \sigma _1^m \sigma _2^n, \end{aligned}$$

and

$$\begin{aligned} -\,\varPsi (P(\sigma )) = \sum _{m=0}^\infty \sum _{n=0}^\infty q_{m,n} \sigma _1^m \sigma _2^n. \end{aligned}$$

Here $\Delta _{m,n}, q_{m,n} \in \mathbb {C}^4$, and $A_{mn}$ are $4 \times 4$ complex valued matrices for all $(m,n) \in \mathbb {N}^2$. Plugging these series expansions into Eq. (10) leads to

$$\begin{aligned} \sum _{m+n \ge 2} \left( (m \lambda _1 + n \lambda _2) \Delta _{m,n} - \sum _{j=0}^m \sum _{k=0}^n A_{m-j, n-k} \Delta _{j,k} \right) \sigma _1^m \sigma _2^n = \sum _{m+n \ge 2} q_{m,n} \sigma _1^m \sigma _2^n, \end{aligned}$$

or, upon matching like powers,

$$\begin{aligned} (m \lambda _1 + n \lambda _2) \Delta _{m,n} - \sum _{j=0}^m \sum _{k=0}^n A_{m-j, n-k} \Delta _{j,k} = q_{m,n}, \end{aligned}$$

for all $m+n \ge 2$. We note that the sum contains one term of order $\Delta _{mn}$, appearing when $j = m$ and $k = n$. That is

$$\begin{aligned} \sum _{j=0}^m \sum _{k=0}^n A_{m-j, n-k} \Delta _{j,k} = A_{00} \Delta _{mn} + ``\text{ lower-order } \text{ terms } \text{ of } \Delta \hbox {''}. \end{aligned}$$

Let

$$\begin{aligned} \tilde{\delta }_{j,k}^{m,n} = {\left\{ \begin{array}{ll} 1 &{}\quad j< m \,\,\text{ or }\,\, k < n \\ 0&{}\quad j = m \,\,\text{ and }\,\, k = n \end{array}\right. }. \end{aligned}$$

Then we use $\tilde{\delta }_{j,k}^{m,n}$ to extract terms of order (m, n) from the sum and write the equation for $\Delta _{mn}$ as

$$\begin{aligned} (m \lambda _1 + n \lambda _2) \Delta _{m,n} - A_{0,0} \Delta _{m,n} - \sum _{j=0}^m \sum _{k=0}^n \tilde{\delta }_{j,k}^{m,n} A_{m-j, n-k} \Delta _{j,k} = q_{m,n}. \end{aligned}$$

Recall that $A_{0,0} = Dg(0) = Df(\mathbf {x}_0)$, so that rearranging terms leads to the linear equations

$$\begin{aligned} \left( Df(\mathbf {x}_0) - (m \lambda _1 + n \lambda _2)\text{ Id } \right) \Delta _{m,n} = - q_{m,n} - \sum _{j=0}^m \sum _{k=0}^n \tilde{\delta }_{j,k}^{m,n} A_{m-j, n-k} \Delta _{j,k}, \end{aligned}$$

(11)

for $m + n \ge 2$. Since the right-hand side of Eq. (11) is exactly the right-hand side appearing in the homological equations (9) of Sect. 3.2.1, arguing as in Remarks 2 and 3 shows that the equations of (11) are uniquely solvable for all $m + n \ge 2$ just as before, and that the resulting power series coefficients have the desired symmetry. Then this Newton scheme is well defined on the space of formal power series.

3.2.3 A pseudo-Newton scheme

While the Newton scheme of the previous section converges rapidly (in the sense of the number of necessary iterations), solving the required nonconstant coefficient linear equations is expensive. In this case the overall computation may be slow just because of the cost of computing the individual corrections. The iterations can be speeded up as follows.

First, we note that

$$\begin{aligned} D\varPsi [P] \Delta (\sigma ) = D \Delta (\sigma ) \varLambda \sigma - Df(\mathbf {x}_0) \Delta (\sigma ) + ``\text{ higher-order } \text{ terms }\hbox {''}, \end{aligned}$$

and we define a new iterative scheme

$$\begin{aligned} P_{k+1}(\sigma ) = P_k(\sigma ) + \tilde{\Delta }_k(\sigma ), \end{aligned}$$

where $ \tilde{\Delta }_k$ is a solution of the constant coefficient linear equation

$$\begin{aligned} D \tilde{\Delta }_k(\sigma ) \varLambda \sigma - Df(\mathbf {x}_0) \tilde{\Delta }_k(\sigma ) = - \varPsi (H_k). \end{aligned}$$

On the level of power series, this equation becomes

$$\begin{aligned} \sum _{m=0}^\infty \sum _{n=0}^\infty \left[ (m \lambda _1 + n \lambda _2) \text{ Id } - Df(\mathbf {x}_0)\right] \tilde{\Delta }_{m,n} \sigma _1^m \sigma _2^n = \sum _{m=0}^\infty \sum _{n=0}^\infty q_{m,n} \sigma _1^m \sigma _2^n, \end{aligned}$$

and matching like powers yields the linear equations

$$\begin{aligned} \left[ Df(\mathbf {x}_0) - (m \lambda _1 + n \lambda _2) \text{ Id } \right] \tilde{\Delta }_{m,n} = -q_{m,n}. \end{aligned}$$

These homological equations uniquely determine the coefficients $\tilde{\Delta }_{m,n}$ and have the virtue of being “diagonal” in Taylor coefficient space. In practice we find that the pseudo-Newton scheme requires more iterates than the Newton method to converge. However, a single iteration step is much faster and for reasonable values of N the pseudo-Newton method is faster overall. We discuss this further below.

Remark 4

In practice the linear approximation of P by the eigenvectors provides a good initial guess for the Newton and pseudo-Newton schemes, especially when computations are started “from scratch.” However, within the context of calculations based on parameter continuation, we will take $P_0$ as the high-order parameterization from the previous mass values.

Indeed, it seems that the best results are obtained by a “hybrid” approach. That is, we compute an initial guess $P_0$ by recursively solving Eq. (9) to some fixed order, $N_0$. Then, we refine this approximation via the Newton or pseudo-Newton scheme to obtain a polynomial approximation to order, $N > N_0$. The runtime performance for this hybrid approach is recorded in Table 1.

Table 1 Runtime data for the parameterization method: here the manifolds are first computed to order $N_0$ in order to measure the exponential decay rate associated with the Taylor coefficients

Full size table

Table 2 Taylor order, scaling, and error bounds for the parameterization method: table reports the numerical defect and numerical conjugacy error associated with the local stable/unstable manifold parameterization for a number of example computations, as functions of the polynomial order and eigenvector scalings

Full size table

Remark 5

(Quantifying the errors) Suppose that the polynomial

$$\begin{aligned} P^N(z_1, z_2) = \sum _{0 \le m+n \le N} p_{m,n} z_1^m z_2^n, \end{aligned}$$

is an approximation solution of Eq. (5). One way to measure the quality of the approximation is to measure the defect associated with $P^N$ defined by the quantity

$$\begin{aligned} \text{ defect }(P^N) = \sup _{\mathbf {z} \in D^2} \left| \left| DP^N(\mathbf {z}) \varLambda \mathbf {z} - f(P(\mathbf {z})) \right| \right| _{\mathbb {C}^4}. \end{aligned}$$

This quantity could be approximated by evaluating on a mesh of points in D. On the other hand, we can use the fact that for power series on the unit disk we have the bound

$$\begin{aligned} \sup _{\mathbf {z} \in D^2} \Vert g(\mathbf {z}) \Vert _{\mathbb {C}^4} \le \sum _{m+n = 0}^\infty \Vert a_{m,n} \Vert _{\mathbb {C}^4}, \end{aligned}$$

where the infinite sum can be approximated by a finite sum. Then another useful a-posteriori indicator is obtained by choosing an $N' > N$ and computing the quantity

$$\begin{aligned} \varepsilon _{\text{ a-posteriori }} = \sum _{0 \le m+n \le N'} \left\| (m \lambda _1 + n \lambda _2) p^N_{m,n} - [f \circ P^N]_{m,n} \right\| _{\mathbb {C}^4}, \end{aligned}$$

where $p_{m,n}^N$ are the power series coefficients of $P^N$, and $[f \circ P^N]_{m,n}$ are the coefficients of $f(P^N(\mathbf {z}))$. Of course this bounds also the real image of $P^N$.

If f is a polynomial of order K, then we take $N' = K N$. If f is not a polynomial, then the power series for $f\circ P^N$ has infinitely many terms even though $P^N$ is polynomial. Then we choose $N' > N$ somewhat arbitrarily. Note that $p_{m,n}^N$ are zero when $m + n > N$, so that eventually the sum involves only the coefficients of the composition.

Yet another useful error indicator is obtained by considering the dynamical conjugacy of Eq. (7). Since the true solution satisfies the dynamical conjugacy exactly, we consider also the quantity defined by

$$\begin{aligned} \text{ conjugacyDefect }(P^N) = \sup _{\mathbf {z} \in D^2} \sup _{t > 0} \left| \left| \varPhi (P(z_1, z_2), t) - P(e^{\lambda _1 t} z_1, e^{\lambda _2 t} z_2) \right| \right| _{\mathbb {C}^4}. \end{aligned}$$

To approximate this quantity, we fix $\tau > 0$ and let $\varPhi _{\text{ num }}$ denote a numerical integrator and $z_{k}$, $1 \le k \le K$ be a mesh of the complex circle so $|z_k| = 1$. Define the indicator

$$\begin{aligned} \varepsilon _{\text{ conjugacy }} = \max _{1 \le k \le K} \left\| \varPhi _{\text{ num }}(P^N(z_k, \overline{z_k}), \tau ) - P^N(e^{\lambda _1 \tau } z_k, e^{\lambda _2 \tau } \overline{z_k}) \right\| _{\mathbb {C}^4}. \end{aligned}$$

Error bounds for a number of example computations are recorded in Table 2.

Remark 6

(Eigenvector scaling and coefficient decay) Solutions of Eq. (5) are only unique up to the choice of the scalings of the eigenvectors and this freedom is exploited in our numerical algorithms. Indeed, this is the reason we can always take our domain to be the unit disk. The results in Table 2 describe the dependence of the numerical errors on the approximation order and the eigenvector scalings. These numerical experiments lead to the following heuristic. If we scale the eigenvectors so that the final coefficients—that is the N-th-order coefficients of $P^N$—are on the order of machine epsilon, then we obtain a-posteriori errors on the order of machine epsilon.

3.3 Integration of analytic arcs

In Sect. 4 we present a scheme for computing an atlas for the stable/unstable manifolds which relies on integrating analytic arcs of initial conditions by the flow generated by f. We describe this integrator in terms of power series expansions. Let us assume that $\gamma :(-1, 1) \rightarrow \mathbb {R}^4$ is an analytic arc with power series expansion

$$\begin{aligned} \gamma (s) = \sum _{n = 0}^\infty \gamma _n s^n \qquad \gamma _n \in \mathbb {R}^4. \end{aligned}$$

Denote the formal series expansion

$$\begin{aligned} \varGamma (s, t) = \varPhi (\gamma (s), t) = \sum _{m=0}^\infty \sum _{n=0}^\infty a_{m,n} s^n t^m \qquad a_{m,n} \in \mathbb {R}^4. \end{aligned}$$

Here, we use the variables (s, t) in place of $(z_1,z_2)$ to emphasize the intuition that s corresponds to the “spatial” parameterization along the initial data, and t corresponds to the “time” parameterization along the flow. In other words, we consider $\varGamma $ as the solution of the parameterized family of initial value problems

$$\begin{aligned} \frac{\mathrm{d}}{\mathrm{d}t} \varGamma (s, t) = f(\varGamma (s, t)), \quad \varGamma (s, 0) = \gamma (s), \quad s \in (-1,1). \end{aligned}$$

Substituting the formal series into this IVP and matching like powers leads to the recursion relations

$$\begin{aligned} a_{m+1, n} = \frac{1}{m+1} [f \circ \varGamma ]_{m,n}, \qquad a_{0, n} = \gamma _n, \end{aligned}$$

which allow us to compute the coefficients of $\varGamma $ to arbitrary order using the same methods described in Sect. 3.2. We also note that the precision of these formal series computations depend on convergence and domain decomposition of these series expansions which has not been addressed and will also be taken up in the following section.

4 Building an atlas for the local stable/unstable manifold

In this section, let $W^*(\mathbf {x}_0)$ denote an invariant stable/unstable manifold for a saddle-focus equilibrium, $\mathbf {x}_0$. Our goal is to describe an algorithm for producing an atlas of chart maps which parameterizes a large portion of the invariant manifold. The union of the images of these maps is a piecewise parameterization of a two-dimensional subset of $W^*(\mathbf {x_0})$. Our procedure is iterative and at each step outputs a (strictly) larger piecewise parameterization.

It is important to emphasize that our computations are carried out only to finite order. In particular, the charts described in this section are analytic functions of two complex variables. However, in practice we fix $(M,N) \in \mathbb {N}^2$, and for each chart we compute a finite polynomial approximation of order (M, N). Nevertheless, throughout this section we denote these analytic charts and their polynomial approximations using the same notation. We end this section by outlining methods for reliably, efficiently, and automatically computing these atlases. This includes algorithms for estimating and controlling truncation errors, identifying Taylor series blowup, domain decomposition, and stiffness.

4.1 Iterative method for computing charts

Before elaborating on the technical details of our method, we briefly describe the overall strategy. Starting from the parameterized local invariant manifolds obtained via methods described in Sect. 3.1, we want to build an even larger representation of the manifold. There are many ways to grow such a representation. We could, for example, simply integrate a collection of initial conditions meshing the boundary of the parameterization. However, as is well known, the exponential separation of initial conditions will force these orbits apart and eventually degrade the description of the manifold. Instead, we mesh the boundary into a collection of one-dimensional arcs and advect each of these under the flow. Propagating these arcs maintains the fidelity of the representation, and leads to new “patches” of the manifold.

Since the initial chart is parameterized by a high-order polynomial, we would like the same representation for new charts. To this end we develop a high-order Taylor integration scheme which applies to analytic arcs of initial conditions. This results in a power series representation of the flow of a boundary arc, and we take this as our next chart. After advecting each one of the boundary arcs, we have a new and strictly larger representation of local stable/unstable manifold. The idea is illustrated in Fig. 4.

After one step of this procedure, we have moved the boundary of the local invariant manifold. In some cases, the image of the advected arc undergoes excessive stretching due to the exponential separation of initial conditions. This stretching in phase space is matched by a corresponding blow up in coefficients of the Taylor expansion, and the computations become numerically unstable.

This problem is overcome by occasionally remeshing the boundary of the atlas. This comes at a cost of increasing the number of charts in the next step of the algorithm. Hence, efficiently computing large atlases while controlling numerical error requires automatic algorithms for managing the growth of the power series coefficients, deciding how long to integrate each individual arc, and deciding when and how to subdivide the new boundaries. These topics account for much of the technical details which follow (Fig. 6).

4.1.1 The initial local manifold

The first step in our algorithm is to compute a polynomial approximation of the local parameterization, either by directly solving the homological equations or by iterating the Newton or pseudo-Newton schemes described in Sect. 3.1. Let $\varGamma _0$ be a solution of Eq. (5), and $D^2$ denote the unit polydisc in $\mathbb {C}^2$. Recall that $\varGamma _0: D^2 \rightarrow W^*_{\text {loc}}(\mathbf {x}_0)$ is analytic, and that $\varGamma _0(\partial D^2)$ is flow transverse. In particular, $\varGamma _0$ serves as our initial local parameterization, and we refer to it as a zeroth-generation interior chart and we write $\varGamma _0(D^2) = W^*_0(\mathbf {x}_0)$.

In practice, we compute $\varGamma _0$ to order (N, N) with $N \in \mathbb {N}$ chosen by applying the heuristic methods discussed in Sect. 3.2. This chart is represented in the computer as a polynomial in two complex variables of total degree $\deg (\varGamma _0) = (N-1)^2$. The truncation error of this approximation is controlled directly by choosing the eigenvector scaling as described in Remark 6, and in practice, is on the order of machine epsilon.

4.1.2 The initial manifold boundary

With $\varGamma _0$ in hand, we fix $K_0 \in \mathbb {N}$ and subdivide $\partial D$ into $K_0$-many analytic segments, each of which has the form, $c_j: [-\,1,1] \rightarrow \partial D$, for $1 \le j \le K_0$. We parameterize $\partial W^*_0(\mathbf {x}_0)$ by defining $\gamma _j(s) = \varGamma _0 \circ c_j(s)$ and we refer to $\gamma _j$ as a lifted boundary. Note that for each $1\le j\le K_0$, $\gamma _j: [-\,1,1] \rightarrow \partial W^*_{\text {loc}}(\mathbf {x}_0)$ and $\gamma _j([-\,1,1])$ is a flow transverse arc since $\varGamma _0$ is a dynamical conjugacy and the image of $c_j$ is transverse to the linear flow. Now, we define the zeroth-generation boundary to be

$$\begin{aligned} \partial W^*_0(\mathbf {x}_0) = \bigcup _{j=1}^{K_0} \gamma _j([-\,1,1]), \end{aligned}$$

and refer to each $\gamma _j$ as a zeroth-generation boundary chart.

4.1.3 The next generation

Now, we apply the high-order Taylor advection described in Sect. 3.3 to grow a larger local manifold denoted by $W^*_1(\mathbf {x}_0)$. Specifically, for $1 \le j \le K_0$, we choose $|\tau _j| > 0$, and our advection algorithm takes $\gamma _j,\tau _j$ as input and produces a chart, $\varGamma _{1,j}: D \rightarrow W^*(\mathbf {x}_0)$ which satisfies

$$\begin{aligned} \varGamma _{1,j}(s,t) = \varPhi \left( \gamma _j(s),\frac{t}{\tau _j} \right) \qquad \text {for} \quad (s,t) \in [-\,1,1]^2. \end{aligned}$$

In other words, $\varGamma _{1,j}$ parameterizes the advected image of $\gamma _j$ under the flow over the time interval $[0,\tau _j]$. These new charts are referred to as first-generation interior charts which we add to our atlas to obtain the first-generation local parameterization

$$\begin{aligned} W^*_1(\mathbf {x}_0) = W^*_0(\mathbf {x}_0) \cup \bigcup _{j=1}^{K_0} \varGamma _{1,j}(D). \end{aligned}$$

Note that $\tau _j \ne 0$ and since $\gamma _j$ is flow transverse, we have $W^*_0(\mathbf {x}_0) \subsetneq W^*_1(\mathbf {x}_1)$ is a strict subset. In fact, transversality of $\gamma _j$ implies the stronger condition that $\partial W^*_0(\mathbf {x}_0) \subset \text {Int}(W^*_1(\mathbf {x}_0))$, i.e., the manifold has grown through every point on the previous boundary.

Remark 7

(Time rescaling) In this description, $\tau _j$ serves as a time rescaling of the flow. This allows direct control over the truncation error (in the time direction) and is analogous to the eigenvector scaling for the initial parameterization described in Remark 6. However, choosing this time rescaling is typically more difficult than choosing the eigenvector scaling and we postpone the discussion of this problem to Sect. 4.2.1.

Once the first-generation interior charts are computed by advection, the first-generation boundary arcs are now obtained by evaluation of the time variable. In particular, for $1 \le j \le K_0$, the evaluation, $\varGamma _{1,j}([-\,1,1],1) \subset \partial W^*_1(\mathbf {x}_0)$ is a flow transverse arc segment. We perform spatial rescaling as needed (see Remark 8 below) to obtain the next-generation boundary arcs, $\gamma _{1,j} : [-\,1,1] \rightarrow \partial W^*_1(\mathbf {x}_0)$ where $1 \le j \le K_1$ for some $K_1 \ge K_0$ and

$$\begin{aligned} \gamma _{1,j}([-\,1,1]) \subset \varGamma _{1,j'}([-\,1,1],1) \qquad \text {for some} \quad 1\le j' \le K_0 \end{aligned}$$

is flow transverse. The advection and evaluation algorithms are then iterated to increase the number of charts in the atlas. The $L^\mathrm{th}$ step in the iteration chain has the form

where $W^*_L(\mathbf {x}_0)$ is parameterized by $K_{L-1}$-many interior charts (polynomials in both the space and time variables), $\partial W^*_L(\mathbf {x}_0)$ is parameterized by $K_{L}$-many boundary charts (polynomials in the space variable only), and $K_{L-1} \le K_L$.

If we stop iteration, say at the Lth step, then the final atlas,

$$\begin{aligned} \mathcal {A} = \left\{ \varGamma _0, \bigcup _{j=1}^{K_0} \varGamma _{1,j}, \bigcup _{j=1}^{K_1} \varGamma _{2,j}, \cdots , \bigcup _{j=1}^{K_L} \varGamma _{L,j} \right\} , \end{aligned}$$

is a collection of $\left| \mathcal {A} \right| = 1 + \sum \limits _{l=1}^{L} K_l$-many analytic charts is a piecewise parameterization a portion of the invariant manifold.

Remark 8

(Spatial rescaling) The parameters, $K_0,\cdots ,K_L$, control the number of boundary subdivisions, and therefore, allow direct control over scaling in the spatial direction. As in the time rescaling problem, choosing these parameters effectively is a nontrivial problem which we take up in Sect. 4.2.2.

4.2 Convergence, manifold subdivision, and numerical integration

Thus far, we have ignored the issue of convergence for our formal power series computations. The best method for studying this issue is to combine rigorous numerical computations with a-posteriori analysis and obtain a proof of the existence of an analytic solution and explicit error bounds on the polynomial approximation. Rigorously validated numerical methods for invariant manifold atlases are described in detail in Kalies et al. (2018), Kepley and Mireles James (2018). In the present work we explore the utility of invariant manifold atlases as a purely numerical tool and trade the computer-assisted proof of rigorous error bounds for improved runtime performance.

In the absence of a rigorous validation scheme, we develop more heuristic checks to insure the reliability of the computations. More precisely, we must automatically identify and fix numerical accuracy issues related to numerical Taylor integration. This amounts to rescaling our Taylor coefficients whenever the decay in either space or time becomes too slow. However, this is less straightforward than the eigenvector rescaling for the initial local parameterization described in Remark 6. In particular, it is helpful to consider the rescaling in space and time “directions” separately.

4.2.1 Time-stepping

Recall that at the saddle-focus equilibrium, the stable/unstable eigenvalues occur in complex conjugate pairs. In particular, both eigenvalues in each pair have equal real parts. It follows that identically rescaling each pair of eigenvectors is the ideal strategy. In fact, this strategy is also necessary and sufficient to ensure that the initial parameterization is real-valued, see Van den Berg et al. (2016). Moreover, in the general case of a hyperbolic equilibrium, the real part of each eigenvalue is a measure of the expansion or contraction rate in the direction of its associated eigenvector. Thus, in cases for which they are not equal, the real parts are still explicitly known and the eigenvectors are scaled proportional to these rates.

On the other hand, all but the initial chart in our atlas is obtained via our advection scheme. In this case, neither the expansion/contraction rates, nor their directions are explicitly known. Obtaining these estimates would require solving for the (spatial) derivative of the flow on each chart. For a general vector field defined on $\mathbb {R}^n$, this amounts to increasing the phase space dimension of our ODE solver from n, to $n + n^2$, which would significantly reduce the size of each manifold which is computationally feasible to produce.

Instead, we take an approach similar to Kalies et al. (2018), which describes heuristics for rescaling time and space independent of one another. Specifically, we adopt a time rescaling which ensures that the norm of the $M^\mathrm{th}$ “coefficient” (with respect to t) for each chart, is less than machine epsilon. Note that for a classical IVP this coefficient is of course just a scalar. However, in our case the coefficient is actually an analytic function of the spatial variable, represented as a power series and the norm of this coefficient is measured using the $\ell ^1$ norm. This is made more precise in the following section.

This choice is highly conservative, which gives us tight control over the truncation error in the time direction. On the other hand, the spatial rescaling in the present work deviates from the scheme presented in Kalies et al. (2018) and is detailed in Sect. 4.2.2.

4.2.2 Manifold subdivision

Next, we describe the spatial rescaling scheme which we refer to as manifold subdivision. We assume that the time rescaling described in the previous section has been carried out on each chart, and our interest is in rescaling each boundary arc to control truncation errors accumulating in the “space direction.” This is equivalent to subdividing a manifold since it is reasonable to assume the rescaling will always shrink the domain. Thus, a single boundary arc will give rise to multiple subarcs defined on reduced domains.

To be more precise, we let $C^{\omega }$ denote the collection of real-valued, analytic functions defined on $(-1,1)$, and let $\mathcal {S}$ denote the collection of real-valued sequences. We define the Taylor transform, $\mathcal {T}: C^{\omega } \rightarrow \mathcal {S}$, to be the mapping which sends an analytic function to its sequence of Taylor coefficients centered at $z = 0$. Specifically, if $g \in C^\omega $ has the Taylor expansion,

$$\begin{aligned} g(z) = \sum _{n=0}^{\infty } a_n z^n \qquad a_n \in \mathbb {R}, \quad z \in (-1,1), \end{aligned}$$

then $\mathcal {T} \left( g \right) = \{a_n\} = a \in \mathcal {S}$. Now, we equip $\mathcal {S}$ with the $\ell _1$-norm defined by

$$\begin{aligned} \left| \left| a \right| \right| _{1} = \sum _{n=0}^{\infty } \left| a_n \right| , \end{aligned}$$

and we note that elements of $\mathcal {S}$ with finite norm form a closed subalgebra denoted as

$$\begin{aligned} \ell _1 = \{x \in \mathcal {S} \ : \left| \left| x \right| \right| _{1} < \infty \}, \end{aligned}$$

and we write $\left| \left| a \right| \right| _{\ell _1}$ when we want to emphasize that $a \in \ell _1$ (i.e., we write $\left| \left| a \right| \right| _{\ell _1}$ for the norm $\left| \left| a \right| \right| _1$ when $\left| \left| a \right| \right| _1$ is finite).

We remark that our error analysis is carried out using the $\ell _1$-norm due to the efficiency of computing this norm for polynomials. However, if $\overline{g}\approx g$ is a numerical approximation, then the errors we are interested in are of the form

$$\begin{aligned} \left| \left| \overline{g}- g \right| \right| _{\infty } = \sup _{z \in [-\,1,1]} \left\{ \left| \overline{g}(z) - g(z) \right| \right\} . \end{aligned}$$

We are justified in using the $\ell _1$ norm due to the well known result that $\left| \left| \overline{g}- g \right| \right| _{\infty } \le \left| \left| \overline{g}- g \right| \right| _{\ell _1}$.

Now, suppose $\gamma \in C^\omega $ and assume that $\mathcal {T} \left( \gamma \right) = a \in \ell _1$. Since $\varPhi $ is a nonlinear flow, a typical arc segment undergoes rapid deformation and stretching when advected. This implies that for a single step in our algorithm with the general form,

we expect both the arc length and curvature of $\gamma '$ to be larger than for $\gamma $. On the level of Taylor coefficients, this statement about deformation/stretching says that if $b = \mathcal {T} \left( \gamma ' \right) $, then in general we expect $\left| \left| a \right| \right| _{\ell _1} \le \left| \left| b \right| \right| _{\ell _1}$. The relationship between this norm and the truncation error implies that advecting an arc adversely impacts the propagation error.

To see this, we recall that in practice our computation stores a truncated polynomial approximation for $\gamma '$ in the form $\overline{b}= \left( b_0,\cdots ,b_{N-1}\right) $. In order that $\overline{b}\approx b$ is a “good” approximation (in the $\ell _1$ topology), $|b_{n}|$ must be “small” for each $n \ge N$. These higher-order terms correspond to the truncation error for $\gamma '$ and primarily arise from two sources. One source which we can not control (once N is fixed) is the truncation error associated with $\gamma $. However, by inspection of the Cauchy product formula in Eq. (19), it is clear that the polynomial coefficients stored for $\gamma $ also contribute to this truncation error for $\gamma '$ after applying the nonlinearity. We refer to these contributions as spillover terms.

This observation implies that for $\bar{b} \approx b$ to be a good approximation, we must also require that $\left| a_n \right| $ is “small” for each $n > N'$ where $N' < N$ depends on the degree of the nonlinearity. This motivates the following heuristic method for controlling truncation error for propagated arcs. We begin by assuming that a has approximately geometric decay. Specifically, we expect that there exists some $r < 1$ such that the tail of the series defined by $\gamma $ decays faster than the geometric series with ratio r. In this case, the truncation error is of order $\mathcal {O}(r^N)$. Now, fix $0< N' < N$, and we define the tail ratio for a by

$$\begin{aligned} T_{N'}(a) := \frac{\sum _{n=N'}^{N-1} \left| a_n \right| }{\sum _{n=0}^{N-1} \left| a_n \right| } = \frac{\left| \left| a - a^{N'} \right| \right| _{\ell _1}}{\left| \left| a \right| \right| _{\ell _1}}. \end{aligned}$$

(12)

Evidently, $T_{N'}(a)$ is small whenever “most” of the $\ell _1$ weight of a is carried in the first $N'$-many coefficients. It follows that if $T_{N'}(a)$ is sufficiently small, then under the action of a nonlinear function, $f : \ell _1 \rightarrow \ell _1$, the spillover terms for f(a) remain small. Of course, small is dependent on context and in particular, choices for $N'$ as well as thresholding values for $T_{N'}$ are problem specific. In the present work, we prove it is always possible to control $T_{N'}$.

Remark 9

Strictly speaking, for the CRFBP we have $\gamma = \left( \gamma ^{(1)}, \cdots , \gamma ^{(4)}\right) $ where each $\gamma ^{(j)} \in C^\omega $ is a coordinate for the boundary chart. Similarly, $\mathcal {T} \left( \gamma \right) = \left( a^{(1)},\cdots ,a^{(4)}\right) \in \ell _1^4$, and thus the discussion in Sect. 4.2.2 thus far is technically not applicable. However, our restriction to scalar-valued functions is justified by the fact that if $a \in \ell _1^4$, then defining

$$\begin{aligned} \left| \left| a \right| \right| _{\ell _1^4} = \max \left\{ \left| \left| a^{(1)} \right| \right| _{\ell _1},\cdots ,\left| \left| a^{(4)} \right| \right| _{\ell _1}\right\} \end{aligned}$$

makes $\ell ^4_1$ into a normed vector space. This choice of norm gives us the freedom to restrict the discussion of remeshing and tail ratios to scalar-valued functions.

Next, we describe our scheme for controlling the tail ratio. This algorithm takes a polynomial representation for $\gamma $, defined on $[-\,1,1]$ as input, and returns a list of polynomials, $\{\gamma _1,\cdots ,\gamma _K\}$, as outputs. The key point is that these polynomials are also defined on $[-\,1,1]$, and they can be chosen such that $T_{N'}(\gamma _j)$ is arbitrarily small for $1 \le j \le K$. In this work, we assume the output polynomials are specified as coefficient vectors of length N (i.e., the same degree as the input); however, this is not required.

This gives rise to an additional remeshing step in our algorithm which is performed as needed after an evaluation step and prior to an advection step leading to an updated schematic

In the remeshing step, the tail ratio for each boundary arc from the previous step is computed and checked against a threshold. Boundary arcs which exceed this threshold are flagged as poorly conditioned, and subdivided into smaller subarcs which satisfy the threshold. The collection of resulting subarcs and well-conditioned arcs from the previous step is passed to the advection step where each results in a separate chart.

Before proving this threshold can always be satisfied, we describe the subdivision algorithm. As noted in Remark 9, it suffices to consider a single coordinate for a parameterized boundary arc. Thus, we assume $\gamma (s): [-\,1,1] \rightarrow \mathbb {R}$ is analytic with Taylor series

$$\begin{aligned} \gamma (s) = \sum _{n=0}^\infty a_n s^n, \end{aligned}$$

and fix a subinterval, $[s_1,s_2] \subset [-\,1,1]$. Define the constants

$$\begin{aligned} \hat{s} := \frac{s_1+s_2}{2} \qquad \qquad \delta := \frac{s_2-s_1}{2} \end{aligned}$$

(13)

and define $\hat{\gamma }: [-\,1,1] \rightarrow \mathbb {R}$ by

$$\begin{aligned} \hat{\gamma }(s) = \sum _{n = 0}^{\infty } c_n s^n \qquad \text {where} \quad c_n = \delta ^n \sum _{k = n}^{\infty } a_k \left( {\begin{array}{c}k\\ n\end{array}}\right) \hat{s}^{k-n}. \end{aligned}$$

(14)

Then $\hat{\gamma }$ is a parameterization for the arc segment parameterized by $\gamma $ restricted to $[s_1,s_2]$. In fact, $\hat{\gamma }$ is the Taylor series for $\gamma $ after recentering at $\hat{s}$ and rescaling by $\delta $ which satisfies the functional equation

$$\begin{aligned} \hat{\gamma }(s) = \gamma (\hat{s} + \delta s) \qquad s \in [-\,1,1]. \end{aligned}$$

(15)

Moreover, the mapping $a \mapsto c$ is a linear transformation on $\mathcal {S}$, and in particular, if $a_n = 0$ for all $n \ge N$, then $c_n = 0$ for all $n \ge N$ also. Now, we prove that we have explicit control over the tail ratio for $\hat{\gamma }$.

Proposition 1

(Controlling tail ratios) Suppose $\gamma : [-\,1,1] \rightarrow \mathbb {R}$ is analytic, fix $\hat{s} \in (-1,1)$, $1 \le N' \le N$, and let $\epsilon > 0$. Then there exists $\delta > 0$ such that $T_{N'}(c) < \epsilon $ where c is the truncation to order N for $\hat{\gamma }: [-\,1,1] \rightarrow \mathbb {R}$ defined by $\hat{s},\delta $ as in Eq. (14).

Proof

Define $\gamma ^N: [-\,1,1] \rightarrow \mathbb {R}$ to be the Taylor polynomial obtained by truncating the Taylor series for $\gamma $ to order N. For $k \in \mathbb {N}$, define the usual $C^k$-norm on $[-\,1,1]$ to be

$$\begin{aligned} \left| \left| g \right| \right| _{C^k} = \max \limits _{0 \le j \le k} \left\{ \left| \left| g^{(j)} \right| \right| _{\infty } \right\} . \end{aligned}$$

Since $\gamma ^N$ is a polynomial, we have the bound

$$\begin{aligned} \left| \left| \gamma ^N \right| \right| _{C^k} \le M := \left| \left| \gamma ^N \right| \right| _{C^{N-1}} \qquad \text {for all} \quad k \in \mathbb {N}. \end{aligned}$$

In particular, for any $\hat{s} \in (-1,1)$, we have $\left| \gamma ^{(n)}(\hat{s}) \right| \le M$, for $0 \le n \le (N-1)$, and we define

$$\begin{aligned} \delta := \min \limits _{N' \le n \le N} \left\{ \left( \frac{\epsilon \left| \gamma (\hat{s}) \right| }{M (N - N')}\right) ^{\frac{1}{n}} \right\} . \end{aligned}$$

It follows that

$$\begin{aligned} \delta ^n \left| \gamma ^{(n)}(\hat{s}) \right| \le \frac{\epsilon \gamma (\hat{s})}{N-N'} \quad \text {for all} \quad N' \le n \le N. \end{aligned}$$

Now, let $\hat{\gamma }$ be defined as in Eq. (14). Recall that $\hat{\gamma }$ is also analytic on $[-\,1,1]$, and by differentiating Eq. (15) we have the derivative formula, $\hat{\gamma }^{(n)}(s) = \delta ^n \gamma ^{(n)}\left( \hat{s} + \delta s\right) $, for all $n \in \mathbb {N}$. By Taylor’s theorem, we obtain another explicit formula for $c_n$ given by

$$\begin{aligned} c_n = \frac{\hat{\gamma }^{(n)}(0)}{n!} = \frac{\delta ^n \gamma ^{(n)}(\hat{s})}{n!}, \end{aligned}$$

and we note that $c_0 = \hat{\gamma }(0) = \gamma (\hat{s})$ does not depend on $\delta $. We have the estimate for the tail ratio of $\hat{\gamma }$:

$$\begin{aligned} T_{N'}(c)&= \frac{1}{\left| \left| c \right| \right| _{\ell _1}} \sum _{n=N'}^{N-1} \left| c_n \right| \\&= \frac{1}{\left| \left| c \right| \right| _{\ell _1}} \sum _{n=N'}^{N-1} \frac{\delta ^n \left| \gamma ^{(n)}(\hat{s}) \right| }{n!} \\&\le \frac{1}{\left| c_0 \right| } \sum _{n=N'}^{N-1} \frac{\epsilon \left| \gamma (\hat{s}) \right| }{N - N'} \\&= \epsilon \end{aligned}$$

which completes the proof. $\square $

Proposition 1 establishes the fact that we may reparameterize $\gamma $ on subintervals of $[-\,1,1]$ with width, $2\delta $, and that as $\delta \rightarrow 0$ the tail ratio also approaches zero. We note that $\delta $ does not depend on the subinterval, and therefore, for a fixed $\epsilon $ the number of required subarcs is finite. In particular, no more than $K = \lceil \frac{2}{\delta } \rceil $ subarcs are required. To summarize the usefulness of this result, we present the following algorithm for controlling the spatial truncation error which was implemented for the atlases in this work.

1.
Fix a threshold $0 < \epsilon \ll 1$, a cutoff $1 \le N' < N$, and $K \in \mathbb {N}$. The threshold and cutoff are both chosen based on the alignment of $\gamma $ with the flow, the degree of the nonlinearity in f, and the truncation size. In practice, these are problem specific choices which require some ad hoc experimentation in order to balance computational efficiency and truncation error.
2.
Following each evaluation step in our algorithm, a boundary arc has the form $\gamma : [-\,1,1] \rightarrow \mathbb {R}$ which is stored in the computer as a polynomial approximation, $\overline{a}= \left( a_0,\cdots ,a_{N-1}\right) $. If $T_{N'}(\overline{a}) < \epsilon $, continue to the advection step.
3.
If $T_{N'}(\overline{a}) \ge \epsilon $, specify a partition of $[-\,1,1]$ into K-many subintervals by choosing their endpoints, $\{s_0,s_1,\cdots ,s_K\}$. Apply the formula in Eq. (14) to obtain $\{\gamma _1,\cdots ,\gamma _K\}$ where for $1 \le j \le K$, $\gamma _j(s) = \gamma (\hat{s}_j + \delta _j s)$ where $\hat{s}_j = \frac{s_j + s_{j-1}}{2}$ and $\delta _j = \frac{s_{j} - s_{j-1}}{2}$.
4.
Each resulting subarc which satisfies the tail ratio threshold passes to the advection step. Subarcs which violate the threshold are subdivided again by repeating step 3. By Proposition 1, this condition is eventually met for every subarc and the algorithm proceeds to the advection step.

4.2.3 Stiffness

The final numerical consideration which we address is the stiffness problem. We recall that the CRFBP vector field is analytic away from the primary masses which correspond to singularities of Eq. (3). Since this system is Hamiltonian, any trajectory which collides with one of these primaries must blow up in finite time. However, smooth trajectories may pass arbitrarily close to these primaries and as they do, the velocity coordinates, $\dot{x}, \dot{y}$, become arbitrarily large.

Recall that a single boundary arc, $\gamma : [-\,1,1] \rightarrow \mathbb {R}^4$, is a parameterized manifold of initial data. Then its advected image, $\varGamma :[-\,1,1] \times [0,1] \rightarrow \mathbb {R}^4$, is a parameterized bundle of trajectory segments. For any $s_0 \in [-\,1,1]$, $\varGamma (s_0,t)$ parameterizes the trajectory passing through $\gamma (s)$ over the (nonscaled) time interval, $[0,\tau ]$.

Now, suppose that for $s_0 \in [-\,1,1]$, the trajectory through $\gamma (s_0)$ passes “close” to a primary at time $t = t_0$. Then, we have

$$\begin{aligned} \left| \left| f\left( \varGamma (s_0,t_0)\right) \right| \right| _{\mathbb {C}^4} \gg 1. \end{aligned}$$

Recalling our time rescaling algorithm described in Sect. 4.2.1, it is clear that controlling truncation in the time direction will require taking increasingly shorter time-steps. Of course, this is not surprising; however, the difficulty arises from the fact that other choices of $s \in [-\,1,1]$ often correspond to trajectory segments which remain far away from the primary and our time rescaling is applies uniformly on $[-\,1,1]$. Hence, the advection of the entire boundary chart is slowed dramatically whenever any portion of its image approaches a primary. We refer to these charts as stiff. Obviously, this is a major problem for our “breadth-first” approach for computing the manifold atlas. Namely, the integrator gets stuck on the stiff charts causing the computation to stall.

A naive method for dealing with this is to define the speed for a boundary chart which is a parameterized curve of the form, $\gamma (s) =$ $(x(s),\dot{x}(s)$, $y(s), \dot{y}(s) )$, by

$$\begin{aligned} S(\gamma ) = \sup _{s \in [-\,1,1]} \left\{ \sqrt{\dot{x}(s)^2 + \dot{y}(s)^2}\right\} , \end{aligned}$$

(16)

set a threshold, $\kappa $, and cease advection of $\gamma $ whenever $S(\gamma ) > \kappa $. While this fixes the problem of computational efficiency, we also lose large portions of the manifold which remain far from the primaries. Instead, we leverage the manifold subdivision procedure which is already introduced in Sect. 4.2.2 to modify the naive algorithm in order to retain these portions of the manifold as follows.

1.
Fix a maximum speed threshold, $\kappa >0$. For each boundary chart, $\gamma $, present after the evaluation step, check that $S(\gamma ) \le \kappa $ and if so, continue to the remeshing step.
2.
If $S(\gamma ) > \kappa $, write $\gamma (s) = \left( x(s),\dot{x}(s), y(s), \dot{y}(s)\right) $ and compute
$$\begin{aligned} \left\{ s \in [-\,1,1] : \dot{x}(s)^2 + \dot{y}(s)^2 - \kappa ^2 = 0\right\} . \end{aligned}$$
Since $\dot{x}, \dot{y}$ are polynomial approximations, this set is a finite collection of roots of a polynomial which we denote by, $\left\{ s_0,\dots ,s_K\right\} $.
3.
For $1\le j \le K$, check that $\dot{x}(s)^2 + \dot{y}(s)^2 - \kappa ^2 < 0$ holds on $[s_j,s_{j+1}]$ and if so, compute $\hat{\gamma }_j$ as in Eq. (14) and continue to the remeshing step. Subintervals which fail this check are discarded.

To summarize, our algorithm identifies regions of the manifold boundary which pass close to a primary by checking the maximum speed. Regions which exceed a threshold are cut away, while regions of the nearby boundary continue to be advected. The cut regions cause the apparent holes punched out around each primary in the manifold plots, as in Figs. 7 and 8.

4.3 Computational results: manifold atlases for the triple Copenhagen problem

Performance results for atlas computations at the libration points $\mathcal {L}_{0}$ and $\mathcal {L}_5$ are given in Tables 3 and 4, respectively. The computations are performed for the case of equal masses, that is for the triple Copenhagen problem. The tables report the advection time—that is the number of time units the boundary of the local parameterizations are integrated—as well as the time required to complete the computations and the number of polynomial charts comprising the atlas. All computations were performed on a MacBook Air laptop running Sierra version 10.12.6, on a 1.8 GHz Intel Core i5, with 8 GB of 1600 MHz DDR3 memory.

The resulting atlases for $\mathcal {L}_0$ and $\mathcal {L}_5$ are illustrated in Figs. 7 and 8 for various integration times. The boundaries for the charts are also shown, making it clear that the computational effort goes up dramatically near the primaries. Note that the chart boundary lines running out of the local parameterizations are actual orbits of the system and hence give a sense of the dynamics on the manifold. The pictures provide some insight into the dynamics of the problem; however, their complexity illustrates the need for more sophisticated search techniques in order to extract further useful qualitative information from the atlases.

5 Homoclinic dynamics in the CRFBP

In this section, we discuss connecting orbits found for the symmetric $m_1 = m_2 = m_3 = 1/3$ case by searching the manifold atlases computed in the previous section.

5.1 Mining the atlases

Assume we have computed atlases, $\mathcal {A}^{s,u}$, for the stable/unstable manifolds of $\mathbf {x_0}$. We are interested in “mining” the chart data to find transverse connections. Since each atlas is stored as a collection of polynomial charts, it suffices to identify pairwise intersections between stable and unstable charts. Thus, throughout we assume $\varGamma ^{s,u} : [-\,1,1]^2 \rightarrow W^{s,u}(\mathbf {x}_0)$ is a pair of charts which parameterize a portion of the stable/unstable manifold. We write $\varGamma ^{s,u}_{1,2,3,4}$ denote the scalar coordinates of each chart. The following theorem whose proof can be found in Kepley and Mireles James (2018) provides a computable condition for verifying transverse intersection of a pair of charts.

Table 3 Atlas computations at $\mathcal {L}_0$ in the triple Copenhagen problem: each chart is computed to polynomial order 20 in space and order 40 in time

Full size table

Table 4 Atlas computations at $\mathcal {L}_5$ in the triple Copenhagen problem: each chart is computed to polynomial order 20 in space and order 40 in time

Full size table

Theorem 1

Define $G :[-\,1,1]^3 \rightarrow \mathbb {R}^3$ by

$$\begin{aligned} G(s, t, \sigma ) := \left( \begin{array}{c} \varGamma _1^u(s,t) - \varGamma _1^s(\sigma , 0) \\ \varGamma _2^u(s,t) - \varGamma _2^s(\sigma , 0) \\ \varGamma _3^u(s,t) - \varGamma _3^s(\sigma , 0) \\ \end{array} \right) , \end{aligned}$$

and suppose $(\hat{s}, \hat{t}, \hat{\sigma }) \in [-1, 1]^3$ satisfies $G(\hat{s}, \hat{t}, \hat{\sigma }) = 0$. If $\varGamma _4^u(\hat{s}, \hat{t})$ and $\varGamma _4^s(\hat{\sigma }, 0)$ have the same sign, then $\hat{\mathbf {x}} := \varGamma ^u(\hat{s}, \hat{t})$ is homoclinic to $\mathbf {x}_0$. Moreover, if $DG(\hat{s}, \hat{t}, \hat{\sigma })$ is nonsingular and if $\nabla E(\hat{\mathbf {x}}) \ne 0$ (where E is the CRFBP energy), then the energy level set is a smooth 3-manifold near $\hat{\mathbf {x}}$ and the stable/unstable manifolds of $\mathbf {x}_0$ intersect transversally in the energy manifold.

We emphasize that Theorem 1 provides a computable condition for verifying a transverse intersection using rigorous numerics. However, we will use the same theorem to detect transverse intersections in the purely numerical setting of this paper. This is made explicit in the following algorithm utilized in the mining scheme for all results in the present work.

Assume $\varGamma ^{s,u}, G$ are as defined in Theorem 1. Apply Newton’s method to find an approximate root of G. Let $\hat{v} = \left( \hat{s}, \hat{t}, \hat{\sigma }\right) $ denote an approximate solution with $G(\hat{v}) \approx 0$, and check the following conditions:

1.
$\varGamma _4^u(\hat{s},\hat{t})$, and $\varGamma _4^s(\hat{\sigma },0)$ are both “far” from 0.
2.
$\varGamma _4^u(\hat{s},\hat{t})$, and $\varGamma _4^s(\hat{\sigma },0)$ have the same sign.

If condition 1 holds without condition 2, then these charts are nonintersecting. In this case, these charts lie on separated portions of the stable/unstable manifolds which are symmetric with respect to the fourth coordinate. We refer to these as “pseudo-intersections.” On the other hand, if both conditions hold, then we conclude from Theorem 1 that we have numerically found a transverse homoclinic for $\mathbf {x}_0$ passing through $\varGamma ^u(\hat{s}, \hat{t}) = {\hat{\mathbf{x}}}$.

Note that condition 1 serves two purposes in this setting. First, it serves as an easily computable condition for checking that $\nabla E({\hat{\mathbf{x}}}) \ne 0$ as required in the theorem. This follows by noting that

$$\begin{aligned} \pi _4 \circ \nabla E({\hat{\mathbf{x}}}) = {\hat{\mathbf{x}}}_4 = \varGamma _4^u(\hat{s},\hat{t}), \end{aligned}$$

so it follows that $\nabla E(\hat{\mathbf {x}}) \ne 0$ is satisfied automatically whenever condition 1 is satisfied.

In addition, condition 1 gives us some confidence that the sign difference from condition 2 holds due to transversality of the homoclinic, as opposed to numerical error. Indeed, if condition 1 is not satisfied, then $\varGamma _4^u(\hat{s},\hat{t})$, and $\varGamma _4^s(\hat{\sigma },0)$ take values near zero in which case sign errors for either coordinate are likely due to integration errors. In this case, even if condition 2 is satisfied we are unable to trust the result, and hence unable to conclude whether the zero of G corresponds to a transverse intersection or a pseudo-intersection. Fortunately, this situation can be remedied as discussed in Remark 11. As a result, we are free to choose our threshold for what is meant by “far” in the statement of condition 1 very conservatively which leads to a great deal of confidence that our mining algorithm returns only transverse homoclinic orbits.

We further increase our confidence in the approximate connection by using it as the input for a BVP solver based on Newton’s method, which allows us to refine our approximation to nearly machine precision, and it is the BVP formulation to which we then apply continuation methods. Every connection reported in this section has been so certified, and none of the connections identified from the mining algorithm had a BVP which failed to converge. In other words, the mining algorithm did not return any false homoclinics.

5.2 Efficient atlas mining

It is not desirable to check every pair of charts from each atlas using the above procedure, and we introduce two methods which significantly reduce the number of chart pairs which must be checked via the Newton intersection scheme based on Theorem 1.

5.2.1 The $\ell _1$ box approximation

The first method for improving the mining efficiency is to apply a coarse preprocessing step to each pair of charts which must be compared. The main idea is based on the fact that for most pairs of charts which do not intersect, these charts will “obviously” not intersect in the sense that their images in phase space will be very far apart. We exploit this using a fast algorithm for identifying many such pairs and in this case skip the slower Newton-based intersection attempt.

To be more precise, consider an arbitrary polynomial $P: [-\,1,1]^2 \rightarrow \mathbb {R}$ defined by

$$\begin{aligned} P(s,t) = \sum _{m = 0}^{M} \sum _{n = 0}^{N} a_{m,n}s^nt^m \quad a_{m,n} \in \mathbb {R}. \end{aligned}$$

We define the $\ell _1$ box for P to be

$$\begin{aligned} B_P = [a_{0,0} - r, a_{0,0} + r] \qquad \text {where } \ r = \sum _{(m,n) \ne (0,0)} \left| a_{m,n} \right| . \end{aligned}$$

The significance of $B_P$ is that we have the bound

$$\begin{aligned} \left| P(s,t) - a_{0,0} \right| \le r \qquad \text {for all } (s,t) \in [-\,1,1]^2 \end{aligned}$$

or equivalently, $P(s,t) \in B_P$ for all $(s,t) \in [-\,1,1]^2$. Analogously, we extend this to higher dimensions component-wise and apply this to geometrically rule out pairs of charts which can not intersect because their images are “well separated.” Specifically, consider a pair of stable/unstable charts

$$\begin{aligned} \varGamma ^s(s,t) = \sum _{m = 0}^{M} \sum _{n = 0}^{N} a_{m,n}s^nt^m \qquad \varGamma ^u(s,t) = \sum _{m = 0}^{M} \sum _{n = 0}^{N} b_{m,n}s^nt^m. \end{aligned}$$

which have $\ell _1$ boxes described by rectangles in $\mathbb {R}^4$ and satisfying $\varGamma ^s(s,t) \in B_{\varGamma ^s}$, and $\varGamma ^u(s,t) \in B_{\varGamma ^u}$. Then, if the set distance, $d(B_{\varGamma ^s}, B_{\varGamma ^u})$ is large enough, we can conclude that $\varGamma ^s,\varGamma ^u$ do not intersect.

Using $\ell _1$ boxes has two advantages. The first is that computing and checking $\ell _1$ boxes for pairwise intersections is much faster than our Newton-like intersection method. This is due to the fact that for each coordinate the box radius, r, is equivalently computed as

$$\begin{aligned} r = a_{0,0} + \left| \left| P \right| \right| _{\ell _1} - \left| a_{0,0} \right| \end{aligned}$$

which is extremely fast to compute using modern implementations. Determining whether two boxes intersect or not is also fast due to efficient interval arithmetic libraries such as the INTLAB library for MATLAB (Rump 1999) which was utilized in our implementation.

The second advantage is that an $\ell _1$ box is typically a very coarse enclosure for the true values of P. This “problem” is often referred to as the data-dependence problem or the wrapping effect. In our situation, however, we consider the coarseness to be a feature since it makes our numerical estimates more conservative. Thus, we are able to rule out many pairs of charts which clearly do not intersect without eliminating false negatives.

In practice, a single pairwise $\ell _1$ box intersection check is approximately 1,000 times more efficient than the Newton-based scheme and this method rules out around 90 percent of nonintersecting chart pairs. Moreover, the $\ell _1$ box for each chart can be computed only once during the atlas construction and stored. This leaves the cost of a single box intersection check as the only significant computational operation.

Finally, we remark that once $\ell _1$ boxes have been computed and stored for each chart in both atlases, one can make careful use of the triangle inequality to reduce the computation even further. This provides roughly an additional order of magnitude improvement in the efficiency of our algorithm which could be crucial to the feasibility of mining extremely large atlases. However, we took limited advantage of this fact in the present work.

5.2.2 Fundamental domains

The other main source of efficiency gain in our algorithm relies on using the dynamics explicitly. Recalling our notation in Sect. 4, assume $\mathcal {A}^s$ is the stable manifold atlas which we have computed to include the $L_s^\mathrm{th}$ generation and let $W^{s}_{k}(\mathbf {x}_0)$ denote the $k^\mathrm{th}$ generation local stable manifold. Then, $W^{s}_k(\mathbf {x}_0)$ is a fundamental domain for $W^{s}(\mathbf {x}_0)$. In other words, if $\mathbf {x}(t)$ is any orbit which satisfies $\lim \limits _{t \rightarrow \infty } \mathbf {x}(t) = \mathbf {x}_0$ and if $\mathbf {x}(0) \ne \mathbf {x}_0$, then there exists $t_k \in \mathbb {R}$ such that $\mathbf {x}(t_k) \in W^{s}_k(\mathbf {x}_0)$. Of course, the same claim holds for the unstable manifold. Taken together, if we assume we have computed the unstable manifold, $\mathcal {A}^u$, up to the $L_u^\mathrm{th}$ generation, then we have the following observation.

Proposition 2

Let $\mathbf {x}(t)$ be a transverse homoclinic to $\mathbf {x}_0$. Then $\mathbf {x}(t) \in W^s(\mathbf {x}_0) \cap W^u(\mathbf {x}_0)$ for all $t \in \mathbb {R}$. Let $W^{s,u}_0(\mathbf {x}_0),W^{s,u}_1(\mathbf {x}_0),\cdots ,W^{s,u}_{L_{s,u}}(\mathbf {x}_0)$ denote the generation sequence of local stable/unstable manifolds. Then exactly one of the following is true.

There exists $k_s,k_u$ and $t_0 \in \mathbb {R}$, such that $\mathbf {x}(t_0) \in W^s_{k_s}(\mathbf {x}_0)\bigcap W^u_{k_u}(\mathbf {x}_0)$ and $k_s + k_u$ is constant for all pairs $(k_s,k_u)$ which satisfy this property.
There exists $t_0\in \mathbb {R}$ such that for all $0 \le k_s \le L_s$, and $0\le k_u \le L_u$, we have
$$\begin{aligned} \mathbf {x}((-\infty , t_0)) \bigcap W^s_{k_s}(\mathbf {x}_0) = \emptyset \quad \text {and} \quad \mathbf {x} (( t_0,\infty ))\bigcap W^u_{k_u}(\mathbf {x}_0) = \emptyset . \end{aligned}$$

Proposition 2 says that any transverse homoclinic for $\mathbf {x}_0$ satisfying the second condition is a connection which does not intersect in the atlases which we have computed. Restricting to those that do, this proposition says that there is a “first” generation for both the stable and unstable atlases for which the connection will appear.

The significance of this situation is that we need only do pairwise comparisons between stable/unstable charts one generation at a time. Thus, the computational complexity for mining intersections between the two atlases has computational complexity of order $\mathcal {O}(K_sK_u (L_s + L_u))$ where $K_s,K_u$ are the sizes of the largest stable/unstable generations, respectively. This is a dramatic improvement over the naive solution of checking every pair in both atlases which has complexity on order $\mathcal {O}(L_sL_uK_sK_u)$.

Remark 10

We note that often the atlases we compute in practice do not technically satisfy the fundamental domain property. This is due to the fact that sections of manifold boundary which pass near a primary are “cut out” as described in Sect. 4.2.3. Nevertheless, this has no impact on our mining algorithm. Specifically, each generation is still a fundamental domain for the subset of the global manifold which satisfies the speed constraint. Thus, mining for connections via “leapfrogging” through pairwise generations is still assured to find all connections which are present in the computed atlases, and therefore, all connections which satisfy the speed constraint.

Remark 11

The result in Proposition 2 gives rise to a natural mining algorithm. Namely, at each generation, all chart pairs are compared and transverse intersections are identified. It follows that once a transverse intersection is identified, then the next/previous generation must also contain an orbit segment corresponding to the same homoclinic. Hence, in addition to gaining a computational speedup, exploiting the fundamental domain property also ensures that all homoclinics identified are distinct. This follows from the existence of the minimum value for $k_s + k_u$ in Proposition 2.

Furthermore, this observation yields a method of resolving the ambiguous case in which the Newton intersection method finds a zero for G but condition 1 from Sect. 5.1 is not satisfied. Specifically, if $G(\hat{s},\hat{t},\hat{\sigma }) \approx 0$ and $\varGamma ^u(\hat{s},\hat{t}) \approx \varGamma ^s(\hat{\sigma },0) \approx 0$, then we may follow the suspected intersection through earlier-/later-generation charts until the sign condition can be verified or refuted in appropriate predecessor/successor charts. Lastly, we mention that by storing “parent/child” information about the charts in the atlas, we can perform the search just described in post-processing.

5.3 The symmetric case: locating, refining, and classifying, connections

We now describe the homoclinic mining procedure in the case of the triple Copenhagen problem. Assume that we have computed stable/unstable atlases denoted by $\mathcal {A}^s,\mathcal {A}^u$, respectively. Each atlas is of the form described in Sect. 4, i.e., each atlas is a union of chart maps having the form, $\varGamma ^{s,u}: D \rightarrow \mathbb {R}^4$ with $\varGamma ^{s,u}(D) \subset W^{s,u}(\mathbf {x_0})$.

We begin with a lemma to motivate the choice to grow each atlas in the symmetric case and then do continuation as opposed to growing the atlas for nonsymmetric cases.

Lemma 1

Assume f is the symmetric CRFBP vector field, i.e., $m_1 = m_2 = m_3 = \frac{1}{3}$ and define two linear maps, $\varphi ^{\pm }: \mathbb {R}^4 \rightarrow \mathbb {R}^4$ by $\varphi ^{\pm }(x,\dot{x},y,\dot{y}) = \varphi ^{\pm }(\mathbf {x}) = R^{\pm }\mathbf {x}$ where $R^{\pm }$ is the matrix given by

$$\begin{aligned} R^{\pm } = \left( \begin{array}{cccc} \cos (\pm \,\theta ) &{}\quad 0 &{}\quad - \sin (\pm \,\theta ) &{}\quad 0 \\ 0 &{}\quad \cos (\pm \,\theta ) &{}\quad 0 &{}\quad - \sin (\pm \,\theta ) \\ \sin (\pm \,\theta ) &{}\quad 0 &{}\quad \cos (\pm \,\theta ) &{}\quad 0 \\ 0 &{}\quad \sin (\pm \,\theta ) &{}\quad 0 &{}\quad \cos (\pm \,\theta ) \\ \end{array} \right) \qquad \theta = \frac{2 \pi }{3}, \end{aligned}$$

then $\varphi ^{\pm }$ is a rotational conjugacy for f and $\varphi ^{\pm } \circ f(\mathbf {x}) = f \circ \varphi ^{\pm }(\mathbf {x})$ for all $\mathbf {x} \in \mathbb {R}^4$. In particular, if $\gamma $ parameterizes a homoclinic orbit for $\mathcal {L}_0$, then $\varphi ^{\pm } \circ \gamma $ are parameterizations for two additional, distinct “symmetric” homoclinic orbits for $\mathcal {L}_0$. Moreover, if $\gamma $ parameterizes a homoclinic orbit for $\mathcal {L}_5$, then $\varphi ^{+} \circ \gamma $ and $\varphi ^- \circ \gamma $ parameterize symmetric homoclinics for $\mathcal {L}_4$ and $\mathcal {L}_6$.

The proof of Lemma 1 is included in “Appendix A.” The significance of this symmetry is the fact that global stable/unstable atlases for the triple Copenhagen problem can be separated into three distinct equivalence classes where for $\mathbf {x},\mathbf {y} \in W^*({\mathbf {x}_0})$, the equivalence relation $\mathbf {x} \sim \mathbf {y}$ is satisfied if and only if $\mathbf {x} \in \{\mathbf {y}, \varphi ^+(\mathbf {y}), \varphi ^-(\mathbf {y})\}$. Thus, each atlas is obtained by advection of only a single representative for each class. In other words, in the equal masses case, we only need to advect $\frac{1}{3}$ of each initial parameterization boundary to obtain the entire atlas. Specifically, we define

$$\begin{aligned} D' = \{\mathbf {z} \in D : 0\le \text {Arg}(z_1) < \theta , \ z_2 = \text {conj}(z_1) \} \end{aligned}$$

and we globalize only $\partial D'$ to obtain a partial atlas, $\mathcal {A}'$. We can then access the full global atlas by applying $\varphi ^+,\varphi ^-$ to each chart in $\mathcal {A}'$ and we set

$$\begin{aligned} \mathcal {A} = \mathcal {A}' \cup \varphi ^+ \left( \mathcal {A}'\right) \cup \varphi ^- \left( \mathcal {A}'\right) . \end{aligned}$$

The advantage is a ninefold increase in computational efficiency for the atlas computation and a threefold improvement in efficiency for the atlas mining scheme. Applying the procedure for the triple Copenhagen problems results in the connecting orbits illustrated in Fig. 9. These results are further described and classified in the next section.

5.3.1 Quantitative/qualitative classifications of the homoclinic orbit set at $\mathcal {L}_{0,5}$

Suppose $\mathbf {x}_0 \in \mathbb {R}^4$ is an equilibrium solution and $W^{s,u}_{\mathrm{loc}}(\mathbf {x}_0)$ a local stable/unstable manifold. Let $\gamma $ be an orbit homoclinic to $\mathbf {x}_0$, and suppose that $T \in \mathbb {R}$ is the elapsed time from when $\gamma $ passes through the boundary of the local unstable manifold to when $\gamma $ passes through the boundary of the local stable manifold. Observe that if $W^{s}_{\mathrm{loc}}(\mathbf {x}_0) \cap W^{u}_{\mathrm{loc}}(\mathbf {x}_0) = \{\mathbf {x}_0\}$ and if the vector field is inflowing/outflowing on the boundaries of $W^{s,u}_{\mathrm{loc}}(\mathbf {x}_0)$, respectively, then $T > 0$ is well defined.

When the local parameterizations intersect only at $\mathbf {x}_0$, it makes sense to talk about the “shortest” connection time,” the “second shortest” connection time, and so on. This natural ordering on connection times provides a useful observable for classifying homoclinic connections relative to fixed local stable/unstable manifolds. Generically, we expect a one-to-one correspondence between connecting orbits and connection times, though this expectation will fail in the presence of symmetries as seen below.

In the CRFBP, when we “mine” the stable/unstable atlases for connecting orbits and order them by connection time we see something interesting. In each of the cases we studied, the shortest homoclinic orbits appear to organize the longer connections. Informally speaking, we find that a small number of short homoclinic orbits serve as a sort of alphabet of “letters,” and the longer connections can be roughly identified as “words” in this alphabet.

For example, the first 42 homoclinic connecting orbits (up to symmetry) at $\mathcal {L}_0$ in the triple Copenhagen problem are classified in Table 5. These results are obtained by integrating initial local stable/unstable manifolds for $\pm \,5$ time units subject to the speed constraint, $\dot{x}^2 + \dot{y}^2 \le 4$. Our method finds all of the connections satisfying these constraints. The classification is in terms of the connection time, the order of appearance, and a geometric description in terms of words and letters.

We give the names $L_{0A}$ and $L_{0B}$ to the shortest two connections at $\mathcal {L}_0$. These orbits are illustrated in Fig. 10 and have connection times approximately 1.717 and 2.331, respectively. Rotating either of these by $\pm \,120^{\circ }$ gives another connecting orbit with exactly the same shape and connection time. We refer to these rotations as $L_{0A^{\pm }}$ and $L_{0B^{\pm }}$. These six shortest connections—$L_{0A}, L_{0B}$ and their symmetric counterparts—organize the rest of the homoclinic behavior seen at $\mathcal {L}_0$ as we now describe.

We associate the third shortest connection with the word $L_{0A^+} \cdot L_{0A}$ because the orbit moves off the unstable manifold appearing to follow $L_{0A^+}$, passes near the equilibrium at $\mathcal {L}_0$, and makes another excursion following $L_{0A}$ before returning to the stable manifold. Similarly, we associate with the 5th longest connecting orbit the word $L_{0B}^2$, as this orbit moves off the unstable manifold and appears to follow $L_{0B}$, making two loops around the second primary before returning to the stable manifold. Heuristically speaking, $L_{0A}$, $L_{0B}$ and their symmetric counterparts comprise a system of “homoclinic channels” or simple allowable motions and other homoclinic orbits seem to follow in their wake.

Table 6 records analogous information for the first 23 connections found at $\mathcal {L}_5$ in the triple Copenhagen problem. In this case there are six basic letters $L_{5A}, L_{5B}, L_{5D}, L_{5D}, L_{5E}, L_{5F}$. Words are formed for these letters just as discussed above. Applying $\pm \,120^{\circ }$ rotations produces connections with the same shapes and connection times at $\mathcal {L}_4$ and $\mathcal {L}_6$, respectively. We stress that this description of the connecting orbits in terms of words and letters, while intuitively appealing, is based on qualitative observations and is subordinate to the rigid quantitative classification of the orbits by connection time.

Table 5 Classification of the connecting orbits for $\mathcal {L}_0$: advecting one-third of the boundary of the local stable/unstable manifolds for $T = \pm \,5$ time units and imposing a speed threshold of 2 reveals the 42 homoclinic connections illustrated in the left frame of Fig. 9

Full size table

Table 6 Classification of the connecting orbits for $\mathcal {L}_5$: the 23 homoclinic connections which appear on the right side of Fig. 9 satisfying the same connection time and speed constraints as in the $\mathcal {L}_0$ case

Full size table

Remark 12

Several comments about the results reported in Tables 5 and 6 are in order.

Additional symmetries Some of the orbits, for example the 27th and 28th shortest orbits at $\mathcal {L}_0$ and the 21st and 22nd shortest orbits at $\mathcal {L}_5$, have reported the same connection times to three decimal places. In fact the connection times agree to within numerical errors. This is because the equal mass problem has reversible symmetries that we are not exploiting in our computations. Rather these serve as a check on the numerics.
Connection time versus ordering While the connection times reported in these tables depend on the choice of local stable/unstable manifold, it should be remarked that, as long as the parameterization method is used to represent the local manifolds as discussed in Sect. 3, the ordering of the connections does not change. It is easy to check that the boundary of the parameterized manifolds are inflowing/outflowing and that the manifolds intersect only at the equilibrium solution. Moreover, since the eigenvalues are complex conjugate, the local parameterizations are unique up to the choice of a single eigenvector scale factor. By choosing the unit disk as the domain of the parameterization, the scaling the only free parameter in the problem. Decreasing the scaling by a factor of $\tau > 0$ is equivalent to flowing the boundary by the same time $\tau $. So: rescaling the eigenvectors changes all the times of flight by exactly the same amount, hence does not reorder them.
Qualitative classification The decomposition of the connecting orbits into words is performed “by eye” in the present work. That is, we simply inspect the connections and describe what we see. We now sketch a method which could be used to formalize our qualitative description and note that the idea is computationally feasible. Recall that if $\gamma $ is a simple closed rectifiable curve in the plane (with counter clockwise orientation), and $z_0 = x_0 + i y_0$ is a point not on $\gamma $, then the number of times that $\gamma $ winds around $z_0$ is counted by the integral
$$\begin{aligned} \text{ Ind }_\gamma (z_0) = \frac{1}{2 \pi i} \int _\gamma \frac{1}{z - z_0} \, \mathrm{d}z. \end{aligned}$$
So, observe for example that the curve $L_{0A}$ winds once around the libration point $\mathcal {L}_1$, while $L_{0A^+}$ and $L_{0A^-}$ each wind once, respectively, around $\mathcal {L}_2$ and $\mathcal {L}_3$. Similarly, the curve $L_{0B}$ winds once around the second primary, while $L_{0B^+}$ and $L_{0B^-}$ wind once each around the third and first primaries. Then for a homoclinic orbit $\gamma $ at $\mathcal {L}_0$ define the integer vector $(\alpha _1, \alpha _2, \alpha _3, \beta _1, \beta _2, \beta _3) \in \mathbb {Z}^6$, where $\alpha _j = \text{ Ind }_{\gamma }(\mathcal {L}_j)$ for and $\beta _j = \text{ Ind }_{\gamma }(P_j)$ both for $j = 1,2,3$. (Here $P_j$ are the coordinates of the jth primary). Then $\alpha _1$ counts the contribution of $L_{0A}$ to $\gamma $, while $\beta _1$ counts the contribution of $L_{0B}$ and so on. This description amounts to an Abelianization of the previous notion of words/letters, as the winding vector looses track of the order of the letters in the word. (It is often the case that mechanical calculation of topological data is facilitated by passing to an Abelianization). This notion is extended to the homoclinic orbits at $\mathcal {L}_{5}$ in a similar way, see Fig. 18.
Blue skies The main theorem of Henrard (1973), already mentioned in the introduction, gives that there is a family of periodic orbits accumulating to each of the homoclinic orbits found by our procedure. In some cases we can venture a guess as which families of periodic orbits accumulate to which homoclinic. For example, Fig. 17 illustrates the orbit $L_{0A}$ and $L_{0A\pm }$ along with the planar Lyapunov families attached to the inner libration points $\mathcal {L}_{1,2,3}$. The results suggest that the planar families may accumulate at to these homoclinics. Similarly, comparing the orbits $L_{5E}$ and $L_{5F}$ in the bottom left and right frames of Fig. 18 with the planar Lyapunov families at $\mathcal {L}_{7,9}$ illustrated in the left frame of Fig. 3 suggests that these may be the families of periodic orbits attached to these homoclinics. The orbits $L_{0B}$, and $L_{0B^\pm }$, as well as the orbits $L_{5A}$ and $L_{5B}$ must be the limits of families of periodic orbits winding around the primary bodies. Making a systematic study of the periodic families associated with the homoclinic orbits discussed here would make a nice topic for a future study.

Remark 13

(Some related work on asymptotic orbits) There are interesting similarities between some of the orbits discussed above, and some asymptotic orbits already discovered in Baltagiannis and Papadakis (2011b). The interested reader might, for example, compare the homoclinic orbit on the bottom right frame of Fig. 20 with the heteroclinic termination orbit illustrated in Figure 5 of Baltagiannis and Papadakis (2011b). (To make such a comparison one has to “flip” Figure 5 of Baltagiannis and Papadakis (2011b) $180^{\circ }$ about the y-axis as the two papers use different normalizations of the four-body problem. Also, their $L_3$ is our $\mathcal {L}_5$). In that study the heteroclinic is discovered by numerical continuation of the author’s $f_{10}$ family of periodic orbits: a family of orbits with winding number one about all three of the primary masses. We note that our homoclinic of Fig. 20 is similar, but that the $\pm \,120^{\circ }$ rotational symmetry broken. We conjecture that there are three families of periodic orbits bifurcating from the $f_{10}$ family after a symmetry breaking, and that these families terminate on the homoclinic of Fig. 20 (bottom right) and its rotation by $\pm \,120^{\circ }$ counterparts.

Similarly, the heteroclinic orbit illustrated in Figure 4 of Baltagiannis and Papadakis (2011b)—which is the termination of the author’s $f_5$ family—is related the pair of homoclinic orbits $L_{E,F}$ illustrated in Fig. 18. To see this, imagine an orbit obtained by combining our $L_{5F}$ with the orbit $L_{4E}$, that is our $L_{5E}$ rotated by $-\,120^{\circ }$ so that it is based at $\mathcal {L}_4$. The resulting union of curves has the same shape as the heteroclinic illustrated in Figure 4 of Baltagiannis and Papadakis (2011b). This suggests that the families of periodic orbits which terminate at our $L_{E,F}$ could emerge from the planar Lyapunov families after symmetry breaking.

In general we note that the homoclinic orbits tend to have less symmetry than the heteroclinic, so that studying the periodic orbits terminating at the homoclinics is a good way to obtain asymmetric periodic orbits—even in the symmetric versions of the problem. We also note that changing the mass parameters will tend to destroy heteroclinic connections, as the libration points will move into distinct energy levels. Homoclinic orbits on the other hand persist under generic Hamiltonian perturbations of the vector field. In particular, they persist after a small change in the mass ratios, facilitating numerical continuation as discussed below.

5.4 Numerical continuation of ensembles of connections

The fact that the homoclinic connecting orbits are formulated as solutions of boundary value problems makes parameter continuation natural. We give only an outline of our continuation algorithm, as numerical continuation of homoclinic orbits for Hamiltonian systems is described in great detail in the literature. References are discussed in the introduction.

Begin with an ensemble of connecting orbits for a libration point $\mathcal {L}$ at the mass parameters $m_1, m_2, m_3$ (initially we have $m_1 = m_2 = m_3 = 1/3$).

We choose a new parameter set $\bar{m}_1 = m_1 + \delta _1$, $\bar{m}_3 = m_3 + \delta _3$. Then we compute $\bar{m}_2 = 1 - \bar{m}_1 - \bar{m}_3$ and apply a first-order predictor corrector to find the libration point at the new parameter values. We numerically compute the eigenvalues and eigenvectors of the new libration point, and if it remains a saddle-focus (i.e., if there has been no bifurcation) we proceed.
We recompute the local invariant manifolds at the new parameter set. A good strategy is to compute the coefficients to order $N_0$ by recursively solving the homological equations. Initially we take the eigenvector scaling from the previous step and rescale if needed. For the higher-order coefficients we use the coefficients from the previous step. This gives as an initial guess for the Newton or pseudo-Newton method which usually converges very fast.
The new local parameterizations provide the boundary conditions for the multiple shooting scheme for the homoclinic orbits. We take the connecting orbits from the previous step as the initial guesses for the Newton method at the current mass parameters. If necessary we can apply a first-order predictor corrector, but this is often unnecessary, due to the fact that the boundary value problem formulated with the high-order parameterizations of the local manifold is very well conditioned. Note that in a given continuation step, the same local parameterizations serve as the boundary conditions for the entire ensemble of connecting orbits. This justifies the cost of computing high-order representations of the manifolds.
Once we have applied Newton to all the connections in the ensemble, we are ready to take a new step. If Newton fails to converge for any of the connecting orbits, we have to decide if we throw the orbit away, or if we recompute with smaller $\delta _1, \delta _2, \delta _3$.

We also remark that the atlas is not recomputed at the new mass parameter set. That is, we continue only the connecting orbits—the intersections of the stable unstable manifolds—not the manifolds themselves. Continuation of ensembles of connections is much cheaper than recomputing the atlas each time we change parameters.

Results of several numerical continuations are illustrated in Figs. 21 and 22. As we change the masses we break the rotational symmetry of the Triple Copenhagen problem and the symmetric counterparts resolve into distinct connection, no longer obtainable by rotations of a single representative. During the numerical continuation we sometimes encounter bifurcations of the connecting orbits themselves, which involve no bifurcation of the underlying equilibrium. Figure 23 illustrates a common scenario where a family of homoclinic orbits undergoes a doubling bifurcation. These bifurcations seem very common and we have not made a systematic effort to track them. This would make an interesting topic for a future study.

6 Conclusions

In this work we implemented a numerical method for computing an atlas for the stable/unstable manifold attached to a libration point in the CRFBP. We consider saddle-focus equilibria, as in this case topological theorems give rich dynamical structure near a transverse homoclinic. We then developed algorithms for searching or “mining” the atlas for approximate connections. After an approximate connection is found we sharpen it using a Newton scheme for an appropriate boundary value problem. The procedure is completely automated and locates all the homoclinic orbits up to a specified integration time. (To make the calculations less stiff we ignore orbits passing too close to the primaries). The resulting collection of homoclinic orbits is ordered by connection time. A different choice of local parameterization may yield different connection times, but the ordering of these connections is universal. This last comment requires that the local stable/unstable manifolds are computed using the parameterization method.

Our method locates dozens of distinct homoclinic connections and we consider their qualitative properties in addition to their connection times. The geometry of the homoclinic orbit set is organized by the several shortest connections, in the sense that they form a system of channels that other connections appear to follow. We decompose the motions of the longer connections into words built from these simple letters and discuss briefly how this decomposition could be calculated in an automated way using the formulation of the winding number as a complex line integral.

Finally, we continue some of the orbits found in the equal mass case to other nonsymmetric mass values using a predictor corrector scheme for the boundary value problem. Rather than recomputing the entire atlas the continuation scheme only recomputes the initial parameterization at the new masses, and this can be done via a Newton scheme.

We remark that it would be easy to intersect that atlas data computed here with any desired surface of section. We have not used sections in the present work because (a) we wanted to find all the connections up to a certain integration time and a given section may find some orbits and not others and (b) projecting to a section may not preserve the ordering of the homoclinics by connection time. Moreover, while the present project focuses on the CRFBP—a two-degree-of-freedom Hamiltonian system—the scheme described here could be extended to higher-dimensional systems were sections are four or more dimensional and hence less useful for visualization purposes. In such a situation, for example the spatial CRFBP, it is desirable to have an automated procedure.

Other interesting topics of future research would be to combine our methods with more sophisticated continuation and branch following algorithms. It would also be nice to return to the ideas of Strömgren and examine the “tubes” of periodic orbits attached to each of our homoclinic connections. These periodic families would themselves undergo bifurcations which one could try to follow numerically.

Another improvement to our method would be to remove the speed constraints on our manifold computations. This could be done by regularizing binary collisions. The idea would be that whenever a chart gets too close to a primary, then instead of subdividing we would change to the regularized coordinates where computations are less stiff. This idea of using such regularizations to improve numerics goes back at least to the work of Thiele. This would also provide a natural way for computing collision orbits between $\mathcal {L}_{0, 5}$ and each of the primaries. A topic we have avoided via our imposed speed constraints. A modern implementation combined with our approach to computing atlases would be valuable, and is the subject of ongoing work.

If such advancements let us compute larger and more complete atlases, a very interesting question is to see if other “fundamental” connecting orbits appear. For example, at $\mathcal {L}_0$ all the connections we find shadow two basic orbits $L_{0A}$, $L_{0B}$ and their symmetric counterparts. Is this true of all the connections? Or is this simply an artifact of the fact that we only consider connections whose velocity is never too large? Will performing longer searches yield more fundamental letters for the alphabets at $\mathcal {L}_{0,5}$?

Of course with more computing power, one could perform the atlas computations at more values of the mass parameters, say for a mesh of ten or twenty different points in the simplex $m_1 + m_2 + m_3 = 1$. This would provide a more complete picture of the global orbit structure. Such a project would greatly benefit form a cluster computing implementation exploiting the data independence of the computations at different parameter sets, and indeed the independence of different portions of the atlas at a given parameter set. Numerical continuation could then be applied to “fill in the gaps” between the mesh points.

References

Abraham, R.H.: Chaostrophes, intermittency, and noise. In: Chaos, Fractals, and Dynamics (Guelph, Ont., 1981/1983), Volume 98 of Lecture Notes in Pure and Applied Mathematics, pp. 3–22. Dekker, New York (1985)
Álvarez-Ramírez, M., Vidal, C.: Dynamical aspects of an equilateral restricted four-body problem. Math. Probl. Eng. 23 (2009). Article ID 181360
Alvarez-Ramírez, M., Barrabés, E.: Transport orbits in an equilateral restricted four-body problem. Celest. Mech. Dyn. Astron. 121(2), 191–210 (2015)
Article ADS MathSciNet Google Scholar
Alvarez-Ramírez, M., Delgado, J., Vidal, C.: Global regularization of a restricted four-body problem. Int. J. Bifurc. Chaos Appl. Sci. Eng. 24(7), 1450092, 15 (2014)
Alvarez-Ramírez, M., García, A., Palacián, J.F., Yanguas, P.: Oscillatory motions in restricted n-body problems. J. Differ. Equ. 265, 779–803 (2018)
Article ADS MathSciNet MATH Google Scholar
Baltagiannis, A.N., Papadakis, K.E.: Equilibrium points and their stability in the restricted four-body problem. Int. J. Bifurc. Chaos Appl. Sci. Eng. 21(8), 2179–2193 (2011a)
Article MathSciNet MATH Google Scholar
Baltagiannis, A.N., Papadakis, K.E.: Families of periodic orbits in the restricted four-body problem. Astrophys. Space Sci. 336, 357–367 (2011b)
Article ADS MATH Google Scholar
Barrabés, E., Mondelo, J.M., Ollé, M.: Numerical continuation of families of homoclinic connections of periodic orbits in the RTBP. Nonlinearity 22(12), 2901–2918 (2009)
Article ADS MathSciNet MATH Google Scholar
Barros, J.F., Leandro, E.S.G.: The set of degenerate central configurations in the planar restricted four-body problem. SIAM J. Math. Anal. 43(2), 634–661 (2011)
Article MathSciNet MATH Google Scholar
Barros, J.F., Leandro, E.S.G.: Bifurcations and enumeration of classes of relative equilibria in the planar restricted four-body problem. SIAM J. Math. Anal. 46(2), 1185–1203 (2014)
Article MathSciNet MATH Google Scholar
Burgos-García, J.: Families of periodic orbits in the planar Hill’s four-body problem. Astrophys. Space Sci. 361(11), 353, 21 (2016)
Article MathSciNet Google Scholar
Burgos-García, J., Bengochea, A.: Horseshoe orbits in the restricted four-body problem. Astrophys. Space Sci. 362(11), 212, 14 (2017)
Article MathSciNet Google Scholar
Burgos-García, J., Delgado, J.: Periodic orbits in the restricted four-body problem with two equal masses. Astrophys. Space Sci. 345(2), 247–263 (2013a)
Article ADS MATH Google Scholar
Burgos-García, J., Delgado, J.: On the “blue sky catastrophe” termination in the restricted four-body problem. Celest. Mech. Dyn. Astron. 117(2), 113–136 (2013b)
Article ADS MathSciNet MATH Google Scholar
Burgos-García, J., Gidea, M.: Hill’s approximation in a restricted four-body problem. Celest. Mech. Dyn. Astron. 122(2), 117–141 (2015)
Article ADS MathSciNet MATH Google Scholar
Burgos-García, J., Lessard, J.P., Mireles James, J.D.: Spatial periodic orbits in the equaliteral circular restricted four body problem: computer assisted proofs of existence. Celest. Mech. Dyn. Astron. 131, 2 (2019)
Article ADS Google Scholar
Cabré, X., Fontich, E., de la Llave, R.: The parameterization method for invariant manifolds. I. Manifolds associated to non-resonant subspaces. Indiana Univ. Math. J. 52(2), 283–328 (2003a)
Article MathSciNet MATH Google Scholar
Cabré, X., Fontich, E., de la Llave, R.: The parameterization method for invariant manifolds II. Regularity with respect to parameters. Indiana Univ. Math. J. 52(2), 329–360 (2003b)
Article MathSciNet MATH Google Scholar
Cabré, X., Fontich, E., de la Llave, R.: The parameterization method for invariant manifolds. III. Overview and applications. J. Differ. Equ. 218(2), 444–515 (2005)
Article ADS MathSciNet MATH Google Scholar
Canalias, E., Masdemont, J.J.: Homoclinic and heteroclinic transfer trajectories between planar Lyapunov orbits in the sun–earth and earth–moon systems. Discrete Contin. Dyn. Syst. Ser. A 14(2), 261–279 (2006)
MathSciNet MATH Google Scholar
Champneys, A.R., Kuznetsov, Y.A., Sandstede, B.: A numerical toolbox for homoclinic bifurcation analysis. Int. J. Bifurc. Chaos Appl. Sci. Eng. 6(5), 867–887 (1996)
Article MathSciNet MATH Google Scholar
Cheng, X., She, Z.: Study on chaotic behavior of the restricted four-body problem with an equilateral triangle configuration. Int. J. Bifurc. Chaos Appl. Sci. Eng. 27(2), 1750026, 12 (2017)
MathSciNet MATH Google Scholar
Darwin, G.H.: Periodic orbits. Acta Math. 21(1), 99–242 (1897)
Article MathSciNet MATH Google Scholar
de la Llave, R., Mireles James, J.D.: Parameterization of invariant manifolds by reducibility for volume preserving and symplectic maps. Discrete Contin. Dyn. Syst. 32(12), 4321–4360 (2012)
Article MathSciNet MATH Google Scholar
Devaney, R.L.: Homoclinic orbits in Hamiltonian systems. J. Differ. Equ. 21(2), 431–438 (1976)
Article ADS MathSciNet MATH Google Scholar
Devaney, R.L.: Blue sky catastrophes in reversible and Hamiltonian systems. Indiana Univ. Math. J. 26(2), 247–263 (1977)
Article MathSciNet MATH Google Scholar
Doedel, E.J., Friedman, M.J.: Numerical computation of heteroclinic orbits. J. Comput. Appl. Math. 26(1–2), 155–170 (1989). (Continuation techniques and bifurcation problems)
Article MathSciNet MATH Google Scholar
Doedel, E.J., Friedman, M.J., Kunin, B.I.: Successive continuation for locating connecting orbits. Numer. Algorithms 14(1–3), 103–124 (1997). [Dynamical numerical analysis (Atlanta, GA, 1995)]
Article ADS MathSciNet MATH Google Scholar
Gidea, M., Burgos, M.: Chaotic transfers in three- and four-body systems. Phys. A 328(3–4), 360–366 (2003)
Article MathSciNet MATH Google Scholar
Haro, À., Canadell, M., Figueras, J.-L., Luque, A., Mondelo, J.-M.: The Parameterization Method for Invariant Manifolds, Volume 195 of Applied Mathematical Sciences. Springer, Cham (2016). (From rigorous results to effective computations )
Book MATH Google Scholar
Henrard, J.: Proof of a conjecture of E. Strömgren. Celest. Mech. 7, 449–457 (1973)
Article ADS MATH Google Scholar
Kalies, W., Kepley, S., Mireles James, J.D.: Analytic continuation of local (un)stable manifolds with rigorous computer assisted error bounds. SIAM J. Appl. Dyn. Syst. 17(1), 157–202 (2018)
Article MathSciNet MATH Google Scholar
Kepley, S., Mireles James, J.D.: Chaotic motions in the restricted four body problem via Devaney’s saddle-focus homoclinic tangle theorem. J. Differ. Equ. 226(4), 1709–1755 (2019)
Article ADS MathSciNet MATH Google Scholar
Koon, W.S., Lo, M.W., Marsden, J.E., Ross, S.D.: Heteroclinic connections between periodic orbits and resonance transitions in celestial mechanics. Chaos 10(2), 427–469 (2000)
Article ADS MathSciNet MATH Google Scholar
Leandro, E.S.G.: On the central configurations of the planar restricted four-body problem. J. Differ. Equ. 226(1), 323–351 (2006)
Article ADS MathSciNet MATH Google Scholar
Lerman, L.M.: Complex dynamics and bifurcations in a Hamiltonian system having a transversal homoclinic orbit to a saddle focus. Chaos 1(2), 174–180 (1991)
Article ADS MathSciNet MATH Google Scholar
Lerman, L.M.: Dynamical phenomena near a saddle-focus homoclinic connection in a Hamiltonian system. J. Stat. Phys. 101(1/2), 357–372 (2000)
Article ADS MathSciNet MATH Google Scholar
Lessard, J.-P., Mireles James, J.D., Ransford, J.: Automatic differentiation for Fourier series and the radii polynomial approach. Phys. D 334(1), 174–186 (2016)
Article MathSciNet MATH Google Scholar
Moulton, F.R., Buchanan, D., Buck, T, Griffin, F.L., Longley, W.R., MacMillan, W.D.: Periodic orbits. Number Publication No. 161. Carnegie Institution of Washington (1920)
Papadakis, K.E.: Families of asymmetric periodic solutions in the restricted four-body problem. Astrophys. Space Sci. 361(12), 377, 15 (2016a)
Article MathSciNet Google Scholar
Papadakis, K.E.: Families of three-dimensional periodic solutions in the circular restricted four-body problem. Astrophys. Space Sci. 361(4), 129, 14 (2016b)
Article MathSciNet Google Scholar
Pedersen, P.: Librationspunkte im restringierten vierkörperproblem. Dan. Mat. Fys. Medd. 21(6), 1–80 (1944)
MATH Google Scholar
Pedersen, P.: Stabilitätsuntersuchungen im restringierten vierkörperproblem. Dan. Mat. Fys. Medd. 26(16), 1–38 (1952)
MATH Google Scholar
Rabe, E.: Determination and survey of periodic Trojan orbits in the restricted problem of three bodies. Astron. J. 66, 500–513 (1961)
Article ADS MathSciNet Google Scholar
Rump, S.M.: INTLAB—INTerval LABoratory. In: Csendes, T. (ed.) Developments in Reliable Computing, pp. 77–104. Kluwer Academic Publishers, Dordrecht (1999). http://www.ti3.tu-harburg.de/rump/
She, Z., Cheng, X.: The existence of a Smale horseshoe in a planar circular restricted four-body problem. Celest. Mech. Dyn. Astron. 118(2), 115–127 (2014)
Article ADS MathSciNet MATH Google Scholar
She, Z., Cheng, X., Li, C.: The existence of transversal homoclinic orbits in a planar circular restricted four-body problem. Celest. Mech. Dyn. Astron. 115(3), 299–309 (2013)
Article ADS MathSciNet MATH Google Scholar
Shilńikov, L.P.: Existence of a countable set of periodic motions in a four-dimensional space in an extended neighborhood of a saddle-focus. Dokl. Akad. Nauk SSSR 172, 54–57 (1967)
MathSciNet Google Scholar
Shilńikov, L.P.: On the question of the structure of an extended neighborhood of a structurally stable state of equilibrium of saddle-focus type. Mat. Sb. (N.S.) 81(123), 92–103 (1970a)
MathSciNet Google Scholar
Shilńikov, L.P.: A contribution to the problem of the structure of an extended neighborhood of a rough equilibrium state of saddle-focus type. Math. USSR Sb. 10(1), 91 (1970b)
Article Google Scholar
Shilnikov, L.P., Shilnikov, A.L., Turaev, D.V.: Showcase of blue sky catastrophes. Int. J. Bifurc. Chaos Appl. Sci. Eng. 24(8), 1440003, 10 (2014)
MathSciNet MATH Google Scholar
Simó, C.: Relative equilibrium solutions in the four-body problem. Celest. Mech. 18(2), 165–184 (1978)
Article ADS MathSciNet MATH Google Scholar
Strömgren, E.: Connaissance actuelle des orbites dans le probleme des trois corps. Bull. Astron. 9, 87–130 (1934)
ADS MATH Google Scholar
Szebehely, V.: Theory of Orbits. Academic Press Inc., Cambridge (1967)
MATH Google Scholar
Szebehely, V., Flandern, T.V.: A family of retegrade orbits around the triangular equilibrium points. Astron. J. 72(3), 373–379 (1967)
Article ADS Google Scholar
Szebehely, V., Nacozy, P.: A class of E. Strömgren’s direct orbits in the restricted problem. Astron. J. 77(2), 184–190 (1967)
Article ADS Google Scholar
Van den Berg, J.B., Mireles James, J.D., Reinhardt, C.: Computing (un)stable manifolds with validated error bounds: non-resonant and resonant spectra. J. Nonlinear Sci. 26, 1055–1095 (2016)
Article ADS MathSciNet MATH Google Scholar

Download references

Acknowledgements

The authors would like to sincerely thank two anonymous referees who carefully read the submitted version of the manuscript. Their suggestions greatly improved the final version. The second author was partially supported by NSF Grant DMS-1813501. Both authors were partially supported by NSF Grant DMS-1700154 and by the Alfred P. Sloan Foundation Grant G-2016-7320.

Author information

Authors and Affiliations

Department of Mathematics, Rutgers University, New Brunswick, USA
Shane Kepley
Department of Mathematical Sciences, Florida Atlantic University, Boca Raton, USA
J. D. Mireles James

Authors

Shane Kepley
View author publications
You can also search for this author in PubMed Google Scholar
J. D. Mireles James
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to J. D. Mireles James.

Ethics declarations

Conflict of interest

The authors of this manuscript certify that they have no affiliations with or involvement in any organization or entity with any financial interest (such as honoraria, educational grants, participation in speaker’s bureaus, membership, employment, consultancies, stock ownership, or other equity interest, and expert testimony or patent-licensing arrangements), or nonfinancial interest (such as personal or professional relationships, affiliations, knowledge or beliefs) in the subject matter or materials discussed in this manuscript.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

The second author was partially supported by NSF Grant DMS-1813501. Both authors were partially supported by NSF Grant DMS-1700154 and by the Alfred P. Sloan Foundation Grant G-2016-7320.

Appendices

Rotational symmetry for the equal mass case

Let $m_1 = m_2 = m_3 = 1/3$ and $\theta = \frac{2 \pi }{3}$. Define the linear map, $\varphi : \mathbb {R}^4 \rightarrow \mathbb {R}^4$, by

$$\begin{aligned} \varphi (x,\dot{x},y,\dot{y}) = \left( \begin{array}{cccc} \cos (\theta ) &{}\quad 0 &{}\quad - \sin (\theta ) &{}\quad 0 \\ 0 &{}\quad \cos (\theta ) &{}\quad 0 &{}\quad - \sin (\theta ) \\ \sin (\theta ) &{}\quad 0 &{}\quad \cos (\theta ) &{}\quad 0 \\ 0 &{}\quad \sin (\theta ) &{}\quad 0 &{}\quad \cos (\theta ) \\ \end{array} \right) \left( \begin{array}{c} x \\ \dot{x} \\ y \\ \dot{y} \end{array} \right) =(\varphi _1, \varphi _2, \varphi _3, \varphi _4)^\mathrm{T}. \end{aligned}$$

Note that $\varphi $ acts as a rotation by $\theta $ in the (x, y) and $(\dot{x},\dot{y})$ coordinate planes independently. Now, suppose that $\mathbf {x}: \mathbb {R}\rightarrow \mathbb {R}^4$ is a trajectory for f, then $\tilde{\mathbf {x}} = \varphi \circ \mathbf {x}$ is also a trajectory for f. Moreover, if $\mathbf {x} \subset W^{s,u}(\mathcal {L}_i)$ for $i \in \{0,4,5,6\}$, then $\tilde{\mathbf {x}} \subset W^{s,u}(L_{\sigma (i)})$, where $\sigma $ is the permutation given by $\sigma = (0)(4,5,6)$.

Proof

Let $\hat{x} = (x, \dot{x},y,\dot{y}) \in \mathbb {R}^4$ and suppose $\mathbf {x}$ is the trajectory through $\hat{x}$ satisfying $\mathbf {x}(0) = \hat{x}$. By definition, $\tilde{\mathbf {x}}(0) = \varphi (\mathbf {x}(0)) = \varphi (\hat{x})$, and we note that $\tilde{\mathbf {x}}$ will parameterize a trajectory for f if and only if $\tilde{\mathbf {x}}(t)$ is tangent to $f(\tilde{\mathbf {x}}(t))$ for all $t \in \mathbb {R}$. Thus, it clearly suffices to prove that $f \circ \varphi = \varphi \circ f$ holds for any $\hat{x}$ on $\mathbb {R}^4$.

With this in mind, define the planar rotation $\eta : \mathbb {R}^2 \rightarrow \mathbb {R}^2$ by

$$\begin{aligned} \eta (x,y) = \left( \begin{array}{cc} \cos (\theta ) &{}\quad - \sin (\theta ) \\ \sin (\theta ) &{}\quad \cos (\theta ) \\ \end{array} \right) \left( \begin{array}{c} x \\ y \end{array} \right) = \left( \begin{array}{c} \eta _1(x,y) \\ \eta _2(x,y) \end{array} \right) , \end{aligned}$$

Recall that for the symmetric mass case, we have equal masses given by $m_1 = m_2 = m_3 = \frac{1}{3}$. Set $m = \frac{1}{3}$, then the primaries are located at $P_1,P_2,P_3$ given by

$$\begin{aligned} P_1 = \left( -\frac{\sqrt{3}}{3}, 0 \right) \quad P_2 = \left( \frac{\sqrt{3}}{6}, -\frac{1}{2} \right) \quad P_3 = \left( \frac{\sqrt{3}}{6}, \frac{1}{2} \right) \end{aligned}$$

and note that $\left| \left| P_1 \right| \right| = \left| \left| P_2 \right| \right| = \left| \left| P_3 \right| \right| = \frac{1}{\sqrt{3}}$. Moreover, $P_1,P_2,P_3$ are vertices of an equilateral triangle and a direct computation shows that $\eta $ acts as a cyclic permutation on the primary bodies in configuration space given by the cycle $\pi = (1,2,3)$. Recalling that $r_i(x,y) = \sqrt{(x-x_i)^2 + (y-y_i)^2} = \left| \left| (x,y) - P_i \right| \right| $, it follows from this symmetry that for $i \in \{1,2,3\}$ we have

$$\begin{aligned} r_i \circ \eta (x,y) = \left| \left| \eta (x,y) - P_i \right| \right| = \left| \left| (x,y) - P_{\pi ^{-1}(i)} \right| \right| = r_{\pi ^{-1}(i)}. \end{aligned}$$

(17)

Now, we recall that in the symmetric case, the CRFBP vector field is given by

$$\begin{aligned} f(x,\dot{x},y,\dot{y}) = \left( \begin{array}{c} \dot{x} \\ 2\dot{y} + x - \frac{1}{3}\sum _{i=1}^{3} \frac{x - x_i}{r_i} \\ \dot{y} \\ -2\dot{x} + y - \frac{1}{3}\sum _{i=1}^{3} \frac{y - y_i}{r_i} \end{array} \right) , \end{aligned}$$

which we write in scalar coordinates as $f = \left( f_1,f_2,f_3,f_4\right) $. Similarly, write $\varphi = \left( \varphi _1,\varphi _2,\varphi _3,\varphi _4 \right) $ and we note that $(\varphi _1(\hat{x}),\varphi _3(\hat{x})) = \eta (x,y)$. Now, we check that $f_i \circ \varphi = \varphi _i \circ f$ holds for each $i \in \{1,2,3,4\}$. For $i =1$, we have the direction computation

$$\begin{aligned} \varphi _1 \circ f \left( \hat{x}\right) = \dot{x} \cos (\theta ) - \dot{y} \sin (\theta ) = f_1 \circ \varphi \left( \hat{x}\right) . \end{aligned}$$

Now, for $i = 2$ we first compute each expression

$$\begin{aligned} \varphi _2 \circ f \left( \hat{x}\right)&= \left( 2 \dot{y} + x - \frac{1}{3}\sum _{i=1}^{3} \frac{x - x_i}{r_i(x,y)}\right) \cos (\theta ) - \left( -2\dot{x} + y - \frac{1}{3} \sum _{i=1}^{3} \frac{y - y_i}{r_i(x,y)}\right) \sin (\theta ) \\ f_2 \circ \varphi \left( \hat{x}\right)&= 2(\dot{x} \sin (\theta ) + \dot{y} \cos (\theta )) + x \cos (\theta ) - y \sin (\theta ) - \frac{1}{3} \sum _{i = 1}^{3} \frac{\eta _1(x,y) - x_i}{r_i \circ \eta (x,y)}. \end{aligned}$$

After canceling like terms in each expression, we are left to prove the following equality

$$\begin{aligned} \sum _{i = 1}^{3} \frac{\eta _1(x,y) - x_i}{r_i \circ \eta (x,y)} = \cos (\theta ) \sum _{i = 1}^{3} \frac{x - x_i}{r_i(x,y)} - \sin (\theta ) \sum _{i=1}^{3} \frac{y - y_i}{r_i(x,y)}. \end{aligned}$$

(18)

Applying the result from (17) to the left side, we have

$$\begin{aligned} \sum _{i = 1}^{3} \frac{\eta _1(x,y) - x_i}{r_{\pi ^{-1}(i)}} = \frac{\eta _1(x,y) - x_1}{r_3(x,y)} + \frac{\eta _1(x,y) - x_2}{r_1(x,y)} + \frac{\eta _1(x,y) - x_3}{r_2(x,y)}, \end{aligned}$$

so that for each $i \in \{1,2,3\}$, the numerator for $r_i$ is given by $\eta _1(x,y) - x_{\pi (i)}$. Now, we compute the numerators for $r_i(x,y)$ on the right-hand side as

$$\begin{aligned} \cos (\theta )(x-x_i) - \sin (\theta )(y-y_i) = \eta _1(x,y) - \eta _1(x_i,y_i) = \eta _1(x,y) - x_{\pi (i)}. \end{aligned}$$

We conclude that the numerators for each $r_i$ are equal, and therefore, the equality in (18) holds which proves that $\varphi _2 \circ f = f_2 \circ \varphi $. The proofs for the $i = 3,4$ cases are computationally similar to the corresponding proofs for $i = 1,2$ which concludes the proof that $f \circ \varphi = \varphi \circ f$, or equivalently, $\tilde{\mathbf {x}}$ is a trajectory for f.

To prove the second claim, fix $i \in \{0,4,5,6\}$ and suppose $\mathbf {x}(t) \rightarrow L_i$ as $t \rightarrow \infty $ implying that $\mathbf {x} \subset W^s(L_i)$. Let $\tilde{\mathbf {x}} = \varphi (\mathbf {x})$, and note that $L_i$ is an equilibrium solution for f implying that $\mathbf {x}_{2,4}(t) \rightarrow 0$. Noting that $\eta $ is a unitary operator, it follows that $\tilde{\mathbf {x}}_{2,4}(t) \rightarrow 0$ as well. Moreover, $\varphi $ is a dynamical conjugacy implying that in configuration space we have

$$\begin{aligned} \lim \limits _{t \rightarrow \infty } \tilde{\mathbf {x}}_{1,3}(t) = \lim \limits _{t \rightarrow \infty } \eta \left( x(t),y(t)\right) = \eta (L_i). \end{aligned}$$

Taken together it follows that $\eta (L_i)$ is again an equilibrium solution for f. Thus, $\eta $ acts as a permutation on equilibria. A direct computation shows that $\eta (L_i) = L_{\sigma (i)}$ where $\sigma $ is the permutation given by $\sigma = (0)(4,5,6)$. The preceding argument applies equally well to the unstable manifold of each equilibrium with $t \rightarrow -\infty $ which completes the proof of the second claim. $\square $

Power series manipulation, automatic differentiation, and the radial gradient

Our local invariant manifold computations are based on formal power series manipulations. The main technical challenge is to compute $f \circ P$ with P an arbitrary power series and f the vector field for the CRFBP. As usual in gravitational N body problems, the nonlinearity contains terms raised to the minus three halves power.

Consider two formal power series $P, Q :\mathbb {C}^2 \rightarrow \mathbb {C}$ given by

$$\begin{aligned} P(z_1, z_2) = \sum _{m=0}^\infty \sum _{n=0}^\infty a_{m,n} z_1^m z_2^n, \quad \quad \text{ and } \quad \quad Q(z_1, z_2) = \sum _{m=0}^\infty \sum _{n=0}^\infty b_{m,n} z_1^m z_2^n, \end{aligned}$$

where $a_{m,n}, b_{m,n} \in \mathbb {C}$ for all $(m,n) \in \mathbb {N}^2$. The collection of all formal power series forms a complex vector space, so that for any $\alpha , \beta \in \mathbb {C}$ we have that

$$\begin{aligned} (\alpha P+ \beta Q)(z_1, z_2) = \sum _{m=0}^\infty \sum _{n=0}^\infty \left( \alpha a_{m,n} + \beta b_{m,n}\right) z_1^m z_2^n. \end{aligned}$$

The collection becomes an algebra when endowed with the Cauchy product

$$\begin{aligned} (P \cdot Q)(z_1, z_2) = \sum _{m=0}^\infty \sum _{n=0}^\infty \left( \sum _{j=0}^m \sum _{k=0}^n a_{m-j,n-k} b_{j k} \right) \, z_1^m z_2^n. \end{aligned}$$

(19)

We evaluate elementary functions of formal power series using a technique called automatic differentiation by many authors. Suppose, for example, we are given a formal series

$$\begin{aligned} P(z_1, z_2) = \sum _{m=0}^\infty \sum _{n=0}^\infty p_{m,n} z_1^m z_2^n, \end{aligned}$$

with $p_{0,0} \ne 0$. We seek the formal series coefficients $q_{m,n}$ of the function

$$\begin{aligned} Q(z_1, z_2) = \sum _{m=0}^\infty \sum _{n=0}^\infty q_{m,n} z_1^m z_2^n = P(z_1, z_2)^{\alpha }, \quad \quad \quad \alpha \in \mathbb {R}. \end{aligned}$$

Our approach follows the discussion given by Haro et al. (2016). Consider the first-order partial differential operator

$$\begin{aligned} \nabla _{\mathrm{rad}} P(z_1, z_2) = \nabla P(z_1, z_2) \left( \begin{array}{c} z_1 \\ z_2 \end{array} \right) = z_1 \frac{\partial }{\partial z_1} P(z_1, z_2) + z_2 \frac{\partial }{\partial z_2} P(z_1, z_2), \end{aligned}$$

which is referred to as the radial gradient of P. Evaluating on the level of formal power series leads to

$$\begin{aligned} \nabla _{\mathrm{rad}} P(z_1, z_2) = \sum _{m=0}^\infty \sum _{n=0}^\infty (m+n) p_{m,n} z_1^m z_2^n. \end{aligned}$$

Observe that

$$\begin{aligned} \nabla _{\mathrm{rad}} Q(z_1, z_2)&= \nabla Q(z_1, z_2) \left( \begin{array}{c} z_1 \\ z_2 \end{array} \right) \\&= \nabla P(z_1, z_2)^\alpha \left( \begin{array}{c} z_1 \\ z_2 \end{array} \right) \\&= \alpha P(z_1, z_2)^{\alpha - 1} \nabla P(z_1, z_2) \left( \begin{array}{c} z_1 \\ z_2 \end{array} \right) . \end{aligned}$$

Multiplying both sides of the equation by P, we obtain

$$\begin{aligned} P(z_1, z_2) \nabla Q(z_1, z_2) \left( \begin{array}{c} z_1 \\ z_2 \end{array} \right) = \alpha Q(z_1, z_2) \nabla P(z_1, z_2) \left( \begin{array}{c} z_1 \\ z_2 \end{array} \right) . \end{aligned}$$

(20)

Here the fractional power is replaced by operations involving only differentiation and multiplication. This is the virtue of the radial gradient in automatic differentiation schemes. Plugging the power series expansions into Eq. (20) leads to

$$\begin{aligned}&\left( \sum _{m=0}^\infty \sum _{n=0}^\infty p_{m,n} z_1^m z_2^n \right) \left( \sum _{m=0}^\infty \sum _{n=0}^\infty (m+n) q_{m,n} z_1^m z_2^n \right) \\&\quad =\left( \sum _{m=0}^\infty \sum _{n=0}^\infty \alpha q_{m,n} z_1^m z_2^n \right) \left( \sum _{m=0}^\infty \sum _{n=0}^\infty (m+n) p_{m,n} z_1^m z_2^n \right) , \end{aligned}$$

and taking Cauchy products gives

$$\begin{aligned}&\sum _{m=0}^\infty \sum _{n=0}^\infty \sum _{j=0}^m \sum _{k=0}^n (j+k) p_{m-j, n-k} q_{j,k} z_1^m z_2^n\\&\quad = \sum _{m=0}^\infty \sum _{n=0}^\infty \sum _{j=0}^m \sum _{k=0}^n \alpha (j+k) q_{m-j, n-k} p_{j,k} z_1^m z_2^n. \end{aligned}$$

Match like powers to get

$$\begin{aligned} \sum _{j=0}^m \sum _{k=0}^n (j+k) p_{m-j, n-k} q_{j,k} = \sum _{j=0}^m \sum _{k=0}^n \alpha (j+k) q_{m-j, n-k} p_{j,k}, \end{aligned}$$

or

$$\begin{aligned}&(m+n) p_{0,0} q_{m,n} + \sum _{j=0}^m \sum _{k=0}^n \hat{\delta }_{j,k}^{m,n} (j+k) p_{m-j, n-k} q_{j,k}\\&\quad = \alpha (m+n) q_{0,0} p_{m,n} + \sum _{j=0}^m \sum _{k=0}^n \hat{\delta }_{j,k}^{m,n} \alpha (j+k) q_{m-j, n-k} p_{j,k}, \end{aligned}$$

for $m + n \ge 1$. Here

$$\begin{aligned} \hat{\delta }_{j,k}^{m,n} := {\left\{ \begin{array}{ll} 0 &{}\quad \text{ if } \,\,j = m \text{ and } k = n \\ 0 &{}\quad \text{ if } \,\,j = 0 \text{ and } k = 0 \\ 1 &{}\quad \text{ otherwise } \end{array}\right. }. \end{aligned}$$

The $\hat{\delta }$ appears to remind us that terms of order (m, n) are extracted from the sum. Isolating $q_{m,n}$ gives

$$\begin{aligned} q_{m,n} = \alpha p_{0,0}^{\alpha -1} p_{m,n} + \frac{1}{(m+n) p_{0,0}} \sum _{j=0}^m \sum _{k=0}^n \hat{\delta }_{j,k}^{m,n} (j+k) \left( \alpha q_{m-j, n-k} p_{j,k} - p_{m-j, n-k} q_{j,k} \right) , \end{aligned}$$

(21)

for $m + n \ge 1$. Note that $q_{0,0} = p_{0,0}^{\alpha } \ne 0$ by hypothesis, so that the coefficients $q_{m,n}$ are formally well defined to all orders. Using the recursion given in Eq. (21) we can compute the formal series coefficients for Q for the cost of a Cauchy product. This allows us to compute power series representations for the nonlinear terms in f(P) and Df(P) in the CRFBP. Another approach which converts the CRFB field to a higher-dimensional polynomial field in discussed in Kepley and Mireles James (2018).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kepley, S., Mireles James, J.D. Homoclinic dynamics in a restricted four-body problem: transverse connections for the saddle-focus equilibrium solution set. Celest Mech Dyn Astr 131, 13 (2019). https://doi.org/10.1007/s10569-019-9890-8

Download citation

Received: 05 August 2018
Revised: 12 February 2019
Accepted: 12 February 2019
Published: 04 March 2019
DOI: https://doi.org/10.1007/s10569-019-9890-8

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Homoclinic dynamics in a restricted four-body problem: transverse connections for the saddle-focus equilibrium solution set

Abstract

Similar content being viewed by others

Critical homoclinics in a restricted four-body problem: numerical continuation and center manifold computations

Revealing the basins of convergence in the planar equilateral restricted four-body problem

Heteroclinic Transition Motions in Periodic Perturbations of Conservative Systems with an Application to Forced Rigid Body Dynamics

1 Introduction

2 Saddle-focus equilibrium solutions of the equilateral CRFBP

2.1 The planar equilateral circular restricted four-body problem

2.2 Saddle-foci in parameter space

2.3 Two ways to formulate a connecting orbit: phase space geometry and boundary value problems

Remark 1

3 Numerical computation of the stable/unstable manifolds

3.1 Parameterization method for the local invariant manifold

3.2 Power series solution of Eq. (5)

3.2.1 Solution by power matching

Remark 2

Remark 3

3.2.2 A Newton scheme

3.2.3 A pseudo-Newton scheme

Remark 4

Remark 5

Remark 6

3.3 Integration of analytic arcs

4 Building an atlas for the local stable/unstable manifold

4.1 Iterative method for computing charts

4.1.1 The initial local manifold

4.1.2 The initial manifold boundary

4.1.3 The next generation

Remark 7

Remark 8

4.2 Convergence, manifold subdivision, and numerical integration

4.2.1 Time-stepping

4.2.2 Manifold subdivision

Remark 9

Proposition 1

Proof

4.2.3 Stiffness

4.3 Computational results: manifold atlases for the triple Copenhagen problem

5 Homoclinic dynamics in the CRFBP

5.1 Mining the atlases

Theorem 1

5.2 Efficient atlas mining

5.2.1 The \(\ell _1\) box approximation

5.2.2 Fundamental domains

Proposition 2

Remark 10

Remark 11

5.3 The symmetric case: locating, refining, and classifying, connections

Lemma 1

5.3.1 Quantitative/qualitative classifications of the homoclinic orbit set at \(\mathcal {L}_{0,5}\)

Remark 12

Remark 13

5.4 Numerical continuation of ensembles of connections

6 Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendices

Rotational symmetry for the equal mass case

Proof

Power series manipulation, automatic differentiation, and the radial gradient

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation