1 Introduction

Kepler’s laws describe the way planets move in their orbits about the Sun. Geometrically, they state that the planets move in planar elliptical orbits with eccentricity \(e\in [0,1)\) and that the area swept by the line joining the planet and the Sun increases linearly with time, which leads immediately to Kepler’s equation \(E-e\sin (E)=M\), relating mean and eccentric anomalies. The mean anomaly is a fictitious angle \(M\) that increases linearly with time at a rate \(M=2\pi t/T\), where \(T\) is the orbital period, and the eccentric anomaly \(E\) gives the coordinates of the planet in its orbit plane as \((x,y)=(a\cos (E), b\sin (E))\). Here, the \(xy\)-plane has origin at the center of the ellipse with the \(x\)-axis pointing to the perihelion, and the values \(a\) and \(b\) are the semi-major and semi-minor axis of the ellipse. Therefore, finding the exact location of a planet at a given time requires solving an instance of Kepler’s equation for some \(M\), assuming that the values \(a\), \(b\), \(e\) and \(T\) are known (actually, only \(a\) and \(e\) are needed, since \(b=a\sqrt{1-e^2}\) and \(T\) can be obtained from \(a\) using the third law). For a derivation of these formulas and a detailed introduction to Kepler’s equation, see Battin (1987).

By a symmetry argument, the equation can be easily reduced to the case \(M\in [0,\pi ]\). The existence and uniqueness of solution \(E\in [0,\pi ]\) follows from the fact that the function \(f_{e,M}:[0,\pi ]\rightarrow [0,\pi ]\) given by \(f_{e,M}(E)=E-e\sin (E)-M\) is strictly increasing.

Several solutions to the problem have been proposed since it was stated 400 years ago. Some authors have tried non-iterative methods to solve the equation up to a fixed predetermined accuracy (Markley 1995; Mortari and Clochiatti 2007). However, we want to calculate the solution with arbitrary precision, hence our interest in iterative techniques.

Kepler himself proposed to use a fixed-point iteration to solve the equation (Chap. 1 of Colwell 1993), i.e. guess \(E_0\), an approximation of the exact solution \(E\), and then iterate \(E_{n+1}=M+e\sin (E_n)\). This sequence converges to \(E\), since \(|E_{n+1}-E|=|M+e\sin (E_n)-E|=e|\sin (E_n)-\sin (E)|\le e|E_n-E|\), which implies that \(|E_n-E|\le e^n|E_0-E|\longrightarrow 0\) as \(n\rightarrow \infty \). The problem with this approach is that the convergence is slow for values of \(e\) near \(1\). For the orbit of Mercury, which has \(e\approx 0.2\), about five iterations are needed to reduce the error by a factor of \(10^{-3}\), while for values of eccentricity \(e>0.5\) the fixed-point iteration is even slower than a bisection method.

Although the fixed-point iteration does not provide an efficient solution to Kepler’s equation, it exhibits the structure of most of the current methods to solve it: first, guess an approximation \(\tilde{E}\) of the solution (called starter), and then use some iterative technique to produce a sequence quickly converging to the actual solution (see Danby 1987; Danby and Burkardt 1983; Mortari and Elipe 2014; Palacios 2002). For the second part, Newton’s method seems to be the most used iteration, mainly due to its conceptual simplicity, generality and fast convergence. The guessing part, however, requires some specific understanding on the equation and has been the subject of many recent papers (Calvo et al. 2013; Mikkola 1987; Ng 1979; Nijenhuis 1991; Odell and Gooding 1986; Taff and Brennan 1989).

Starters have been compared (and optimized) using different criteria, such as the number of iterations needed to reach certain precision, the distance to the actual solution, the number of floating point operations needed for its computation, etc. For this purpose, we adopt a criterion which is very specific to Newton’s method and guarantees that the iterations reduce the error at quadratic speed. More precisely, we will only accept an approximate solution \(\tilde{E}\) of the equation \(f_{e,M}(E)=0\) if Newton’s method starting at \(E_0=\tilde{E}\) produces a sequence \(E_n\) such that \(|E_n-E|\le (\frac{1}{2})^{2^n-1}|\tilde{E}-E|\) for all \(n\ge 0\).

Taking one of these starters satisfying \(\tilde{E} \in [0,\pi ]\), the initial error is at most \(\pi \), so we obtain an accuracy \(10^{-N}\) after only \(n= \lceil \log _2 \left( 1+\log _{2} (\pi )+ \log _{2} (10) N \right) \rceil \) iterations. In particular, ten iterations of Newton’s method starting from \(\tilde{E}\) give an error less than \(10^{-307}\) for any input value of \(e\) and \(M\).

We will use a simple test, due to Smale (1986) and later improved by Wang and Han (1990), which depends only on the starter \(\tilde{E}\) and guarantees the speed of convergence that we claim.

Definition 1.1

(Smale’s \(\alpha \)-test) We say that \(\widetilde{E}\) is an approximate zero of \(f_{e,M}\) if it satisfies the following condition

$$\begin{aligned} \alpha (f_{e,M},\widetilde{E})=\beta (f_{e,M},\widetilde{E}) \cdot \gamma (f_{e,M},\widetilde{E}) <\alpha _0, \end{aligned}$$

where

$$\begin{aligned} \beta (f_{e,M},\widetilde{E})=\left| \frac{f_{e,M}(\widetilde{E})}{f_{e,M}'(\widetilde{E})} \right| ,\quad \gamma (f_{e,M},\widetilde{E})=\sup _{k \ge 2} \left| \dfrac{f_{e,M}^{(k)}(\widetilde{E})}{k!f_{e,M}'(\widetilde{E})}\right| ^{\frac{1}{k-1}} \end{aligned}$$

and \(\alpha _0=3-2\sqrt{2}\approx 0.1715728\).

Odell and Gooding (1986) compiled a list of starters that have been proposed in the literature by many authors. Table 1 provides a formula for those that will be studied in this paper.

Table 1 Classical starters

In Sect. 2 we present an analytical study of the starters \(\tilde{E}=0, \pi , M, \frac{M}{1-e}\) using the notion of approximate zero. More precisely, for each of these starters, we obtain in Theorems 2.2, 2.3, 2.4 and 2.5 regions where they satisfy Smale’s \(\alpha \)-test, thus providing approximate solutions. We also show in Theorem 2.6 that Ng’s starter \(S_{10}\) (Eq. 9 of Ng 1979), which is obtained by solving a cubic equation, gives an approximate solution on the entire domain.

Similarly, in Sect. 3 we compare the remaining starters \(S_2, \ldots ,S_{9}\), and the improved \(S_7\) starter obtained by Calvo et al. (Prop. 1, 2013). More precisely, we check numerically where those starters satisfy Smale’s \(\alpha \)-test on a very fine grid of points in \([0,1) \times [0,\pi ]\).

In Sect. 4 we prove Theorem 1.2, showing a simple starter \(\tilde{E}=\tilde{E}(e,M)\) that satisfies the \(\alpha \)-test for all \(e\in [0,1)\) and \(M\in [0,\pi ]\). The starter is a piecewise-defined function that requires a single cubic root in a small part of the region close to the corner \(e=1\), \(M=0\). Apart from that root, the rest of the expressions involved are constant or rational functions that can be computed with at most two arithmetic operations. The highlights of this starter are its computational simplicity and the fact that it is formally proven to converge at quadratic speed since the first iteration, thus providing arbitrary precision with a very few Newton’s method steps. It should be noted that reducing the initial error (i.e. the distance from the starter to the exact solution) is not our design goal.

Theorem 1.2

The starter

$$\begin{aligned} \tilde{E}(e,M)=\left\{ \begin{array}{ll} M &{} \text {if } e \le \frac{1}{2}\text { or } M \ge \frac{2\pi }{3} \\ \frac{2\pi }{3} &{} \text {if } e \ge \frac{1}{2}\text { and } \frac{\pi }{4}\le M \le \frac{2\pi }{3} \\ \frac{\pi }{2} &{} \text {if } e \ge \frac{1}{2}\text { and } \frac{\pi }{7}\le M \le \frac{\pi }{4} \\ \frac{M}{1-e} &{} \text {if } e \ge \frac{1}{2},\;M\le \frac{\pi }{7} \text { and } M <\frac{\root 4 \of {12\alpha }(1-e)^{\frac{3}{2}}}{\sqrt{e}} \\ \frac{\root 3 \of {6Me^2}}{e}-\frac{2(1-e)}{\root 3 \of {6Me^2}} &{} \text {otherwise} \end{array} \right. \end{aligned}$$

is an approximate zero of \(f_{e,M}\) for all \(e\in [0,1)\) and \(M\in [0,\pi ]\).

This way of constructing an approximate solution by a piecewise function (see Fig. 1) can be compared to Ng’s approach (see Figure 2 of Ng 1979). However, our function is computationally simpler because Ng’s formula outside the corner uses rational functions involving many terms and near the corner uses \(S_{10}\), which requires at least a cubic and a square root for its computation.

Fig. 1
figure 1

The points where \(\tilde{E}=M\), \(\tilde{E}=\frac{3\pi }{2}\) and \(\tilde{E}=\frac{\pi }{2}\) satisfy the \(\alpha \)-test for \(f_{e,M}(E)\) are shown in blue, red and green respectively. The ones of \(\tilde{E}=\frac{M}{1-e}\) and \(\tilde{E}=\frac{\root 3 \of {6Me^2}}{e}-\frac{2(1-e)}{\root 3 \of {6Me^2}}\) appear in yellow and orange

The region near the \((1,0)\) corner where a cubic root is needed can be reduced as much as desired but cannot be completely avoided, as can be seen in Theorems 1.3 and 1.4, which will be proven in Sect. 5. Other authors have found similar obstructions in handling values of the eccentricity near \(1\) (Mikkola 1987; Ng 1979; Nijenhuis 1991).

Theorem 1.3

For any \(\varepsilon >0\), there is a piecewise constant function \(\tilde{E}\) defined in \(([0,1) \times [0,\pi ]) \setminus ([1-\varepsilon ,1] \times [0,\arccos (1-\varepsilon )])\) that satisfies the \(\alpha \)-test.

Theorem 1.4

Let \(\tilde{E}\) be a piecewise rational function in \([0,1) \times [0,\pi ]\) with a finite number of branches defined by polynomial inequalities. Then there exists \((e_0,M_0)\) such that \(\tilde{E}(e_0,M_0)\) is not an approximate zero of \(f_{e_0,M_0}\).

The starter defined in Theorem 1.3 can be extended if \(\varepsilon < 1-\cos (\frac{\pi }{7})\) to the whole region by using \(\frac{M}{1-e}\) and \(\frac{\root 3 \of {6Me^2}}{e}-\frac{2(1-e)}{\root 3 \of {6Me^2}}\) in the corner, as in Theorem 1.2. This result is the basis for constructing lookup tables of starters.

Finally, Theorem 1.4 and Remark 5.1 show that the classical starters \(S_1, \ldots ,\) \(S_8\) and the improved \(S_7\) of Calvo et al. (2013) will necessarily fail near the corner \((1,0)\), as Figs. 3, 4, 5 and 6 will later illustrate. Our theorem also excludes the possibility of using truncated power series (with integer exponents) for approximate zeros near the corner.

2 Analytical study of classical starters via \(\alpha \)-theory

In this section we find regions where the starters \(\tilde{E}=0,\pi ,M,\frac{M}{1-e}\) are approximate zeros of Kepler’s equation in Theorems 2.2, 2.3, 2.4 and 2.5. We compare these with the regions computed numerically on a fine grid in Figs. 2 and 3. We also show that Ng’s starter \(S_{10}\) works in the entire region in Theorem 2.6.

Fig. 2
figure 2

The regions of Theorems 2.2 and 2.3 are shown in blue. Red color shows the points where \(\tilde{E}=0\) and \(\tilde{E}=\pi \) satisfy the \(\alpha \)-test for \(f_{e,M}(E)\) that are not in the blue region

Fig. 3
figure 3

The regions of Theorems 2.4, and 2.5 are shown in blue. Red color shows the points where \(\tilde{E}=M\) and \(\tilde{E}=\frac{M}{1-e}\) satisfy the \(\alpha \)-test for \(f_{e,M}(E)\) that are not in the blue region

Throughout the paper, we will need the following technical result.

Lemma 2.1

Let \(n\ge 2\) and \(x\ge \frac{n!}{(n+1)^{n-1}}\). Then, the sequence \(\{(\frac{x}{k!})^{\frac{1}{k-1}}\}_{k\ge n}\) is decreasing.

Proof

It is enough to show that \((\frac{x}{k!})^{\frac{1}{k-1}}\ge (\frac{x}{(k+1)!})^{\frac{1}{k}}\) for all \(k\ge n\), which is equivalent to the inequality \((\frac{x}{k!})^k\ge ( \frac{x}{(k+1)!})^{k-1}\), or more simply \(x\ge \frac{k!}{(k+1)^{k-1}}\). Note that the sequence \(\frac{k!}{(k+1)^{k-1}}\) is decreasing, since

$$\begin{aligned} \frac{(k+1)!(k+1)^{k-1}}{k!(k+2)^k}=\frac{(k+1)^k}{(k+2)^k}<1. \end{aligned}$$

In particular, \(x\ge \frac{n!}{(n+1)^{n-1}}\ge \frac{k!}{(k+1)^{k-1}}\) for all \(k\ge n\), as we needed. \(\square \)

Theorem 2.2

\(\tilde{E}=0\) is an approximate zero of \(f_{e,M}(E)\) in the region \(R_1\cup R_2\), where

$$\begin{aligned} \begin{aligned} R_1&= \left\{ 0\le M\le 4\alpha _0(1-e),\, 0\le e\le \frac{3}{11} \right\} ,\\ R_2&= \left\{ 0\le M\le \frac{\sqrt{6}\alpha _0(1-e)^{\frac{3}{2}}}{\sqrt{e}},\, \frac{3}{11}\le e<1 \right\} . \end{aligned} \end{aligned}$$

Proof

It is enough to show that \(\alpha (f_{e,M},0)<\alpha _0\), which is equivalent to

$$\begin{aligned} \frac{M}{1-e}\sup _{\small \begin{array}{c} {k\ge 3}\\ {k\;\text {odd}} \end{array}}\left( \frac{e}{k!(1-e)}\right) ^{\frac{1}{k-1}}<\alpha _0, \end{aligned}$$

since \(f(0)=-M\), \(f'(0)=1-e\), \(f^{(\text {even})}(0)=0\) and \(f^{(\text {odd})}(0)=\pm e\). When \(e\in [\frac{3}{11},1)\), we have \(\frac{e}{1-e}\ge \frac{3}{8}\), and by Lemma 2.1,

$$\begin{aligned} \sup _{\small \begin{array}{c} {k\ge 3}\\ {k\;\text {odd}} \end{array}}\left( \frac{e}{k!(1-e)}\right) ^{\frac{1}{k-1}} = \sqrt{\frac{e}{6(1-e)}}. \end{aligned}$$

In this case, Smale’s \(\alpha \)-test translates into \(\frac{M\sqrt{e}}{\sqrt{6}(1-e)^{3/2}}<\alpha _0\), which corresponds to the region \(R_2\). For the remaining case, \(e\in [0,\frac{3}{11}]\), we have that \(\frac{e}{1-e}\le \frac{3}{8}\), so

$$\begin{aligned} \left( \frac{e}{k!(1-e)}\right) ^{\frac{1}{k-1}}\le \left( \frac{1}{16}\right) ^{\frac{1}{k-1}} \le \frac{1}{4}\quad \forall \,k\ge 3. \end{aligned}$$

This means that Smale’s condition is implied by \(\frac{M}{4(1-e)}<\alpha _0\), which corresponds to the region \(R_1\). \(\square \)

Theorem 2.3

\(\tilde{E}=\pi \) is an approximate zero of \(f_{e,M}(E)\) in the region \(R_3\cup R_4\), where

$$\begin{aligned} \begin{aligned} R_3&= \left\{ \pi -4\alpha _0(1+e)<M\le \pi ,\; 0\le e\le \frac{3}{5} \right\} ,\\ R_4&= \left\{ \pi -\frac{\sqrt{6}\alpha _0(1+e)^{\frac{3}{2}}}{\sqrt{e}}<M\le \pi ,\;\frac{3}{5}\le e<1 \right\} . \end{aligned} \end{aligned}$$

Proof

Since \(f(\pi )=\pi -M\), \(f'(\pi )=1+e\), \(f^{(\text {even})}(\pi )=0\) and \(f^{(\text {odd})}(\pi )=\pm e\), Smale’s \(\alpha \)-test is equivalent to

$$\begin{aligned} \frac{\pi -M}{1+e}\sup _{\small \begin{array}{c} {k\ge 3}\\ {k\;\text {odd}} \end{array}} \left( \frac{e}{k!(1+e)}\right) ^{\frac{1}{k-1}}<\alpha _0. \end{aligned}$$

For any \(e\in [0,\frac{3}{5}]\), we have \(\frac{e}{1+e}\le \frac{3}{8}\). This gives the following estimate for the supremum:

$$\begin{aligned} \left( \frac{e}{k!(1+e)}\right) ^{\frac{1}{k-1}}\le \left( \frac{\frac{3}{8}}{k!}\right) ^{\frac{1}{k-1}}\le \left( \frac{1}{16}\right) ^{\frac{1}{k-1}}\le \frac{1}{4},\quad \forall \,k\ge 3. \end{aligned}$$

This means that Smale’s condition is implied by \(\frac{\pi -M}{4(1+e)}<\alpha _0\), which corresponds exactly to the region \(R_3\). For the other case, where \(e\in [\frac{3}{5},1)\), the supremum is \(\sqrt{\frac{e}{6(1-e)}}\) by Lemma 2.1, so the \(\alpha \)-condition is reduced to

$$\begin{aligned} \frac{(\pi -M)\sqrt{e}}{\sqrt{6}(1+e)^{\frac{3}{2}}} < \alpha _0, \end{aligned}$$

which corresponds to the region \(R_4\). \(\square \)

Theorem 2.4

\(\tilde{E}=M\) is an approximate zero of \(f_{e,M}(E)\) in the region

$$\begin{aligned} \left\{ 0\le e\le \frac{1}{2} \right\} \cup \left\{ \frac{2\pi }{3}\le M\le \pi \right\} \cup R_2, \end{aligned}$$

where \(R_2\) is defined as in Theorem 2.2.

Proof

Consider first the strip \(M\ge \frac{2\pi }{3}\).

$$\begin{aligned} \beta (f_{e,M},M)=\left| \frac{e\sin (M)}{1-e\cos (M)}\right| \le \left| \frac{\sin (M)}{1-\cos (M)}\right| =\cot \left( \frac{M}{2} \right) \le \cot \left( \frac{\pi }{3} \right) =\frac{1}{\sqrt{3}}. \end{aligned}$$

By Lemma 2.1, we have that for any even integer \(k\ge 2\),

$$\begin{aligned} \left| \frac{e\sin (M)}{k!(1-e\cos (M))}\right| ^{\frac{1}{k-1}}\le \left| \frac{\frac{1}{\sqrt{3}}}{k!}\right| ^{\frac{1}{k-1}}\le \frac{1}{2\sqrt{3}}, \end{aligned}$$

and for any odd integer \(k\ge 3\),

$$\begin{aligned} \left| \frac{e\cos (M)}{k!(1-e\cos (M))}\right| ^{\frac{1}{k-1}}\le \left| \frac{\frac{1}{2}}{k!}\right| ^{\frac{1}{k-1}}\le \frac{1}{2\sqrt{3}}. \end{aligned}$$

The last two inequalities together imply \(\gamma (f_{e,M},M)\le \frac{1}{2\sqrt{3}}\) and \(\alpha (f_{e,M},M)\le \frac{1}{6}<\alpha _0\). This proves that the starter \(\tilde{E}=M\) satisfies the \(\alpha \)-test in the strip \(M\ge \frac{2\pi }{3}\).

In the region \(\left\{ \frac{\pi }{2} \le M \le \frac{2\pi }{3}, 0 \le e \le \frac{1}{2} \right\} \), we have that \(\sin (M) \in [\frac{\sqrt{3}}{2},1]\) and \(\cos (M) \in [-\frac{1}{2},0]\), so

$$\begin{aligned} \beta (f_{e,M},M)=\left| \frac{f(M)}{f'(M)} \right| =\frac{e \sin (M)}{1-e \cos (M)} \le \frac{1}{2}. \end{aligned}$$

On the other hand, using Lemma 2.1 gives us

$$\begin{aligned} \begin{aligned} \sup _{\small \begin{array}{c} {k\ge 2}\\ {k\;\text {even}} \end{array}} \left| \frac{f^{(k)}(M)}{k! f'(M)} \right| ^{\frac{1}{k-1}}&\le \sup _{\small \begin{array}{c} {k\ge 2}\\ {k\;\text {even}} \end{array}} \left| \frac{1}{2 k! } \right| ^{\frac{1}{k-1}} =\max \left\{ \frac{1}{4}, \sup _{\small \begin{array}{c} {k\ge 4}\\ {k\;\text {even}} \end{array}} \left| \frac{1}{2 k! } \right| ^{\frac{1}{k-1}} \right\} \\&=\max \left\{ \frac{1}{4}, \frac{1}{\root 3 \of {48}} \right\} =\frac{1}{\root 3 \of {48}} \approx 0.2752, \end{aligned} \end{aligned}$$
$$\begin{aligned} \begin{aligned} \sup _{\small \begin{array}{c} {k\ge 3}\\ {k\;\text {odd}} \end{array}} \left| \frac{f^{(k)}(M)}{k! f'(M)} \right| ^{\frac{1}{k-1}}&\le \sup _{\small \begin{array}{c} {k\ge 3}\\ {k\;\text {odd}} \end{array}} \left| \frac{1}{4 k! } \right| ^{\frac{1}{k-1}} =\max \left\{ \frac{1}{\sqrt{24}}, \sup _{\small \begin{array}{c} {k\ge 4}\\ {k\;\text {even}} \end{array}} \left| \frac{1}{4 k! } \right| ^{\frac{1}{k-1}} \right\} \\&=\max \left\{ \frac{1}{\sqrt{24}}, \frac{1}{\root 4 \of {480}} \right\} =\frac{1}{\root 4 \of {480}} \approx 0.2136. \end{aligned} \end{aligned}$$

Therefore, \(\gamma (f_{e,M},M) \le \frac{1}{\root 3 \of {48}}\) and the \(\alpha \)-test holds because \(\frac{1}{2}\frac{1}{\root 3 \of {48}} < \alpha _0\).

In the region \(\left\{ 0 \le M \le \frac{\pi }{2}, 0 \le e \le \frac{1}{2} \right\} \),

$$\begin{aligned} \frac{e \sin (M)}{1-e \cos (M)} \le \frac{\frac{1}{2} \sin (M)}{1-\frac{1}{2} \cos (M)}\le \frac{1}{\sqrt{3}} \end{aligned}$$
(1)

and using Lemma 2.1 we obtain that

$$\begin{aligned} \begin{aligned} \sup _{\small \begin{array}{c} {k\ge 2}\\ {k\;\text {even}} \end{array}} \left| \frac{f^{(k)}(M)}{k! f'(M)} \right| ^{\frac{1}{k-1}}&\le \max \left\{ g_2,g_4,\sup _{\small \begin{array}{c} {k\ge 6}\\ {k\;\text {even}} \end{array}} \left| \frac{\frac{1}{2} \sin (M)}{k!(1-\frac{1}{2} \cos (M))} \right| ^{\frac{1}{k-1}} \right\} \\&\le \max \left\{ g_2,g_4,\sup _{\small \begin{array}{c} {k\ge 6}\\ {k\;\text {even}} \end{array}} \left| \frac{1}{k!} \right| ^{\frac{1}{k-1}} \right\} = \max \left\{ g_2,g_4,\root 5 \of {\frac{1}{6!}} \right\} , \end{aligned} \end{aligned}$$

where \(g_k=\left( \frac{\frac{1}{2} \sin (M)}{k!(1-\frac{1}{2} \cos (M))} \right) ^{\frac{1}{k-1}}\) for \(k=2,4\). Similarly,

$$\begin{aligned} \begin{aligned} \sup _{\small \begin{array}{c} {k\ge 3}\\ {k\;\text {odd}} \end{array}} \left| \frac{f^{(k)}(M)}{k! f'(M)} \right| ^{\frac{1}{k-1}}&\le \max \left\{ g_3,g_5,\sup _{\small \begin{array}{c} {k\ge 7}\\ {k\;\text {odd}} \end{array}} \left| \frac{\frac{1}{2} \cos (M)}{k!(1-\frac{1}{2} \cos (M))} \right| ^{\frac{1}{k-1}} \right\} \\&\le \max \left\{ g_3,g_5, \sup _{\small \begin{array}{c} {k\ge 7}\\ {k\;\text {odd}} \end{array}} \left| \frac{1}{k!} \right| ^{\frac{1}{k-1}} \right\} =\max \left\{ g_3,g_5, \root 6 \of {\frac{1}{7!}} \right\} , \end{aligned} \end{aligned}$$

where \(g_k=\left( \frac{\frac{1}{2} \cos (M)}{k!(1-\frac{1}{2} \cos (M))} \right) ^{\frac{1}{k-1}}\) for \(k=3,5\). Therefore,

$$\begin{aligned} \gamma (f_{e,M},M) \le \max \left\{ g_2,g_3,g_4,g_5,\root 5 \of {\frac{1}{6!}}, \root 6 \of {\frac{1}{7!}} \right\} =\max \left\{ g_2,g_3,g_4,g_5,\root 5 \of {\frac{1}{6!}} \right\} . \end{aligned}$$

As an immediate consequence of the second inequality in (1), we get \(g_2 <g_4\), \(\frac{\frac{1}{2} \sin (M)}{1-\frac{1}{2} \cos (M)} g_4 <\alpha _0\) and \(\frac{\frac{1}{2} \sin (M)}{1-\frac{1}{2} \cos (M)} \root 5 \of {\frac{1}{6!}} <\alpha _0\). It remains to see that \(\frac{\frac{1}{2} \sin (M)}{1-\frac{1}{2} \cos (M)} g_k \le \alpha _0\) for \(k=3,5\), which is equivalent to proving

$$\begin{aligned} \frac{\sin ^3(M) \cos (M)}{\left( 1-\frac{1}{2} \cos (M) \right) ^3} <48 \alpha _0^2 \approx 1.41, \text { and } \frac{\sin ^4(M) \cos (M)}{\left( 1-\frac{1}{2} \cos (M) \right) ^5} <3840 \alpha _0^4 \approx 3.33. \end{aligned}$$

In both cases, the left-hand side function has a maximum and the inequalities are true at it.

Finally, note that \(f_{e,M}(M)=-e\sin M\le 0\) and \(f_{e,M}\) is increasing, so \(0\le M\le E\), where \(E\) represents the exact solution of Kepler’s equation. In particular, \(M\) is always closer to \(E\) than \(0\), hence for any point in \(R_2\), the starter \(\tilde{E}=M\) gives an approximate solution. \(\square \)

Theorem 2.5

\(\tilde{E}=\frac{M}{1-e}\) is an approximate zero of \(f_{e,M}(E)\) in the region \(R_5\cup R_6\), where

$$\begin{aligned} \begin{aligned} R_5&= \left\{ 0\le M< \min \left\{ \root 4 \of {12\alpha _0}\frac{(1-e)^{\frac{3}{2}}}{e^{\frac{1}{2}}}, \root 3 \of {24\alpha _0}\frac{(1-e)^{\frac{4}{3}}}{e^{\frac{1}{3}}} \right\} ,\, 0\le e\le \frac{3}{11} \right\} ,\\ R_6&= \left\{ 0\le M< \root 4 \of {12\alpha _0} \frac{(1-e)^{\frac{3}{2}}}{e^{\frac{1}{2}}} ,\, \frac{3}{11} \le e<1 \right\} . \end{aligned} \end{aligned}$$

This region contains the region of Theorem 2.2.

Proof

In this case we have

$$\begin{aligned} |f(\tilde{E})|=e\left| \frac{M}{1-e}-\sin \left( \frac{M}{1-e}\right) \right| \le \frac{eM^3}{6(1-e)^3}, \end{aligned}$$

\(|f'(\tilde{E})|\ge 1-e\) and \(|f^{(k)}(\tilde{E})|\le e\) for all \(k\ge 2\). Besides,

$$\begin{aligned} \gamma \left( f_{e,M}, \frac{M}{1-e} \right) \le \max \left\{ \frac{e \frac{M}{1-e}}{2(1-e)},\, \sup _{k\ge 3}\left| \frac{e}{k!(1-e)}\right| ^{\frac{1}{k-1}} \right\} . \end{aligned}$$

In particular, Smale’s \(\alpha \)-test is satisfied if

$$\begin{aligned} \frac{M^4 e^2}{12(1-e)^6} < \alpha _0 \ \ \text { and } \ \ \frac{eM^3}{6(1-e)^4}\sup _{k\ge 3}\left| \frac{e}{k!(1-e)}\right| ^{\frac{1}{k-1}}<\alpha _0. \end{aligned}$$

The first condition is equivalent to \(M<\frac{\root 4 \of {12\alpha _0} (1-e)^{\frac{3}{2}}}{e^{\frac{1}{2}}}\), which is true in both \(R_5\) and \(R_6\). The second inequality needs to be discussed depending on the value of \(e\).

When \(e\in [\frac{3}{11},1)\), we have by Lemma 2.1 that

$$\begin{aligned} \sup _{k\ge 3}\left| \frac{e}{k!(1-e)}\right| ^{\frac{1}{k-1}}=\sqrt{\frac{e}{6(1-e)}}, \end{aligned}$$

so the second inequality becomes \(M<\sqrt{6} \root 3 \of {\alpha _0} \frac{(1-e)^{3/2}}{e^{1/2}}\), which is automatically true in \(R_6\) since \(\sqrt{6} \root 3 \of {\alpha _0} > \root 4 \of {12\alpha _0}\).

In the other case, i.e. when \(e\in [0,\frac{3}{11}]\), we have \(\frac{e}{1-e}\le \frac{3}{8}\). In particular, we can estimate the supremum from above as follows:

$$\begin{aligned} \sup _{k\ge 3}\left| \frac{e}{k!(1-e)}\right| ^{\frac{1}{k-1}}\le \sup _{k\ge 3}\left| \frac{3}{8k!}\right| ^{\frac{1}{k-1}}= \frac{1}{4}, \end{aligned}$$

where we have used Lemma 2.1. Therefore, in the case \(e\in [0,\frac{3}{11}]\), the \(\alpha \)-test is satisfied when

$$\begin{aligned} M<\frac{\root 4 \of {12\alpha _0} (1-e)^{\frac{3}{2}}}{e^{\frac{1}{2}}} \ \ \text { and } \ \ M < \root 3 \of {24 \alpha _0} \frac{(1-e)^{\frac{4}{3}}}{e^{\frac{1}{3}}}, \end{aligned}$$

which is the definition of the region \(R_5\).

Finally, the inclusion \(R_2 \subseteq R_6\) follows immediately from \(\sqrt{6} \alpha _0 < \root 4 \of {12 \alpha _0}\) and \(R_1 \subseteq R_5\) from the fact that \(4 \alpha _0 (1-e) < \root 4 \of {12 \alpha _0} \frac{(1-e)^{\frac{3}{2}}}{e^{\frac{1}{2}}}\) and \(4 \alpha _0 (1-e) < \root 3 \of {24 \alpha _0} \frac{(1-e)^{\frac{4}{3}}}{e^{\frac{1}{3}}}\) for all \(e \in [0,\frac{3}{11}]\). \(\square \)

Theorem 2.6

The exact solution of the cubic equation \(\tilde{E}(1-e)+e\frac{\tilde{E}^3}{6}-M=0\) is an approximate zero of \(f_{e,M}(E)\) in the entire region \([0,1)\times [0,\pi ]\).

Proof

First, note that the derivative of the left-hand side of the equation is \((1-e)+e\tilde{E}^2/2>0\), so the expression is increasing. This means that the cubic has only one real root. Moreover, the values of the cubic at \(0\) and \(\pi \) are \(-M\le 0\) and \(\pi (1-e)+e\frac{\pi ^3}{6}-M\ge \pi -M\ge 0\) respectively, so the real root \(\tilde{E}\) must be in \([0,\pi ]\). In particular, we have that \(\tilde{E}< \sqrt{42}\), so

$$\begin{aligned} |f(\tilde{E})| =|\tilde{E}-e \sin (\tilde{E})-M|=\left| \tilde{E}(1-e)+e \left( \frac{\tilde{E}^3}{3!}-\frac{\tilde{E}^5}{5!}+\cdots \right) -M \right| \le e\frac{\tilde{E}^5}{120}. \end{aligned}$$

Let us now consider two different cases depending on the value of \(\tilde{E}\).

If \(\tilde{E}\le \frac{\pi }{2}\), we have that \(f'(\tilde{E}) \ge 1-\cos (\tilde{E})=2 \sin ^2(\frac{\tilde{E}}{2})\ge \frac{4}{\pi ^2} \tilde{E}^2\) and

$$\begin{aligned} \gamma (f_{e,M},\tilde{E}) \le \sup _{k\ge 2} \left| \frac{1}{k!(1-\cos (\tilde{E})}\right| ^{\frac{1}{k-1}} \le \sup _{k\ge 2} \left| \frac{\pi ^2}{4 k! \tilde{E}^2 }\right| ^{\frac{1}{k-1}} =\frac{\pi ^2}{8 \tilde{E}^2} \end{aligned}$$

by Lemma 2.1. Therefore, the \(\alpha \)-test follows if we prove

$$\begin{aligned} \frac{e\frac{\tilde{E}^5}{120}}{ \frac{4}{\pi ^2} \tilde{E}^2} \frac{\pi ^2}{8 \tilde{E}^2} <\frac{\pi ^4 \tilde{E}}{3840} < \alpha _0 \Leftrightarrow \tilde{E} <\frac{3840\alpha _0}{\pi ^4} \approx 6.76, \end{aligned}$$

which is always true in this region.

If \(\tilde{E}> \frac{\pi }{2}\), then \( \gamma (f_{e,M},\tilde{E}) =\max \{ g_2,g_3,g_4,g_5 \}\), where

$$\begin{aligned} \begin{aligned} g_2&= \frac{1 }{2 (1-e \cos (\tilde{E}))}, \ g_3=\sqrt{\frac{|\cos (\tilde{E})|}{6(1-e\cos (\tilde{E}))}}, \\ g_4&=\sup _{\small \begin{array}{c} {k\ge 4}\\ {k\;\text {even}} \end{array}} \left| \frac{1}{k!(1-e \cos (\tilde{E}))}\right| ^{\frac{1}{k-1}}= \root 3 \of {\frac{1}{24(1-e \cos (\tilde{E}))}},\\ g_5&=\sup _{\small \begin{array}{c} {k\ge 5}\\ {k\;\text {odd}} \end{array}} \left| \frac{1}{k!(1-e \cos (\tilde{E}))}\right| ^{\frac{1}{k-1}} = \root 4 \of {\frac{1}{120(1-e \cos (\tilde{E}))}} \le g_4. \end{aligned} \end{aligned}$$

Therefore, the \(\alpha \)-test is satisfied if \(\frac{e\tilde{E}^5}{120 (1-e \cos (\tilde{E}))} g_i < \alpha _0\) for \(i=2,3,4\).

Since \(g_2\), \(g_3\) and \(g_4\), are increasing in \(M\), it is enough to prove the inequalities when \(M=\pi \). Moreover, \(\tilde{E}(e,\pi )\) is decreasing, so \(\tilde{E}(e,\pi ) \in [\root 3 \of {6 \pi },\pi ]\) and \(1-e\cos (\tilde{E}(e,\pi )) \ge 1-e\cos (\root 3 \of {6\pi })\).

We also have that \(\pi =e \frac{\tilde{E}^3(e,\pi )}{6} +(1-e)\tilde{E}(e,\pi ) \ge e \frac{\tilde{E}^3(e,\pi )}{6} +(1-e)\root 3 \of {6\pi }\), hence

$$\begin{aligned} \tilde{E}(e,\pi ) \le \root 3 \of {\frac{6 \left( \pi -(1-e)\root 3 \of {6\pi } \right) }{e}}. \end{aligned}$$
(2)

Let us now study the three different cases.

When \(i=2\), it is enough to prove that

$$\begin{aligned} \frac{e\tilde{E}^5}{120 (1-e \cos (\tilde{E}))} g_2 <\frac{e\tilde{E}(e,\pi )^5}{240 (1-e \cos (\root 3 \of {6\pi }))^2} < \alpha _0, \end{aligned}$$

which is true using that \(\tilde{E} \le \pi \) in \(e \in [0,0.17]\), \(\tilde{E}(e,\pi ) \le 2.92\) in \(e \in [0.17,0.3]\), \(\tilde{E}(e,\pi ) \le 2.84\) in \(e \in [0.3,0.4]\) and Eq. (2) in \(e\in [0.4,1]\).

When \(i=3\), it suffices to show that

$$\begin{aligned} \frac{e\tilde{E}^5}{120 (1-e \cos (\tilde{E}))} g_3 <\frac{e\tilde{E}^5(e,\pi ) \sqrt{ |cos(\tilde{E}(e,\pi ))| }}{120\sqrt{6} (1-e \cos (\root 3 \of {6\pi }))^{\frac{3}{2}} } < \alpha _0, \end{aligned}$$

which is true using that

  • \(\tilde{E} \le \pi \) and \(\sqrt{|cos(\tilde{E}(e,\pi )|} \le 1\) in \(e \in [0,0.2]\),

  • Eq. (2) and \(\sqrt{|cos(\tilde{E}(e,\pi )|} \le 1\) in \(e \in [0.2,0.7]\),

  • Eq. (2) and \(\sqrt{|cos(\tilde{E}(e,\pi )|} <0.91 \) in \(e\in [0.7,1]\).

Lastly, the case \(i=4\) follows by using \(\tilde{E} \le \pi \) in \(e \in [0,0.2]\) and Eq. (2) in \(e\in [0.2,1]\). \(\square \)

3 Numerical comparison of classical starters via \(\alpha \)-theory

We tested numerically the \(\alpha \)-condition on a fine grid (dividing each axis in \(1000\) points) for the starters \(S_2,\ldots ,S_9\), defined in Odell and Gooding (1986), and the improved \(S_7\) starter obtained in (Prop. 1 of Calvo et al. 2013), which we denote \(S_{CEMR}\). Note in Figs. 4, 5 and 6 that none of the starters produce approximate zeros near the corner \((1,0)\).

Fig. 4
figure 4

The regions of \(S_2\), \(S_3\) and \(S_4\)

Fig. 5
figure 5

The regions of \(S_5\), \(S_6\) and \(S_7\)

Fig. 6
figure 6

The regions of \(S_8\), \(S_9\) and \(S_{CEMR}\)

4 A simple new starter that covers the entire region

We devote this section to proving Theorem 1.2. We study each branch separately.

Theorem 4.1

\(\tilde{E}=\frac{2\pi }{3}\) is an approximate zero of \(f_{e,M}(E)\) in the region

$$\begin{aligned} \left\{ \frac{\pi }{4}\le M\le \frac{2\pi }{3} ,\, \frac{1}{2} \le e<1 \right\} . \end{aligned}$$

Proof

First of all, we have that

$$\begin{aligned} \beta \left( f_{e,M},\frac{2\pi }{3} \right) \le \frac{\frac{2\pi }{3}-\frac{\sqrt{3}}{2} e-\frac{\pi }{4}}{1+\frac{e}{2}}=\frac{\frac{5\pi }{12}-\frac{\sqrt{3}}{2} e}{1+\frac{e}{2}}. \end{aligned}$$

On the other hand,

$$\begin{aligned} \begin{aligned} \gamma \left( f_{e,M},\frac{2\pi }{3} \right)&=\max \left\{ \sup _{\small \begin{array}{c} {k \ge 2}\\ {k\;\text {even}} \end{array}} \left| \frac{e \frac{\sqrt{3}}{2}}{k! (1+\frac{e}{2})} \right| ^{\frac{1}{k-1}}, \sup _{\small \begin{array}{c} {k \ge 3}\\ {k\;\text {odd}} \end{array}} \left| \frac{ \frac{e}{2}}{k! (1+\frac{e}{2})} \right| ^{\frac{1}{k-1}} \right\} . \end{aligned} \end{aligned}$$

Since \(\frac{ \frac{e}{2}}{1+\frac{e}{2}} \in [\frac{1}{3},\frac{1}{5}]\), we can apply Lemma 2.1 for \(n=4\) and \(n=5\):

$$\begin{aligned} \begin{aligned} \sup _{\small \begin{array}{c} {k\ge 2}\\ {k\;\text {even}} \end{array}} \left| \frac{e \frac{\sqrt{3}}{2}}{k! (1+\frac{e}{2})} \right| ^{\frac{1}{k-1}}&= \max \left\{ \frac{e \frac{\sqrt{3}}{2}}{2! (1+\frac{e}{2})} , \left( \frac{e \frac{\sqrt{3}}{2}}{4! (1+\frac{e}{2})} \right) ^{\frac{1}{3}}\right\} ,\\ \sup _{\small \begin{array}{c} {k\ge 3}\\ {k\;\text {odd}} \end{array}} \left| \frac{\frac{e}{2}}{k! (1+\frac{e}{2})} \right| ^{\frac{1}{k-1}}&= \max \left\{ \left( \frac{\frac{e}{2}}{3! (1+\frac{e}{2})} \right) ^\frac{1}{2}, \left( \frac{\frac{e}{2}}{5! (1+\frac{e}{2})} \right) ^{\frac{1}{4}}\right\} . \end{aligned} \end{aligned}$$

Comparing the four functions, we obtain

$$\begin{aligned} \gamma \left( f_{e,M},\frac{2\pi }{3} \right) = \left( \frac{e \frac{\sqrt{3}}{2}}{4! (1+\frac{e}{2})} \right) ^{\frac{1}{3}}. \end{aligned}$$

Therefore, the \(\alpha \)-test is satisfied if

$$\begin{aligned} \frac{\frac{5\pi }{12}-\frac{\sqrt{3}}{2} e}{1+\frac{e}{2}} \left( \frac{e \frac{\sqrt{3}}{2}}{4! (1+\frac{e}{2})} \right) ^{\frac{1}{3}} < \alpha _0. \end{aligned}$$

Taking derivatives, it can be shown that the left-hand side of the inequality is a decreasing function of \(e\). Also, its value at \(e=\frac{1}{2}\) is approximately \(0.1706\), which is less than \(\alpha _0\). \(\square \)

Theorem 4.2

\(\tilde{E}=\frac{\pi }{2}\) is an approximate zero of \(f_{e,M}(E)\) in the region

$$\begin{aligned} \left\{ \frac{\pi }{7}\le M\le \frac{\pi }{4} ,\, \frac{1}{2}\le e<1 \right\} . \end{aligned}$$

Proof

We have that \(f_{e,M}(\frac{\pi }{2} ) =\frac{\pi }{2}-e-M \le \frac{\pi }{2}-e-\frac{\pi }{7}=\frac{5\pi }{14}-e\) and \(f_{e,M}'(\frac{\pi }{2} )=1\). Moreover, \(f^{(\text {odd})}(\frac{\pi }{2})=0\), hence

$$\begin{aligned} \gamma \left( f_{e,M},\frac{\pi }{2}\right) =\sup _{\small \begin{array}{c} {k \ge 2}\\ {k\;\text {even}} \end{array}} \left| \frac{e }{k!} \right| ^{\frac{1}{k-1}} =\max \left\{ \frac{e}{2}, \sup _{\small \begin{array}{c} {k \ge 4}\\ {k\;\text {even}} \end{array}} \left| \frac{e }{k!} \right| ^{\frac{1}{k-1}} \right\} =\max \left\{ \frac{e}{2}, \root 3 \of {\frac{e }{24}} \right\} \end{aligned}$$

by Lemma 2.1. The \(\alpha \)-test is satisfied because

$$\begin{aligned} \begin{aligned} \left( \frac{5\pi }{14}-e\right) \frac{e}{2}&\le \left( \frac{5\pi }{14}-\frac{5\pi }{28}\right) \frac{\frac{5\pi }{28}}{2} \approx 0.1573 <\alpha _0,\\ \left( \frac{5\pi }{14}-e\right) \root 3 \of {\frac{e}{24}}&\le \left( \frac{5\pi }{14}-\frac{1}{2}\right) \root 3 \of {\frac{\frac{1}{2}}{24}} \approx 0.1711 <\alpha _0, \end{aligned} \end{aligned}$$

which ends the proof. \(\square \)

Theorem 4.3

\(\tilde{E}=\frac{\root 3 \of {6Me^2}}{e}-\frac{2(1-e)}{\root 3 \of {6Me^2}} \) is an approximate zero of \(f_{e,M}(E)\) in the region \(R_7\), where

$$\begin{aligned} \begin{aligned} R_7&= \left\{ \frac{8(1-e)^{3/2}}{27\sqrt{6} \alpha _0 e^{1/2}} < M \le \frac{\pi }{7}, \frac{3}{11} \le e <1 \right\} . \end{aligned} \end{aligned}$$

Proof

The first condition we have to impose is that \(\tilde{E}\ge 0\), which is equivalent to \(M \ge \frac{\sqrt{2}(1-e)^{\frac{3}{2}}}{3 \alpha _0 e^{\frac{1}{2}}}\) and true in \(R_7\). We also show that \(\tilde{E} \le \frac{\pi }{2}\) in \([0,\frac{\pi }{7}] \times [0,1)\), which includes \(R_7\).

Indeed, \(\tilde{E} \le \frac{\pi }{2}\) is equivalent to

$$\begin{aligned} h(e,M)= \frac{\root 3 \of {36} e^{\frac{1}{3}} M^{\frac{2}{3}}}{2(1-e)} -\frac{\pi \root 3 \of {6} e^{\frac{2}{3}} M^{\frac{1}{3}}}{4(1-e)} \le 1. \end{aligned}$$
(3)

For a fixed \(e\), the function \(h\) has a minimum at \(M=\frac{\pi ^3}{384}e\) and no other critical points. Therefore, the inequality (3) holds if and only if \(h(e,0) \le 1\) and \(h \left( e,\frac{\pi }{7} \right) \le 1\). The first one is trivial since \(h(e,0)=0\) and the second one is equivalent to

$$\begin{aligned} 2 \root 3 \of {36} \left( \frac{\pi }{7}\right) ^{\frac{2}{3}} e^{\frac{1}{3}} -\pi \root 3 \of {6} \left( \frac{\pi }{7}\right) ^{\frac{1}{3}} e^{\frac{2}{3}} -4(1-e) <0. \end{aligned}$$

The substitution \(e=x^3\) transforms the inequality above into

$$\begin{aligned} 2 \root 3 \of {36} \left( \frac{\pi }{7}\right) ^{\frac{2}{3}} x -\pi \root 3 \of {6} \left( \frac{\pi }{7}\right) ^{\frac{1}{3}} x^2 -4(1-x^3) <0, \end{aligned}$$

which is verified for all \(x \in [0,1]\) since the expression in \(x\) is increasing and the inequality is true at \(x=1\).

Substituting the expression for \(\tilde{E}\) and using the Taylor expansion of \(\sin ({\tilde{E}})\), we obtain

$$\begin{aligned} \begin{aligned} |f(\tilde{E})|&=|\tilde{E}-e \sin (\tilde{E})-M|=\left| \tilde{E}(1-e)+e \left( \frac{\tilde{E}^3}{3!}-\frac{\tilde{E}^5}{5!}+\cdots \right) -M \right| \\&\le \left| \tilde{E}(1-e)+e\frac{\tilde{E}^3}{6}-M \right| +\left| \frac{\tilde{E}^5}{120} \right| =\frac{2(1-e)^3}{9eM}+\left| \frac{\tilde{E}^5}{120} \right| , \end{aligned} \end{aligned}$$

where we have bounded the alternating series using Leibniz’s criterion (possible because \(\tilde{E} < \sqrt{42}\)).

Since \(\tilde{E}\le \frac{\pi }{2}\), we have both \(f'(\tilde{E}) \ge 1-e\) and \(f'(\tilde{E}) \ge 1-\cos (\tilde{E})=2 \sin ^2(\frac{\tilde{E}}{2})\ge \frac{4}{\pi ^2} \tilde{E}^2\). Therefore, the \(\alpha \)-test follows if we prove the stronger conditions

$$\begin{aligned} \frac{2(1-e)^2}{9eM} \gamma (f_{e,M},\tilde{E}) < \frac{3\alpha _0}{4} \, \text { and } \, \left| \frac{\tilde{E}^3\pi ^2}{480} \right| \gamma (f_{e,M},\tilde{E}) < \frac{\alpha _0}{4}. \end{aligned}$$
(4)

The second one holds because

$$\begin{aligned} \gamma (f_{e,M},\tilde{E}) \le \sup _{k\ge 2} \left| \frac{1}{k!(1-\cos (\tilde{E})}\right| ^{\frac{1}{k-1}} \le \sup _{k\ge 2} \left| \frac{\pi ^2}{4 k! \tilde{E}^2 }\right| ^{\frac{1}{k-1}} =\frac{\pi ^2}{8 \tilde{E}^2}, \end{aligned}$$

by Lemma 2.1, and

$$\begin{aligned} \left| \frac{\tilde{E}^3\pi ^2}{480} \right| \frac{\pi ^2}{8 \tilde{E}^2} =\frac{\pi ^4 \tilde{E}}{3840} < \frac{\alpha _0}{4} \Leftrightarrow \tilde{E} <\frac{960\alpha _0}{\pi ^4} \approx 1.69, \end{aligned}$$

which is true since \(\tilde{E} \le \frac{\pi }{2}\) in \(R_7\).

For the first inequality in (4), we need

$$\begin{aligned} \begin{aligned} \gamma (f_{e,M},\tilde{E})&\le \max \left\{ \frac{e \sin (\tilde{E}) }{2! (1-e)}, \sup _{k\ge 3} \left| \frac{e}{k!(1-e)}\right| ^{\frac{1}{k-1}} \right\} \\&\le \max \left\{ \frac{e \tilde{E} }{2! (1-e)}, \left| \frac{e}{3!(1-e)}\right| ^{\frac{1}{2}} \right\} , \end{aligned} \end{aligned}$$

true by Lemma 2.1 when \(e \ge \frac{3}{11}\). Therefore,

$$\begin{aligned} \frac{2(1-e)^2}{9eM} \left| \frac{e}{3!(1-e)}\right| ^{\frac{1}{2}} <\frac{3\alpha _0}{4} \Leftrightarrow M > \frac{8(1-e)^{\frac{3}{2}}}{27\sqrt{6} \alpha e^{\frac{1}{2}}}, \end{aligned}$$

which is one of the conditions of the region \(R_7\).

It only remains to show that

$$\begin{aligned} \frac{2(1-e)^2}{9eM} \frac{e \tilde{E} }{2! (1-e)}=\frac{\tilde{E}(1-e)}{9M} <\frac{3\alpha _0}{4}, \end{aligned}$$

which is equivalent to

This is true for every \(e \in [0,1)\) and \(M \in [0,\pi ]\) because, if we fix \(e\), the function \(g\) has a minimum at \(M=\sqrt{\frac{48}{27^3 \alpha _0^3}}\) and

$$\begin{aligned} g \left( e,\sqrt{\frac{48}{27^3 \alpha _0^3}} \right) =-\frac{24}{27^2 \alpha _0^2} > -\frac{8}{27 \alpha _0}. \end{aligned}$$

\(\square \)

Proof of Theorem 1.2

It follows immediately from Theorems 2.4, 2.5, 4.1, 4.2 and 4.3, and the inequality \(\root 4 \of {12 \alpha _0} > \frac{8}{27 \sqrt{6} \alpha _0}\) that implies that the “otherwise” region is included in the one from Theorem 4.3. \(\square \)

5 Approximate solutions near \(e=1\) and \(M=0\)

In this section we will prove Theorems 1.3 and 1.4.

Proof of Theorem 1.3

Given \(\varepsilon >0\), let us take a natural number \(N\) such that \(N> \frac{\pi +2}{2 \alpha _0 \varepsilon ^2}\). Given two integers \(i \in \{ 0,\ldots ,N-1 \}\) and \(j \in \{ 0,\ldots , N \}\), we define the constants \(E_{ij}^{\text {low}}=\frac{\pi j}{N}\) and \(E_{ij}^{\text {up}}=\pi \), which satisfy

$$\begin{aligned} E_{ij}^{\text {low}}-\frac{i}{N} \sin (E_{ij}^{\text {low}})-\frac{\pi j}{N}&=-\frac{i}{N} \sin \left( \frac{\pi j}{N} \right) \le 0,\\ E_{ij}^{\text {up}}-\frac{i}{N} \sin (E_{ij}^{\text {up}})-\frac{\pi j}{N}&=\pi -\frac{\pi j}{N} \ge 0, \end{aligned}$$

respectively. By the bisection method, we can thus find \(E_{ij}\) such that

$$\begin{aligned} \frac{\pi j}{N}=E_{ij}^{\text {low}} \le E_{ij} \le E_{ij}^{\text {up}}=\pi \; \text { and } \; \left| E_{ij}-\frac{i}{N} \sin (E_{ij})-\frac{\pi j}{N} \right| <\frac{1}{N}. \end{aligned}$$

Given \((e,M) \in ([0,1) \times [0,\pi ]) \setminus ([1-\varepsilon ,1] \times [0,\arccos (1-\varepsilon )])\), we now define \(\tilde{E}(e,M)=E_{ij}\), where \(i=\lfloor Ne\rfloor \in \{ 0,\ldots ,N-1 \}\) and \(j=\left\lceil \frac{M N}{\pi } \right\rceil \in \{ 0,\ldots , N \}\). Therefore, \(\tilde{E}\) is a piecewise constant function and it only remains to show that it satisfies the \(\alpha \)-test.

Indeed, we have that

$$\begin{aligned} |f(\tilde{E})|&=|E_{ij} -e \sin (E_{ij})-M|\\&=\left| \left( E_{ij}-\frac{i}{N} \sin (E_{ij})-\frac{\pi j}{N} \right) -\left( e-\frac{i}{N} \right) \sin (E_{ij})-\left( M-\frac{\pi j}{N} \right) \right| \\&< \frac{1}{N}+\left| e-\frac{i}{N} \right| +\left| M-\frac{\pi j }{N} \right| \le \frac{\pi +2}{N}. \end{aligned}$$

On the other hand, \(|f'(\tilde{E})|=1-e\cos (\tilde{E}) \ge \varepsilon \) because

$$\begin{aligned} |f'(\tilde{E})| \ge \left\{ \begin{array}{ll} 1-e \ge \varepsilon &{} \text { if } e \in [0,1-\varepsilon ],\\ 1-\cos (E_{ij}) \ge 1-\cos (M) \ge \varepsilon &{} \text { if } \tilde{E} \in [0,\frac{\pi }{2}], M \ge \arccos (1-\varepsilon ),\\ 1 \ge \varepsilon &{} \text { if } \tilde{E} \in [\frac{\pi }{2},\pi ], \end{array} \right. \end{aligned}$$

where we have used that \(E_{ij} \ge E_{ij}^{\text {low}}=\frac{\pi j}{N} = \frac{\pi \lceil \frac{MN}{\pi }\rceil }{N}\ge M\).

Since \(|f^{(k)}(\tilde{E})| \le 1\), we obtain using Lemma 2.1 and the hypothesis over \(N\) that

$$\begin{aligned} \alpha (f_{e,M},\tilde{E}) \le \frac{\pi +2}{N \varepsilon } \sup _{k\ge 2} \left| \frac{1}{k! \varepsilon } \right| ^{\frac{1}{k-1}} \le \frac{\pi +2}{2 N \varepsilon ^2} < \alpha _0, \end{aligned}$$

which ends the proof. \(\square \)

Proof of Theorem 1.4

We proceed by contradiction, i.e. we assume that \(\tilde{E}(e,M)\) is an approximate zero of \(f_{e,M}\) for all \(e\in [0,1)\) and \(M\in [0,\pi ]\). Since the branches of \(\tilde{E}\) are given by polynomial inequalities, there is an open set \(U\subseteq \mathbb {R}^2\) and \(\varepsilon >0\) such that \(\overline{U}\supset \{1\}\times [0,\varepsilon ]\) and \(\tilde{E}\) is a rational function on \(U\cap ([0,1)\times [0,\pi ])\). We also assume that \(U\subseteq [\frac{1}{2},1)\times [0,0.0001]\).

By definition of approximate zero, we have that

It can be readily verified that \(B\ge 0.14433\) for all \(e\in [\frac{1}{2},1)\) and any \(\tilde{E}\in \mathbb {R}\), so \(|f(\tilde{E})|<\frac{\alpha _0}{0.14433}\le 1.1888\) in \(U\). By the triangle inequality, this implies that \(|\tilde{E}|< 1.1888+e+M<2.1889\) in \(U\). Repeating the argument, but using that \(|\tilde{E}|<2.1889\), it can be shown that \(B\ge 0.176\), so \(|\tilde{E}|<\frac{\alpha _0}{0.176}+e+M\le 1.975\) in \(U\). Doing this one more time, gives \(B\ge 0.2368\) and the estimate \(|\tilde{E}|<1.725\) in \(U\).

Since \(\tilde{E}\) is bounded in \(U\), it can be extended analytically to \(\{1\}\times (0,\delta )\) for some \(0<\delta <\varepsilon \le 0.0001\). To show this, recall that \(\tilde{E}(e,M)=\frac{p(e,M)}{q(e,M)}\) for some polynomials \(p\) and \(q\) with no common factors. Now, if \(q(1,M)\) were zero (as a polynomial), then \(q\) would be divisible by \(e-1\) and \(p\) would not, so \(\tilde{E}\) would not be bounded, in contradiction with our previous result. This proves that \(q(1,M)\not \equiv 0\), so we can take \(\delta >0\) small enough to ensure that \(q(1,M)\) has no roots in \((0,\delta )\), hence \(\tilde{E}(1,M)\) is well defined.

Denote \(\tilde{E}_1(M)=\tilde{E}(1,M)\) for \(M\in (0,\delta )\). Using that \(B\ge \frac{e |\sin (\tilde{E})|}{2(1-e\cos \tilde{E})^2}\), we get

$$\begin{aligned} |\tilde{E}-e\sin \tilde{E}-M|\le \frac{2\alpha _0(1-e\cos \tilde{E})^2}{e |\sin (\tilde{E})|}. \end{aligned}$$

Taking limit as \(e\rightarrow 1^-\), we obtain

$$\begin{aligned} |\tilde{E}_1-\sin \tilde{E}_1-M|\le \frac{2\alpha _0(1-\cos \tilde{E}_1)^2}{|\sin (\tilde{E}_1)|}= \frac{4\alpha _0|\sin (\frac{\tilde{E}_1}{2})|^3}{|\cos (\frac{\tilde{E}_1}{2})|}\le \frac{\alpha _0|\tilde{E}_1|^3}{2|\cos (\frac{\tilde{E}_1}{2})|}< 0.133 |\tilde{E}_1|^3 \end{aligned}$$

for all \(M\in (0,\delta )\). By the power series expansion of \(\sin (\tilde{E}_1)\),

$$\begin{aligned} \left| \frac{\tilde{E}_1^3}{3!}-\frac{\tilde{E}_1^5}{5!}+\cdots -M\right| < 0.133 |\tilde{E}_1^3|. \end{aligned}$$

By the triangle inequality,

$$\begin{aligned} \begin{aligned} \left| \frac{\tilde{E}_1^3}{6}-M\right|&\le 0.133 |\tilde{E}_1^3| + \left| \frac{\tilde{E}_1^5}{5!}-\frac{\tilde{E}_1^7}{7!}+\ldots \right| \\&\le |\tilde{E}_1^3|\left( 0.133+\frac{\tilde{E}_1^2}{120} \left( 1+\frac{\tilde{E}_1^2}{6 \cdot 7}+\frac{\tilde{E}_1^4}{6 \cdot 7 \cdot 8 \cdot 9}+\ldots \right) \right) \\&\le |\tilde{E}_1^3|\left( 0.133+\frac{1.725^2}{120} \left( 1+\frac{1.725^2}{6^2}+\frac{1.725^4}{6^4}+\ldots \right) \right) \\&\le 0.161|\tilde{E}_1^3|, \end{aligned} \end{aligned}$$

for all \(M\in (0,\delta )\). This implies that \((\frac{1}{6}-0.161)|\tilde{E}_1^3|\le M\), or equivalently,

$$\begin{aligned} |\tilde{E}_1|\le \root 3 \of {\frac{|M|}{\frac{1}{6}-0.161}}\xrightarrow [M\rightarrow 0^+]{}0. \end{aligned}$$

This shows that \(\tilde{E}_1\) has a removable singularity at \(M=0\), so it can be extended analytically to \([0,\delta )\) with \(\tilde{E}_1(0)=0\). Moreover, \(\tilde{E}_1(M)=Mr(M)\) for some analytic function \(r(M)\) in \([0,\delta )\), since the power series of \(\tilde{E}_1\) cannot have a non-zero constant term.

Finally, by definition of approximate zero,

$$\begin{aligned} \begin{aligned} \alpha _0&> \frac{|f(\tilde{E})|}{1-e\cos (\tilde{E})}\max \left\{ \frac{e |\sin (\tilde{E})|}{2(1-e\cos \tilde{E})}, \sqrt{\frac{e|\cos (\tilde{E})|}{6(1-e\cos \tilde{E})}} \right\} \\&\ge \frac{|f(\tilde{E})|}{1-e\cos (\tilde{E})}\max \left\{ \frac{e|\sin (\tilde{E})|}{\sqrt{6(1-e\cos (\tilde{E}))}}, \frac{e|\cos (\tilde{E})|}{\sqrt{6(1-e\cos (\tilde{E}))}} \right\} \\&= \frac{e|f(\tilde{E})|}{\sqrt{6}(1-e\cos (\tilde{E}))^{\frac{3}{2}}}\max \{ |\sin \tilde{E}|, |\cos \tilde{E}|\}\ge \frac{e|f(\tilde{E})|}{\sqrt{12}(1-e\cos (\tilde{E}))^{\frac{3}{2}}}, \end{aligned} \end{aligned}$$

and taking limit as \(e\rightarrow 1^-\),

$$\begin{aligned} \begin{aligned} |\tilde{E}_1-\sin \tilde{E}_1-M|&\le \sqrt{12}\alpha _0(1-\cos (\tilde{E}_1))^{\frac{3}{2}}\\&=\sqrt{96}\alpha _0\left| \sin ^3 \left( \frac{\tilde{E}_1}{2} \right) \right| \le \frac{\sqrt{96}\alpha _0|\tilde{E}_1^3|}{8}=\sqrt{\frac{3}{2}}\alpha _0|\tilde{E}_1|^3. \end{aligned} \end{aligned}$$

Dividing by \(M\), using that \(\tilde{E}_1(M)=Mr(M)\) and taking limits as \(M\rightarrow 0^+\),

$$\begin{aligned} \left| r(M)-\frac{\sin (M r(M))}{M}-1\right| \le \sqrt{\dfrac{3}{2}}\alpha _0 M^2 |r(M)|^3, \end{aligned}$$

which gives us the contradiction \(1 \le 0\). \(\square \)

Remark 5.1

Note that in the proof of Theorem 1.4 we use the rationality of the function only to show that it can be analytically extended to a small segment \(\{1\}\times [0,\varepsilon ]\) for some \(\varepsilon >0\). If we start with an analytic function defined on \([0,1] \times [0,\pi ]\), this step is not necessary and the same contradiction is obtained.

This shows that the classical starters \(S_1,\ldots ,S_8\), as well as \(S_{CEMR}\), are not approximate zeros in the entire domain, as Figs. 3, 4, 5 and 6 illustrate. The same argument can be used to show that no starter which is a linear combination of the former ones (for instance, \(M+ke\) with \(0\le k \le 1\)) gives an approximate zero in the entire region.

6 Conclusions

This paper proposes to quantify the efficiency of a starter for Kepler’s equation \(E-e\sin (E)=M\) by using Smale’s \(\alpha \)-theory. We certify analytically regions where certain classical starters give quadratic convergence under Newton’s method. We also study numerically how efficient the classical starters are.

Our main contribution is the construction of a simple starter which converges quadratically in the entire domain. This starter is given by a piecewise-defined function that uses constant and rational functions everywhere except for a small part of the domain near the corner \(e=1\), \(M=0\), where a single cubic root is needed.

In the final section, we provide a technical analysis of the difficulties of solving Kepler’s equation. More precisely, we show that it is possible to find approximate solutions given by piecewise-constant functions everywhere except for an arbitrarily small region near the corner \((1,0)\). Additionally, we prove that no starter satisfying our efficiency criterion in the entire region can be obtained if only rational (or analytic) functions are used, thus showing that the cubic root in our proposed starter is unavoidable.