Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

We are very glad to dedicate this paper to Professor S. Smale. The works of Smale in the theory of dynamical systems played a great role in the development of this important field and led to the appearance of new concepts and methods. We wish Professor Smale a very good health and many new important results.

The usual bifurcation theory deals with one-parameter families of smooth maps or vector fields. In this situation fixed points or periodic orbits become functions of this parameter. Bifurcations appear when their linearized spectrum changes its structure. The main role in the theory is played by the so-called versal deformations, i.e. by special families such that arbitrary families can be represented as some projections of versal deformations (see, for example [1]). In this approach the positions of bifurcating orbits and their dependence on the parameter are known.

In this paper we consider a dynamical system generated by the 2-dimensional Navier–Stokes System and deformations are produced by solutions of this system. Certainly, this is a very special case of a much more general problem in which Navier–Stokes System is replaced by linear or non-linear PDE for which strong existence and uniqueness results are known. The next step is to choose fixed points or periodic orbits and sometimes this can be a difficult problem. In our case this is done under the assumption of an additional symmetry of the problem.

We write Navier–Stokes System for the stream function \(\psi = \psi (\tilde{{x}}_{1},\tilde{{x}}_{2},t)\) on the 2-dimensional square \(0 \leq \tilde{ {x}}_{1},\,\tilde{{x}}_{2} \leq \pi \):

$$\frac{\partial \psi } {\partial t} + {\Delta }^{-1}\left ( \frac{\partial \psi } {\partial \tilde{{x}}_{1}} \cdot \frac{\partial \Delta \psi } {{\partial }_{\tilde{{x}}_{2}}} - \frac{\partial \psi } {\partial \tilde{{x}}_{2}} \cdot \frac{\partial \Delta \psi } {\partial \tilde{{x}}_{1}} \right ) = \Delta \psi.$$
(1)

In (1) the viscosity is taken to be 1 and the external forcing terms are absent. The velocity of the fluid u = (u 1, u 2) is expressed from ψ through the relations

$${u}_{1} = -\frac{\partial \psi } {\partial \tilde{{x}}_{2}},\quad {u}_{2} = \frac{\partial \psi } {\partial \tilde{{x}}_{1}}$$
(2)

which show that u is a local function of ψ. This is one of the advantages of ψ. Moreover, the velocity u given by (2) always satisfies incompressibility condition

$$\text{ div}(u) = \frac{\partial {u}_{1}} {\partial \tilde{{x}}_{1}} + \frac{\partial {u}_{2}} {\partial {x}_{2}} = 0.$$

We consider the space of functions ψ written as a series

$$\psi (\tilde{{x}}_{1},\tilde{{x}}_{2},t) =\sum\limits_{{m}^{2}+{n}^{2}\neq 0}{f}_{\mathit{mn}}\sin m\tilde{{x}}_{1}\sin n\tilde{{x}}_{2}.$$
(3)

The coefficients f mn are odd functions of its arguments and decay fast enough so that all appearing series converge. In Sect. 2 we reproduce the proof of the theorem from [4] in which we show that the space of such ψ is invariant under the dynamics generated by (1).

In (1) the operator Δ  − 1 has the form

$${\Delta }^{-1}\psi = -\sum\limits_{{m}^{2}+{n}^{2}\neq 0} \frac{1} {{m}^{2} + {n}^{2}}\sin m\tilde{{x}}_{1}\sin n\tilde{{x}}_{2}.$$

The formulas (2) and (3) show that on the boundary the velocity vector u is directed along the boundary. This situation is called the slip boundary condition. From the physical point of view it is not so natural but it is quite satisfactory as a mathematical model.

Let us write down an infinite-dimensional system of ODE for the coefficients f mn which follows from (1) and actually is equivalent to (1)

$$\begin{array}{rcl} & & \frac{{\mathit{df }}_{\mathit{mn}}} {\mathit{dt}} - \frac{1} {{m}^{2} + {n}^{2}}\sum\limits_{{ {m}^{{\prime}}+{m}^{{\prime\prime}}=m \atop {n}^{{\prime}}+{n}^{{\prime\prime}}=n} }{f}_{{m}^{{\prime}}{n}^{{\prime}}}{f}_{{m}^{{\prime\prime}}{n}^{{\prime\prime}}}\cdot ({({m}^{{\prime\prime}})}^{2} + {({n}^{{\prime\prime}})}^{2}) \cdot ({m}^{{\prime}}{n}^{{\prime\prime}}- {m}^{{\prime\prime}}{n}^{{\prime}}) \\ & & \quad = -({m}^{2} + {n}^{2}){f}_{\mathit{ mn}}. \end{array}$$
(4)

Introduce the vorticity

$$\omega = \Delta \psi =\sum\limits_{m,n}{\omega }_{\mathit{mn}}\sin m\tilde{{x}}_{1}\sin n\tilde{{x}}_{2}$$

which shows that \({\omega }_{\mathit{mn}} = -({m}^{2} + {n}^{2}){f}_{\mathit{mn}}\). For the coefficients \({\omega }_{\mathit{mn}}\) we have even a simpler system of ODE equivalent to (4)

$$\begin{array}{rcl} & & \frac{d{\omega }_{\mathit{mn}}} {\mathit{dt}} +\sum\limits_{{ {m}^{{\prime}}+{m}^{{\prime\prime}}=m \atop {n}^{{\prime}}+{n}^{{\prime\prime}}=n} }{\omega }_{{m}^{{\prime}}{n}^{{\prime}}}{\omega }_{{m}^{{\prime\prime}}{n}^{{\prime\prime}}}\cdot \frac{{m}^{{\prime}}{n}^{{\prime\prime}}- {m}^{{\prime\prime}}{n}^{{\prime}}} {{({m}^{{\prime}})}^{2} + {({n}^{{\prime}})}^{2}} \\ & & \quad = -({m}^{2} + {n}^{2}){\omega }_{\mathit{ mn}}. \end{array}$$
(5)

In [24], the following theorem was proven

Theorem 1 (Global wellposedness and decay). 

Let γ > 1, A > 0 and

$$\vert {\omega }_{\mathit{mn}}(0)\vert \leq \frac{A} {{({m}^{2} + {n}^{2})}^{\frac{\gamma } {2} }} ,$$
(6)

for all m,n, m 2 + n 2 ≠0. Then for some absolute constant K 1 > 0 and all t > 0,

$$\vert {\omega }_{\mathit{mn}}(t)\vert \leq \frac{A{K}_{1}} {{({m}^{2} + {n}^{2})}^{\frac{\gamma } {2} }}.$$
(7)

The proof of Theorem 1 is given in Sect. 2. In the periodic case it was given in [3] and [4] and extended to other boundary conditions in [2]. The inequality (7) implies that for the stream function

$$\vert {f}_{\mathit{mn}}(t)\vert \leq \frac{A{K}_{1}} {{({m}^{2} + {n}^{2})}^{\frac{\gamma } {2} +1}},\quad \forall \,m,n.$$

We shall take γ to be so large that the decay of f mn will be sufficient for our purposes. Actually the decay of f mn is much faster but we do not dwell on this here.

Remark 1.

Our flow (1)–(3) is closely connected with a special class of 2π-periodic flows on the whole plane. Namely suppose \(\tilde{\psi } =\tilde{ \psi }(\tilde{{x}}_{1},\tilde{{x}}_{2},t)\) is a solution to the Navier–Stokes equation with 2π-periodic boundary condition, and satisfy

$$\tilde{\psi }(-\tilde{{x}}_{1},\tilde{{x}}_{2},t) = -\tilde{\psi }(\tilde{{x}}_{1},\tilde{{x}}_{2},t) =\tilde{ \psi }(\tilde{{x}}_{1},-\tilde{{x}}_{2},t),\quad \forall \,\tilde{{x}}_{1},\tilde{{x}}_{2}.$$
(8)

It is not difficult to check that the special symmetry (8) is preserved under the dynamics of the Navier–Stokes flow. Furthermore if we write

$$\tilde{\psi }(\tilde{{x}}_{1},\tilde{{x}}_{2},t) =\sum\limits_{m,n}\tilde{{f}}_{\mathit{mn}}{e}^{i(m\tilde{{x}}_{1}+n\tilde{{x}}_{2})},$$

then

$$-\tilde{{f}}_{\mathit{mn}} =\tilde{ {f}}_{-m,n} =\tilde{ {f}}_{m,-n},\quad \forall \,m,n.$$

Therefore from a simple computation

$$\tilde{\psi }(\tilde{{x}}_{1},\tilde{{x}}_{2},t) = -\sum\limits_{m,n}\tilde{{f}}_{\mathit{mn}}\sin m\tilde{{x}}_{1}\sin n\tilde{{x}}_{2}$$
(9)

which corresponds exactly to (3) up to a minus sign. This shows that \(\tilde{\psi }\) is also a solution to our problem (1)–(3).

We shall call extremal points of the stream function ψ the points of local minima or maxima of ψ. Near these points the velocity u is tangent to the level sets of ψ (or \(\tilde{\psi }\)) which are closed curves. It is natural to call extremal points of ψ viscous vortices. The main purpose of this paper is to show that these vortices can split or merge.

Now we can formulate our main results of this paper.

Theorem 2 (Existence of bifurcations). 

There exists an open set \(\mathcal{A}\) in the space of stream functions such that the following holds true:

For each stream function \({\psi }_{0} \in \mathcal{A}\) , there is an open neighborhood U of the point \((\tilde{{x}}_{1},\tilde{{x}}_{2}) = (\frac{\pi } {2} , \frac{\pi } {2} )\) , two moments of times 0 < t 1 < t 2 such that the corresponding stream function \(\psi = \psi (\tilde{{x}}_{1},\tilde{{x}}_{2},t)\) solves (1)with initial data ψ 0 and satisfy

  1. 1.

    At t = 0, \((\frac{\pi } {2} , \frac{\pi } {2} )\) is a non-degenerate minimum of ψ in the neighborhood U.

  2. 2.

    For any 0 < t ≤ t 1 , ψ has only one critical point in U given by \((\tilde{{x}}_{1},\tilde{{x}}_{2}) = (\frac{\pi } {2} , \frac{\pi } {2} )\).

  3. 3.

    At t = t 1, \((\tilde{{x}}_{1},\tilde{{x}}_{2}) = (\frac{\pi } {2} , \frac{\pi } {2} )\) is a degenerate local minimum of ψ in U.

  4. 4.

    For t 1 < t ≤ t 2 , ψ has exactly three critical points in U. The point \((\frac{\pi } {2} , \frac{\pi } {2} )\) becomes a saddle. Two other critical points are of the form \((\frac{\pi } {2} + {x}^{{_\ast}}, \frac{\pi } {2} + {y}^{{_\ast}})\), \((\frac{\pi } {2} - {x}^{{_\ast}}, \frac{\pi } {2} - {y}^{{_\ast}})\) where x ≠0, y ≠0 and are local minima.

Remark 2.

Under our conditions the point \((\frac{\pi } {2} , \frac{\pi } {2} )\) is the extremal point of the stream function for all time. This property plays the same role as the knowledge of fixed points or periodic orbits in the usual theory of bifurcations.

Remark 3.

The fact that the extra critical points emerge in the form \((\frac{\pi } {2} + {x}^{{_\ast}}, \frac{\pi } {2} + {y}^{{_\ast}})\), \((\frac{\pi } {2} - {x}^{{_\ast}}, \frac{\pi } {2} - {y}^{{_\ast}})\) is not surprising. As we shall see later in Sect. 3, by the inversion symmetry (18), our stream function ψ is invariant under the reflection about the point \((\frac{\pi } {2} , \frac{\pi } {2} )\).

Our next result is in some sense the reversal of the process described in Theorem 2. For a class of initial data having three critical points near the special point \((\frac{\pi } {2} , \frac{\pi } {2} )\), we show that they merge into one critical point in finite time.

Theorem 3 (Merging of critical points). 

There exists an open set \(\mathcal{A}\) in the space of stream functions such that the following holds true:

For each stream function \({\psi }_{0} \in \mathcal{A}\) , there is an open neighborhood U of the point \((\tilde{{x}}_{1},\tilde{{x}}_{2}) = (\frac{\pi } {2} , \frac{\pi } {2} )\) , two moments of times 0 < t 1 < t 2 such that the corresponding stream function \(\psi = \psi (\tilde{{x}}_{1},\tilde{{x}}_{2},t)\) solves (1)with initial data ψ 0 and satisfy

  1. 1.

    For 0 ≤ t < t 1 , ψ has exactly three critical points in U. The point \((\frac{\pi } {2} , \frac{\pi } {2} )\) is a saddle. Two other critical points are of the form \((\frac{\pi } {2} + {x}^{{_\ast}}, \frac{\pi } {2} + {y}^{{_\ast}})\), \((\frac{\pi } {2} - {x}^{{_\ast}}, \frac{\pi } {2} - {y}^{{_\ast}})\) where x ≠0, y ≠0 and are local minima.

  2. 2.

    At t = t 1, \((\frac{\pi } {2} , \frac{\pi } {2} )\) is a degenerate minimum of ψ in the neighborhood U.

  3. 3.

    For any t 1 < t ≤ t 2 , ψ has only one critical point in U given by \((\tilde{{x}}_{1},\tilde{{x}}_{2}) = (\frac{\pi } {2} , \frac{\pi } {2} )\).

This paper is organized as follows. In Sect. 2 we give the proof of Theorem 1. In Sect. 3 we derive the equation for extremal points and formulate sufficient conditions for bifurcations needed in Theorem 2. Section 4 is devoted to the construction of bifurcations in the degenerate case. In Sect. 5 we prove the existence of bifurcation for non-degenerate initial data by using a perturbation argument. In Sect. 6 we give the construction of stream functions satisfying the needed conditions. In Sects. 7 and 8 we describe the proof of Theorem 3 and construction of initial conditions.

2 Proof of Theorem 1

In this section we give the proof of Theorem 1 using the trapping argument from [4].

We shall use the letter C with or without indices to denote different absolute constants whose values may vary from line to line. The actual value of C does not play any role in our arguments.

To simplify notations, denote \({Z}_{{_\ast}}^{2} =\{ (m,n) \in {\mathbb{Z}}^{2},\,m\neq 0,n\neq 0\}\) and \(r = (m,n) \in {\mathbb{Z}}_{{_\ast}}^{2}\), \({r}^{{\prime}} = ({m}^{{\prime}},{n}^{{\prime}}) \in {\mathbb{Z}}_{{_\ast}}^{2}\), \({r}^{{\prime\prime}} = ({m}^{{\prime\prime}},{n}^{{\prime\prime}}) \in {\mathbb{Z}}_{{_\ast}}^{2}\), and also denote \({\omega }_{r} = {\omega }_{\mathit{mn}}\), \({\omega }_{{r}^{{\prime}}} = {\omega }_{{m}^{{\prime}}{n}^{{\prime}}}\) and so on.

By standard enstrophy inequality, we have

$$\|\omega {(t)\|}_{{L}_{\tilde{{x}}_{ 1}\tilde{{x}}_{2}}^{2}([0,\pi ]\times [0,\pi ])} \leq {\mathcal{E}}_{0},\quad \forall \,t > 0,$$

where \({\mathcal{E}}_{0} > 0\) is the enstrophy at t = 0.

By Fourier transform, this implies

$${ \left (\sum\limits_{r\in {\mathbb{Z}}_{{_\ast}}^{2}}\vert {\omega }_{r}(t){\vert }^{2}\right )}^{\frac{1} {2} } \leq {C}_{1}{\mathcal{E}}_{0},\forall \,t > 0.$$
(10)

Let K 1 > 0 be a constant depending on A which will be taken sufficiently large. By (10), we get

$$\vert {\omega }_{r}(t)\vert \leq \frac{{C}_{1}{K}_{1}{\mathcal{E}}_{0}} {\vert r{\vert }^{\frac{\gamma } {2} }} ,\quad \forall \,\vert r\vert \leq {K}_{1}^{ \frac{2} {\gamma } },\quad \forall \,t > 0.$$

Define the trapping set

$$\Omega ({K}_{1}) = \left \{(\tilde{{\omega }}_{r}) :\, \vert \tilde{{\omega }}_{r}\vert \leq \frac{{C}_{1}{K}_{1}{\mathcal{E}}_{0}} {\vert r{\vert }^{\frac{\gamma } {2} }} ,\quad \forall \,\vert r\vert \geq {K}_{1}^{ \frac{2} {\gamma } }\right \}.$$
(11)

Now we show that for all t > 0 the trajectories of our system remain inside the set Ω(K 1). Indeed at t = 0, by choosing K 1 > 2A (see (6)), we get that our system lies strictly inside Ω(K 1). Assume t 1 > 0 is the first moment of time when our system reaches the boundary \(\partial \Omega ({K}_{1})\).Footnote 1

Then for some \(\vert {r}^{{_\ast}}\vert \geq {K}_{1}^{ \frac{2} {\gamma } }\),

$$\vert {\omega }_{{r}^{{_\ast}}}({t}_{1})\vert = \frac{{C}_{1}{K}_{1}{\mathcal{E}}_{0}} {\vert {r}^{{_\ast}}{\vert }^{\gamma }}.$$

WLOG assume

$${\omega }_{{r}^{{_\ast}}}({t}_{1}) = \frac{{C}_{1}{K}_{1}{\mathcal{E}}_{0}} {\vert {r}^{{_\ast}}{\vert }^{\gamma }}.$$

The case \({\omega }_{{r}^{{_\ast}}}({t}_{1}) = -\frac{{C}_{1}{K}_{1}{\mathcal{E}}_{0}} {\vert {r}^{{_\ast}}{\vert }^{\gamma }}\) is similar and therefore its discussion is omitted.

We then aim to show that

$${\left.{\partial }_{t}{\omega }_{{r}^{{_\ast}}}(t)\right \vert }_{t={t}_{1}} < 0.$$

This will guarantee that the trajectory of our system cannot exit the trapping set Ω(K 1) and will remain inside Ω(K 1).

Recall the vorticity equation

$${\partial }_{t}\omega + {\Delta }^{-1}{\nabla }^{\perp }\omega \cdot \nabla \omega = \Delta \omega.$$
(12)

By using (12), we have

$${\Delta }^{-1}{\nabla }^{\perp }\omega \cdot \nabla \omega =\sum\limits_{(m,n)\in {\mathbb{Z}}_{{_\ast}}^{2}}{N}_{\mathit{mn}}\sin m\tilde{{x}}_{1}\sin n\tilde{{x}}_{2},$$

where

$$\vert {N}_{\mathit{mn}}\vert \leq \sum\limits_{{r}^{{\prime}}+{r}^{{\prime\prime}}=r}\frac{\vert {\omega }_{{r}^{{\prime}}}\vert } {\vert {r}^{{\prime}}\vert } \cdot \vert {r}^{{\prime\prime}}\vert \cdot \vert {\omega }_{{ r}^{{\prime\prime}}}\vert.$$
(13)

There are two cases.

Case 1.

\(\vert {r}^{{\prime}}\vert > \frac{1} {3}\vert r\vert \). Then

$$\frac{\vert {r}^{{\prime\prime}}\vert } {\vert {r}^{{\prime}}\vert } \leq \frac{\vert r\vert + \vert {r}^{{\prime}}\vert } {\vert {r}^{{\prime}}\vert } \leq C.$$

Hence

$$\begin{array}{rcl} \text{ RHS of (13)}& & \leq C\sum\limits_{{ {r}^{{\prime}}+{r}^{{\prime\prime}}=r \atop \vert {r}^{{\prime}}\vert >\frac{1} {3} \vert r\vert } }\vert {\omega }_{{r}^{{\prime}}}\vert \cdot \vert {\omega }_{{r}^{{\prime\prime}}}\vert \\ & &\leq C{\left (\sum\limits_{\vert {r}^{{\prime}}\vert >\frac{1} {3} \vert r\vert }\vert {\omega }_{{r}^{{\prime}}}{\vert }^{2}\right )}^{\frac{1} {2} } \cdot {\left (\sum\limits_{{r}^{{\prime\prime}}}\vert {\omega }_{{r}^{{\prime\prime}}}{\vert }^{2}\right )}^{\frac{1} {2} } \\ & & \leq \frac{C{K}_{1}} {\vert r{\vert }^{\gamma -1}}{\mathcal{E}}_{1}.\end{array}$$

Case 2.

\(\vert {r}^{{\prime\prime}}\vert > \frac{1} {3}\vert r\vert \) and \(\vert {r}^{{\prime}}\vert \leq \frac{1} {3}\vert z\vert \). Then

$$\begin{array}{rcl} \text{ RHS of (12) }& & \leq \frac{C{K}_{1}} {\vert r{\vert }^{\gamma -1}}\sum\limits_{\vert {r}^{{\prime}}\vert \leq \frac{1} {3} \vert r\vert }\frac{\vert {\omega }_{{r}^{{\prime}}}\vert } {\vert {r}^{{\prime}}\vert } \\ & &\leq \frac{C{K}_{1}} {\vert r{\vert }^{\gamma -1}}\log \vert r\vert \cdot {\mathcal{E}}_{1}.\end{array}$$

Concluding from the above cases, we get

$$\vert {N}_{{r}^{{_\ast}}}({t}_{1})\vert \leq \frac{C{K}_{1}{\mathcal{E}}_{1}} {\vert {r}^{{_\ast}}{\vert }^{\gamma -1}} \log \vert {r}^{{_\ast}}\vert $$

and hence by (12)

$$\begin{array}{rcl}{ \left.{\partial }_{t}{\omega }_{{r}^{{_\ast}}}(t)\right \vert }_{t={t}_{1}}& & \leq \frac{C{K}_{1}{\mathcal{E}}_{1}} {\vert {r}^{{_\ast}}{\vert }^{\gamma -1}} \log \vert {r}^{{_\ast}}\vert -\frac{{C}_{1}{K}_{1}{\mathcal{E}}_{1}} {\vert {r}^{{_\ast}}{\vert }^{\gamma -2}} \\ & & < 0, \\ \end{array}$$

if K 1 is sufficiently large (recall that by (11), \(\vert {r}^{{_\ast}}\vert \geq {K}_{1}^{ \frac{2} {\gamma } }\)). This finishes the trapping argument and Theorem 1 is proved.

3 The Equation for Extremal Points

We consider a special class of flows

$$\psi (\tilde{{x}}_{1},\tilde{{x}}_{2},t) =\sum\limits_{m+n\ \mathrm{is\ even}}{f}_{\mathit{mn}}\sin (m\tilde{{x}}_{1})\sin (n\tilde{{x}}_{2}).$$
(14)

It is also invariant under the Navier–Stokes dynamics. If this condition is valid, then on the vertical boundaries, for any \(0\,\leq \,\tilde{{x}}_{2}\,\leq \,\pi \), the velocity vector at the point \((0,\tilde{{x}}_{2})\) has the same magnitude but opposite direction to the velocity at the point \((0,\pi -\tilde{ {x}}_{2})\). In some sense they form a dipole with center at \((\frac{\pi } {2} , \frac{\pi } {2} )\). Similar statements also hold for the horizontal boundaries.

In this paper we study bifurcations of the stream function near the point \((\frac{\pi } {2} , \frac{\pi } {2} )\).

After the change of variables,

$$\tilde{{x}}_{1} = \frac{\pi } {2} + x,\quad \tilde{{x}}_{2} = \frac{\pi } {2} + y,$$
(15)

we shift our coordinate system and define

$$\phi (x,y,t) = \psi (\frac{\pi } {2} + x, \frac{\pi } {2} + y,t).$$
(16)

By (16), (14) and (9), we get

$$\begin{array}{rcl} \phi (x,y,t)& =& -\sum\limits_{m+n\ \mathrm{is\ even}}{f}_{\mathit{mn}}{e}^{i(\frac{m+n} {2} \pi +mx+ny)} \\ & =& -\sum\limits_{m+n\text{ is even}}{f}_{\mathit{mn}}{(-1)}^{\frac{m+n} {2} }{e}^{i(mx+ny)}.\end{array}$$

Since ϕ and f mn are both real-valued, we get

$$\phi (x,y,t) = -\sum\limits_{m+n\text{ is even}}{f}_{\mathit{mn}}{(-1)}^{\frac{m+n} {2} }\cos (mx + ny)$$
(17)
$$= \phi (-x,-y,t),$$
(18)

i.e. ϕ satisfies the inversion symmetry. It implies that at the point (x, y) = (0, 0) the gradient of ϕ vanishes.

Introduce a neighborhood \({U}_{\delta } =\{ (x,y) :\; {x}^{2} + {y}^{2} \leq {\delta }^{2}\}\). Later we shall choose δ to be sufficiently small.

For sufficiently small t 2 > 0 consider the time interval [0, t 2] and write the following expansion of ϕ in the neighborhood U δ:

$$\begin{array}{rcl} \phi (x,y,t)& =& \phi (0,0,t) + {a}_{1}(t){x}^{2} + {a}_{ 2}(t){y}^{2} + {a}_{ 3}(t)xy \\ & & \quad + {b}_{1}(t){x}^{4} + {b}_{ 2}(t){y}^{4} + {b}_{ 3}(t){x}^{3}y + {b}_{ 4}(t){x}^{2}{y}^{2} + {b}_{ 5}(t)x{y}^{3} \\ & & \quad + \epsilon (x,y,t), \end{array}$$
(19)

where the remainder term satisfies the inequalities

$$\epsilon (x,y,t) = O({x}^{6} + {y}^{6}),$$
$$\frac{\partial \epsilon } {\partial x}(x,y,t) = O(\vert x{\vert }^{5} + \vert y{\vert }^{5}),$$
$$\frac{\partial \epsilon } {\partial y}(x,y,t) = O(\vert x{\vert }^{5} + \vert y{\vert }^{5}).$$
(20)

In the expansion (19), terms of odd degree are not present because of the symmetry (18).

The first equation for the critical point takes the form

$${\partial }_{x}\phi = 0.$$

By (19), we get

$$\begin{array}{rcl} & & 2{a}_{1}x + {a}_{3}y + 4{b}_{1}{x}^{3} + 3{b}_{ 3}{x}^{2}y + 2{b}_{ 4}x{y}^{2} \\ & & \qquad + {b}_{5}{y}^{3} + \frac{\partial \epsilon } {\partial x} = 0. \end{array}$$
(21)

Here and later we occasionally suppress the time dependence and write a i (t), b i (t) simply as a i , b i when the context is clear.

Assume for 0 ≤ t ≤ t 2

$${a}_{3}(t) \sim O(1).$$
(22)

More precisely

$$\mathit{const} \leq {a}_{3}(t) \leq \mathit{const}.$$

The values of constants play some role later. This will be clarified below (see (34)).

Equation 21 takes the form

$$\begin{array}{rcl} y& =& -\frac{2{a}_{1}} {{a}_{3}} x -\frac{4{b}_{1}} {{a}_{3}} {x}^{3} -\frac{3{b}_{3}} {{a}_{3}} {x}^{2}y -\frac{2{b}_{4}} {{a}_{3}} x{y}^{2} \\ & & \qquad - \frac{{b}_{5}} {{a}_{3}}{y}^{3} - \frac{1} {{a}_{3}} \frac{\partial \epsilon } {\partial x}. \end{array}$$
(23)

Assume also that in formula (19)

$${b}_{2}(0) = {b}_{3}(0) = {b}_{4}(0) = {b}_{5}(0) = 0.$$
(24)

For sufficiently small t 2 it implies that for 0 ≤ t ≤ t 2,

$${b}_{i}(t) \sim O(t),\quad i = 2,\cdots \,,5.$$
(25)

Write (23) in the form

$$\begin{array}{rcl} y& =& -\frac{2{a}_{1}} {{a}_{3}} x -\frac{4{b}_{1}} {{a}_{3}} {x}^{3} + O(t) \cdot O(\vert x{\vert }^{3} + \vert y{\vert }^{3}) \\ & & \quad + O(\vert x{\vert }^{5} + \vert y{\vert }^{5}). \end{array}$$
(26)

Since (x, y) ∈ U δ, we have the rough estimate

$$y = O(x).$$
(27)

Consider the other critical point equation

$$\frac{\partial \phi } {\partial y} = 0.$$

By (19), we get

$$\begin{array}{rcl} & & 2{a}_{2}y + {a}_{3}x + 4{b}_{2}{y}^{3} + {b}_{ 3}{x}^{3} + 2{b}_{ 4}{x}^{2}y \\ & & \quad + 3{b}_{5}x{y}^{2} + \frac{\partial \epsilon } {\partial y} =0. \end{array}$$

In view of the assumptions (25) and (20), we obtain

$$2{a}_{2}y + {a}_{3}x + O(t) \cdot O(\vert x{\vert }^{3} + \vert y{\vert }^{3}) + O(\vert x{\vert }^{5} + \vert y{\vert }^{5}) = 0.$$

Using (27), we get

$$2{a}_{2}y + {a}_{3}x + O(t) \cdot O(\vert x{\vert }^{3}) + O(\vert x{\vert }^{5}) = 0.$$
(28)

Substituting (26) into (28) and using again (27), we have

$$2{a}_{2}\left (-\frac{2{a}_{1}} {{a}_{3}} x -\frac{4{b}_{1}} {{a}_{3}} {x}^{3}\right ) + {a}_{ 3}x + O(t) \cdot O(\vert x{\vert }^{3}) + O(\vert x{\vert }^{5}) = 0.$$

Or simply,

$$\frac{{a}_{3}^{2} - 4{a}_{1}{a}_{2}} {{a}_{3}} x -\frac{8{a}_{2}{b}_{1}} {{a}_{3}} {x}^{3} + O(t) \cdot O(\vert x{\vert }^{3}) + O(\vert x{\vert }^{5}) = 0.$$
(29)

It is obvious that (29) has a solution x = 0. We now look for other possible solutions in U δ. Dividing both sides of (29) by \(\frac{x} {{a}_{3}}\), we obtain

$$({a}_{3}^{2} - 4{a}_{ 1}{a}_{2}) - 8{a}_{2}{b}_{1}{x}^{2} + O(t) \cdot O({x}^{2}) + O({x}^{4}) = 0.$$
(30)

We shall choose initial data very carefully so that the needed bifurcation happens on the time interval [0, t 2]. This will be done in two stages. At the first stage we consider the degenerate case in which the bifurcation happens immediately for t > 0. In the second stage we perturb the degenerate data so that the bifurcation is “delayed” to a later time 0 < t 1 < t 2. In other words, we show that for sufficiently small (and special) perturbations, the desired bifurcation happens at t = t 1.

4 Stage 1: The Bifurcation in the Degenerate Case

Rewrite (30) as

$$-({a}_{3}^{2} - 4{a}_{ 1}{a}_{2}) + 8{a}_{2}{b}_{1}{x}^{2} + O(t) \cdot O({x}^{2}) + O({x}^{4}) = 0.$$
(31)

Choose ϕ0 = ϕ0(x, y) so that

$${a}_{3}{(0)}^{2} - 4{a}_{ 1}(0){a}_{2}(0) = 0,$$
$${\left. \frac{d} {\mathit{dt}}\left ({a}_{3}^{2}(t) - 4{a}_{ 1}(t){a}_{2}(t)\right )\right \vert }_{t=0} > 0,$$
$${a}_{2}(0) > 0,\,{a}_{3}(0) > 0,\,{b}_{1}(0) > 0.$$
(32)

In addition, we also need

$${b}_{2}(0) = {b}_{3}(0) = {b}_{4}(0) = {b}_{5}(0) = 0.$$
(33)

The possibility of choosing ϕ0 with properties (32)–(33) will be shown later (see Sect. 6). Assume for the moment that these conditions are met, then for sufficiently small t 2 > 0, we have for 0 < t ≤ t 2,

$$\begin{array}{rcl} {A}_{3}^{{\prime\prime}}& \geq & {a}_{ 3}(t) \geq {A}_{3}^{{\prime}} > 0, \\ {A}_{2}^{{\prime\prime}}& \geq & {a}_{ 2}(t) \geq {A}_{2}^{{\prime}} > 0, \\ {B}_{1}^{{\prime\prime}}& \geq & \frac{d} {\mathit{dt}}\left ({a}_{3}^{2}(t) - 4{a}_{ 1}(t){a}_{2}(t)\right ) \geq {B}_{1}^{{\prime}} > 0, \\ {B}_{2}^{{\prime\prime}}& \geq & {b}_{ 1}(t) \geq {B}_{2}^{{\prime}} > 0, \end{array}$$
(34)

where \({A}_{i}^{{\prime}}\), \({A}_{i}^{{\prime\prime}}\), \({B}_{i}^{{\prime}}\), \({B}_{i}^{{\prime\prime}}\) are constants.

By (32)–(34), we have for 0 < t ≤ t 2

$$({a}_{3}^{2}(t) - 4{a}_{ 1}(t){a}_{2}(t)) \sim t,$$

which means that

$$\mathit{const} \cdot t \leq {a}_{3}^{2}(t) - 4{a}_{ 1}(t){a}_{2}(t) \leq \mathit{const} \cdot t.$$

Also we have

$$8{a}_{2}(t){b}_{1}(t) \sim \mathit{const}.$$

It follows that for 0 < t ≤ t 2, the equation (31) is of the form

$$-O(t) + O(1) \cdot {x}^{2} + O(t) \cdot O({x}^{2}) + O({x}^{4}) = 0.$$
(35)

For sufficiently small δ and sufficiently small t 2, the equation (35) has two and only two solutions

$$x = \pm O(\sqrt{t})$$

because O(t) is of order of t, O(1) > 0 and other terms do not play any essential role. In this sense solutions to (31) bifurcates into two solutions for 0 < t ≤ t 2.

Remark that at t = 0, the only solution to (31) is x = 0 due to the conditions \({a}_{3}{(0)}^{2} - 4{a}_{1}(0){a}_{2}(0) = 0\) and \({a}_{2}(0){b}_{1}(0) \sim \mathit{const}\).

5 Stage 2: Bifurcation from Non-degenerate Initial Data, a Perturbation Argument

In stage 2 we finish our construction of bifurcation from non-degenerate initial data. The main idea is to perturb the initial data considered in Stage 1. The perturbation will be chosen so that initially we will have only one local non-degenerate minimum located at (x, y) = (0, 0).

To this end, consider \(\tilde{{\phi }}_{0} =\tilde{ {\phi }}_{0}(x,y) \in {C}^{\infty }\) with the following properties:

$$\tilde{{\phi }}_{0}(x,y) =\tilde{ {\phi }}_{0}(-x,-y),\quad \forall \,x,y,$$
$${\left. \frac{{\partial }^{4}\tilde{{\phi }}_{0}} {\partial {x}^{m}\partial {y}^{n}}\right \vert }_{(x,y)=(0,0)} = 0,\quad \forall \,m + n = 4,0 \leq m \leq 4,$$
$${ \left. \frac{{\partial }^{2}\tilde{{\phi }}_{0}} {\partial x\partial y}\right \vert }_{(x,y)=(0,0)} = 0,\quad {\left.\frac{{\partial }^{2}\tilde{{\phi }}_{0}} {\partial {x}^{2}} \right \vert }_{(x,y)=(0,0)} > 0,\quad {\left.\frac{{\partial }^{2}\tilde{{\phi }}_{0}} {\partial {y}^{2}} \right \vert }_{(x,y)=(0,0)} > 0.$$
(36)

Fix ϕ0 = ϕ0(x, y) taken from Stage 1 which has the properties (32)–(33). We shall consider the perturbation by \(\tilde{{\phi }}_{0}\) having the form

$$\tilde{{\phi }}_{0}^{\epsilon }(x,y) = {\phi }_{ 0}(x,y) + \epsilon \tilde{{\phi }}_{0}(x,y),$$

where ε > 0 is sufficiently small.

Denote the corresponding solution of the main equation (1) (in the shifted coordinates) by \({\phi }^{\epsilon } = {\phi }^{\epsilon }(x,y,t)\). To simplify the notations, we expand \({\phi }^{\epsilon }(x,y,t)\) in the form corresponding to (19), i.e. we write

$$\begin{array}{rcl}{ \phi }^{\epsilon }(x,y,t)& =& {\phi }^{\epsilon }(0,0,t) + {a}_{ 1}^{\epsilon }(t){x}^{2} + {a}_{ 2}^{\epsilon }{y}^{2} + {a}_{ 3}^{\epsilon }xy \\ & & \quad + {b}_{1}^{\epsilon }(t){x}^{4} + {b}_{ 2}^{\epsilon }(t){y}^{4} + {b}_{ 3}^{\epsilon }(t){x}^{3}y + {b}_{ 4}^{\epsilon }(t){x}^{2}{y}^{2} + {b}_{ 5}^{\epsilon }(t)x{y}^{3} \\ & & \quad +\tilde{ \epsilon }(x,y,t) \end{array}$$
(37)

where \(\tilde{\epsilon }\) satisfies an estimate similar to (20).

We now check the properties of \({\phi }^{\epsilon }(x,y,t)\).

  1. (a)

    At t = 0, the point (x, y) = (0, 0) is the unique extremum of \({\phi }^{\epsilon }(x,y,0)\) in the neighborhood U δ. Also (0, 0) is a non-degenerate local minimum.

    To prove this, we note that due to (32), (33) and (36), the critical point equation (30) still holds for \({\phi }^{\epsilon }(x,y,t)\) for sufficiently small \(\epsilon > 0\) with corresponding coefficients a 1, a 2, a 3, b 1 now replaced by \({a}_{1}^{\epsilon }\), \({a}_{2}^{\epsilon }\), \({a}_{3}^{\epsilon }\), \({b}_{1}^{\epsilon }\). In particular this gives us

    $${({a}_{3}^{\epsilon }(0))}^{2} - 4{a}_{ 1}^{\epsilon }(0){a}_{ 2}^{\epsilon }(0) - 8{a}_{ 2}^{\epsilon }(0){b}_{ 1}^{\epsilon }(0){x}^{2} + O({x}^{4}) = 0.$$
    (38)

    Denote

    $${\left.\tilde{{a}}_{1} = \frac{{\partial }^{2}\tilde{{\phi }}_{0}} {\partial {x}^{2}} \right \vert }_{(x,y)=(0,0)} > 0,$$
    $${\left.\tilde{{a}}_{2} = \frac{{\partial }^{2}\tilde{{\phi }}_{0}} {\partial {y}^{2}} \right \vert }_{(x,y)=(0,0)} > 0.$$

    By (32) and (36), we have

    $$\begin{array}{rcl} & & {({a}_{3}^{\epsilon }(0))}^{2} - 4{a}_{ 1}^{\epsilon }(0){a}_{ 2}^{\epsilon }(0) \\ & & \quad = {a}_{3}{(0)}^{2} + O({\epsilon }^{2}) - 4({a}_{ 1}(0) + \epsilon \tilde{{a}}_{1})({a}_{2}(0) + \epsilon \tilde{{a}}_{2}) \\ & & \quad = -4({a}_{1}(0)\tilde{{a}}_{2} + {a}_{2}(0)\tilde{{a}}_{1})\epsilon + O({\epsilon }^{2}). \end{array}$$
    (39)

    On the other hand, for sufficiently small \(\epsilon > 0\), by using (38) and (36), we have

    $$\begin{array}{rcl}{ a}_{2}^{\epsilon }(0){b}_{ 1}^{\epsilon }(0)& =& ({a}_{ 2}(0) + O(\epsilon )) \cdot ({b}_{1}(0) + O({\epsilon }^{2})) \\ & =& {a}_{2}(0){b}_{1}(0) + O(\epsilon ) \\ & & \sim \mathit{const}. \end{array}$$
    (40)

    Therefore by (39) and (40), the equation (38) takes the form

    $$-O(1)\epsilon - O(1) \cdot {x}^{2} + O({x}^{4}) = 0,$$

    or simply

    $$O(1) \cdot \epsilon + O(1) \cdot O({x}^{2}) = 0.$$

    It is clear that for \(\epsilon > 0\) this equation does not have any real-valued solution in U δ.

    To show that (0, 0) is a non-degenerate local minimum at t = 0, we observe that by (39), for sufficiently small \(\epsilon > 0\),

    $${({a}_{3}^{\epsilon }(0))}^{2} - 4{a}_{ 1}^{\epsilon }(0){a}_{ 2}^{\epsilon }(0) < 0.$$
    (41)

    Also we have by (32)

    $${a}_{1}^{\epsilon }(0) > 0,\quad {a}_{ 2}^{\epsilon }(0) > 0.$$
    (42)

    Equations 41 and 42 show that the Hessian matrix

    $$\left (\begin{array}{cc} {a}_{1}^{\epsilon }(0) &\frac{1} {2}{a}_{3}^{\epsilon }(0) \\ \frac{1} {2}{a}_{3}^{\epsilon }(0)& {a}_{ 2}^{\epsilon }(0)\end{array} \right )$$

    is strictly positive definite. Hence (0, 0) is a non-degenerate local minimum.

  2. (b)

    Consider the function

    $${D}^{\epsilon }(t) = {({a}_{ 3}^{\epsilon }(t))}^{2} - 4{a}_{ 1}^{\epsilon }(t){a}_{ 2}^{\epsilon }(t).$$

    It will be proven below that for sufficiently small ε > 0, the following holds:

    There exists unique \({t}_{1} = {t}_{1}(\epsilon ) > 0\) such that

    $$\begin{array}{rcl} & & {D}^{\epsilon }(t) < 0,\quad \text{ for}0 \leq t < {t}_{ 1}, \\ & & {D}^{\epsilon }(t) = 0,\quad \text{ for}t = {t}_{ 1}, \\ & & {D}^{\epsilon }(t) > 0,\quad \text{ for}{t}_{ 1} < t \leq {t}_{2}. \end{array}$$
    (43)

Furthermore, the reduced critical-point equation (see (30))

$${D}^{\epsilon }(t) - 8{a}_{ 2}^{\epsilon }(t){b}_{ 1}^{\epsilon }(t){x}^{2} + O(t) \cdot O({x}^{2}) + O({x}^{4}) = 0$$
(44)

has

  • No solution for 0 ≤ t < t 1,

  • Exactly one solution given by x = 0 for t = t 1,

  • Two nonzero solutions for t 1 < t ≤ t 2.

To prove (43), we recall the bound (34) , where for 0 ≤ t ≤ t 2

$${B}_{1}^{{\prime\prime}}\geq \frac{d} {\mathit{dt}}\left ({a}_{3}^{2}(t) - 4{a}_{ 1}(t){a}_{2}(t)\right ) \geq {B}_{1}^{{\prime}} > 0.$$
(45)

Since our initial data are given by

$${\phi }_{0}^{\epsilon }(x,y) = {\phi }_{ 0}(x,y) + \epsilon \tilde{{\phi }}_{0}(x,y),$$

it follows from simple perturbation theory that for sufficiently small ε > 0 , we have

$${ \left \|{\phi }^{\epsilon }(x,y,t) - \phi (x,y,t)\right \|}_{{ H}_{t,x,y}^{m}} \leq \eta (\epsilon ,m),$$
(46)

where \(\eta (\epsilon ,m) \rightarrow 0\) as \(\epsilon \rightarrow 0\) and m is fixed.

The notation H t, x, y m denotes m th Sobolev norms of ψ:

$${\left \|\psi \right \|}_{{H}_{t,x,y}^{m}} =\sum\limits_{0\leq \alpha +\beta +\gamma \leq m}{\left \|{\partial }_{t}^{\alpha }{\partial }_{ x}^{\beta }{\partial }_{ y}^{\gamma }\psi \right \|}_{{ L}^{2}}.$$

Take m to be sufficiently large and then ε sufficiently small. It follows from (45) and (46) that

$$2{B}_{1}^{{\prime\prime}}\geq \frac{d} {\mathit{dt}}\left ({({a}_{3}^{\epsilon }(t))}^{2} - 4{a}_{ 1}^{\epsilon }(t){a}_{ 2}^{\epsilon }(t)\right ) \geq \frac{{B}_{1}^{{\prime}}} {2} > 0,$$
(47)

for any 0 ≤ t ≤ t 2.

This means in particular that D ε(t) is strictly increasing for 0 ≤ t ≤ t 2.

By (39), we have for t = 0 and ε sufficiently small,

$${D}^{\epsilon }(0) < 0.$$
(48)

On the other hand for t = t 2, by using the analysis from Stage 1, we have

$${({a}_{3}({t}_{2}))}^{2} - 4{a}_{ 1}({t}_{2}){a}_{2}({t}_{2}) > 0.$$

Since

$${D}^{\epsilon }({t}_{ 2}) = {({a}_{3}({t}_{2}))}^{2} - 4{a}_{ 1}({t}_{2}){a}_{2}({t}_{2}) + O(\epsilon ),$$

it follows easily that for ε sufficiently small

$${D}^{\epsilon }({t}_{ 2}) > 0.$$
(49)

Now (47)–(49) easily yield (43).

Finally the conclusion after (44) is a simple corollary of the properties of D ε(t) and perturbation theory. We omit the details.

In summary, we have proved the following:

For sufficiently small ε > 0, the function \({\phi }^{\epsilon }(x,y,t)\) has the following properties in the neighborhood U δ:

There exists 0 < t 1 < t 2, such that

  • For 0 ≤ t < t 1, (x, y) = (0, 0) is the only critical point in U δ. Furthermore it is a non-degenerate local minimum.

  • For t = t 1, (x, y) = (0, 0) is the only critical point in U δ.

  • For t 1 < t ≤ t 2, there are three critical points in U δ. The point (x, y) = (0, 0) is a saddle. Two other critical points are of the form (x  ∗ , y  ∗ ), ( − x  ∗ , − y  ∗ ), where x  ∗  > 0, y  ∗  > 0.

Remark that due to our inversion symmetry (18), if (x  ∗ , y  ∗ ) is a critical point with x  ∗ ≠0, then ( − x  ∗ , − y  ∗ ) is also a critical point.

6 Construction of ϕ0 Satisfying (32)–(33)

We now demonstrate the existence of \({\phi }_{0} = {\phi }_{0}(x,y)\) which satisfies conditions (32)–(33) and also has inversion symmetry (18).

By (17), we choose

$${\phi }_{0}(x,y) = -\sum\limits_{{ m+n\mathrm{is\ even} \atop \vert m\vert \leq N,\vert n\vert \leq N} }\tilde{{f}}_{\mathit{mn}}{(-1)}^{\frac{m+n} {2} }\cos (mx + ny).$$
(50)

To simplify matters, we impose the following conditions on \(\tilde{{f}}_{\mathit{mn}}\):

  • \(\tilde{{f}}_{\mathit{mn}}\) is real-valued;

  • \(\tilde{{f}}_{\mathit{mn}} = 0\) if m = 0 or n = 0;

  • \(\tilde{{f}}_{\mathit{mn}}\) are odd in each of its variables m and n.

The above conditions imply that

$$\begin{array}{rcl}{ \phi }_{0}(x,y)& =& -\sum\limits_{{ 1\leq m,n\leq N \atop m+n\text{ is even}} }\tilde{{f}}_{\mathit{mn}} \cdot \left (2{(-1)}^{\frac{m+n} {2} }\cos (mx + ny)\right. \\ & & \qquad \left.-{(-1)}^{\frac{-m+n} {2} }\cos (-mx + ny) - {(-1)}^{\frac{m-n} {2} }\cos (mx - ny)\right ) \\ & =& -\sum\limits_{{ 1\leq m,n\leq N \atop m+n\text{ is even}} }2\tilde{{f}}_{\mathit{mn}}{(-1)}^{\frac{m+n} {2} }\left (\cos (mx + ny) - {(-1)}^{n}\cos (mx - ny)\right ).\end{array}$$
(51)

Define

$${f}_{\mathit{mn}} = -2\tilde{{f}}_{\mathit{mn}}{(-1)}^{\frac{m+n} {2} }.$$

Then we have

$${\phi }_{0}(x,y) =\sum\limits_{{ 1\leq m,n\leq N \atop m+n\text{ is even}} }{f}_{\mathit{mn}}\left (\cos (mx + ny) - {(-1)}^{n}\cos (mx - ny)\right ),$$
(52)

where f mn are the coefficients to be determined.

Now recall the conditions (32) and (33) and choose

$$\begin{array}{rcl} & & {a}_{3}(0) = 2, \\ & & {a}_{1}(0) = {a}_{2}(0) = 1, \\ & & {b}_{1}(0) = \frac{{r}_{1}} {24} > 0, \\ & & {b}_{2}(0) = {b}_{3}(0) = {b}_{4}(0) = {b}_{5}(0) = 0,\end{array}$$
(53)

where r 1 is a parameter whose value will be specified later.

We still have to check the second condition in (32). This condition can be simplified a little bit. By (53),

$$\begin{array}{rcl} & &{ \left. \frac{d} {\mathit{dt}}\left ({a}_{3}^{2}(t) - 4{a}_{ 1}(t){a}_{2}(t)\right )\right \vert }_{t=0} \\ & & \quad = 4(\dot{{a}}_{3}(0) -\dot{ {a}}_{1}(0) -\dot{ {a}}_{2}(0)) \\ & & \quad = 2\left ( \frac{{\partial }^{3}\phi } {\partial t\partial x\partial y}(0,0,0) -\left ( \frac{\partial } {\partial t}\Delta \phi \right )(0,0,0)\right ).\end{array}$$

By (1), (19), (16) and (53), we have

$$\begin{array}{rcl} \left ( \frac{\partial } {\partial t}\Delta \phi \right )(0,0,0)& =& \frac{\partial } {\partial t}\Delta \psi \left (\frac{\pi } {2} , \frac{\pi } {2} ,0\right ) \\ & =& {\Delta }^{2}{\psi }_{ 0}\left (\frac{\pi } {2} , \frac{\pi } {2} \right ) + ({\nabla }^{\perp }{\psi }_{ 0})\left (\frac{\pi } {2} , \frac{\pi } {2} \right ) \cdot (\nabla \Delta {\psi }_{0})\left (\frac{\pi } {2} , \frac{\pi } {2} \right ) \\ & =& {r}_{1}.\end{array}$$

Similarly

$$\frac{{\partial }^{3}\phi } {\partial t\partial x\partial y}(0,0,0) = \left ({\partial }_{xy}{\Delta }^{-1}({\nabla }^{\perp }{\phi }_{ 0} \cdot \nabla \Delta {\phi }_{0})\right )(0,0).$$

Therefore the condition

$${\left. \frac{d} {\mathit{dt}}\left ({a}_{3}^{2}(t) - 4{a}_{ 1}(t){a}_{2}(t)\right )\right \vert }_{t=0} > 0$$

is equivalent to

$${\partial }_{xy}{\Delta }^{-1}({\nabla }^{\perp }{\phi }_{ 0} \cdot \nabla \Delta {\phi }_{0})(0,0) > {r}_{1}.$$
(54)

Our goal is to find (f mn ) in (52) such that both (53) and (54) hold. In our formulae below, the summation is understood to be in the region \(\{(m,n)\,:\,1 \leq m,n \leq N\mbox{ and}m + n\mbox{ is even}\}\). In terms of f mn , the conditions (53) now take the form

$$\begin{array}{rcl} & & \sum {f}_{\mathit{mn}} \cdot mn \cdot (1 + {(-1)}^{n}) = -2, \\ & & \sum {f}_{\mathit{mn}} \cdot {m}^{2} \cdot (1 - {(-1)}^{n}) = -1, \\ & & \sum {f}_{\mathit{mn}} \cdot {n}^{2} \cdot (1 - {(-1)}^{n}) = -1, \\ & & \sum {f}_{\mathit{mn}} \cdot {m}^{4} \cdot (1 - {(-1)}^{n}) = {r}_{ 1}, \\ & & \sum {f}_{\mathit{mn}} \cdot {m}^{3}n \cdot (1 + {(-1)}^{n}) = 0, \\ & & \sum {f}_{\mathit{mn}} \cdot {m}^{2}{n}^{2} \cdot (1 - {(-1)}^{n}) = 0, \\ & & \sum {f}_{\mathit{mn}} \cdot m{n}^{3} \cdot (1 + {(-1)}^{n}) = 0, \\ & & \sum {f}_{\mathit{mn}} \cdot {n}^{4} \cdot (1 - {(-1)}^{n}) = 0. \end{array}$$
(55)

Due to the factors (1 ± ( − 1)n) which can vanish depending on the parity of n in the summation, we distinguish two types of coefficients. We shall say f mn is even if both m and n are even. Otherwise f mn is called odd. Notice that due to the constraint that m + n is even we shall only have either odd or even coefficients.

We consider first the equations for even coefficients. From (55) we only need

$$\begin{array}{rcl} & & \sum\limits_{{ m,n\geq 2 \atop m,n\text{ are even}} }{f}_{\mathit{mn}} \cdot mn = -1, \\ & & \sum\limits_{{ m,n\geq 2 \atop m,n\text{ are even}} }{f}_{\mathit{mn}} \cdot {m}^{3}n = 0, \\ & & \sum\limits_{{ m,n\geq 2 \atop m,n\text{ are even}} }{f}_{\mathit{mn}} \cdot m{n}^{3} = 0,\end{array}$$
(56)

Now we assume that we only have two nonzero even coefficients f 22 and f 44. Then from (56) we get

$${f}_{22} \cdot {2}^{2} + {f}_{ 44} \cdot {4}^{2} = -1,$$
$${f}_{22} \cdot {2}^{4} + {f}_{ 44} \cdot {4}^{4} = 0.$$

A simple computation gives that

$${f}_{22} = -1/3,\quad {f}_{44} = 1/48;$$
(57)

Next we turn to odd coefficients.

From (55), we get

$$\begin{array}{rcl} & & \sum\limits_{{ 1\leq m,n\leq N \atop m,n\text{ are odd}} }{f}_{\mathit{mn}} \cdot {m}^{2} = -\frac{1} {2}, \\ & & \sum\limits_{{ 1\leq m,n\leq N \atop m,n\text{ are odd}} }{f}_{\mathit{mn}} \cdot {n}^{2} = -\frac{1} {2}, \\ & & \sum\limits_{{ 1\leq m,n\leq N \atop m,n\text{ are odd}} }{f}_{\mathit{mn}} \cdot {m}^{4} = \frac{{r}_{1}} {2} , \\ & & \sum\limits_{{ 1\leq m,n\leq N \atop m,n\text{ are odd}} }{f}_{\mathit{mn}} \cdot {m}^{2}{n}^{2} = 0, \\ & & \sum\limits_{{ 1\leq m,n\leq N \atop m,n\text{ are odd}} }{f}_{\mathit{mn}} \cdot {n}^{4} = 0, \end{array}$$
(58)

To simplify matters, we assume that the only nonzero odd coefficients are f 11, f 31, f 33, f 15, f 51.

Let r 2 be another parameter whose value will be specified later. We shall choose f 51 = r 2 and add this condition to (58). For the coefficients f 11, f 31, f 33, f 15, f 51 we then have the matrix equation

$$\left (\begin{array}{cccccc} 1 & 1 &{3}^{2} & {3}^{2} & 1 &{5}^{2} \\ 1 &{3}^{2} & 1 &{3}^{2} & {5}^{2} & 1 \\ 1 & 1 &{3}^{4} & {3}^{4} & 1 &{5}^{4} \\ 1 &{3}^{2} & {3}^{2} & {9}^{2} & {5}^{2} & {5}^{2} \\ 1 &{3}^{4} & 1 &{3}^{4} & {5}^{4} & 1\\ 0\quad 0 & 0 & 0 & 0 & 0 & 1\\ \end{array} \right )\left (\begin{array}{c} {f}_{11} \\ {f}_{13} \\ {f}_{31} \\ {f}_{33} \\ {f}_{15} \\ {f}_{51}\\ \end{array} \right ) = \left (\begin{array}{c} -\frac{1} {2} \\ -\frac{1} {2} \\ \frac{{r}_{1}} {2} \\ 0\\ 0 \\ {r}_{2}\\ \end{array} \right )$$
(59)

Choose \({r}_{1} = \frac{1} {10}\) and r 2 =  − 10. From (59), we obtain

$$\left (\begin{array}{c} {f}_{11} \\ {f}_{13} \\ {f}_{31} \\ {f}_{33} \\ {f}_{15} \\ {f}_{51} \end{array} \right ) = \left (\begin{array}{c} - 580.5698\\ 90.0012 \\ 90.0008\\ - 6.6598 \\ - 10.0001\\ - 10.0000 \\ \end{array} \right )$$
(60)

We have completely solved (53). It remains to check the condition (54).

To simplify the computation, we rewrite (52) as

$$\begin{array}{rcl}{ \phi }_{0}(x,y)& =& \sum\limits_{{ 1\leq m,n\leq N \atop m+n\text{ is even}} }{f}_{\mathit{mn}} \cdot \frac{{e}^{i(mx+ny)} + {e}^{-i(mx+ny)}} {2} \\ & & \quad +\sum\limits_{{ 1\leq m,n\leq N \atop m+n\text{ is even}} }{f}_{\mathit{mn}} \cdot {(-1)}^{n+1} \cdot \frac{{e}^{i(mx-ny)} + {e}^{-i(mx-ny)}} {2}. \\ & =& \sum\limits_{\vert m\vert \leq N,\vert n\vert \leq N}{g}_{\mathit{mn}}{e}^{i(mx+ny)}, \end{array}$$
(61)

where the coefficients g mn satisfy

  • g mn  = 0 if (m + n) is not even or m = 0 or n = 0.

  • \({g}_{\mathit{mn}} = \frac{1} {2}{f}_{\vert m\vert ,\vert n\vert }\) if mn > 0.

  • \({g}_{\mathit{mn}} = \frac{1} {2}{f}_{\vert m\vert ,\vert n\vert }\cdot {(-1)}^{n+1}\) if mn < 0.

To find the LHS of (54), we use the coefficients g mn and calculate

$$\begin{array}{rcl}{ \Delta }^{-1}\nabla {\phi }_{ 0}(x,y)& =& \sum\limits_{\vert m\vert \leq N,\vert n\vert \leq N}{g}_{\mathit{mn}} \cdot i \cdot \frac{\left({m atop n}\right)} {(-1)({m}^{2} + {n}^{2})}{e}^{i(mx+ny)}, \\ {\nabla }^{\perp }{\phi }_{ 0}(x,y)& =& \sum\limits_{\vert \tilde{m}\vert \leq N,\vert \tilde{n}\vert \leq N}{g}_{\tilde{m}\tilde{n}} \cdot i \cdot \left({-\tilde{n} atop -\tilde{m}}\right) \cdot {e}^{i(\tilde{m}x+\tilde{n}y)}.\end{array}$$

Hence

$$({\nabla }^{-1}\nabla {\phi }_{ 0}\cdot {\nabla }^{\perp }{\phi }_{ 0})(x,y) =\sum\limits_{{ \vert m\vert \leq N,\vert n\vert \leq N \atop \vert \tilde{m}\vert \leq N,\vert \tilde{n}\vert \leq N} }{g}_{\mathit{mn}}{g}_{\tilde{m}\tilde{n}}\cdot \frac{\tilde{m}n - m\tilde{n}} {{m}^{2} + {n}^{2}} \cdot {e}^{i\left ((m+\tilde{m})x+(n+\tilde{n})y\right )}.$$
(62)

Note that in the summation of the RHS of (62), the zero-th mode is not present since if \(m = -\tilde{m}\), \(n = -\tilde{n}\) then \(\tilde{m}n - m\tilde{n} = 0\).

We then apply the operator \({\partial }_{xy}{\Delta }^{-1}\) to both sides of (62) to obtain

$$\begin{array}{rcl} & &{ \left.{\partial }_{xy}{\Delta }^{-1}\left ({\Delta }^{-1}\nabla {\phi }_{ 0} \cdot {\nabla }^{\perp }{\phi }_{ 0}\right )\right \vert }_{(x,y)=(0,0)} \\ & & \quad =\sum\limits_{{ \vert m\vert \leq N,\vert n\vert \leq N \atop \vert \tilde{m}\vert \leq N,\vert \tilde{n}\vert \leq N} }{g}_{\mathit{mn}}{g}_{\tilde{m}\tilde{n}} \cdot \frac{\tilde{m}n - m\tilde{n}} {{m}^{2} + {n}^{2}} \cdot \frac{(m +\tilde{ m})(n +\tilde{ n})} {{(m +\tilde{ m})}^{2} + {(n +\tilde{ n})}^{2}}.\end{array}$$
(63)

By using (57), (60), (61), and (63) and a tedious calculation, we obtain

$$\text{ LHS of (54)} = 0.1436 > 0.1 = {r}_{1}.$$

Clearly this gives us all the needed estimates.

We have finished the construction of the desired initial data ϕ0 needed in Stage 1. The proof of Theorem 2 is now completed.

7 Proof of Theorem 3

In this section we give the proof of Theorem 3. The argument is similar to the proof of Theorem 2 and is again done in two stages. We sketch the details as follows.

  • Stage 1: degenerate case. Recall the reduced critical point equation,

    $$-({a}_{3}^{2} - 4{a}_{ 1}{a}_{2}) + 8{a}_{2}{b}_{1}{x}^{2} + O(t) \cdot O({x}^{2}) + O({x}^{4}) = 0.$$
    (64)

    Choose \({\phi }_{0} = {\phi }_{0}(x,y)\) so that

    $$\begin{array}{rcl} & & {a}_{3}{(0)}^{2} - 4{a}_{ 1}(0){a}_{2}(0) = 0, \\ & &{ \left. \frac{d} {\mathit{dt}}\left ({a}_{3}^{2}(t) - 4{a}_{ 1}(t){a}_{2}(t)\right )\right \vert }_{t=0} < 0, \\ & &{a}_{2}(0) > 0,\,{a}_{3}(0) > 0,\,{b}_{1}(0) > 0,\end{array}$$
    (65)

    and also

    $${b}_{2}(0) = {b}_{3}(0) = {b}_{4}(0) = {b}_{5}(0) = 0.$$
    (66)

    The possibility of choosing ϕ0 with properties (65)–(66) will be shown in Sect. 8. Assume for the moment that these conditions are met, then for sufficiently small t 2 > 0, we have for 0 < t ≤ t 2,

    $$\begin{array}{rcl} {A}_{3}^{{\prime\prime}}&\geq & {a}_{ 3}(t) \geq {A}_{3}^{{\prime}} > 0, \\ {A}_{2}^{{\prime\prime}}&\geq & {a}_{ 2}(t) \geq {A}_{2}^{{\prime}} > 0, \\ {B}_{1}^{{\prime\prime}}&\geq & \frac{d} {\mathit{dt}}\left (4{a}_{1}(t){a}_{2}(t) - {a}_{3}^{2}(t)\right ) \geq {B}_{ 1}^{{\prime}} > 0, \\ {B}_{2}^{{\prime\prime}}&\geq & {b}_{ 1}(t) \geq {B}_{2}^{{\prime}} > 0, \end{array}$$
    (67)

    where \({A}_{i}^{{\prime}}\), \({A}_{i}^{{\prime\prime}}\), \({B}_{i}^{{\prime}}\), \({B}_{i}^{{\prime\prime}}\) are constants.

    By (65)–(67), we have for 0 < t ≤ t 2

    $$\mathit{const} \cdot t \leq 4{a}_{1}(t){a}_{2}(t) - {a}_{3}^{2}(t) \leq \mathit{const} \cdot t,$$

    and also

    $$8{a}_{2}(t){b}_{1}(t) \sim \mathit{const}.$$

    It follows that for 0 < t ≤ t 2, the equation (64) is of the form

    $$O(t) + O(1) \cdot {x}^{2} + O(t) \cdot O({x}^{2}) + O({x}^{4}) = 0$$
    (68)

    which clearly has no real-valued solution for 0 < t ≤ t 2.

  • Stage 2: a perturbation argument. In stage 2 we perturb the initial data considered in Stage 1 so that initially we will have three critical points.

    To this end, consider \(\tilde{{\phi }}_{0} =\tilde{ {\phi }}_{0}(x,y) \in {C}^{\infty }\) with the following properties:

    $$\tilde{{\phi }}_{0}(x,y) =\tilde{ {\phi }}_{0}(-x,-y),\quad \forall \,x,y,$$
    $${\left. \frac{{\partial }^{4}\tilde{{\phi }}_{0}} {\partial {x}^{m}\partial {y}^{n}}\right \vert }_{(x,y)=(0,0)} = 0,\quad \forall \,m + n = 4,0 \leq m \leq 4,$$
    $${ \left. \frac{{\partial }^{2}\tilde{{\phi }}_{0}} {\partial x\partial y}\right \vert }_{(x,y)=(0,0)} = 0,\quad {\left.\frac{{\partial }^{2}\tilde{{\phi }}_{0}} {\partial {x}^{2}} \right \vert }_{(x,y)=(0,0)} > 0,\quad {\left.\frac{{\partial }^{2}\tilde{{\phi }}_{0}} {\partial {y}^{2}} \right \vert }_{(x,y)=(0,0)} > 0.$$
    (69)

    Fix \({\phi }_{0} = {\phi }_{0}(x,y)\) taken from Stage 1 which has the properties (65)–(66) and consider the perturbation by \(\tilde{{\phi }}_{0}\) having the form

    $$\tilde{{\phi }}_{0}^{\epsilon }(x,y) = {\phi }_{ 0}(x,y) - \epsilon \tilde{{\phi }}_{0}(x,y),$$
    (70)

    where ε > 0 is sufficiently small.

    Denote the corresponding solution of the main equation (1) (in the shifted coordinates) by \({\phi }^{\epsilon } = {\phi }^{\epsilon }(x,y,t)\). Expand \({\phi }^{\epsilon }(x,y,t)\) in the form

    $$\begin{array}{rcl}{ \phi }^{\epsilon }(x,y,t)& =& {\phi }^{\epsilon }(0,0,t) + {a}_{ 1}^{\epsilon }(t){x}^{2} + {a}_{ 2}^{\epsilon }{y}^{2} + {a}_{ 3}^{\epsilon }xy \\ & & \quad + {b}_{1}^{\epsilon }(t){x}^{4} + {b}_{ 2}^{\epsilon }(t){y}^{4} + {b}_{ 3}^{\epsilon }(t){x}^{3}y + {b}_{ 4}^{\epsilon }(t){x}^{2}{y}^{2} + {b}_{ 5}^{\epsilon }(t)x{y}^{3} \\ & & \quad +\tilde{ \epsilon }(x,y,t) \end{array}$$
    (71)

    where \(\tilde{\epsilon }\) satisfies an estimate similar to (20).

We now check that \({\phi }^{\epsilon }(x,y,t)\) has the desired properties needed in Theorem 3.

  1. (a)

    At t = 0, \({\phi }^{\epsilon }(x,y,0)\) has three critical points in the neighborhood U δ. Also (0, 0) is a saddle point.

    To prove this, we note that due to (65), (66) and (69), the reduced critical point equation for \({\phi }^{\epsilon }(x,y,t)\) takes the form

    $${({a}_{3}^{\epsilon }(0))}^{2} - 4{a}_{ 1}^{\epsilon }(0){a}_{ 2}^{\epsilon }(0) - 8{a}_{ 2}^{\epsilon }(0){b}_{ 1}^{\epsilon }(0){x}^{2} + O({x}^{4}) = 0.$$
    (72)

    Denote

    $$\begin{array}{rcl} \tilde{{a}}_{1}& =&{ \left.\frac{{\partial }^{2}\tilde{{\phi }}_{0}} {\partial {x}^{2}} \right \vert }_{(x,y)=(0,0)} > 0, \\ \tilde{{a}}_{2}& =&{ \left.\frac{{\partial }^{2}\tilde{{\phi }}_{0}} {\partial {y}^{2}} \right \vert }_{(x,y)=(0,0)} > \end{array}$$
    (0.)

    By (65), (69), and (70), we have

    $$\begin{array}{rcl} & & {({a}_{3}^{\epsilon }(0))}^{2} - 4{a}_{ 1}^{\epsilon }(0){a}_{ 2}^{\epsilon }(0) \\ & & \quad = {a}_{3}{(0)}^{2} + O({\epsilon }^{2}) - 4({a}_{ 1}(0) - \epsilon \tilde{{a}}_{1})({a}_{2}(0) - \epsilon \tilde{{a}}_{2}) \\ & & \quad = 4({a}_{1}(0)\tilde{{a}}_{2} + {a}_{2}(0)\tilde{{a}}_{1})\epsilon + O({\epsilon }^{2}). \end{array}$$
    (73)

    On the other hand, for sufficiently small ε > 0, by using (72), (69), and (70), we have

    $$\begin{array}{rcl}{ a}_{2}^{\epsilon }(0){b}_{ 1}^{\epsilon }(0)& =& ({a}_{ 2}(0) - O(\epsilon )) \cdot ({b}_{1}(0) + O({\epsilon }^{2})) \\ & =& {a}_{2}(0){b}_{1}(0) - O(\epsilon ) \\ & & \sim \mathit{const}. \end{array}$$
    (74)

    Therefore by (73) and (74), the equation (72) takes the form

    $$O(1)\epsilon - O(1) \cdot {x}^{2} + O({x}^{4}) = 0,$$

    or simply

    $$O(1) \cdot \epsilon - O(1) \cdot O({x}^{2}) = 0.$$

    It is clear that for ε > 0 sufficiently small this equation has two real-valued solutions in U δ.

    To verify that (0, 0) is a saddle point at t = 0, we observe that by (73), for sufficiently small ε > 0,

    $${({a}_{3}^{\epsilon }(0))}^{2} - 4{a}_{ 1}^{\epsilon }(0){a}_{ 2}^{\epsilon }(0) > 0.$$
    (75)

    Also we have by (65)

    $${a}_{1}^{\epsilon }(0) > 0,\quad {a}_{ 2}^{\epsilon }(0) > 0.$$
    (76)

    Equations 75 and 76 show that the Hessian matrix

    $$\left (\begin{array}{cc} {a}_{1}^{\epsilon }(0) &\frac{1} {2}{a}_{3}^{\epsilon }(0) \\ \frac{1} {2}{a}_{3}^{\epsilon }(0)& {a}_{ 2}^{\epsilon }(0)\end{array} \right )$$

    has one positive eigen-value and one negative eigen-value. Hence (0, 0) is a saddle.

  2. (b)

    Consider the function

    $${D}^{\epsilon }(t) = {({a}_{ 3}^{\epsilon }(t))}^{2} - 4{a}_{ 1}^{\epsilon }(t){a}_{ 2}^{\epsilon }(t).$$

    It will be proven below that for sufficiently small ε > 0, the following holds:

There exists unique \({t}_{1} = {t}_{1}(\epsilon ) > 0\) such that

$$\begin{array}{rcl} & & {D}^{\epsilon }(t) > 0,\quad \text{ for}0 \leq t < {t}_{ 1}, \\ & & {D}^{\epsilon }(t) = 0,\quad \text{ for}t = {t}_{ 1}, \\ & & {D}^{\epsilon }(t) < 0,\quad \text{ for}{t}_{ 1} < t \leq {t}_{2}.\end{array}$$
(77)

Furthermore, the reduced critical-point equation

$${D}^{\epsilon }(t) - 8{a}_{ 2}^{\epsilon }(t){b}_{ 1}^{\epsilon }(t){x}^{2} + O(t) \cdot O({x}^{2}) + O({x}^{4}) = 0$$
(78)

has

  • Two nonzero solutions for 0 ≤ t < t 1,

  • Exactly one solution given by x = 0 for t = t 1,

  • No solutions for t 1 < t ≤ t 2.

To prove (77), we recall the bound (67), where for 0 ≤ t ≤ t 2

$${B}_{1}^{{\prime\prime}}\geq \frac{d} {\mathit{dt}}\left ({a}_{3}^{2}(t) - 4{a}_{ 1}(t){a}_{2}(t)\right ) \geq {B}_{1}^{{\prime}} > 0.$$
(79)

Since our initial data are given by

$${\phi }_{0}^{\epsilon }(x,y) = {\phi }_{ 0}(x,y) + \epsilon \tilde{{\phi }}_{0}(x,y),$$

it follows from simple perturbation theory that

$$2{B}_{1}^{{\prime\prime}}\geq \frac{d} {\mathit{dt}}\left (4{a}_{1}^{\epsilon }(t){a}_{ 2}^{\epsilon }(t) - {({a}_{ 3}^{\epsilon }(t))}^{2}\right ) \geq \frac{{B}_{1}^{{\prime}}} {2} > 0,$$
(80)

for any 0 ≤ t ≤ t 2.

This means in particular that D ε(t) is strictly decreasing for 0 ≤ t ≤ t 2.

By (73), we have for t = 0 and ε sufficiently small,

$${D}^{\epsilon }(0) > 0.$$
(81)

On the other hand for t = t 2, by using the analysis from Stage 1, we have

$${({a}_{3}({t}_{2}))}^{2} - 4{a}_{ 1}({t}_{2}){a}_{2}({t}_{2}) < 0.$$

Since

$${D}^{\epsilon }({t}_{ 2}) = {({a}_{3}({t}_{2}))}^{2} - 4{a}_{ 1}({t}_{2}){a}_{2}({t}_{2}) + O(\epsilon ),$$

it follows easily that for ε sufficiently small

$${D}^{\epsilon }({t}_{ 2}) < 0.$$
(82)

Now (80)–(82) easily yield (77).

8 Construction of ϕ0 Satisfying (65)–(66)

We now demonstrate the existence of ϕ0 = ϕ0(x, y) which satisfies conditions (65)–(66). The construction is similar to the one in Sect. 6 and therefore we shall only sketch the details.

Choose ϕ0 in the form

$${\phi }_{0}(x,y) =\sum\limits_{{ 1\leq m,n\leq N \atop m+n\text{ is even}} }{f}_{\mathit{mn}}\left (\cos (mx + ny) - {(-1)}^{n}\cos (mx - ny)\right ),$$
(83)

where f mn are the coefficients to be determined.

Now recall the conditions (65) and (66) and set

$$\begin{array}{rcl} & & {a}_{3}(0) = 2, \\ & & {a}_{1}(0) = {a}_{2}(0) = 1, \\ & & {b}_{1}(0) = \frac{{r}_{1}} {24} > 0, \\ & & {b}_{2}(0) = {b}_{3}(0) = {b}_{4}(0) = {b}_{5}(0) = 0,\end{array}$$
(84)

where r 1 is a parameter whose value will be specified later.

The second condition in (65) simplifies to

$${\partial }_{xy}{\Delta }^{-1}({\nabla }^{\perp }{\phi }_{ 0} \cdot \nabla \Delta {\phi }_{0})(0,0) < {r}_{1}.$$
(85)

Our goal is to find (f mn ) in (83) such that both (84) and (85) hold. In our formulae below, the summation is understood to be in the region \(\{(m,n)\,:\,1 \leq m,n \leq N\mbox{ and}m + n\mbox{ is even}\}\). In terms of f mn , the conditions (84) now take the form

$$\begin{array}{rcl} & & \sum {f}_{\mathit{mn}} \cdot mn \cdot (1 + {(-1)}^{n}) = -2, \\ & & \sum {f}_{\mathit{mn}} \cdot {m}^{2} \cdot (1 - {(-1)}^{n}) = -1, \\ & & \sum {f}_{\mathit{mn}} \cdot {n}^{2} \cdot (1 - {(-1)}^{n}) = -1, \\ & & \sum {f}_{\mathit{mn}} \cdot {m}^{4} \cdot (1 - {(-1)}^{n}) = {r}_{ 1}, \\ & & \sum {f}_{\mathit{mn}} \cdot {m}^{3}n \cdot (1 + {(-1)}^{n}) = 0, \\ & & \sum {f}_{\mathit{mn}} \cdot {m}^{2}{n}^{2} \cdot (1 - {(-1)}^{n}) = 0, \\ & & \sum {f}_{\mathit{mn}} \cdot m{n}^{3} \cdot (1 + {(-1)}^{n}) = 0, \\ & & \sum {f}_{\mathit{mn}} \cdot {n}^{4} \cdot (1 - {(-1)}^{n}) = 0. \end{array}$$
(86)

Due to the factors (1 ± ( − 1)n) which can vanish depending on the parity of n in the summation, we distinguish two types of coefficients. We shall say f mn is even if both m and n are even. Otherwise f mn is called odd. Notice that due to the constraint that m + n is even we shall only have either odd or even coefficients.

Consider first the equations for even coefficients. From (86) we only need

$$\begin{array}{rcl} & & \sum\limits_{{ m,n\geq 2 \atop m,n\text{ are even}} }{f}_{\mathit{mn}} \cdot mn = -1, \\ & & \sum\limits_{{ m,n\geq 2 \atop m,n\text{ are even}} }{f}_{\mathit{mn}} \cdot {m}^{3}n = 0, \\ & & \sum\limits_{{ m,n\geq 2 \atop m,n\text{ are even}} }{f}_{\mathit{mn}} \cdot m{n}^{3} = 0,\end{array}$$
(87)

Now we assume that we only have two nonzero even coefficients f 22 and f 44. Then from (87) we get

$$\begin{array}{rcl} & & {f}_{22} \cdot {2}^{2} + {f}_{ 44} \cdot {4}^{2} = -1, \\ & & {f}_{22} \cdot {2}^{4} + {f}_{ 44} \cdot {4}^{4} = \end{array}$$
(0.)

A simple computation gives that

$${f}_{22} = -1/3,\quad {f}_{44} = 1/48;$$
(88)

Next we turn to odd coefficients.

From (86), we get

$$\begin{array}{rcl} & & \sum\limits_{{ 1\leq m,n\leq N \atop m,n\text{ are odd}} }{f}_{\mathit{mn}} \cdot {m}^{2} = -\frac{1} {2}, \\ & & \sum\limits_{{ 1\leq m,n\leq N \atop m,n\text{ are odd}} }{f}_{\mathit{mn}} \cdot {n}^{2} = -\frac{1} {2}, \\ & & \sum\limits_{{ 1\leq m,n\leq N \atop m,n\text{ are odd}} }{f}_{\mathit{mn}} \cdot {m}^{4} = \frac{{r}_{1}} {2} , \\ & & \sum\limits_{{ 1\leq m,n\leq N \atop m,n\text{ are odd}} }{f}_{\mathit{mn}} \cdot {m}^{2}{n}^{2} = 0, \\ & & \sum\limits_{{ 1\leq m,n\leq N \atop m,n\text{ are odd}} }{f}_{\mathit{mn}} \cdot {n}^{4} = 0, \end{array}$$
(89)

To simplify matters, we assume that the only nonzero odd coefficients are f 11, f 31, f 33, f 15, f 51.

Let r 2 be another parameter whose value will be specified later. We shall choose f 51 = r 2 and add this condition to (89). For the coefficients f 11, f 31, f 33, f 15, f 51 we then have the matrix equation

$$\left (\begin{array}{cccccc} 1 & 1 &{3}^{2} & {3}^{2} & 1 &{5}^{2} \\ 1 &{3}^{2} & 1 &{3}^{2} & {5}^{2} & 1 \\ 1 & 1 &{3}^{4} & {3}^{4} & 1 &{5}^{4} \\ 1 &{3}^{2} & {3}^{2} & {9}^{2} & {5}^{2} & {5}^{2} \\ 1 &{3}^{4} & 1 &{3}^{4} & {5}^{4} & 1\\ 0\quad 0 & 0 & 0 & 0 & 0 & 1 \end{array} \right )\left (\begin{array}{c} {f}_{11} \\ {f}_{13} \\ {f}_{31} \\ {f}_{33} \\ {f}_{15} \\ {f}_{51} \end{array} \right ) = \left (\begin{array}{c} -\frac{1} {2} \\ -\frac{1} {2} \\ \frac{{r}_{1}} {2} \\ 0\\ 0 \\ {r}_{2} \end{array} \right )$$
(90)

Choose r 1 = r 2 = 1. From (90), we obtain

$$\left (\begin{array}{c} {f}_{11} \\ {f}_{13} \\ {f}_{31} \\ {f}_{33} \\ {f}_{15} \\ {f}_{51} \end{array} \right ) = \left (\begin{array}{c} 57.3646\\ - 8.9883 \\ - 8.9922\\ 0.6727 \\ 0.9987\\ 1.0000 \end{array} \right )$$
(91)

We have completely solved (84). It remains to check the condition (85).

For this purpose, we rewrite (83) as

$$\begin{array}{rcl}{ \phi }_{0}(x,y)& =& \sum\limits_{{ 1\leq m,n\leq N \atop m+n\text{ is even}} }{f}_{\mathit{mn}} \cdot \frac{{e}^{i(mx+ny)} + {e}^{-i(mx+ny)}} {2} \\ & & \quad +\sum\limits_{{ 1\leq m,n\leq N \atop m+n\text{ is even}} }{f}_{\mathit{mn}} \cdot {(-1)}^{n+1} \cdot \frac{{e}^{i(mx-ny)} + {e}^{-i(mx-ny)}} {2}. \\ & =& \sum\limits_{\vert m\vert \leq N,\vert n\vert \leq N}{g}_{\mathit{mn}}{e}^{i(mx+ny)}, \end{array}$$
(92)

where the coefficients g mn satisfy

  • g mn  = 0 if (m + n) is not even or m = 0 or n = 0.

  • \({g}_{\mathit{mn}} = \frac{1} {2}{f}_{\vert m\vert ,\vert n\vert }\) if mn > 0.

  • \({g}_{\mathit{mn}} = \frac{1} {2}{f}_{\vert m\vert ,\vert n\vert }\cdot {(-1)}^{n+1}\) if mn < 0.

In terms of the coefficients g mn , the LHS of (85) takes the form

$$({\nabla }^{-1}\nabla {\phi }_{ 0}\cdot {\nabla }^{\perp }{\phi }_{ 0})(x,y) =\sum\limits_{{ \vert m\vert \leq N,\vert n\vert \leq N \atop \vert \tilde{m}\vert \leq N,\vert \tilde{n}\vert \leq N} }{g}_{\mathit{mn}}{g}_{\tilde{m}\tilde{n}}\cdot \frac{\tilde{m}n - m\tilde{n}} {{m}^{2} + {n}^{2}} \cdot {e}^{i\left ((m+\tilde{m})x+(n+\tilde{n})y\right )}.$$

By a tedious calculation, we obtain

$$\text{ LHS of (85)} = -0.1420 < 1 = {r}_{1}.$$

Clearly this gives us all the needed estimates.

We have finished the construction of the desired initial data ϕ0 needed in Stage 1 of Sect. 7. The proof of Theorem 3 is now completed.