Abstract
We prove the existence of small amplitude quasi-periodic solutions for quasi-linear and fully nonlinear forced perturbations of the linear Airy equation. For Hamiltonian or reversible nonlinearities we also prove their linear stability. The key analysis concerns the reducibility of the linearized operator at an approximate solution, which provides a sharp asymptotic expansion of its eigenvalues. For quasi-linear perturbations this cannot be directly obtained by a KAM iteration. Hence we first perform a regularization procedure, which conjugates the linearized operator to an operator with constant coefficients plus a bounded remainder. These transformations are obtained by changes of variables induced by diffeomorphisms of the torus and pseudo-differential operators. At this point we implement a Nash–Moser iteration (with second order Melnikov non-resonance conditions) which completes the reduction to constant coefficients.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
A challenging and open question in the theory of quasi-periodic motions for PDEs concerns its possible extension to quasi-linear and fully nonlinear equations, namely PDEs whose nonlinearities contain derivatives of the same order as the linear operator. Besides its mathematical interest, this question is also relevant in view of applications to physical real world nonlinear models, for example in fluid dynamics and elasticity.
The goal of this paper is to make the first step in this direction, developing a KAM theory for quasi-periodically forced perturbations of the linear Airy equation
In (1.1) the modulus of the frequency vector \(\omega \) is used as a parameter in the problem, see (1.2).
First, in Theorem 1.1 we prove an existence result of quasi-periodic solutions for a large class of quasi-linear nonlinearities \( f \). Then for Hamiltonian or reversible nonlinearities, we also prove the linear stability of the solutions, see Theorems 1.2, 1.3. Theorem 1.3 also holds for fully nonlinear perturbations. The precise meaning of stability is stated in Theorem 1.5. The key analysis is the reduction to constant coefficients of the linearized Airy equation, see Theorem 1.4. These results are presented in [3]. To the best of our knowledge, these are the first KAM results for quasi-linear or fully nonlinear PDEs. We reserve to a future work the study of autonomous, parameter independent, perturbations of KdV, which also requires the analysis of the frequency-to-amplitude map arising from the nonlinearity. We think it is worth to split these difficulties.
Let us outline a short history of the subject. KAM and Nash–Moser theory for PDEs, which counts nowadays on a wide literature, started with the pioneering works of Kuksin [29] and Wayne [43], and was developed in the 1990s by Craig–Wayne [17], Bourgain [12, 13], Pöschel [37] (see also [16, 31, 32] for more references). These papers concern wave and Schrödinger equations with bounded Hamiltonian nonlinearities.
The first KAM results for unbounded perturbations have been obtained by Kuksin [30, 31], and, then, Kappeler–Pöschel [27], for Hamiltonian, analytic perturbations of KdV. Here the highest constant coefficients linear operator is \(\partial _{xxx}\) and the nonlinearity contains one space derivative \(\partial _x\). This means that the Hamiltonian density is a functions of \(x\) and \(u\) (it could also depend on \( |\partial _x|^{1/2}u\)). The key idea is to work with a variable coefficients normal form. The corresponding homological equations are solved thanks to the so called “Kuksin lemma”, see Chapter 5 in [27]. Their approach has been recently improved by Liu–Yuan [34] who proved a stronger version of the Kuksin lemma. Then in [35] (see also Zhang et al. [44]) they applied it to \(1\)-dimensional derivative NLS (DNLS) and Benjamin–Ono equations, where the highest order constant coefficients linear operator is \( \partial _{xx}\) and the nonlinearity contains one derivative \(\partial _x\). These methods apply to dispersive PDEs with derivatives like KdV, DNLS, the Duffing oscillator (see Bambusi–Graffi [4]), but not to derivative wave equations (DNLW) which contain first order derivatives \(\partial _x , \partial _t \) in the nonlinearity.
For DNLW, KAM theorems have been recently proved by Berti–Biasco–Procesi for both Hamiltonian [6] and reversible [7] equations. The key ingredient is an asymptotic expansion of the perturbed eigenvalues that is sufficiently accurate to impose the second order Melnikov non-resonance conditions. This is achieved introducing the notion of “quasi-Töplitz” vector field, which is inspired to the concept of “quasi-Töplitz” and “Töplitz–Lipschitz” Hamiltonians, developed, respectively, in Procesi–Xu [39] and Eliasson–Kuksin [19, 20] (see also Geng et al. [21], Grébert–Thomann [23], Procesi–Procesi [38]).
Existence of quasi-periodic solutions of PDEs can also be proved by imposing only the first order Melnikov conditions. This approach has been developed by Bourgain [12–15] extending the work of Craig–Wayne [17] for periodic solutions. It is especially convenient for PDEs in higher space dimension, because of the high multiplicity of the eigenvalues: see also the recent results by Wang [42], Berti–Bolle [9, 10] (and [11, 22] for periodic solutions). This method does not provide information about the stability of the quasi-periodic solutions, because the linearized equations have variable coefficients.
All the aforementioned results concern “semilinear” PDEs, namely equations in which the nonlinearity contains strictly less derivatives than the linear differential operator. For quasi-linear or fully nonlinear PDEs the perturbative effect is much stronger, and the possibility of extending KAM theory in this context is doubtful, see [16, 27, 35], because of the possible phenomenon of formation of singularities outlined in Lax [33], Klainerman and Majda [28]. For example, Kappeler–Pöschel [27] (remark 3, page 19) wrote: “It would be interesting to obtain perturbation results which also include terms of higher order, at least in the region where the KdV approximation is valid. However, results of this type are still out of reach, if true at all”. The study of this important issue is at its first steps.
For quasi-linear and fully nonlinear PDEs, the literature concerns, so far, only existence of periodic solutions. We quote the classical bifurcation results of Rabinowitz [40, 41] for fully nonlinear forced wave equations with a small dissipation term. More recently, Baldi [1] proved existence of periodic forced vibrations for quasi-linear Kirchhoff equations. Here the quasi-linear perturbation term depends explicitly only on time. Both these results are proved via Nash–Moser methods.
For the water waves equations, which are a fully nonlinear PDE, we mention the pioneering work of Iooss et al. [24] about existence of time periodic standing waves, and of Iooss–Plotnikov [25, 26] for 3-dimensional traveling water waves. The key idea is to use diffeomorphisms of the torus \({\mathbb {T}}^2\) and pseudo-differential operators, in order to conjugate the linearized operator to one with constant coefficients plus a sufficiently smoothing remainder. This is enough to invert the whole linearized operator by Neumann series. Very recently Baldi [2] has further developed the techniques of [24], proving the existence of periodic solutions for fully nonlinear autonomous, reversible Benjamin–Ono equations.
These approaches do not imply the linear stability of the solutions and, unfortunately, they do not work for quasi-periodic solutions, because stronger small divisors difficulties arise, see the comment 1.2 below.
We finally mention that, for quasi-linear Klein–Gordon equations on spheres, Delort [18] has proved long time existence results via Birkhoff normal form methods.
The key analysis of the present paper concerns the linearized operator (1.16) obtained at any step of the Nash–Moser iteration. Its reduction to constant coefficients can not be obtained by the KAM schemes [27, 30, 35]. The reason is that the perturbation in (1.1) is unbounded of order three (i.e. \( O(\partial _{xxx}) \)) and the homological equation (solved by the Kuksin lemma) gains only two space derivatives (thanks to the cubic dispersion relation of KdV). Therefore the scheme does not converge. Our idea is to perform, before starting with the KAM iteration, some preliminary transformations which decrease the \( \partial _x \)-order of the perturbation, but not its size. We use changes of variables, like quasi-periodic time-dependent diffeomorphisms of the space variable \( x \), a quasi-periodic reparametrization of time, multiplication operators and Fourier multipliers, which reduce the linearized operator to constant coefficients up to a bounded remainder, see (1.24). These transformations, which are inspired by [2, 24], are very different from the usual KAM transformations. At this point, we start a KAM reducibility scheme à la Eliasson-Kuksin which reduces the size of the perturbation quadratically, and completely diagonalizes the linearized operator (actually, since we work with finite differentiability, we implement a Nash–Moser scheme). For reversible or Hamiltonian perturbations we get that the eigenvalues of this diagonal operator are purely imaginary, i.e. we prove the linear stability. In Sect. 1.2 we present the main ideas of the proof.
1.1 Main results
We consider problem (1.1) where \( \varepsilon > 0 \) is a small parameter, the nonlinearity is quasi-periodic in time with diophantine frequency vector
and \( f(\varphi , x, z )\), \(\varphi \in {\mathbb {T}}^{\nu }\), \( z := (z_0, z_1, z_2, z_3) \in {\mathbb {R}}^4 \), is a finitely many times differentiable function, namely
for some \( q \in {\mathbb {N}}\) large enough. For simplicity we fix in (1.2) the diophantine exponent \( \tau _0 := \nu \). The only “external” parameter in (1.1) is \( \lambda \), which is the length of the frequency vector (this corresponds to a time scaling). We consider the following questions:
-
For \( \varepsilon \) small enough, do there exist quasi-periodic solutions of (1.1) for positive measure sets of \( \lambda \in \Lambda \)?
-
Are these solutions linearly stable?
Clearly, if \( f(\varphi ,x, 0)\) is not identically zero, then \( u = 0 \) is not a solution of (1.1) for \( \varepsilon \ne 0 \). Thus we look for non-trivial \( (2 \pi )^{\nu +1}\)-periodic solutions \(u(\varphi ,x) \) of the Airy equation
in the Sobolev space
where
From now on, we fix \( {{\mathfrak {s}}}_0 := (\nu + 2) / 2 > (\nu +1 ) / 2 \), so that for all \(s \ge \mathfrak {s}_0\) the Sobolev space \(H^s\) is a Banach algebra, and it is continuously embedded \( H^s ({\mathbb {T}}^{\nu +1} ) \hookrightarrow C({\mathbb {T}}^{\nu +1} ) \).
We need some assumptions on the perturbation \( f(\varphi , x,u, u_x, u_{xx}, u_{xxx}) \). We suppose that
-
Type (F). The fully nonlinear perturbation has the form
$$\begin{aligned} f (\varphi , x, u, u_x, u_{xxx}) , \end{aligned}$$(1.6)
namely it is independent of \( u_{xx} \) (note that the dependence on \( u_{xxx} \) may be nonlinear). Otherwise, we require that
-
Type (Q). The perturbation is quasi-linear, namely
$$\begin{aligned} f = f_0 (\varphi , x, u, u_x, u_{xx}) + f_1 (\varphi , x,u,u_x, u_{xx}) u_{xxx} \end{aligned}$$is affine in \( u_{xxx} \), and it satisfies (naming the variables \( z_0 = u \), \( z_1 = u_x \), \( z_2 = u_{xx} \), \( z_3 = u_{xxx} \))
$$\begin{aligned} \partial _{z_2} f = \alpha (\varphi ) \left( \partial ^2_{z_3 x} f + z_1 \partial ^2_{z_3 z_0} f + z_2 \partial ^2_{z_3 z_1} f + z_3 \partial ^2_{z_3 z_2} f \right) \end{aligned}$$(1.7)for some function \( \alpha (\varphi ) \) (independent on \( x \)).
The Hamiltonian nonlinearities in (1.11) satisfy the above assumption (Q), see remark 3.2. In comment 3 after Theorem 1.5 we explain the reason for assuming either condition (F) or (Q).
The following theorem is an existence result of quasi-periodic solutions.
Theorem 1.1
(Existence) There exist \( s := s( \nu ) > 0\), \( q := q( \nu ) \in {\mathbb {N}}\), such that:
For every quasi-linear nonlinearity \( f \in C^q \) of the form
satisfying the (Q)-condition (1.7), for all \(\varepsilon \in (0, \varepsilon _0)\), where \(\varepsilon _0 := \varepsilon _0 (f, \nu ) \) is small enough, there exists a Cantor set \( \mathcal{C}_\varepsilon \subset \Lambda \) of asymptotically full Lebesgue measure, i.e.
such that \( \forall \lambda \in \mathcal{C}_\varepsilon \) the perturbed equation (1.4) has a solution \( u( \varepsilon , \lambda ) \in H^s \) with \( \Vert u(\varepsilon , \lambda ) \Vert _s \rightarrow 0 \) as \( \varepsilon \rightarrow 0 \).
We may ensure the linear stability of the solutions requiring further conditions on the nonlinearity, see Theorem 1.5 for the precise statement. The first case is that of Hamiltonian equations
which have the form (1.1), (1.8) with
The phase space of (1.10) is
endowed with the non-degenerate symplectic form
where \( \partial _x^{-1} u \) is the periodic primitive of \( u \) with zero average, see (3.19). As proved in Remark 3.2, the Hamiltonian nonlinearity \( f \) in (1.11) satisfies also the (Q)-condition (1.7). As a consequence, Theorem 1.1 implies the existence of quasi-periodic solutions of (1.10). In addition, we also prove their linear stability.
Theorem 1.2
(Hamiltonian case) For all Hamiltonian quasi-linear equations (1.10) the quasi-periodic solution \(u(\varepsilon ,\lambda )\) found in Theorem 1.1 is linearly stable (see Theorem 1.5).
The stability of the quasi-periodic solutions also follows by the reversibility condition
Actually (1.13) implies that the infinite-dimensional non-autonomous dynamical system
is reversible with respect to the involution
namely
In this case it is natural to look for “reversible” solutions of (1.4), that is
Theorem 1.3
(Reversible case) There exist \( s := s( \nu ) > 0\), \( q := q( \nu ) \in {\mathbb {N}}\), such that:
For every nonlinearity \( f \in C^q \) that satisfies
-
(i)
the reversibility condition (1.13), and
-
(ii)
either the (F)-condition (1.6) or the (Q)-condition (1.7), for all \(\varepsilon \in (0, \varepsilon _0)\), where \(\varepsilon _0 := \varepsilon _0 (f, \nu ) \) is small enough, there exists a Cantor set \( \mathcal{C}_\varepsilon \subset \Lambda \) with Lebesgue measure satisfying (1.9), such that for all \( \lambda \in \mathcal{C}_\varepsilon \) the perturbed Airy equation (1.4) has a solution \( u (\varepsilon , \lambda ) \in H^s \) that satisfies (1.14), with \( \Vert u (\varepsilon , \lambda ) \Vert _s \rightarrow 0 \) as \( \varepsilon \rightarrow 0 \). In addition, \(u(\varepsilon ,\lambda )\) is linearly stable.
Let us make some comments on the results.
-
1.
The quasi-periodic solutions of Theorem 1.1 could be unstable because the nonlinearity \( f \) has no special structure and some eigenvalues of the linearized operator at the solutions could have non zero real part (partially hyperbolic tori). In any case, we reduce to constant coefficients the linearized operator (Theorem 1.4) and we may compute its eigenvalues (i.e. Lyapunov exponents) with any order of accuracy. With further conditions on the nonlinearity—like reversibility or in the Hamiltonian case—the eigenvalues are purely imaginary, and the torus is linearly stable. The present situation is very different with respect to [9, 10, 12–15, 17] and also [2, 24–26], where the lack of stability information is due to the fact that the linearized equation has variable coefficients.
-
2.
One cannot expect the existence of quasi-periodic solutions of (1.4) for any perturbation \( f \). Actually, if \( f = m \ne 0 \) is a constant, then, integrating (1.4) in \( (\varphi ,x) \) we find the contradiction \( \varepsilon m = 0 \). This is a consequence of the fact that
$$\begin{aligned} \mathrm{Ker}(\omega \cdot \partial _\varphi + \partial _{xxx}) = {\mathbb {R}}\end{aligned}$$(1.15)is non trivial. Both the condition (1.8) (which is satisfied by the Hamiltonian nonlinearities) and the reversibility condition (1.13) allow to overcome this obstruction, working in a space of functions with zero average. The degeneracy (1.15) also reflects in the fact that the solutions of (1.4) appear as a \(1\)-dimensional family \( c + u_c( \varepsilon , \lambda ) \) parametrized by the “average” \( c \in {\mathbb {R}}\). We could also avoid this degeneracy by adding a “mass” term \( + m u \) in (1.1), but it does not seem to have physical meaning.
-
3.
In Theorem 1.1 we have not considered the case in which \( f \) is fully nonlinear and satisfies condition (F) in (1.6), because any nonlinearity of the form (1.8) is automatically quasi-linear (and so the first condition in (1.7) holds) and (1.6) trivially implies the second condition in (1.7) with \( \alpha (\varphi ) = 0 \).
-
4.
The solutions \( u \in H^s \) have the same regularity in both variables \( (\varphi ,x) \). This functional setting is convenient when using changes of variables that mix the time and space variables, like the composition operators \({\mathcal {A}}\), \(\mathcal {T}\) in Sects. 3.1, 3.4.
-
5.
In the Hamiltonian case (1.10), the nonlinearity \(f\) in (1.11) satisfies the reversibility condition (1.13) if and only if \( F( -\varphi , -x, z_0, -z_1) = F( \varphi , x, z_0, z_1) \).
Theorems 1.1–1.3 are based on a Nash–Moser iterative scheme. An essential ingredient in the proof—which also implies the linear stability of the quasi-periodic solutions—is the reducibility of the linear operator
obtained by linearizing (1.4) at any approximate (or exact) solution \( u \), where the coefficients \( a_i (\varphi , x) \) are defined in (3.2). Let \( H^s_x := H^s ({\mathbb {T}}) \) denote the usual Sobolev spaces of functions of \( x \in {\mathbb {T}}\) only.
Theorem 1.4
(Reducibility) There exist \( \bar{\sigma }> 0 \), \( q \in {\mathbb {N}}\), depending on \( \nu \), such that:
For every nonlinearity \( f \in C^q \) that satisfies the hypotheses of Theorems 1.1 or 1.3, for all \(\varepsilon \in (0, \varepsilon _0)\), where \(\varepsilon _0 := \varepsilon _0 (f, \nu ) \) is small enough, for all \(u\) in the ball \(\Vert u \Vert _{ { {\mathfrak {s}}}_0 + \bar{\sigma }} \le 1\), there exists a Cantor like set \( \Lambda _\infty (u) \subset \Lambda \) such that, for all \( \lambda \in \Lambda _\infty (u) \):
-
i)
for all \( s \in ({{\mathfrak {s}}}_0, q - \bar{\sigma }) \), if \(\Vert u \Vert _{ s + \bar{\sigma }} < + \infty \) then there exist linear invertible bounded operators \( W_1 \), \( W_2 : H^s ({\mathbb {T}}^{\nu +1})\rightarrow H^s ( {\mathbb {T}}^{\nu +1} ) \) (see (4.72)) with bounded inverse, that semi-conjugate the linear operator \( \mathcal{L}(u) \) in (1.16) to the diagonal operator \( \mathcal{L}_\infty \), namely
$$\begin{aligned} \mathcal{L}(u) = W_1 \mathcal{L}_\infty W_2^{-1} , \quad \mathcal{L}_\infty := {\omega \cdot \partial _{\varphi }}+ \mathcal{D}_\infty \end{aligned}$$(1.17)where
$$\begin{aligned}&\mathcal{D}_\infty := \mathrm{diag}_{j \in {\mathbb {Z}}} \{ \mu _j \}, \quad \mu _j := \mathrm{i} (-m_3 j^3 + m_1 j) + r_j , \quad m_3, m_1 \in {\mathbb {R}},\nonumber \\&\sup _j |r_j | \le C \varepsilon . \end{aligned}$$(1.18) -
ii)
For each \( \varphi \in {\mathbb {T}}^\nu \) the operators \( W_i \) are also bounded linear bijections of \( H^s_x \) (see notation (2.18))
$$\begin{aligned} W_i ( \varphi ) , W_i^{-1} ( \varphi ) : H^s_x \rightarrow H^s_x , \quad i = 1,2. \end{aligned}$$A curve \( h(t) = h(t, \cdot ) \in H^{s}_x \) is a solution of the quasi-periodically forced linear equation
$$\begin{aligned} \partial _t h + (1 + a_3(\omega t,x)) \partial _{xxx}h + a_2(\omega t,x) \partial _{xx}h + a_1(\omega t,x) \partial _xh + a_0 (\omega t,x)h = 0\nonumber \\ \end{aligned}$$(1.19)if and only if the transformed curve
$$\begin{aligned} v(t) := v(t, \cdot ) := W_2^{-1} ( \omega t ) [h(t)] \in H^{s}_x \end{aligned}$$is a solution of the constant coefficients dynamical system
$$\begin{aligned} \partial _t v + \mathcal{D}_\infty v = 0 \, , \quad {\dot{v}}_j = - \mu _j v_j , \ \ \forall j \in {\mathbb {Z}}. \end{aligned}$$(1.20)In the reversible or Hamiltonian case all the \( \mu _j \in \mathrm{i} {\mathbb {R}}\) are purely imaginary.
The operator \( W_1 \) differs from \( W_2 \) (see (4.72)) only for the multiplication by the function \( \rho \) in (3.26) which comes from the re-parametrization of time of Sect. 3.2. As explained in Sect. 2.2 this does not affect the dynamical consequence of Theorem 1.4-\(ii\)).
The exponents \( \mu _j \) can be effectively computed. All the solutions of (1.20) are
If the \( \mu _j \) are purely imaginary—as in the reversible or the Hamiltonian cases—all the solutions of (1.20) are almost periodic in time (in general) and the Sobolev norm
is constant in time. As a consequence we have:
Theorem 1.5
(Linear stability) Assume the hypothesis of Theorem 1.4 and, in addition, that \( f \) is Hamiltonian (see (1.11)) or it satisfies the reversibility condition (1.13). Then, \( \forall s \in ( \mathfrak {s}_0, q - \bar{\sigma }- {\mathfrak {s}}_0) \), \( \Vert u \Vert _{s+ {\mathfrak {s}}_0 + \bar{\sigma }} < + \infty \), there exists \( K_0 > 0 \) such that for all \(\lambda \in \Lambda _\infty (u) \), \(\varepsilon \in (0,\varepsilon _0)\), all the solutions of (1.19) satisfy
and, for some \( \mathtt a \in (0,1) \),
Theorems 1.1–1.5 are proved in Sect. 5.1 collecting all the information of Sects. 2–5.
1.2 Ideas of the proof
The proof of Theorems 1.1–1.3 is based on a Nash–Moser iterative scheme in the scale of Sobolev spaces \( H^s \). The main issue concerns the invertibility of the linearized operator \( \mathcal{L} \) in (1.16), at each step of the iteration, and the proof of the tame estimates (5.7) for its right inverse. This information is obtained in Theorem 4.3 by conjugating \( \mathcal{L} \) to constant coefficients. This is also the key which implies the stability results for the Hamiltonian and reversible nonlinearities, see Theorems 1.4–1.5.
We now explain the main ideas of the reducibility scheme. The term of \( \mathcal{L} \) that produces the strongest perturbative effects to the spectrum (and eigenfunctions) is \( a_3 (\varphi ,x) \partial _{xxx} \), and, then, \( a_2 (\varphi ,x) \partial _{xx} \). The usual KAM transformations are not able to deal with these terms because they are “too close” to the identity. Our strategy is the following. First, we conjugate the operator \( {\mathcal {L}}\) in (1.16) to a constant coefficients third order differential operator plus a zero order remainder
(see (3.55)), via changes of variables induced by diffeomorphisms of the torus, a reparametrization of time, and pseudo-differential operators. This is the goal of Sect. 3. All these transformations could be composed into one map, but we find it more convenient to split the regularization procedure into separate steps (Sects. 3.1–3.5), both to highlight the basic ideas, and, especially, in order to derive estimates on the coefficients in Sect. 3.6. Let us make some comments on this procedure.
-
1.
In order to eliminate the space variable dependence of the highest order perturbation \( a_3 (\varphi ,x) \partial _{xxx} \) (see (3.20)) we use, in Sect. 3.1, \(\varphi \)-dependent changes of variables of the form
$$\begin{aligned} (\mathcal{A} h)(\varphi , x) := h(\varphi , x + \beta (\varphi , x)). \end{aligned}$$These transformations converge pointwise to the identity if \( \beta \rightarrow 0 \) but not in operatorial norm. If \( \beta \) is odd, \({\mathcal {A}}\) preserves the reversible structure, see Remark 3.4. On the other hand for the Hamiltonian equation (1.10) we use the modified transformation
$$\begin{aligned} (\mathcal{A}h)(\varphi ,x)&:= (1+ \beta _x(\varphi , x)) \, h(\varphi , x + \beta (\varphi , x))\nonumber \\&= \frac{d}{dx} \big \{ ({\partial _x}^{-1} h )(\varphi , x+ \beta (\varphi ,x)) \big \} \end{aligned}$$(1.25)for all \( h(\varphi , \cdot ) \in H^1_0 ({\mathbb {T}}) \). This map is canonical, for each \( \varphi \in {\mathbb {T}}^\nu \), with respect to the KdV-symplectic form (1.12), see Remark 3.3. Thus (1.25) preserves the Hamiltonian structure and also eliminates the term of order \( \partial _{xx} \), see Remark 3.5.
-
2.
In the second step of Sect. 3.2 we eliminate the time dependence of the coefficients of the highest order spatial derivative operator \( \partial _{xxx} \) by a quasi-periodic time re-parametrization. This procedure preserves the reversible and the Hamiltonian structure, see Remark 3.6 and 3.7.
-
3.
Assumptions (Q) (see (1.7)) or (F) (see (1.6)) allow to eliminate terms like \( a (\varphi , x) \partial _{xx} \) along this reduction procedure, see (3.41). This is possible, by a conjugation with multiplication operators (see (3.34)), if (see (3.40))
$$\begin{aligned} \int _{\mathbb {T}}\frac{a_2(\varphi ,x)}{1 + a_3(\varphi ,x)} \, dx = 0. \end{aligned}$$(1.26)If (F) holds, then the coefficient \( a_2(\varphi ,x) = 0 \) and (1.26) is satisfied. If (Q) holds, then an easy computation shows that \( a_2(\varphi ,x) = \alpha (\varphi ) \, \partial _x a_3(\varphi ,x) \) (using the explicit expression of the coefficients in (3.2)), and so
$$\begin{aligned} \int _{\mathbb {T}}\frac{a_2(\varphi ,x)}{1 + a_3(\varphi ,x)} \, dx = \int _{\mathbb {T}}\alpha (\varphi ) \, \partial _x \left( \log [ 1+a_3(\varphi ,x)] \right) \, dx = 0 . \end{aligned}$$In both cases (Q) and (F), condition (1.26) is satisfied. In the Hamiltonian case there is no need of this step because the symplectic transformation (1.25) also eliminates the term of order \( \partial _{xx} \), see Remark 3.7. We note that without assumptions (Q) or (F) we may always reduce \({\mathcal {L}}\) to a time dependent operator with \( a (\varphi ) \partial _{xx} \). If \( a(\varphi ) \) were a constant, then this term would even simplify the analysis, killing the small divisors. The pathological situation that we avoid by assuming (Q) or (F) is when \( a(\varphi ) \) changes sign. In such a case, this term acts as a friction when \( a(\varphi ) < 0 \) and as an amplifier when \( a(\varphi ) > 0 \).
-
4.
In Sects. 3.4–3.5, we are finally able to conjugate the linear operator to another one with a coefficient in front of \( \partial _x \) which is constant, i.e. obtaining (1.24). In this step we use a transformation of the form \( I + w(\varphi ,x) \partial _x^{-1} \), see (3.49). In the Hamiltonian case we use the symplectic map \( e^{\pi _0 w(\varphi , x) \partial _x^{-1}} \), see Remark 3.13.
-
5.
We can iterate the regularization procedure at any finite order \( k = 0, 1, \ldots \), conjugating \( \mathcal{L} \) to an operator of the form \({\mathfrak D} + \mathcal{R }\), where
$$\begin{aligned} {\mathfrak D} = \omega \cdot \partial _\varphi + {\mathcal {D}}, \quad {\mathcal {D}}= m_3 \partial _{x}^3 + m_1 \partial _x + \cdots + m_{-k} \partial _x^{-k} , \quad m_{i} \in {\mathbb {R}}, \end{aligned}$$has constant coefficients, and the rest \( \mathcal{R } \) is arbitrarily regularizing in space, namely
$$\begin{aligned} \partial _x^{k} \circ {\mathcal {R}}= \text {bounded}. \end{aligned}$$(1.27)However, one cannot iterate this regularization infinitely many times, because it is not a quadratic scheme, and therefore, because of the small divisors, it does not converge. This regularization procedure is sufficient to prove the invertibility of \( \mathcal{L} \), giving tame estimates for the inverse, in the periodic case, but it does not work for quasi-periodic solutions. The reason is the following. In order to use Neumann series, one needs that \({\mathfrak D} ^{-1}{\mathcal {R}}= ({\mathfrak D}^{-1}\partial _x^{-k}) (\partial _x^{k} {\mathcal {R}})\) is bounded, namely, in view of (1.27), that \( {\mathfrak D} ^{-1}\partial _x^{-k} \) is bounded. In the region where the eigenvalues \((\mathrm{i} \omega \cdot l + {\mathcal {D}}_j)\) of \({\mathfrak D} \) are small, space and time derivatives are related, \(|\omega \cdot l| \sim |j|^3\), where \(l\) is the Fourier index of time, \(j\) is that of space, and \({\mathcal {D}}_j = - \mathrm{i} m_3 j^3 + \mathrm{i} m_1 j + \cdots \) are the eigenvalues of \({\mathcal {D}}\). Imposing the first order Melnikov conditions \(|\mathrm{i} \omega \cdot l + {\mathcal {D}}_j| > \gamma |l|^{-\tau }\), in that region, \(( {\mathfrak D} ^{-1}\partial _x^{-k})\) has eigenvalues
$$\begin{aligned} \left| \frac{1}{(\mathrm{i} \omega \cdot l + {\mathcal {D}}_j) j^{k}}\, \right| < \frac{|l|^\tau }{\gamma |j|^{k}} \, < \frac{C |l|^\tau }{|\omega \cdot l|^{k/3}}. \end{aligned}$$In the periodic case, \(\omega \in {\mathbb {R}}\), \(l \in {\mathbb {Z}}\), \(|\omega \cdot l| = |\omega | |l|\), and this determines the order of regularization that is required by the procedure: \( k \ge 3 \tau \). In the quasi-periodic case, instead, \(|l|\) is not controlled by \(|\omega \cdot l|\), and the argument fails.
Once (1.24) has been obtained, we implement a quadratic reducibility KAM scheme to diagonalize \( \mathcal{L}_5 \), namely to conjugate \( \mathcal{L}_5 \) to the diagonal operator \( \mathcal{L}_\infty \) in (1.17). Since we work with finite regularity, we perform a Nash–Moser smoothing regularization (time-Fourier truncation). We use standard KAM transformations, in order to decrease, quadratically at each step, the size of the perturbation \({\mathcal {R}}\), see Sect. 4.1.1. This iterative scheme converges (Theorem 4.2) because the initial remainder \( \mathcal{R}_0 \) is a bounded operator (of the space variable \(x\)), and this property is preserved along the iteration. This is the reason for performing the regularization procedure of Sects. 3.1–3.5. The second order Melnikov non-resonance conditions required by the reducibility scheme (see (4.17)) are verified thanks to the good control of the eigenvalues
We underline that the goal of the Töplitz–Lipschitz [19, 21, 23] and quasi-Töplitz property [6, 7, 38, 39] is precisely to provide an asymptotic expansion of the perturbed eigenvalues sharp enough to verify the second order Melnikov conditions.
Note that the above eigenvalues \( \mu _j \) could be not purely imaginary, i.e. \( r_j \) could have a non-zero real part which depends on the nonlinearity (unlike the reversible or Hamiltonian case, where \( r_j \in \mathrm{i} {\mathbb {R}}\)). In such a case, the invariant torus could be (partially) hyperbolic. Since we do not control the real part of \( r_j \) (i.e. the hyperbolicity may vanish), we perform the measure estimates proving the diophantine lower bounds of the imaginary part of the small divisors.
The final comment concerns the dynamical consequences of Theorem 1.4-\(ii\)). All the above transformations (both the changes of variables of Sects. 3.1–3.5 as well as the KAM matrices of the reducibility scheme) are time-dependent quasi-periodic maps of the phase space (of functions of \( x\) only), see Sect. 2.2. It is thanks to this “Töplitz-in-time” structure that the linear equation (1.19) is transformed into the dynamical system (1.20) as explained in Sect. 2.2. Note that in [24] (and also [9, 10, 15]) the analogous transformations have not this Töplitz-in-time structure and stability informations are not obtained.
2 Functional setting
For a function \(f : \Lambda _o \rightarrow E\), \(\lambda \mapsto f(\lambda )\), where \((E, \Vert \ \Vert _E)\) is a Banach space and \( \Lambda _o \) is a subset of \({\mathbb {R}}\), we define the sup-norm and the Lipschitz semi-norm
and, for \( \gamma > 0 \), the Lipschitz norm
If \( E = H^s \) we simply denote \( \Vert f \Vert ^{\mathrm{{Lip}(\gamma )}}_{H^s} := \Vert f \Vert ^{\mathrm{{Lip}(\gamma )}}_s \).
As a notation, we write
for some constant \( C(s) \). For \( s = {\mathfrak {s}}_0 := (\nu +2) \slash 2 \) we only write \( a \,\lessdot \,b \). More in general the notation \( a \lessdot b \) means \( a \le C b \) where the constant \( C \) may depend on the data of the problem, namely the nonlinearity \( f \), the number \( \nu \) of frequencies, the diophantine vector \( \bar{\omega }\), the diophantine exponent \( \tau > 0 \) in the non-resonance conditions in (4.6). Also the small constants \( \delta \) in the sequel depend on the data of the problem.
2.1 Matrices with off-diagonal decay
Let \( b \in {\mathbb {N}}\) and consider the exponential basis \(\{ e_i : i \in {\mathbb {Z}}^b \} \) of \(L^2({\mathbb {T}}^b) \), so that \(L^2({\mathbb {T}}^b)\) is the vector space \(\{ u = \sum u_i e_i\), \(\sum |u_i |^2 < \infty \}\). Any linear operator \(A : L^2 ({\mathbb {T}}^b) \rightarrow L^2 ({\mathbb {T}}^b) \) can be represented by the infinite dimensional matrix
We now define the \( s \)-norm (introduced in [9]) of an infinite dimensional matrix.
Definition 2.1
The \(s\)-decay norm of an infinite dimensional matrix \( A := (A_{i_1}^{i_2} )_{i_1, i_2 \in {\mathbb {Z}}^b } \) is
For parameter dependent matrices \( A := A(\lambda ) \), \(\lambda \in \Lambda _o \subseteq {\mathbb {R}}\), the Definitions (2.1) and (2.2) become
Clearly, the matrix decay norm (2.3) is increasing with respect to the index \( s \), namely
The \( s \)-norm is designed to estimate the polynomial off-diagonal decay of matrices, actually it implies
and, on the diagonal elements,
We now list some properties of the matrix decay norm proved in [9].
Lemma 2.1
(Multiplication operator) Let \( p = \sum _i p_i e_i \in H^s({\mathbb {T}}^b)\). The multiplication operator \( h \mapsto p h\) is represented by the Töplitz matrix \( T_i^{i'} = p_{i - i'} \) and
Moreover, if \(p = p(\lambda )\) is a Lipschitz family of functions,
The \(s\)-norm satisfies classical algebra and interpolation inequalities.
Lemma 2.2
(Interpolation) For all \(s \ge s_0 > b/2 \) there are \( C(s) \ge C(s_0) \ge 1 \) such that
In particular, the algebra property holds
If \(A = A(\lambda )\) and \(B = B(\lambda )\) depend in a Lipschitz way on the parameter \(\lambda \in \Lambda _o \subset {\mathbb {R}}\), then
For all \(n \ge 1\), using (2.8) with \( s = s_0 \), we get
Moreover (2.10) implies that (2.11) also holds for Lipschitz norms \(| \ |_s^\mathrm{{Lip}(\gamma )}\).
The \( s \)-decay norm controls the Sobolev norm, also for Lipschitz families:
Lemma 2.3
Let \( \Phi = I + \Psi \) with \(\Psi := \Psi (\lambda )\), depending in a Lipschitz way on the parameter \(\lambda \in \Lambda _o \subset {\mathbb {R}}\), such that \( C(s_0) | \Psi |_{s_0}^{\mathrm{{Lip}(\gamma )}} \le 1/ 2 \). Then \( \Phi \) is invertible and, for all \( s \ge s_0 > b / 2 \),
If \( \Phi _i = I + \Psi _i \), \( i = 1,2 \), satisfy \( C(s_0) | \Psi _i |_{s_0}^{\mathrm{{Lip}(\gamma )}} \le 1/ 2 \), then
Proof
Estimates (2.13) follow by Neumann series and (2.11). To prove (2.14), observe that
and use (2.7), (2.13). \(\square \)
2.1.1 Töplitz-in-time matrices
Let now \( b := \nu + 1 \) and
An important sub-algebra of matrices is formed by the matrices Töplitz in time defined by
whose decay norm (2.3) is
These matrices are identified with the \( \varphi \)-dependent family of operators
which act on functions of the \(x\)-variable as
We still denote by \( | A(\varphi ) |_s \) the \( s \)-decay norm of the matrix in (2.17).
Lemma 2.4
Let \( A \) be a Töplitz matrix as in (2.15), and \({\mathfrak {s}}_0 := (\nu + 2)/2\) (as defined above). Then
Proof
For all \( \varphi \in {\mathbb {T}}^\nu \) we have
whence the lemma follows. \(\square \)
Given \( N \in {\mathbb {N}}\), we define the smoothing operator \(\Pi _N\) as
Lemma 2.5
The operator \( \Pi _N^\bot := I - \Pi _N \) satisfies
where in the second inequality \(A := A(\lambda )\) is a Lipschitz family \(\lambda \in \Lambda \).
2.2 Dynamical reducibility
All the transformations that we construct in Sects. 3 and 4 act on functions \( u(\varphi , x ) \) (of time and space). They can also be seen as:
- (a):
-
transformations of the phase space \(H^s_x\) that depend quasi-periodically on time (Sects. 3.1, 3.3–3.5 and 4);
- (b):
-
quasi-periodic reparametrizations of time (Sect. 3.2).
This observation allows to interpret the conjugacy procedure from a dynamical point of view.
Consider a quasi-periodic linear dynamical system
We want to describe how (2.21) changes under the action of a transformation of type \((a)\) or \((b)\).
Let \(A(\omega t)\) be of type \((a)\), and let \(u = A(\omega t)v\). Then (2.21) is transformed into the linear system
The transformation \(A(\omega t)\) may be regarded to act on functions \( u(\varphi , x) \) as
and one can check that \( ({\tilde{A}}^{-1} u)(\varphi ,x) = A^{-1}(\varphi ) u(\varphi , x) \). The operator associated to (2.21) (on quasi-periodic functions)
transforms under the action of \( {\tilde{A}} \) into
which is exactly the linear system in (2.22), acting on quasi-periodic functions.
Now consider a transformation of type \((b)\), namely a change of the time variable
where \(\alpha = \alpha (\varphi )\), \(\varphi \in {\mathbb {T}}^\nu \), is a \(2\pi \)-periodic function of \(\nu \) variables (in other words, \( t \mapsto t + \alpha (\omega t) \) is the diffeomorphism of \({\mathbb {R}}\) induced by the transformation \(B\)). If \( u(t) \) is a solution of (2.21), then \( v(\tau ) \), defined by \( u = Bv\), solves
We may regard the associated transformation on quasi-periodic functions defined by
as in step 3.2, where we calculate
(2.27) is nothing but the linear system (2.26), acting on quasi-periodic functions.
2.3 Real, reversible and Hamiltonian operators
We consider the space of real functions
and of even (in space-time), respectively odd, functions
Definition 2.2
An operator \( R \) is
-
1.
real if \( R : Z \rightarrow Z \)
-
2.
reversible if \( R : X \rightarrow Y \)
-
3.
reversibility-preserving if \( R : X \rightarrow X \), \( R : Y \rightarrow Y \).
The composition of a reversible and a reversibility-preserving operator is reversible.
The above properties may be characterized in terms of matrix elements.
Lemma 2.6
We have
For the Hamiltonian equation (1.10) the phase space is \( H^1_0 := \{ u \in H^1 ({\mathbb {T}}) \, : \, \int _{{\mathbb {T}}} u(x) dx = 0 \} \).
Definition 2.3
A time dependent linear vector field \( X(t) : H_0^1 \rightarrow H_0^1\) is Hamiltonian if \( X(t) = \partial _x G(t) \) for some real linear operator \( G(t) \) which is self-adjoint with respect to the \( L^2 \) scalar product.
If \( G(t) = G(\omega t)\) is quasi-periodic in time, we say that the associated operator \( \omega \cdot \partial _{\varphi } - \partial _x G( \varphi ) \) (see (2.24)) is Hamiltonian.
Definition 2.4
A map \( A : H_0^1 \rightarrow H_0^1\) is symplectic if
where the symplectic 2-form \( \Omega \) is defined in (1.12). Equivalently \( A^T \partial _x^{-1} A = \partial _x^{-1} \).
If \( A (\varphi ) \), \( \forall \varphi \in {\mathbb {T}}^\nu \), is a family of symplectic maps we say that the corresponding operator in (2.23) is symplectic.
Under a time dependent family of symplectic transformations \( u = \Phi (t) v \) the linear Hamiltonian equation
transforms into the equation
with Hamiltonian
Note that \(E(t)\) is self-adjoint with respect to the \(L^2\) scalar product because \(\Phi ^T \partial _x^{-1} \Phi _t + \Phi _t^T \partial _x^{-1} \Phi = 0\).
3 Regularization of the linearized operator
Our existence proof is based on a Nash–Moser iterative scheme. The main step concerns the invertibility of the linearized operator (see (1.16))
obtained linearizing (1.4) at any approximate (or exact) solution \( u \). The coefficients \(a_i = a_i(\varphi ,x) = a_i(u,\varepsilon )(\varphi ,x)\) are periodic functions of \((\varphi ,x)\), depending on \(u,\varepsilon \). They are explicitly obtained from the partial derivatives of \(\varepsilon f(\varphi ,x,z)\) as
The operator \({\mathcal {L}}\) depends on \(\lambda \) because \(\omega = \lambda \bar{\omega }\). Since \(\varepsilon \) is a (small) fixed parameter, we simply write \({\mathcal {L}}(\lambda ,u)\) instead of \({\mathcal {L}}(\lambda ,u,\varepsilon )\), and \(a_i(u)\) instead of \(a_i(u,\varepsilon )\). We emphasize that the coefficients \(a_i\) do not depend explicitly on the parameter \(\lambda \) (they depend on \( \lambda \) only through \( u(\lambda ) \)).
In the Hamiltonian case (1.11) the linearized operator (3.1) has the form
where
and it is generated by the quadratic Hamiltonian
Remark 3.1
In the reversible case, i.e. the nonlinearity \( f\) satisfies (1.13) and \( u \in X \) (see (2.29), (1.14)) the coefficients \( a_i \) satisfy the parity
and \({\mathcal {L}}\) maps \(X\) into \(Y\), namely \({\mathcal {L}}\) is reversible, see Definition 2.2.
Remark 3.2
In the Hamiltonian case (1.11), assumption (Q)-(1.7) is automatically satisfied (with \( \alpha (\varphi ) = 2 \)) because
where
and so
The coefficients \(a_i\), together with their derivative \(\partial _u a_i(u)[h]\) with respect to \(u\) in the direction \(h\), satisfy tame estimates:
Lemma 3.1
Let \( f \in C^q \), see (1.3). For all \( {\mathfrak {s}}_{0} \le s \le q - 2 \), \( \Vert u \Vert _{{\mathfrak {s}}_0 + 3} \le 1 \), we have, for all \(i = 0,1,2,3\),
If, moreover, \( \lambda \mapsto u(\lambda ) \in H^s \) is a Lipschitz family satisfying \( \Vert u \Vert _{{\mathfrak {s}}_0 + 3}^{\mathrm{{Lip}(\gamma )}} \le 1 \) (see (2.2)), then
Proof
The tame estimate (3.4) follows by Lemma 6.2\((i)\) applied to the function \(\partial _{z_i}f\), \(i=0,\ldots ,3 \), which is valid for \(s+1 \le q\). The tame bound (3.5) for
follows by (6.5) and applying Lemma 6.2\((i)\) to the functions \(\partial ^2_{z_k z_i}f\), which gives
for \(s+2 \le q\). The Lipschitz bound (3.6) follows similarly. \(\square \)
3.1 Step 1. Change of the space variable
We consider a \( \varphi \)-dependent family of diffeomorphisms of the \( 1 \)-dimensional torus \( {\mathbb {T}}\) of the form
where \( \beta \) is a (small) real-valued function, \(2\pi \) periodic in all its arguments. The change of variables (3.7) induces on the space of functions the linear operator
The operator \( \mathcal{A} \) is invertible, with inverse
where \( y \mapsto y + {\tilde{\beta }}(\varphi ,y) \) is the inverse diffeomorphism of (3.7), namely
Remark 3.3
In the Hamiltonian case (1.11) we use, instead of (3.8), the modified change of variable (1.25) which is symplectic, for each \( \varphi \in {\mathbb {T}}^\nu \). Indeed, setting \( U := \partial _x^{-1} u \) (and neglecting to write the \( \varphi \)-dependence)
where \( c \) is the average of \( U(x+ \beta (x) ) \) in \( {\mathbb {T}}\). The inverse operator of (1.25) is \( (\mathcal{A}^{-1} v) (\varphi , y) = (1+ {\tilde{\beta }}_y (\varphi , y)) v( y + \tilde{\beta }(\varphi , y)) \) which is also symplectic.
Now we calculate the conjugate \( \mathcal{A}^{-1} \mathcal{L} \mathcal{A} \) of the linearized operator \({\mathcal {L}}\) in (3.1) with \( \mathcal{A} \) in (3.8).
The conjugate \( \mathcal{A} ^{-1}a \mathcal{A} \) of any multiplication operator \(a : h(\varphi ,x) \mapsto a(\varphi ,x) h(\varphi ,x)\) is the multiplication operator \(( \mathcal{A} ^{-1}a)\) that maps \(v(\varphi ,y) \mapsto ( \mathcal{A} ^{-1}a)(\varphi ,y) \, v(\varphi ,y)\). By conjugation, the differential operators become
where all the coefficients \(\{ A^{-1}(\ldots ) \}\) are periodic functions of \((\varphi ,y)\). Thus (recall (3.1))
where
We look for \(\beta (\varphi ,x)\) such that the coefficient \(b_3(\varphi ,y)\) of the highest order derivative \(\partial _{yyy}\) in (3.11) does not depend on \(y\), namely
for some function \(b(\varphi )\) of \(\varphi \) only. Since \(\mathcal{A}\) changes only the space variable, \(\mathcal{A}b = b\) for every function \(b(\varphi )\) that is independent on \(y\). Hence (3.14) is equivalent to
namely
The equation (3.16) has a solution \(\beta \), periodic in \( x \), if and only if \( \int _{{\mathbb {T}}}{\rho _0(\varphi ,x) \, dx} = 0 \). This condition uniquely determines
Then we fix the solution (with zero average) of (3.16),
where \( \partial _x^{-1} \) is defined by linearity as
In other words, \(\partial _x^{-1}h\) is the primitive of \(h\) with zero average in \(x \).
With this choice of \( \beta \), we get (see (3.11), (3.14))
where \( b_3(\varphi ) := b(\varphi ) \) is defined in (3.17).
Remark 3.4
In the reversible case, \( \beta \in Y \) because \(a_3 \in X\), see (3.3). Therefore the operator \(\mathcal{A} \) in (3.8), as well as \( \mathcal{A}^{-1} \) in (3.9), maps \( X \rightarrow X \) and \( Y \rightarrow Y \), namely it is reversibility-preserving, see Definition 2.2. By (3.3) the coefficients of \({\mathcal {L}}_1\) (see (3.12), (3.13)) have parity
and \({\mathcal {L}}_1\) maps \(X \rightarrow Y\), namely it is reversible.
Remark 3.5
In the Hamiltonian case (1.11) the resulting operator \( \mathcal{L}_1 \) in (3.20) is Hamiltonian and \( b_2 (\varphi , y) = 2 \partial _y b_3 (\varphi ) \equiv 0 \). Actually, by (2.31), the corresponding Hamiltonian has the form
for some function \( B_0 (\varphi ,y) \).
3.2 Step 2. Time reparametrization
The goal of this section is to make constant the coefficient of the highest order spatial derivative operator \(\partial _{yyy}\) of \( \mathcal{L}_1 \) in (3.20), by a quasi-periodic reparametrization of time. We consider a diffeomorphism of the torus \( {\mathbb {T}}^{\nu } \) of the form
where \( \alpha \) is a (small) real valued function, \( 2\pi \)-periodic in all its arguments. The induced linear operator on the space of functions is
whose inverse is
where \( \varphi = \vartheta + \omega {\tilde{\alpha }}(\vartheta ) \) is the inverse diffeomorphism of \( \vartheta = \varphi + \omega \alpha (\varphi ) \). By conjugation, the differential operators become
Thus, see (3.20),
We look for \(\alpha (\varphi )\) such that the (variable) coefficients of the highest order derivatives (\({\omega \cdot \partial _{\vartheta }}\) and \(\partial _{yyy}\)) are proportional, namely
for some constant \(m_3 \in {\mathbb {R}}\). Since \( B \) is invertible, this is equivalent to require that
Integrating on \({\mathbb {T}}^\nu \) determines the value of the constant \(m_3\),
Thus we choose the unique solution of (3.29) with zero average
where \( ({\omega \cdot \partial _{\varphi }})^{-1}\) is defined by linearity
With this choice of \( \alpha \) we get (see (3.27), (3.28))
where
Remark 3.6
In the reversible case, \(\alpha \) is odd because \(b_3\) is even (see (3.21)), and \( B \) is reversibility preserving. Since \(\rho \) (defined in (3.26)) is even, the coefficients \( c_3, c_1 \in X\), \( c_2, c_0 \in Y \) and \({\mathcal {L}}_2 : X \rightarrow Y \) is reversible.
Remark 3.7
In the Hamiltonian case, the operator \( {\mathcal {L}}_2 \) is still Hamiltonian (the new Hamiltonian is the old one at the new time, divided by the factor \( \rho \)). The coefficient \( c_2 (\vartheta , y) \equiv 0 \) because \( b_2 \equiv 0 \), see Remark 3.5.
3.3 Step 3. Descent method: step zero
The aim of this section is to eliminate the term of order \(\partial _{yy}\) from \( \mathcal{L}_2 \) in (3.32).
Consider the multiplication operator
where the function \(v\) is periodic in all its arguments. Calculate the difference
where
To eliminate the factor \(T_2\), we need
Equation (3.37) has the periodic solution
provided that
Let us prove (3.39). By (3.33), (3.26), for each \(\vartheta = \varphi + \omega \alpha (\varphi )\) we get
By the definition (3.13) of \(b_2\) and changing variable \( y = x + \beta (\varphi ,x) \) in the integral (recall (3.8))
The first integral in (3.40) is zero because \(\beta _{xx} / (1 + \beta _x) = \partial _x \log (1 + \beta _x)\). The second one is zero because of assumptions (Q)-(1.7) or (F)-(1.6), see (1.26). As a consequence (3.39) is proved, and (3.37) has the periodic solution \(v\) defined in (3.38). Note that \( v \) is close to \( 1 \) for \( \varepsilon \) small. Hence the multiplication operator \( \mathcal{M} \) defined in (3.34) is invertible and \( \mathcal{M} ^{-1} \) is the multiplication operator by \( 1 / v \). By (3.35) and since \(T_2 = 0\), we deduce
Remark 3.8
In the reversible case, since \(c_2\) is odd (see Remark 3.6 ) the function \(v\) is even, then \( \mathcal{M} \), \( \mathcal{M} ^{-1}\) are reversibility preserving and by (3.36) and (3.41) \(d_1 \in X\) and \(d_0 \in Y\), which implies that \({\mathcal {L}}_3 : X \rightarrow Y\).
Remark 3.9
In the Hamiltonian case, there is no need to perform this step because \( c_2 \equiv 0 \), see Remark 3.7.
3.4 Step 4. Change of space variable (translation)
Consider the change of the space variable
which induces the operators
The differential operators become
Thus, by (3.41),
where
Now we look for \(p(\vartheta )\) such that the average
for some constant \(m_1 \in {\mathbb {R}}\) (independent of \( \vartheta \)). Equation (3.44) is equivalent to
The equation (3.45) has a periodic solution \(p(\vartheta )\) if and only if \(\int _{{\mathbb {T}}^{\nu }}V(\vartheta ) \, d \vartheta = 0\). Hence we have to define
and
With this choice of \(p\), after renaming the space-time variables \(z = x\) and \(\vartheta = \varphi \), we have
Remark 3.10
By (3.45), (3.47) and since \( d_1 \in X \) (see Remark 3.8), the function \(p\) is odd. Then \( \mathcal{T} \) and \( \mathcal{T}^{-1}\) defined in (3.42) are reversibility preserving and the coefficients \( e_1, e_0 \) defined in (3.43) satisfy \(e_1 \in X\), \(e_0 \in Y\). Hence \({\mathcal {L}}_4 : X \rightarrow Y\) is reversible.
Remark 3.11
In the Hamiltonian case the operator \( {\mathcal {L}}_4 \) is Hamiltonian, because the operator \( \mathcal{T} \) in (3.42) is symplectic (it is a particular case of the change of variables (1.25) with \( \beta (\varphi ,x) = p( \varphi ) \)).
3.5 Step 5. Descent method: conjugation by pseudo-differential operators
The goal of this section is to conjugate \( {\mathcal {L}}_4 \) in (3.48) to an operator of the form \( \omega \cdot \partial _{\varphi } + m_3 \partial _{xxx} + m_1 \partial _{x} + {\mathcal {R}}\) where the constants \(m_3\), \(m_1\) are defined in (3.30), (3.46), and \({\mathcal {R}}\) is a pseudo-differential operator of order \(0\).
Consider an operator of the form
where \(w : {\mathbb {T}}^{\nu + 1}\rightarrow {\mathbb {R}}\) and the operator \(\partial _{x}^{-1}\) is defined in (3.19). Note that \(\partial _x^{-1} \partial _x = \partial _x \partial _x^{-1} = \pi _0\), where \( \pi _0 \) is the \( L^2 \)-projector on the subspace \( H_0 := \{ u(\varphi ,x) \in L^2 ({\mathbb {T}}^{\nu +1})\, : \, \int _{{\mathbb {T}}} u(\varphi , x) \, dx = 0 \} \).
A direct computation shows that the difference
where (using \( \partial _x \pi _0 = \pi _0 \partial _x = \partial _x \), \( \partial _x^{-1} \partial _{xxx} = \partial _{xx} \))
We look for a periodic function \( w (\varphi , x )\) such that \( r_1 = 0\). By (3.51) and (3.44) we take
For \( \varepsilon \) small enough the operator \( \mathcal{S} \) is invertible and we obtain, by (3.50),
Remark 3.12
In the reversible case, the function \(w \in Y\), because \(e_1 \in X\), see Remark 3.10. Then \({\mathcal {S}}\), \({\mathcal {S}}^{-1}\) are reversibility preserving. By (3.52) and (3.53), \(r_0 \in Y\) and \(r_{-1} \in X\). Then the operators \( {\mathcal {R}}, {\mathcal {L}}_5 \) defined in (3.55) are reversible, namely \({\mathcal {R}}, {\mathcal {L}}_5 : X \rightarrow Y\).
Remark 3.13
In the Hamiltonian case, we consider, instead of (3.49), the modified operator
which, for each \( \varphi \in {\mathbb {T}}^\nu \), is symplectic. Actually \( {\mathcal {S}}\) is the time one flow map of the Hamiltonian vector field \( \pi _0 w(\varphi , x) \partial _{x}^{-1} \) which is generated by the Hamiltonian
The corresponding \( {\mathcal {L}}_5 \) in (3.55) is Hamiltonian. Note that the operators (3.56) and (3.49) differ only for pseudo-differential smoothing operators of order \( O( \partial _{x}^{-2} ) \) and of smaller size \( O( w^2 ) = O(\varepsilon ^2) \).
3.6 Estimates on \({\mathcal {L}}_5\)
Summarizing the steps performed in the previous Sects. 3.1–3.5, we have (semi)-conjugated the operator \( {\mathcal {L}}\) defined in (3.1) to the operator \({\mathcal {L}}_5 \) defined in (3.55), namely
(where \( \rho \) means the multiplication operator for the function \( \rho \) defined in (3.26)).
In the next lemma we give tame estimates for \({\mathcal {L}}_5\) and \(\Phi _1, \Phi _2\). We define the constants
where \( \tau _0 \) is defined in (1.2) and \( \nu \) is the number of frequencies.
Lemma 3.2
Let \( f \in C ^q \), see (1.3), and \( {\mathfrak {s}}_0 \le s \le q - \sigma \). There exists \(\delta > 0 \) such that, if \( \varepsilon \gamma _0 ^{-1}< \delta \) (the constant \( \gamma _0 \) is defined in (1.2)), then, for all
\((i)\) the transformations \({\Phi }_1, {\Phi }_2\) defined in (3.57) are invertible operators of \(H^s({\mathbb {T}}^{\nu +1})\), and satisfy
for \(i = 1, 2\). Moreover, if \(u(\lambda )\), \(h(\lambda )\) are Lipschitz families with
then
\((ii)\) The constant coefficients \(m_3, m_1\) of \({\mathcal {L}}_5\) defined in (3.55) satisfy
Moreover, if \(u(\lambda )\) is a Lipschitz family satisfying (3.61), then
\((iii)\) The operator \({\mathcal {R}}\) defined in (3.55) satisfies:
where \( \sigma > \sigma ' \) are defined in (3.58). Moreover, if \(u(\lambda )\) is a Lipschitz family satisfying (3.61), then
Finally, in the reversible case, the maps \(\Phi _i, \Phi _i^{-1}\), \(i=1,2\) are reversibility preserving and \({\mathcal {R}}, {\mathcal {L}}_5 : X \rightarrow Y\) are reversible. In the Hamiltonian case the operator \( {\mathcal {L}}_5 \) is Hamiltonian.
Proof
The proof is elementary. It is based only on a repeated use of the tame estimates of the Lemmata in the Appendix. \(\square \)
In the same way we get the following lemma.
Lemma 3.3
In the same hypotheses of Lemma 3.2, for all \(\varphi \in {\mathbb {T}}^\nu \), the operators \(\mathcal{A}(\varphi )\), \( \mathcal{M} (\varphi )\), \( \mathcal{T} (\varphi )\), \({\mathcal {S}}(\varphi )\) are invertible operators of the phase space \(H^s_x := H^s({\mathbb {T}})\), with
4 Reduction of the linearized operator to constant coefficients
The goal of this section is to diagonalize the linear operator \( {\mathcal {L}}_5 \) obtained in (3.55), and therefore to complete the reduction of \( \mathcal{L} \) in (3.1) into constant coefficients. For \( \tau > \tau _0 \) (see (1.2)) we define the constant
Theorem 4.1
Let \( f \in C^q \), see (1.3). Let \( \gamma \in (0,1) \) and \( {\mathfrak {s}}_0 \le s \le q - \sigma - \beta \) where \( \sigma \) is defined in (3.58), and \( \beta \) in (4.1). Let \(u(\lambda ) \) be a family of functions depending on the parameter \(\lambda \in \Lambda _o \subset \Lambda := [1/2, 3/2]\) in a Lipschitz way, with
Then there exist \( \delta _{0} \), \( C \) (depending on the data of the problem) such that, if
then:
- (i) :
-
(Eigenvalues) \(\forall \lambda \in \Lambda \) there exists a sequence
$$\begin{aligned} \begin{aligned} \mu _j^\infty (\lambda )&:= \mu _j^\infty (\lambda , u) = {\tilde{\mu }}^{0}_j(\lambda ) + r_j^\infty (\lambda ) ,\\ {\tilde{\mu }}^0_j(\lambda )&:= \mathrm{i} \left( - {\tilde{m}}_3 ( \lambda ) j^3 + {\tilde{m}}_1(\lambda ) j \right) , \ j \in {\mathbb {Z}}, \end{aligned} \end{aligned}$$(4.4)where \( {\tilde{m}}_3, {\tilde{m}}_1\) coincide with the coefficients of \( \mathcal{L}_5 \) in (3.55) for all \( \lambda \in \Lambda _o \), and the corrections \(r_j^\infty \) satisfy
$$\begin{aligned} | {\tilde{m}}_3 - 1 |^{\mathrm{{Lip}(\gamma )}} + | {\tilde{m}}_1 |^{\mathrm{{Lip}(\gamma )}} + | r^{\infty }_j |^{\mathrm{{Lip}(\gamma )}}_{\Lambda }&\le \varepsilon C , \ \ \forall j \in {\mathbb {Z}}. \end{aligned}$$(4.5)Moreover, in the reversible case (i.e. (1.13) holds) or Hamiltonian case (i.e. (1.11) holds), all the eigenvalues \(\mu _j^{\infty }\) are purely imaginary.
- (ii) :
-
(Conjugacy). For all \(\lambda \) in
$$\begin{aligned} \Lambda _\infty ^{2\gamma } := \Lambda _\infty ^{2\gamma } (u)&:= \left\{ \lambda \in \Lambda _o \, : \, | \mathrm{i} \lambda \bar{\omega }\cdot l + \mu ^{\infty }_j (\lambda ) - \mu ^{\infty }_{k} (\lambda ) |\right. \nonumber \\&\ge \left. 2 \gamma | j^{3} - k^{3} | \langle l \rangle ^{-\tau }, \ \forall l \in {\mathbb {Z}}^{\nu }, \, j ,k \in {\mathbb {Z}}\right\} \end{aligned}$$(4.6)there is a bounded, invertible linear operator \(\Phi _\infty (\lambda ) : H^s \rightarrow H^s\), with bounded inverse \(\Phi _\infty ^{-1}(\lambda )\), that conjugates \({\mathcal {L}}_5\) in (3.55) to constant coefficients, namely
$$\begin{aligned} \begin{aligned} \mathcal{L}_{\infty }(\lambda )&:= \Phi _{\infty }^{-1}(\lambda ) \circ {\mathcal {L}}_5(\lambda ) \circ \Phi _{\infty }(\lambda ) = \lambda \bar{\omega }\cdot \partial _{\varphi } + \mathcal{D}_{\infty }(\lambda ), \\ \mathcal{D}_{\infty }(\lambda )&:= \mathrm{diag}_{j \in {\mathbb {Z}}} \mu ^{\infty }_{j}(\lambda ). \end{aligned} \end{aligned}$$(4.7)The transformations \(\Phi _\infty , \Phi _\infty ^{-1}\) are close to the identity in matrix decay norm, with estimates
$$\begin{aligned} | \Phi _{\infty } (\lambda ) - I |_{s,\Lambda _\infty ^{2\gamma }}^\mathrm{{Lip}(\gamma )}+ | \Phi _{\infty }^{- 1} (\lambda ) - I |_{s,\Lambda _\infty ^{2\gamma }}^\mathrm{{Lip}(\gamma )}\le \varepsilon \gamma ^{-1} C(s) \left( 1 + \Vert u \Vert _{s + \sigma + \beta ,\Lambda _o }^\mathrm{{Lip}(\gamma )}\right) .\nonumber \\ \end{aligned}$$(4.8)For all \(\varphi \in {\mathbb {T}}^\nu \), the operator \(\Phi _\infty (\varphi ) : H^s_x \rightarrow H^s_x \) is invertible (where \(H^s_x := H^s({\mathbb {T}})\)) with inverse \( (\Phi _\infty (\varphi ))^{-1} = \Phi _\infty ^{-1}(\varphi )\), and
$$\begin{aligned} \Vert (\Phi _\infty ^{\pm 1}(\varphi ) - I) h \Vert _{H^s_x}&\le \varepsilon \gamma ^{-1} C(s) \left( \Vert h \Vert _{H^s_x} + \Vert u \Vert _{s + \sigma + \beta + {\mathfrak {s}}_0} \Vert h \Vert _{H^1_x} \right) . \end{aligned}$$(4.9)In the reversible case \(\Phi _{\infty }, \Phi _{\infty }^{-1} : X \rightarrow X \), \(Y \rightarrow Y\) are reversibility preserving, and \({\mathcal {L}}_\infty : X \rightarrow Y\) is reversible. In the Hamiltonian case the final \({\mathcal {L}}_\infty \) is Hamiltonian.
An important point of Theorem 4.1 is to require only the bound (4.2) for the low norm of \( u \), but it provides the estimate for \( \Phi _\infty ^{\pm 1} - I \) in (4.8) also for the higher norms \( | \cdot |_s \), depending also on the high norms of \( u \). From Theorem 4.1 we shall deduce tame estimates for the inverse linearized operators in Theorem 4.3.
Note also that the set \( \Lambda _{\infty }^{2 \gamma } \) in (4.6) depend only on the final eigenvalues, and it is not defined inductively as in usual KAM theorems. This characterization of the set of parameters which fulfill all the required Melnikov non-resonance conditions (at any step of the iteration) was first observed in [5, 8] in an analytic setting. Theorem 4.1 extends this property also in a differentiable setting. A main advantage of this formulation is that it allows to discuss the measure estimates only once and not inductively: the Cantor set \( \Lambda _{\infty }^{2 \gamma } \) in (4.6) could be empty (actually its measure \( |\Lambda _{\infty }^{2 \gamma } | = 1 - O(\gamma ) \) as \( \gamma \rightarrow 0 \)) but the functions \( \mu ^\infty _j (\lambda ) \) are anyway well defined for all \( \lambda \in \Lambda \), see (4.4). In particular we shall perform the measure estimates only along the nonlinear iteration, see Sect. 5.
Theorem 4.1 is deduced from the following iterative Nash–Moser reducibility theorem for a linear operator of the form
where \(\omega = \lambda \bar{\omega }\),
the \( m_3(\lambda ,u(\lambda )), m_1 (\lambda ,u(\lambda )) \in {\mathbb {R}}\) and \( u(\lambda ) \) is defined for \( \lambda \in \Lambda _o \subset \Lambda \). Clearly \({\mathcal {L}}_5\) in (3.55) has the form (4.10). Define
(then \( N_{\nu +1} = N_{\nu }^\chi \), \( \forall \nu \ge 0 \)) and
where \(\sigma \) is defined in (3.58) and \(\beta \) is defined in (4.1).
Theorem 4.2
(KAM reducibility) Let \( q > \sigma + {\mathfrak {s}}_0 + \beta \). There exist \(C_0 > 0 \), \( N_{0} \in {\mathbb {N}}\) large, such that, if
then, for all \( \nu \ge 0 \):
- \(\mathbf{(S1)_{\nu }}\) :
-
There exists an operator
$$\begin{aligned} \mathcal{L}_\nu&:= \omega \cdot \partial _{\varphi } + \mathcal{D}_\nu + \mathcal{R}_\nu \quad where \quad \mathcal{D}_\nu = \mathrm{diag}_{j \in {\mathbb {Z}}} \{ \mu ^{\nu }_{j}(\lambda ) \}\end{aligned}$$(4.15)$$\begin{aligned} \mu _j^{\nu }(\lambda )&= \mu _j^0(\lambda ) + r_j^{\nu }(\lambda ), \mu _j^0(\lambda ) := - \mathrm{i} \left( m_3(\lambda ,u(\lambda )) j^3 - m_1(\lambda ,u(\lambda )) j \right) , \nonumber \\&j \in {\mathbb {Z}}, \end{aligned}$$(4.16)defined for all \( \lambda \in \Lambda _{\nu }^{\gamma }(u)\), where \(\Lambda _{0}^{\gamma }(u) := \Lambda _o \) (is the domain of \( u \)), and, for \(\nu \ge 1\),
$$\begin{aligned} \Lambda _{\nu }^{\gamma } := \Lambda _{\nu }^{\gamma }(u)&:= \left\{ \lambda \in \Lambda _{\nu - 1}^{\gamma } : \left| \mathrm{i} \omega \cdot l + \mu ^{\nu -1}_{j}(\lambda ) - \mu ^{\nu - 1}_{k}(\lambda ) \right| \right. \nonumber \\&\ge \left. \gamma \frac{ |j^{3} - k^{3}|}{\left\langle l\right\rangle ^{\tau }} \ \forall \left| l\right| \le N_{ \nu -1}, \ j, k \in {\mathbb {Z}}\right\} . \end{aligned}$$(4.17)For \(\nu \ge 0\), \(r_j^{\nu } = \overline{r_{-j}^{\nu }}\), equivalently \( \mu _j^{\nu } = \overline{\mu _{-j}^{\nu }}\), and
$$\begin{aligned} |r_j^{\nu }|^{\mathrm{{Lip}(\gamma )}} := |r_j^{\nu }|^{\mathrm{{Lip}(\gamma )}}_{\Lambda _{\nu }^\gamma } \le \varepsilon C . \end{aligned}$$(4.18)The remainder \( \mathcal{R}_\nu \) is real (Definition 2.2) and, \( \forall s \in [ {\mathfrak {s}}_0, q - \sigma - \beta ] \),
$$\begin{aligned} \left| \mathcal{R}_{\nu }\right| _{s}^{\mathrm{{Lip}(\gamma )}} \le \left| \mathcal{R}_{0}\right| _{s+\beta }^{\mathrm{{Lip}(\gamma )}}N_{\nu - 1}^{-\alpha } , \quad \left| \mathcal{R}_{\nu }\right| _{s + \beta }^{\mathrm{{Lip}(\gamma )}} \le \left| \mathcal{R}_{0}\right| _{s+\beta }^{\mathrm{{Lip}(\gamma )}}\,N_{\nu - 1}. \end{aligned}$$(4.19)Moreover, for \( \nu \ge 1 \),
$$\begin{aligned} \mathcal{L}_{\nu } = \Phi _{\nu -1}^{-1} \mathcal{L}_{\nu -1} \Phi _{\nu -1} , \quad \Phi _{\nu -1} := I + \Psi _{\nu -1} , \end{aligned}$$(4.20)where the map \( \Psi _{\nu -1} \) is real, Töplitz in time \( \Psi _{\nu -1} := \Psi _{\nu -1}(\varphi ) \) (see (2.17)), and satisfies
$$\begin{aligned} \left| \Psi _{\nu -1} \right| _{s}^{\mathrm{{Lip}(\gamma )}} \le |\mathcal{R}_{0} |_{s+\beta }^{\mathrm{{Lip}(\gamma )}} \gamma ^{-1} N_{\nu -1}^{2 \tau +1} N_{\nu - 2}^{- \alpha } . \end{aligned}$$(4.21)In the reversible case, \({\mathcal {R}}_{\nu } : X \rightarrow Y\), \(\Psi _{\nu - 1}, \Phi _{\nu - 1}, \Phi _{\nu - 1}^{-1} \) are reversibility preserving. Moreover, all the \( \mu ^\nu _{j}(\lambda ) \) are purely imaginary and \( \mu ^{\nu }_{j} = - \mu ^{\nu }_{-j} \), \( \forall j \in {\mathbb {Z}}\).
- \(\mathbf{(S2)_{\nu }}\) :
-
For all \( j \in {\mathbb {Z}}\), there exist Lipschitz extensions \( \widetilde{\mu }_{j}^{\nu }(\cdot ): \Lambda \rightarrow {\mathbb {R}}\) of \( \mu _{j}^{\nu }(\cdot ) : \Lambda _\nu ^\gamma \rightarrow {\mathbb {R}}\) satisfying, for \(\nu \ge 1\),
$$\begin{aligned} |\widetilde{\mu }_{j}^{\nu } - \widetilde{\mu }_{j}^{\nu -1} |^{\mathrm{{Lip}(\gamma )}} \le | \mathcal{R}_{\nu -1} |^{\mathrm{{Lip}(\gamma )}}_{{\mathfrak {s}}_0}. \end{aligned}$$(4.22) - \(\mathbf{(S3)_{\nu }}\) :
-
Let \( u_1(\lambda )\), \( u_2(\lambda )\), be Lipschitz families of Sobolev functions, defined for \(\lambda \in \Lambda _o\) and such that conditions (4.2), (4.14) hold with \( \mathcal{R}_0 := \mathcal{R}_0 ( u_i) \), \( i = 1,2 \), see (4.11). Then, for \( \nu \ge 0 \), \(\forall \lambda \in \Lambda _{\nu }^{\gamma _1}(u_1) \cap \Lambda _{\nu }^{\gamma _2}(u_2)\), with \( \gamma _1, \gamma _2 \in [\gamma /2, 2\gamma ]\),
$$\begin{aligned} \begin{aligned} |\mathcal{R}_{\nu }(u_2) - {\mathcal {R}}_{\nu }(u_1)|_{{\mathfrak {s}}_{0}}&\le \varepsilon N_{\nu - 1}^{-\alpha } \Vert u_1 - u_2 \Vert _{{\mathfrak {s}}_0 + \sigma _2},\\ |\mathcal{R}_{\nu }(u_2) - {\mathcal {R}}_{\nu }(u_1)|_{\mathfrak s_{0}+\beta }&\le \varepsilon N_{\nu - 1} \Vert u_1 - u_2 \Vert _{\mathfrak s_0 + \sigma _2}. \end{aligned} \end{aligned}$$(4.23)Moreover, for \(\nu \ge 1\), \( \forall s \in [\mathfrak s_{0},{\mathfrak {s}}_{0}+\beta ] \), \(\forall j \in {\mathbb {Z}}\),
$$\begin{aligned} \big |\!\left( r_{j}^{\nu }(u_2) \!-\! r_{j}^{\nu }(u_1)\right) - \left( r_{j}^{\nu -1}(u_2) \!-\! r_{j}^{\nu -1}(u_1)\right) \big |\!&\le \! \vert \mathcal{R}_{\nu -1}(u_2) \!-\! {\mathcal {R}}_{\nu -1}(u_1) \vert _{{\mathfrak {s}}_0} ,\nonumber \\\end{aligned}$$(4.24)$$\begin{aligned} | r_j^{\nu }(u_2) \!-\! r_j^{\nu }(u_1) | \!&\le \! \varepsilon C \Vert u_1 \!-\! u_2 \Vert _{{\mathfrak {s}}_0 + \sigma _2}. \end{aligned}$$(4.25) - \(\mathbf{(S4)_{\nu }}\) :
-
Let \(u_1, u_2\) like in \((\mathbf{S3})_\nu \) and \( 0 < \rho < \gamma / 2 \). For all \( \nu \ge 0 \) such that
$$\begin{aligned} \varepsilon C N_{\nu - 1}^{\tau } \Vert u_1 - u_2 \Vert _{{\mathfrak {s}}_0 + \sigma _2}^\mathrm{sup} \le \rho \Longrightarrow \Lambda _{\nu }^{\gamma }(u_1) \subseteq \Lambda _{\nu }^{\gamma - \rho }(u_2). \end{aligned}$$(4.26)
Remark 4.1
In the Hamiltonian case \( \Psi _{\nu -1}\) is Hamiltonian and, instead of (4.20) we consider the symplectic map
The corresponding operators \( {\mathcal {L}}_\nu \), \(\mathcal{R}_\nu \) are Hamiltonian. Note that the operators (4.27) and (4.20) differ for an operator of order \(\Psi _{\nu - 1}^2\).
The proof of Theorem 4.2 is postponed in Subsection 4.1. We first give some consequences.
Corollary 4.1
(KAM transformation) \( \forall \lambda \in \cap _{\nu \ge 0} \Lambda _{\nu }^{\gamma } \) the sequence
converges in \( |\cdot |_{s}^{\mathrm{{Lip}(\gamma )}}\) to an operator \(\Phi _{\infty }\) and
In the reversible case \(\Phi _\infty \) and \(\Phi _{\infty }^{-1}\) are reversibility preserving.
Proof
To simplify notations we write \(|\cdot |_s \) for \(|\cdot |_s^{\mathrm{{Lip}(\gamma )}}\). For all \( \nu \ge 0 \) we have \( \widetilde{\Phi }_{\nu + 1} = \widetilde{\Phi }_{\nu }\circ \Phi _{\nu + 1} = \widetilde{\Phi }_{\nu } + \widetilde{\Phi }_{\nu }\Psi _{\nu + 1} \) (see (4.20)) and so
where \( \varepsilon _\nu := C' |\mathcal{R}_{0} |_{{\mathfrak {s}}_0 +\beta }^{\mathrm{{Lip}(\gamma )}} \gamma ^{-1} N_{\nu +1}^{2 \tau +1} N_{\nu }^{- \alpha } \). Iterating (4.30) we get, for all \( \nu \),
using (4.21) (with \( \nu =1 \), \( s = {\mathfrak {s}}_0 \)) to estimate \( | \Phi _0 |_{{\mathfrak {s}}_0} \) and (4.14). The high norm of \( \widetilde{\Phi }_{\nu + 1} = \widetilde{\Phi }_{\nu } + \widetilde{\Phi }_{\nu }\Psi _{\nu + 1} \) is estimated by (2.10), (4.31) (for \( {\widetilde{\Phi }}_\nu \)), as
Iterating the above inequality and using \( \Pi _{j \ge 0} (1+ \varepsilon _j^{(0)}) \le 2 \), we get
using \( |\Phi _0 |_s \le 1+ C(s) | \mathcal{R}_0|_{s+ \beta } \gamma ^{-1} \). Finally, \(\widetilde{\Phi }_{j}\) is a Cauchy sequence in norm \( | \cdot |_s \) because
Hence \(\widetilde{\Phi }_{\nu } \mathop {\rightarrow }\limits ^{\left| \cdot \right| _ s} \Phi _{\infty } \). The bound for \( \Phi _\infty - I \) in (4.29) follows by (4.33) with \( m = \infty \), \( \nu = 0 \) and \( |\widetilde{\Phi }_0 - I |_s = \) \( |\Psi _0|_s \lessdot \gamma ^{-1} | \mathcal{R}_0|_{s+\beta } \). Then the estimate for \( \Phi _\infty ^{-1} - I \) follows by (2.13).
In the reversible case all the \(\Phi _\nu \) are reversibility preserving and so \(\widetilde{\Phi }_\nu \), \(\Phi _{\infty }\) are reversibility preserving. \(\square \)
Remark 4.2
In the Hamiltonian case, the transformation \(\widetilde{\Phi }_\nu \) in (4.28) is symplectic, because \(\Phi _\nu \) is symplectic for all \(\nu \) (see Remark 4.1). Therefore \(\Phi _\infty \) is also symplectic.
Let us define for all \(j \in {\mathbb {Z}}\)
It could happen that \( \Lambda _{\nu _0}^\gamma = \emptyset \) (see (4.17)) for some \( \nu _0 \). In such a case the iterative process of Theorem 4.2 stops after finitely many steps. However, we can always set \( \widetilde{\mu }_{j}^{\nu } := \widetilde{\mu }_{j}^{\nu _0} \), \( \forall \nu \ge \nu _0 \), and the functions \( \mu ^{\infty }_{j} : \Lambda \rightarrow {\mathbb {R}}\) are always well defined.
Corollary 4.2
(Final eigenvalues) For all \( \nu \in {\mathbb {N}}\), \( j \in {\mathbb {Z}}\)
Proof
The bound (4.34) follows by (4.22) and (4.19) by summing the telescopic series. \(\square \)
Lemma 4.1
(Cantor set)
Proof
Let \( \lambda \in \Lambda _{\infty }^{2\gamma } \). By definition \( \Lambda _{\infty }^{2\gamma } \subset \Lambda _0^\gamma := \Lambda _o \). Then for all \( \nu > 0 \), \( | l | \le N_{\nu } \), \( j \ne k \)
because \( \gamma |j^{3} - k^{3} | \langle l \rangle ^{-\tau } \ge \gamma N_\nu ^{-\tau } \mathop {\ge }\limits ^{(4.14)}2 C | \mathcal{R}_0|_{{\mathfrak {s}}_0+ \beta } N_{\nu -1}^{-\alpha } \). \(\square \)
Lemma 4.2
For all \(\lambda \in \Lambda _{\infty }^{2\gamma } (u) \) ,
and in the reversible case
Actually in the reversible case \( \mu _j^{\infty }(\lambda ) \) are purely imaginary for all \( \lambda \in \Lambda \).
Proof
Formula (4.36) and (4.37) follow because, for all \( \lambda \in \Lambda _\infty ^{2\gamma } \subseteq \cap _{\nu \ge 0} \Lambda _\nu ^\gamma \) (see (4.35)), we have \( \mu _j^{\nu } = \overline{\mu _{-j}^{\nu }} \), \( r_j^{\nu } = \overline{r_{-j}^{\nu }} \), and, in the reversible case, the \( \mu _j^{\nu } \) are purely imaginary and \( \mu _j^{\nu } = - \mu _{-j}^{\nu } \), \( r_j^{\nu } = - r_{-j}^{\nu } \). The final statement follows because, in the reversible case, the \( \mu _j^\nu (\lambda ) \in \mathrm{i} {\mathbb {R}}\) as well as its extension \( {\widetilde{\mu }}_j^\nu (\lambda ) \). \(\square \)
Remark 4.3
In the reversible case, (4.37) imply that \(\mu _0^\infty = r_0^\infty = 0\).
Proof of Theorem 4.1
We apply Theorem 4.2 to the linear operator \( \mathcal{L}_0 := \mathcal{L}_5\) in (3.55), where \( {\mathcal {R}}_0 = \mathcal{R }\) defined in (4.11) satisfies
Then the smallness condition (4.14) is implied by (4.3) taking \(\delta _0:= \delta _0(\nu )\) small enough.
For all \( \lambda \in \Lambda _\infty ^{2\gamma } \subset \cap _{\nu \ge 0} \Lambda _\nu ^\gamma \) (see (4.35)), the operators
because
Applying (4.20) iteratively we get \( \mathcal{L}_{\nu } = {{\widetilde{\Phi }}_{\nu -1}}^{-1} \mathcal{L}_0 {\widetilde{\Phi }}_{\nu -1} \) where \( {\widetilde{\Phi }}_{\nu -1} \) is defined by (4.28) and \( {\widetilde{\Phi }}_{\nu -1} \rightarrow {\Phi }_\infty \) in \( | \ |_s \) (Corollary 4.1). Passing to the limit we deduce (4.7). Moreover (4.34) and (4.38) imply (4.5). Then (4.29), (3.68) (applied to \( \mathcal{R}_0 = \mathcal{R} \)) imply (4.8).
Estimate (4.9) follows from (2.12) (in \( H^s_x ({\mathbb {T}}) \)), Lemma 2.4, and the bound (4.8).
In the reversible case, since \(\Phi _\infty \), \(\Phi _{\infty }^{-1}\) are reversibility preserving (see Corollary 4.1), and \({\mathcal {L}}_0\) is reversible (see Remark 3.12 and Lemma 3.2), we get that \({\mathcal {L}}_\infty \) is reversible too. The eigenvalues \( \mu _j^{\infty } \) are purely imaginary by Lemma 4.2.
In the Hamiltonian case, \( {\mathcal {L}}_0 \equiv {\mathcal {L}}_5 \) is Hamiltonian, \(\Phi _{\infty }\) is symplectic, and therefore \(\mathcal{L}_{\infty } = \Phi _{\infty }^{-1} {\mathcal {L}}_5 \Phi _{\infty }\) (see (4.7)) is Hamiltonian, namely \({\mathcal {D}}_\infty \) has the structure \( {\mathcal {D}}_\infty = \partial _x \mathcal {B} \), where \(\mathcal {B} = \mathrm {diag}_{j \ne 0} \{ b_j \}\) is self-adjoint. This means that \(b_j \in {\mathbb {R}}\), and therefore \(\mu _j^\infty = \mathrm{i} j b_j\) are all purely imaginary.
4.1 Proof of Theorem 4.2
Proof of \(\mathbf{({S}i)}_{0}\), \(i=1,\ldots ,4\). Properties (4.15)–(4.19) in \(\mathbf{({S}1)}_0\) hold by (4.10)–(4.11) with \( \mu _j^0 \) defined in (4.16) and \( r_j^0(\lambda ) = 0 \) (for (4.19) recall that \( N_{-1} := 1 \), see (4.12)). Moreover, since \(m_1\), \(m_3\) are real functions, \(\mu _j^0\) are purely imaginary, \(\mu _j^0 = \overline{{\mu }_{-j}^0}\) and \(\mu _j^0 = - \mu _{-j}^0\). In the reversible case, Remark 3.12 implies that \({\mathcal {R}}_0 := {\mathcal {R}}\), \({\mathcal {L}}_0 := {\mathcal {L}}_5\) are reversible operators. Then there is nothing else to verify.
\(\mathbf{({S}2)}_0 \) holds extending from \( \Lambda ^\gamma _0 := \Lambda _o \) to \( \Lambda \) the eigenvalues \(\mu _{j}^0 (\lambda ) \), namely extending the functions \( m_1 (\lambda ) \), \( m_3 (\lambda ) \) to \( {\tilde{m}}_1 (\lambda ) \), \( {\tilde{m}}_3 (\lambda ) \), preserving the sup norm and the Lipschitz semi-norm, by Kirszbraun theorem, see e.g. [37]-Lemma A.2, or [32].
\(\mathbf{({S}3)}_0 \) follows by (3.67), for \(s = \mathfrak s_0 , {\mathfrak {s}}_0 + \beta \), and (4.2), (4.13).
\(\mathbf{({S}4)}_0 \) is trivial because, by definition, \(\Lambda _0^\gamma (u_1) = \Lambda _o = \Lambda _0^{\gamma -\rho }(u_2)\).
4.1.1 The reducibility step
We now describe the generic inductive step, showing how to define \( \mathcal{L}_{\nu +1 } \) (and \( \Phi _\nu \), \( \Psi _\nu \), etc). To simplify notations, in this section we drop the index \( \nu \) and we write \( + \) for \( \nu + 1\). We have
where \([\mathcal{D}, \Psi ] := \mathcal{D} \Psi - \Psi \mathcal{D} \) and \(\Pi _{N}\mathcal{R}\) is defined in (2.19).
Remark 4.4
The application of the smoothing operator \( \Pi _N \) is necessary since we are performing a differentiable Nash–Moser scheme. Note also that \( \Pi _N \) regularizes only in time (see (2.19)) because the loss of derivatives of the inverse operator is only in \( \varphi \) (see (4.44) and the bound on the small divisors (4.17)).
We look for a solution of the homological equation
Lemma 4.3
(Homological equation) For all \( \lambda \in {\Lambda }_{\nu +1}^{\gamma } \), (see (4.17)) there exists a unique solution \( \Psi := \Psi (\varphi ) \) of the homological equation (4.41). The map \( \Psi \) satisfies
Moreover if \(\gamma / 2 \le \gamma _1, \gamma _2 \le 2\gamma \) and if \( u_1(\lambda )\), \( u_2(\lambda ) \) are Lipschitz functions, then \(\forall s \in [\mathfrak s_0, {\mathfrak {s}}_0 + \beta ] \), \(\lambda \in \Lambda _{\nu + 1}^{\gamma _1}(u_1) \cap \Lambda _{\nu + 1}^{\gamma _2}(u_2)\)
where we define \( \Delta _{12} \Psi := \Psi ( u_1) -\Psi (u_2) \).
In the reversible case, \( \Psi \) is reversibility-preserving.
Proof
Since \(\mathcal{D} := \mathrm{diag}_{j \in {\mathbb {Z}}} (\mu _{j})\) we have \( [\mathcal{D}, \Psi ]_{j}^{k} = (\mu _j - \mu _k) \Psi _{j}^{k}(\varphi ) \) and (4.41) amounts to
whose solutions are \( \Psi _{j}^{k}(\varphi ) = \sum _{l \in {\mathbb {Z}}^\nu } \Psi _{j}^{k}(l) e^{\mathrm{i} l \cdot \varphi } \) with coefficients
Note that, for all \( \lambda \in \Lambda _{\nu + 1}^{\gamma } \), by (4.17) and (1.2), if \( j \ne k \) or \( l \ne 0 \) the divisors \( \delta _{ljk}(\lambda ) \ne 0 \). Recalling the definition of the \( s \)-norm in (2.3) we deduce by (4.44), (4.17), (1.2), that
For \( \lambda _{1}, \lambda _{2} \in \Lambda _{\nu + 1}^{\gamma } \),
and, since \(\omega = \lambda \bar{\omega }\),
because
Hence, for \( j \ne k \), \( \varepsilon \gamma ^{-1} \le 1 \),
for \( |l| \!\le \! N \). Finally, recalling (2.3), the bounds (4.46), (4.49) and (4.45) imply (4.42). Now we prove (4.43). By (4.44), for any \( \lambda \!\in \! \Lambda _{\nu + 1}^{\gamma _1} (u_1) \cap \Lambda _{\nu + 1}^{\gamma _2} (u_2) \), \( l \!\in \! {\mathbb {Z}}^{\nu } \), \( j \!\ne \! k \), we get
where
Then (4.50), (4.51), \(\varepsilon \gamma ^{-1} \le 1\), \(\gamma _{1}^{-1}, \gamma _{2}^{-1} \le \gamma ^{-1}\) imply
and so (4.43) (in fact, (4.43) holds with \(2\tau \) instead of \(2\tau +1\)).
In the reversible case \( \mathrm{i} \omega \cdot l + \mu _j - \mu _k \in \mathrm{i} {\mathbb {R}}\), \(\overline{{\mu }_{-j}} = \mu _j\) and \( \mu _{-j} = - \mu _j \). Hence Lemma 2.6 and (4.44) imply
and so \( \Psi \) is real, again by Lemma 2.6. Moreover, since \( \mathcal{R} : X \rightarrow Y \),
which implies \( \Psi : X \rightarrow X \) by Lemma 2.6. Similarly we get \( \Psi : Y \rightarrow Y \). \(\square \)
Remark 4.5
In the Hamiltonian case \( \mathcal{R} \) is Hamiltonian and the solution \( \Psi \) in (4.44) of the homological equation is Hamiltonian, because \( \overline{ \delta _{l,j,k} } = \delta _{-l,k,j} \) and, in terms of matrix elements, an operator \(G(\varphi )\) is self-adjoint if and only if \( \overline{ G_j^k(l) } = G_k^j(-l) \).
Let \( \Psi \) be the solution of the homological equation (4.41) which has been constructed in Lemma 4.3. By Lemma 2.3, if \( C({\mathfrak {s}}_0) | \Psi |_{{\mathfrak {s}}_0} < 1 /2 \) then \( \Phi := I + \Psi \) is invertible and by (4.40) (and (4.41)) we deduce that
where
Note that \({\mathcal {L}}_+\) has the same form of \( \mathcal{L} \), but the remainder \( {\mathcal {R}}_+ \) is the sum of a quadratic function of \( \Psi , \mathcal{R} \) and a remainder supported on high modes.
Lemma 4.4
(New diagonal part) The eigenvalues of
satisfy \(\mu _{j}^{+} = \overline{{\mu }^{+}_{-j}}\) and
Moreover if \( u_1 (\lambda )\), \(u_2 (\lambda )\) are Lipschitz functions, then for all \(\lambda \in \Lambda _{\nu }^{\gamma _1}(u_1) \cap \Lambda _{\nu }^{\gamma _2}(u_2)\)
In the reversible case, all the \(\mu _j^{+}\) are purely imaginary and satisfy \(\mu ^{+}_{j} = - \mu ^{+}_{-j}\) for all \(j \in {\mathbb {Z}}\).
Proof
The estimates (4.54)–(4.55) follow using (2.4) because \( | \mathcal{R}^{j}_{j}(0) |^\mathrm{lip} = \) \( |\mathcal{R}^{(l,j)}_{(l,j)} |^\mathrm{lip}\le |\mathcal{R} |_0^\mathrm{lip} \le |\mathcal{R} |_{{\mathfrak {s}}_0}^\mathrm{lip} \) and
Since \( \mathcal{R} \) is real, by Lemma 2.6,
and so \( \mu _{j}^+ = \overline{{\mu }_{-j}^{+}} \). If \({\mathcal {R}}\) is also reversible, by Lemma 2.6,
We deduce that \( \mathcal{R}^{j}_{j}(0) = - \mathcal{R}^{-j}_{-j}(0) \), \( \mathcal{R}^{j}_{j}(0) \in \mathrm{i} {\mathbb {R}}\) and therefore, \( \mu ^{+}_{j} = - \mu ^{+}_{-j} \) and \(\mu _{j}^{+} \in \mathrm{i} {\mathbb {R}}\). \(\square \)
Remark 4.6
In the Hamiltonian case, \({\mathcal {D}}_\nu \) is Hamiltonian, namely \( {\mathcal {D}}_\nu = \partial _x \mathcal {B} \) where \(\mathcal {B} = \mathrm {diag}_{j \ne 0} \{ b_j \}\) is self-adjoint. This means that \(b_j \in {\mathbb {R}}\), and therefore all \(\mu _j^\nu = \mathrm{i} j b_j \) are purely imaginary.
4.1.2 The iteration
Let \(\nu \ge 0\), and suppose that the statements \(\mathbf{({S}i)_{\nu }}\) are true. We prove \((\mathbf{Si})_{\nu +1}\), \(i=1,\ldots ,4\). To simplify notations we write \(|\cdot |_s\) instead of \(|\cdot |_s^{\mathrm{{Lip}(\gamma )}}\).
Proof of \((\mathbf{S1})_{\nu + 1}\). By \(\mathbf{(S1)_\nu } \), the eigenvalues \(\mu _j^\nu \) are defined on \(\Lambda _\nu ^\gamma \). Therefore the set \(\Lambda _{\nu +1}^\gamma \) is well-defined. By Lemma 4.3, for all \( \lambda \in \Lambda _{\nu +1}^{\gamma } \) there exists a real solution \( \Psi _{\nu } \) of the homological equation (4.41) which satisfies, \( \forall s \in [{\mathfrak {s}}_0, q- \sigma - \beta ] \),
which is (4.21) at the step \( \nu +1 \). In particular, for \( s = {\mathfrak {s}}_0 \),
for \( N_0 \) large enough. Then the map \( \Phi _{\nu } := I + \Psi _\nu \) is invertible and, by (2.13),
Hence (4.52)–(4.53) imply \( \mathcal{L}_{\nu + 1} := \) \( \Phi _{\nu }^{-1} \mathcal{L}_{\nu } \Phi _{\nu } = \) \( \omega \cdot \partial _{\varphi } + \mathcal{D}_{\nu + 1} + \mathcal{R}_{\nu + 1} \) where (see Lemma 4.4)
with \(\mu _j^{\nu + 1} = \overline{\mu _{-j}^{\nu + 1}} \) and
In the reversible case, \({\mathcal {R}}_\nu : X \rightarrow Y\), therefore, by Lemma 4.3, \(\Psi _\nu \), \(\Phi _\nu \), \(\Phi _{\nu }^{-1}\) are reversibility preserving, and then, by formula (4.60), also \({\mathcal {R}}_{\nu + 1} : X \rightarrow Y\).
Let us prove the estimates (4.19) for \( \mathcal{R}_{\nu + 1} \). For all \( s \in [{\mathfrak {s}}_0, q - \sigma - \beta ] \) we have
which shows that the iterative scheme is quadratic plus a super-exponentially small term. In particular
(\( \chi = 3 / 2 \)) which is the first inequality of (4.19) at the step \( \nu +1 \). The next key step is to control the divergence of the high norm \( | \mathcal{R}_{\nu +1} |_{s+\beta } \). By (4.61) (with \( s + \beta \) instead of \( s \)) we get
(the difference with respect to (4.62) is that we do not apply to \( | \Pi _{N_\nu }^\bot \mathcal{R}_{\nu } |_{s+\beta } \) any smoothing). Then (4.63), (4.19), (4.14), (4.13) imply the inequality
whence, iterating,
for \( N_0 := N_0 (s,\beta ) \) large enough, which is the second inequality of (4.19) with index \( \nu +1 \).
By Lemma 4.4 the eigenvalues \( \mu _j^{\nu + 1} := \mu _j^0 + r_j^{\nu + 1} \), defined on \( \Lambda _{\nu +1}^{\gamma } \), satisfy \(\mu _j^{\nu + 1} = \overline{{\mu }_{-j}^{\nu + 1}} \), and, in the reversible case, the \(\mu _{j}^{\nu + 1}\) are purely imaginary and \(\mu _j^{\nu + 1} = - \mu _{-j}^{\nu + 1}\).
It remains only to prove (4.18) for \(\nu +1\), which is proved below.
Proof of \(\mathbf{({S}2)}_{\nu + 1} \). By (4.54),
By Kirszbraun theorem, we extend the function \( \mu _j^{\nu + 1} - \mu _j^{\nu } = r_j^{\nu + 1} - r_j^{\nu } \) to the whole \( \Lambda \), still satisfying (4.64). In this way we define \( \tilde{\mu }_j^{\nu + 1}\). Finally (4.18) follows summing all the terms in (4.64) and using (3.68).
Proof of \(\mathbf{({S}3)}_{\nu + 1} \). Set, for brevity,
which are all operators defined for \(\lambda \in \Lambda _{\nu }^{\gamma _1}(u_1) \cap \Lambda _{\nu }^{\gamma _2}(u_2) \). By Lemma 4.3 one can construct \(\Psi _{\nu }^{i}:= \Psi _{\nu }(u_i)\), \(\Phi _{\nu }^i := \Phi _{\nu }(u_i)\), \(i = 1, 2\), for all \(\lambda \in \Lambda _{\nu + 1}^{\gamma _1}(u_1) \cap \Lambda _{\nu + 1}^{\gamma _2}(u_2)\). One has
for \( \varepsilon \gamma ^{-1} \) small (and (4.13)). By (2.14), applied to \(\Phi := \Phi _{\nu }\), and (4.65), we get
which implies for \(s = {\mathfrak {s}}_0\), and using (4.21), (4.14), (4.65)
Let us prove the estimates (4.23) for \(\Delta _{12}{\mathcal {R}}_{\nu + 1}\), which is defined on \(\lambda \in \Lambda _{\nu + 1}^{\gamma _1}(u_1) \cap \Lambda _{\nu + 1}^{\gamma _2}(u_2)\). For all \(s \in [ {{\mathfrak {s}}}_{0}, {\mathfrak s}_{0}+\beta ]\), using the interpolation (2.7) and (4.60),
We estimate the above terms separately. Set for brevity \( A^\nu _{s} := | {\mathcal {R}}_\nu (u_1) |_s + | {\mathcal {R}}_\nu (u_2) |_s \). By (4.60) and (2.7),
Estimating the four terms in the right hand side of (4.68) in the same way, using (4.66), (4.60), (4.42), (4.43), (4.21), (4.67), (4.58), (4.69), (4.19), we deduce
Specializing (4.70) for \( s = {\mathfrak {s}}_0 \) and using (3.68), (2.20), (4.19), (4.23), we deduce
for \(N_{0}\) large and \(\varepsilon \gamma ^{-1}\) small. Next by (4.70) with \( s = {\mathfrak {s}}_0 + \beta \)
for \(N_{0}\) large enough. Finally note that (4.24) is nothing but (4.55).
Proof of \(\mathbf{({S}4)}_{\nu + 1} \). We have to prove that, if \(C \varepsilon N_{\nu }^{\tau } \Vert u_1 - u_2 \Vert _{{\mathfrak {s}}_0 + \sigma _2} \le \rho \), then
Let \( \lambda \in \Lambda _{\nu +1}^{\gamma }(u_1) \). Definition (4.17) and \(\mathbf{({S}4)_{\nu }}\) (see (4.26)) imply that \( \Lambda _{\nu +1}^{\gamma }(u_1) \subseteq \Lambda _{\nu }^{\gamma }(u_1) \subseteq \Lambda _{\nu }^{\gamma - \rho }(u_2) \). Hence \( \lambda \in \Lambda _{\nu }^{\gamma - \rho }(u_2) \subset \Lambda _{\nu }^{\gamma /2}(u_2) \). Then, by \(\mathbf{({S}1)_{\nu }}\), the eigenvalues \(\mu _{j}^{\nu }(\lambda , u_2(\lambda ))\) are well defined. Now (4.16) and the estimates (3.64), (4.25) (which holds because \( \lambda \in \Lambda _{\nu }^{\gamma }(u_1) \cap \Lambda _{\nu }^{\gamma /2}(u_2) \)) imply that
Then we conclude that for all \(|l| \le N_{\nu }\), \(j \ne k\), using the definition of \(\Lambda _{\nu +1}^\gamma (u_1)\) (which is (4.17) with \(\nu +1\) instead of \(\nu \)) and (4.71),
provided \(C \varepsilon N_{\nu }^{\tau } \Vert u_1 - u_2 \Vert _{{\mathfrak {s}}_0 + \sigma _2} \le \rho \). Hence \( \lambda \in \Lambda ^{\gamma - \rho }_{\nu +1} ( u_2 ) \). This proves (4.26) at the step \( \nu + 1 \).
4.2 Inversion of \(\mathcal{L}(u)\)
In (3.57) we have conjugated the linearized operator \( {\mathcal {L}}\) to \({\mathcal {L}}_5\) defined in (3.55), namely \({\mathcal {L}}= \Phi _1 {\mathcal {L}}_5 \Phi _2^{-1}\). In Theorem 4.1 we have conjugated the operator \({\mathcal {L}}_5\) to the diagonal operator \({\mathcal {L}}_{\infty }\) in (4.7), namely \( {\mathcal {L}}_5 = \Phi _{\infty } {\mathcal {L}}_{\infty } \Phi _{\infty }^{-1}\). As a consequence
We first prove that \(W_1, W_2 \) and their inverses are linear bijections of \(H^{s}\). We take
Lemma 4.5
Let \( {\mathfrak {s}}_{0} \le s \le q - \sigma - \beta -3 \) where \( \beta \) is defined in (4.1) and \( \sigma \) in (3.58). Let \(u:= u(\lambda )\) satisfy \( \Vert u \Vert _{{\mathfrak {s}}_0 + \sigma + \beta + 3}^{\mathrm{{Lip}(\gamma )}} \le 1 \), and \( \varepsilon \gamma ^{-1} \le \delta \) be small enough. Then \( W_i \), \( i = 1, 2 \), satisfy, \( \forall \lambda \in \Lambda _{\infty }^{2\gamma }(u) \),
In the reversible case (i.e. (1.13) holds), \( W_i \), \( W_i^{-1} \), \( i = 1, 2 \) are reversibility-preserving.
Proof
The bound (4.74), resp. (4.75), follows by (4.8), (3.60), resp. (3.62), (2.12) and Lemma 6.5. In the reversible case \( W_i^{\pm 1} \) are reversibility preserving because \( \Phi _i^{\pm 1} \), \( \Phi _\infty ^{\pm 1} \) are reversibility preserving. \(\square \)
By (4.72) we are reduced to show that, \( \forall \lambda \in \Lambda ^{2\gamma }_{\infty }(u) \), the operator
is invertible, assuming (1.8) or the reversibility condition (1.13).
We introduce the following notation:
If (1.8) holds, then the linearized operator \( \mathcal{L} \) in (3.1) satisfies
(for \( {\mathfrak {s}}_0 \le s \le q-1 \)). In the reversible case (1.13)
Lemma 4.6
Assume either (1.8) or the reversibility condition (1.13). Then the eigenvalue
Proof
Assume (1.8). If \( r_0^\infty \ne 0 \) then there exists a solution of \( \mathcal{L}_\infty w = 1 \), which is \( w = 1 / r_0^\infty \). Therefore, by (4.72),
which is a contradiction because \( \Pi _C W_1 [1] \ne 0 \), for \( \varepsilon \gamma ^{-1} \) small enough, but the average \( \Pi _C \mathcal{L} W_2 [1 / r^\infty _0] = 0 \) by (4.77). In the reversible case \( r^\infty _0 = 0 \) was proved in Remark 4.3. \(\square \)
As a consequence of (4.79), the definition of \( \Lambda _\infty ^{2 \gamma } \) in (4.6) (just specializing (4.6) with \( k = 0 \)), and (1.2) (with \(\gamma \) and \(\tau \) as in (4.73)), we deduce also the first order Melnikov non-resonance conditions
Lemma 4.7
(Invertibility of \(\mathcal{L}_\infty \) ) For all \( \lambda \in \Lambda _\infty ^{2 \gamma } (u) \), for all \( g \in H^s_{00} \) the equation \( \mathcal{L}_\infty w = g \) has the unique solution with zero average
For all Lipschitz family \( g := g(\lambda ) \in H^s_{00} \) we have
In the reversible case, if \( g \in Y \) then \( \mathcal{L}_\infty ^{-1} g \in X \).
Proof
For all \(\lambda \in \Lambda _\infty ^{2\gamma } (u) \), by (4.80), formula (4.81) is well defined and
Now we prove the Lipschitz estimate. For \( \lambda _1 , \lambda _2 \in \Lambda _\infty ^{2\gamma } (u) \)
By (4.83)
Now we estimate the second term of (4.84). We simplify notations writing \( g := g(\lambda _{2}) \) and \( \delta _{lj} := \mathrm{i} \lambda \bar{\omega }\cdot l + \mu _j^\infty \).
The bound (4.5) imply \( \vert \mu _{j}^{\infty } \vert ^\mathrm{lip} \lessdot \varepsilon \gamma ^{-1} | j |^{3} \lessdot | j |^{3} \) and, using also (4.80),
Then (4.86) and (4.87) imply \( \gamma \Vert (\mathcal{L}_{\infty }^{-1}(\lambda _2) - \mathcal{L}_{\infty }^{-1}(\lambda _1) )g \Vert _s \lessdot \gamma ^{-1} \Vert g \Vert _{s + 2\tau + 1}^{\mathrm{{Lip}(\gamma )}} |\lambda _2 - \lambda _1 | \) that, finally, with (4.83), (4.85), prove (4.82). The last statement follows by the property (4.37). \(\square \)
In order to solve the equation \( \mathcal{L} h = f \) we first prove the following lemma.
Lemma 4.8
Let \({\mathfrak {s}}_0 + \tau + 3 \le s \le q - \sigma - \beta - 3 \). Under the assumption (1.8) we have
Proof
It is sufficient to prove that \( W_1 (H^s_{00}) = H^s_{00} \) because the second equality of (4.88) follows applying the isomorphism \( W_1^{-1} \). Let us give the proof of the inclusion
(which is essentially algebraic). For any \( g \in H^s_{00}\), let \( w(\varphi ,x) := \mathcal{L}_\infty ^{-1} g \in H^{s - \tau }_{00} \) defined in (4.81). Then \( h := W_2 w \in H^{s-\tau } \) satisfies
By (4.77) we deduce that \(W_1 g = {\mathcal {L}}h \in H^{s - \tau - 3}_{00} \). Since \( W_1 g \in H^s \) by Lemma 4.5, we conclude \( W_1 g \in H^s \cap H^{s - \tau - 3}_{00} = H^s_{00}\). The proof of (4.89) is complete.
It remains to prove that \(H^s_{00} {\setminus } W_1(H^s_{00}) = \emptyset \). By contradiction, let \( f \in H^s_{00} {\setminus } W_1(H^s_{00}) \). Let \( g := W_1^{-1}f \in H^s \) by Lemma 4.5. Since \(W_1 g = f \notin W_1(H^s_{00})\), it follows that \( g \notin H^s_{00} \) (otherwise it contradicts (4.89)), namely \(c := \Pi _C g \ne 0\). Decomposing \( g = c + {\mathbb {P}}g \) (recall (4.76)) and applying \(W_1\), we get \( W_1 g = c W_1[1] + W_1 {\mathbb {P}}g \). Hence
because \(W_1 g = f \in H^s_{00}\) and \(W_1 {\mathbb {P}}g \in W_1(H^s_{00}) \subseteq H^s_{00}\) by (4.89). However, \( \Pi _C W_1[1] \ne 0 \), a contradiction. \(\square \)
Remark 4.7
In the Hamiltonian case (which always satisfies (1.8)), the \( W_i (\varphi ) \) are maps of (a subspace of) \( H^1_0 \) so that Lemma 4.8 is automatic, and there is no need of Lemma 4.6.
We may now prove the main result of Sects. 3 and 4.
Theorem 4.3
(Right inverse of \( \mathcal{L}\) ) Let
where \( \sigma \), \(\beta \) are defined in (3.58), (4.1) respectively. Let \( u ( \lambda ) \), \( \lambda \in \Lambda _o \subseteq \Lambda \), be a Lipschitz family with
Then there exists \( \delta \) (depending on the data of the problem) such that if
and condition (1.8), resp. the reversibility condition (1.13), holds, then for all \( \lambda \in \Lambda _\infty ^{2 \gamma }(u)\) defined in (4.6), the linearized operator \({\mathcal {L}}:= {\mathcal {L}}(\lambda , u(\lambda ))\) (see (3.1)) admits a right inverse on \( H^s_{00} \), resp. \( Y \cap H^s \). More precisely, for \(\mathfrak s_0 \le s \le q - \mu \), for all Lipschitz family \( f(\lambda ) \in H^s_{00} \), resp. \( Y \cap H^s \), the function
is a solution of \( \mathcal{L} h = f \). In the reversible case, \( \mathcal{L}^{-1} f \in X \). Moreover
Proof
Given \(f \in H^s_{00}\), resp. \( f \in Y \cap H^s \), with \( s \) like in Lemma 4.8, the equation \( {\mathcal {L}}h = f \) can be solved for \( h \) because \(\Pi _C f = 0 \). Indeed, by (4.72), the equation \( \mathcal{L} h = f \) is equivalent to \( {\mathcal {L}}_\infty W_2^{-1}h = W_1^{-1}f \) where \(W_1^{-1}f \in H^s_{00} \) by Lemma 4.8, resp. \( W_1^{-1}f \in Y \cap H^s \) being \( W_1^{-1} \) reversibility-preserving (Lemma 4.5). As a consequence, by Lemma 4.7, all the solutions of \( \mathcal{L} h = f \) are
The solution (4.92) is the one with \( c = 0 \). In the reversible case, the fact that \( \mathcal{L}^{-1} f \in X \) follows by (4.92) and the fact that \( W_i \), \( W_i^{-1}\) are reversibility-preserving and \( \mathcal{L}_\infty ^{-1} : Y \rightarrow X \), see Lemma 4.7.
Finally (4.75), (4.82), (4.91) imply
and (4.93) follows using (6.2) with \( b_0 = {\mathfrak {s}}_0 \), \( a_0 := {\mathfrak {s}}_0 + 2 \tau + \sigma + \beta + 7 \), \( q = 2 \tau + 7 \), \( p = s - {\mathfrak {s}}_0 \). \(\square \)
In the next section we apply Theorem 4.3 to deduce tame estimates for the inverse linearized operators at any step of the Nash–Moser scheme. The approximate solutions along the iteration will satisfy (4.91).
5 The Nash–Moser iteration
We define the finite-dimensional subspaces of trigonometric polynomials
where \( N_n := N_0^{\chi ^n}\) (see (4.12)) and the corresponding orthogonal projectors
The following smoothing properties hold: for all \(\alpha , s \ge 0\),
where the function \(u(\lambda )\) depends on the parameter \(\lambda \) in a Lipschitz way. The bounds (5.1) are the classical smoothing estimates for truncated Fourier series, which also hold with the norm \(\Vert \cdot \Vert ^\mathrm{{Lip}(\gamma )}_s \) defined in (2.2).
Let
We define the constants
where \(\mu \) is the loss of regularity in (4.90).
Theorem 5.1
(Nash–Moser) Assume that \( f \in C^q \), \( q \ge {\mathfrak {s}}_0 + \mu + \beta _1 \), satisfies the assumptions of Theorem 1.1 or Theorem 1.3. Let \( 0 < \gamma \le \mathrm{min}\{\gamma _0, 1/48 \} \), \( \tau > \nu + 1 \). Then there exist \( \delta > 0 \), \( C_* > 0 \), \( N_0 \in {\mathbb {N}}\) (that may depend also on \( \tau \)) such that, if \( \varepsilon \gamma ^{-1} < \delta \), then, for all \( n \ge 0 \):
- \((\mathcal{P}1)_{n}\) :
-
there exists a function \(u_n : {\mathcal {G}}_n \subseteq \Lambda \rightarrow H_n\), \(\lambda \mapsto u_n(\lambda )\), with \( \Vert u_{n} \Vert _{{\mathfrak {s}}_0 + \mu }^{\mathrm{{Lip}(\gamma )}} \le 1 \), \( u_0 := 0\), where \(\mathcal{G}_{n} \) are Cantor like subsets of \( \Lambda := [1/2, 3/2] \) defined inductively by: \( \mathcal{G}_{0} := \Lambda \),
$$\begin{aligned} \mathcal{G}_{n+1}&:= \left\{ \lambda \in \mathcal{G}_{n} \, : \, |\mathrm{i} \omega \cdot l + \mu _j^\infty (u_{n}) - \mu _k^\infty (u_{n})| \right. \nonumber \\&\qquad \qquad \qquad \quad \ge \left. \frac{2\gamma _{n} |j^{3}-k^{3}|}{\left\langle l\right\rangle ^{\tau }}, \ \forall j , k \in {\mathbb {Z}}, \ l \in {\mathbb {Z}}^{\nu } \right\} \end{aligned}$$(5.4)where \( \gamma _{n}:=\gamma (1 + 2^{-n}) \). In the reversible case, namely (1.13) holds, then \( u_n (\lambda ) \in X \). The difference \(h_n := u_{n} - u_{n-1}\), where, for convenience, \(h_0 := 0\), satisfies
$$\begin{aligned} \Vert h_{n} \Vert _{{\mathfrak {s}}_0 + \mu }^\mathrm{{Lip}(\gamma )}\le C_* \varepsilon \gamma ^{-1} N_{n}^{-\sigma _1} , \quad \sigma _1 := 18 + 2 \mu . \end{aligned}$$(5.5) - \((\mathcal{P}2)_{n}\) :
-
\( \Vert F(u_n) \Vert _{{\mathfrak {s}}_{0}}^{\mathrm{{Lip}(\gamma )}} \le C_* \varepsilon N_{n}^{- \kappa }\).
- \((\mathcal{P}3)_{n}\) :
-
(High norms). \( \Vert u_n \Vert _{\mathfrak s_{0}+ \beta _1}^{\mathrm{{Lip}(\gamma )}} \le C_* \varepsilon \gamma ^{-1} N_{n}^{\kappa }\) and \( \Vert F(u_n ) \Vert _{{\mathfrak {s}}_{0}+\beta _1}^{\mathrm{{Lip}(\gamma )}} \le C_* \varepsilon N_{n}^{\kappa }\).
- \((\mathcal{P}4)_{n}\) :
-
(Measure). The measure of the Cantor like sets satisfies
(5.6)
All the Lip norms are defined on \( \mathcal{G}_{n} \).
Proof
The proof of Theorem 5.1 is split into several steps. For simplicity, we denote \( \Vert \ \Vert ^\mathrm{Lip} \) by \( \Vert \ \Vert \).
Step 1: prove \(({\mathcal {P}}1,2,3)_0\). \(({\mathcal {P}}1)_0\) and the first inequality of \(({\mathcal {P}}3)_0\) are trivial because \(u_0 = h_0 = 0\). \(({\mathcal {P}}2)_0\) and the second inequality of \(({\mathcal {P}}3)_0\) follow with \( C_* \ge \) \( \max \{ \Vert f(0)\Vert _{{\mathfrak {s}}_0} N_0^\kappa , \) \( \Vert f(0)\Vert _{{\mathfrak {s}}_0+ \beta _1} N_0^{-\kappa } \} \).
Step 2: assume that \(({\mathcal {P}}1,2,3)_n\) hold for some \(n \ge 0\), and prove \(({\mathcal {P}}1,2,3)_{n+1}\). By \(({\mathcal {P}}1)_n\) we know that \( \Vert u_n \Vert _{{\mathfrak {s}}_{0} + \mu } \le 1 \), namely condition (4.91) is satisfied. Hence, for \( \varepsilon \gamma ^{-1}\) small enough, Theorem 4.3 applies. Then, for all \(\lambda \in \mathcal{G}_{n+1} \) defined in (5.4), the linearized operator
(see (3.1)) admits a right inverse for all \( h \in H^s_{00} \), if condition (1.8) holds, respectively for \( h \in Y \cap H^s \) if the reversibility condition (1.13) holds. Moreover (4.93) gives the estimates
(use (5.1) and \( \Vert u_n \Vert _{{\mathfrak {s}}_{0} + \mu } \le 1 \)), for all Lipschitz map \(h(\lambda )\). Then, for all \(\lambda \in \mathcal{G}_{n+1} \), we define
which is well defined because, if condition (1.8) holds, then \( \Pi _{n + 1} F(u_n) \in H^s_{00} \), and, respectively, if (1.13) holds, then \( \Pi _{n + 1} F(u_{n}) \in Y \cap H^s \) (hence in both cases \( \mathcal{L}_n^{-1} \Pi _{n + 1} F(u_n) \) exists). Note also that in the reversible case \( h_{n + 1} \in X \) and so \( u_{n + 1} \in X \).
Recalling (5.2) and that \({\mathcal {L}}_n := F'(u_n) \), we write
where
With this definition,
where we have gained an extra \(\varepsilon \) from the commutator
Lemma 5.1
Set
There exists \(C_0 := C ( \tau _1, \mu , \nu , \beta _1) > 0 \) such that
Proof
The operators \({\mathcal {N}}'(u_n)\) and \(Q(u_n,\cdot )\) satisfy the following tame estimates:
where \(h(\lambda )\) depends on the parameter \(\lambda \) in a Lipschitz way. The bounds (5.14) and (5.16) follow by Lemma 6.2\((i)\) and Lemma 6.3. (5.15) is simply (5.14) at \(s = {\mathfrak {s}}_0\), using that \(\Vert u_n \Vert _{{\mathfrak {s}}_0 + 3} \le 1\), \(u_n, h_{n+1} \in H_{n+1}\) and the smoothing (5.1).
By (5.7) and (5.16), the term (in (5.11)) \( R_n := [ \mathcal{N}' (u_n), \Pi _{n+1}^\bot ] \mathcal{L}_n^{-1} \Pi _{n+1} F(u_n) \) satisfies, using also that \( u_n \in H_n \) and (5.1),
because \(\mu \ge \tau _1 + 3\). In proving (5.17) and (5.18), we have simply estimated \({\mathcal {N}}'(u_n) \Pi _{n+1}^\perp \) and \(\Pi _{n+1}^\perp {\mathcal {N}}'(u_n)\) separately, without using the commutator structure.
From the definition (5.9) of \(h_{n+1}\), using (5.7), (5.8) and (5.1), we get
because \(\mu \ge \tau _1\). Then
Formula (5.11) for \(F(u_{n+1})\), and (5.18), (5.15), (5.20), \(\varepsilon \gamma ^{-1}\le 1\), (5.1), imply
Similarly, using the “high norm” estimates (5.17), (5.14), (5.19), (5.20), \(\varepsilon \gamma ^{-1}\le 1\) and (5.1),
By (5.21), (5.22) and (5.23) we deduce (5.13). \(\square \)
By \( (\mathcal{P}2)_n \) we deduce, for \( \varepsilon \gamma ^{-1} \) small, that (recall the definition on \( w_n \) in (5.12))
Then, by the second inequality in (5.13), (5.24), \( (\mathcal{P}3)_n \) (recall the definition on \( U_n \) in (5.12)) and the choice of \( \kappa \) in (5.3), we deduce \( U_{n+1} \le C_* \varepsilon \gamma ^{-1} N_{n+1}^\kappa \), for \( N_0 \) large enough. This proves \( (\mathcal{P}3)_{n+1} \).
Next, by the first inequality in (5.13), (5.24), \( (\mathcal{P}2)_n \) (recall the definition on \( w_n \) in (5.12)) and (5.3), we deduce \( w_{n+1} \le C_* \varepsilon \gamma ^{-1} N_{n+1}^\kappa \), for \( N_0 \) large, \( \varepsilon \gamma ^{-1}\) small. This proves \( (\mathcal{P}2)_{n+1} \).
The bound (5.5) at the step \( n +1\) follows by (5.20) and \((\mathcal{P}2)_n \) (and (5.3)). Then
for \( \varepsilon \gamma ^{-1} \) small enough. As a consequence \(({\mathcal {P}}1,2,3)_{n+1}\) hold.
Step 3: prove \(({\mathcal {P}}4)_n\), \(n \ge 0\). For all \(n \ge 0\),
where
Notice that, by the definition (5.26), \(R_{ljk} (u_{n}) = \emptyset \) for \(j = k\). Then we can suppose in the sequel that \(j \ne k\). We divide the estimate into some lemmata.
Lemma 5.2
For \( \varepsilon \gamma ^{-1}\) small enough, for all \(n \ge 0\), \(|l|\le N_n\),
Proof
We claim that, for all \( j , k \in {\mathbb {Z}}\),
where \(\mu _{j}^{\infty }(u_{n}) := \mu _{j}^{\infty }(\lambda , u_{n}(\lambda ))\) and \( \alpha \) is defined in (4.13). Before proving (5.28) we show how it implies (5.27). For all \( j \ne k\), \(|l| \le N_{n}\), \(\lambda \in {\mathcal {G}}_n\), by (5.28)
for \(C \varepsilon \gamma ^{-1}N_{n}^{\tau -\alpha }\, 2^{n+1} \le 1\) (recall that \(\gamma _n := \gamma (1 + 2^{-n})\)), which implies (5.27).
Proof of (5.28)
By (4.4),
where \( m_3 (u_{n}) := m_3(\lambda , u_{n}(\lambda ))\) and similarly for \( m_1, r_{j}^{\infty }\). We first apply Theorem 4.2-\(\mathbf{(S4)_{\nu }}\) with \( \nu = n + 1 \), \( \gamma = \gamma _{n-1} \), \( \gamma - \rho = \gamma _n \), and \( u_1 \), \( u_2 \), replaced, respectively, by \( u_{n-1} \), \( u_n \), in order to conclude that
The smallness condition in (4.26) is satisfied because \(\sigma _2 < \mu \) (see definitions (4.13), (4.90)) and so
for \(\varepsilon \gamma ^{-1}\) small enough, because \( \sigma _1 > \tau \) (see (5.5), (4.90)). Then, by the definitions (5.4) and (4.6), we have
Next, for all \( \lambda \in \mathcal{G}_n \subset \Lambda _{n+1}^{\gamma _{n-1}}(u_{n-1}) \cap \Lambda _{n+1}^{\gamma _n}(u_n) \) both \( r_j^{n+1} (u_{n-1}) \) and \( r_j^{n+1} (u_{n}) \) are well defined, and we deduce by Theorem 4.2-\(\mathbf{(S3)}_\nu \) with \( \nu = n+1 \), that
Moreover (4.34) (with \( \nu = n+1 \)) and (3.66) imply that
because \( \sigma + \beta < \mu \) and \( \Vert u_{n-1} \Vert _{{\mathfrak {s}}_0 + \mu } + \) \( \Vert u_n \Vert _{{\mathfrak {s}}_0 + \mu } \le 2 \) by \( \mathbf{(S1)}_{n-1} \) and \( \mathbf{(S1)}_n \). Therefore, for all \(\lambda \in \mathcal{G}_{n}\), \( \forall j \in {\mathbb {Z}}\),
because \( \sigma _1 > \alpha \) (see (4.13), (5.5)). Finally (5.29), (5.33), (3.64), \(\Vert u_n \Vert _{{\mathfrak {s}}_0 + \mu }\le 1\), imply (5.28). \(\square \)
By definition, \( R_{ljk}(u_n) \subset \mathcal{G}_n \) (see (5.26)) and, by (5.27), for all \( |l| \le N_n \), we have \( R_{ljk} (u_n) \subseteq R_{ljk} (u_{n-1}) \). On the other hand \( R_{ljk}(u_{n-1}) \cap \mathcal{G}_n = \emptyset \), see (5.4). As a consequence, \( \forall |l| \le N_n \), \( R_{ljk} (u_n) = \emptyset \), and
Lemma 5.3
Let \(n \ge 0\). If \(R_{ljk}(u_{n}) \ne \emptyset \), then \(|j^{3}-k^{3}| \le 8 |\bar{\omega }\cdot l|\).
Proof
If \(R_{ljk}(u_{n})\,\ne \,\emptyset \) then there exists \(\lambda \in \Lambda \) such that \( |\mathrm{i} \lambda \bar{\omega }\cdot l + \mu _{j}^{\infty }(\lambda ,u_{n}(\lambda ))- \mu _{k}^{\infty }(\lambda ,u_{n}(\lambda )) | < \) \( 2 \gamma _{n} |j^{3}-k^{3} | \langle l \rangle ^{-\tau } \) and, therefore,
Moreover, by (4.4), (3.63), (4.5), for \(\varepsilon \) small enough,
if \(j \ne k\). Since \(\gamma _n \le 2\gamma \) for all \(n \ge 0\), \(\gamma \le 1/ 48\), by (5.35) and (5.36) we get
proving the Lemma. \(\square \)
Lemma 5.4
For all \(n \ge 0\),
Proof
Consider the function \( \phi : \Lambda \rightarrow {\mathbb {C}}\) defined by
where \( {\tilde{m}}_3 (\lambda ) \), \( {\tilde{m}}_1 (\lambda ) \), \( r^{\infty }_j (\lambda ) \), \( \mu _{j}^{\infty }(\lambda )\), are defined for all \( \lambda \in \Lambda \) and satisfy (4.5) by \( \Vert u_n \Vert ^{\mathrm{{Lip}(\gamma )}}_{\mathfrak s_0 + \mu , {\mathcal {G}}_n} \le 1\) (see \((\mathcal{P}1)_n\)). Recalling \( | \cdot |^\mathrm{lip}\le \gamma ^{-1}| \cdot |^\mathrm{{Lip}(\gamma )}\) and using (4.5)
Moreover Lemma 5.3 implies that, \(\forall \lambda _1, \lambda _2 \in \Lambda \),
for \(\varepsilon \gamma ^{-1}\) small enough. Hence
which is (5.37). \(\square \)
Now we prove \(({\mathcal {P}}4)_0\). We observe that, for each fixed \(l\), all the indices \(j,k\) such that \(R_{ljk}(0) \ne \emptyset \) are confined in the ball \(j^2 + k^2 \le 16 |\bar{\omega }| |l|\), because
and \(|j^{3}-k^{3}| \le 8 |\bar{\omega }| |l|\) by Lemma 5.3. As a consequence
if \(\tau > \nu + 1\). Thus the first estimate in (5.6) is proved, taking a larger \(C_*\) if necessary.
Finally, \(({\mathcal {P}}4)_n\) for \(n \ge 1\), follows by
and (5.6) is proved. The proof of Theorem 5.1 is complete. \(\square \)
5.1 Proof of Theorems 1.1, 1.2, 1.3, 1.4 and 1.5
Proof of Theorems 1.1, 1.2, 1.3
Assume that \( f \in C^q \) satisfies the assumptions in Theorem 1.1 or in Theorem 1.3 with a smoothness exponent \( q := q(\nu ) \ge {\mathfrak {s}}_0 + \mu + \beta _1 \) which depends only on \( \nu \) once we have fixed \( \tau := \nu + 2 \) (recall that \( {\mathfrak {s}}_0 := (\nu + 2 ) \slash 2 \), \( \beta _1 \) is defined in (5.3) and \( \mu \) in (4.90)).
For \( \gamma = \varepsilon ^a \), \( a \in (0,1) \) the smallness condition \( \varepsilon \gamma ^{- 1} = \varepsilon ^{1- a} < \delta \) of Theorem 5.1 is satisfied. Hence on the Cantor set \(\mathcal{G}_{\infty } := \cap _{n \ge 0} \mathcal{G}_{n} \), the sequence \( u_{n}(\lambda ) \) is well defined and converges in norm \( \Vert \cdot \Vert _{{\mathfrak {s}}_{0}+\mu , {\mathcal {G}}_\infty }^{\mathrm{{Lip}(\gamma )}}\) (see (5.5)) to a solution \(u_{\infty }(\lambda )\) of
namely \(u_{\infty }(\lambda )\) is a solution of the perturbed equation (1.4) with \( \omega = \lambda \bar{\omega }\). Moreover, by (5.6), the measure of the complementary set satisfies
proving (1.9). The proof of Theorem 1.1 is complete. In order to finish the proof of Theorems 1.2 or 1.3, it remains to prove the linear stability of the solution, namely Theorem 1.5.
Proof of Theorem 1.4
Part \((i) \) follows by (4.72), Lemma 4.5, Theorem 4.1 (applied to the solution \( u_\infty (\lambda ) \)) with the exponents \( \bar{\sigma }:= \sigma + \beta + 3 \), \( \Lambda _\infty (u) := \Lambda _\infty ^{2\gamma } (u) \), see (4.6). Part (\(ii\)) follows by the dynamical interpretation of the conjugation procedure, as explained in Sect. 2.2. Explicitly, in Sects. 3 and 4, we have proved that
By the arguments in Sect. 2.2 we deduce that a curve \(h(t)\) in the phase space \(H^s_x\) is a solution of the dynamical system (1.19) if and only if the transformed curve
(see notation (2.18), Lemma 3.3, (4.9)) is a solution of the constant coefficients dynamical system (1.20).
Proof of Theorem 1.5
If all \( \mu _j \) are purely imaginary, the Sobolev norm of the solution \( v(t) \) of (1.20) is constant in time, see (1.21). We now show that also the Sobolev norm of the solution \( h(t) \) in (5.39) does not grow in time. For each \(t \in {\mathbb {R}}\), \( \mathcal{A}(\omega t) \) and \(W(\omega t)\) are transformations of the phase space \(H^s_x \) that depend quasi-periodically on time, and satisfy, by (3.69), (3.71), (4.9),
where the constant \(C(s)\) depends on \(\Vert u \Vert _{s + \sigma + \beta + {\mathfrak {s}}_0} < + \infty \). Moreover, the transformation \(B\) is a quasi-periodic reparametrization of the time variable (see (2.25)), namely
where \(\tau = \psi (t) := t + \alpha (\omega t)\), \(t = \psi ^{-1}(\tau ) = \tau + \tilde{\alpha }(\omega \tau )\) and \(\alpha \), \(\tilde{\alpha }\) are defined in Sect. 3.2. Thus
having chosen \(\tau _0 := \psi (0) = \alpha (0)\) (in the reversible case, \(\alpha \) is an odd function, and so \(\alpha (0) = 0\)). Hence (1.22) is proved. To prove (1.23), we collect the estimates (3.70), (3.72), (4.9) into
where the constant \(C(s)\) depends on \(\Vert u \Vert _{s + \sigma + \beta + {\mathfrak {s}}_0}\). Thus
Applying the same chain of inequalities at \( \tau = \tau _0 \), \( t = 0 \), we get that the last term is
proving the second inequality in (1.23) with \( \mathtt a := 1 - a \). The first one follows similarly.
References
Baldi, P.: Periodic solutions of forced Kirchhoff equations. Ann. Scuola Norm. Sup. Pisa. Cl. Sci. (5) 8, 117–141 (2009)
Baldi, P.: Periodic solutions of fully nonlinear autonomous equations of Benjamin–Ono type. Ann. I. H. Poincaré (C) Anal. Non Linéaire 30(1), 33–77 (2013)
Baldi, P., Berti, M., Montalto, R.: A note on KAM theory for quasi-linear and fully nonlinear KdV. Rend. Lincei Mat. Appl. 24, 437–450 (2013)
Bambusi, D., Graffi, S.: Time quasi-periodic unbounded perturbations of Schrödinger operators and KAM methods. Commun. Math. Phys. 219, 465–480 (2001)
Berti, M., Biasco, L.: Branching of Cantor manifolds of elliptic tori and applications to PDEs. Commun. Math. Phys 305(3), 741–796 (2011)
Berti, M., Biasco, P., Procesi, M.: KAM theory for the Hamiltonian DNLW. Ann. Sci. Éc. Norm. Supér. (4), t. 46 (2013), fascicule 2, 301–272
Berti, M., Biasco, L., Procesi, M.: KAM for the reversible derivative wave equations. Arch. Ration. Mech. Anal (to appear)
Berti, M., Bolle, P.: Cantor families of periodic solutions for completely resonant nonlinear wave equations. Duke Math. J. 134, 359–419 (2006)
Berti, M., Bolle, P.: Quasi-periodic solutions with Sobolev regularity of NLS on \(\mathbb{T}^d\) with a multiplicative potential. Eur. J. Math. 15, 229–286 (2013)
Berti, M., Bolle, P.: Sobolev quasi periodic solutions of multidimensional wave equations with a multiplicative potential. Nonlinearity 25, 2579–2613 (2012)
Berti, M., Bolle, P., Procesi, M.: An abstract Nash–Moser theorem with parameters and applications to PDEs. Ann. I. H. Poincaré 27, 377–399 (2010)
Bourgain, J.: Construction of quasi-periodic solutions for Hamiltonian perturbations of linear equations and applications to nonlinear PDE. Int. Math. Res. Notices, no. 11 (1994)
Bourgain, J.: Quasi-periodic solutions of Hamiltonian perturbations of \(2D\) linear Schrödinger equations. Ann. Math. 148, 363–439 (1998)
Bourgain, J.: Periodic solutions of nonlinear wave equations, Harmonic analysis and partial differential equations. Chicago Lectures in Math., pp. 69–97. Univ. Chicago Press, Chicago (1999)
Bourgain, J.: Green’s function estimates for lattice Schrödinger operators and applications. In: Annals of Mathematics Studies, vol. 158. Princeton University Press, Princeton (2005)
Craig, W.: Problèmes de petits diviseurs dans les équations aux dérivées partielles. In: Panoramas et Synthèses, vol. 9. Société Mathématique de France, Paris (2000)
Craig, W., Wayne, C.E.: Newton’s method and periodic solutions of nonlinear wave equation. Commun. Pure Appl. Math. 46, 1409–1498 (1993)
Delort, J.-M.: A quasi-linear Birkhoff normal forms method. Application to the quasi-linear Klein–Gordon equation on \(\mathbb{S}^1\). Astérisque 341 (2012)
Eliasson, L.H., Kuksin, S.: KAM for non-linear Schrödinger equation. Ann. Math. 172, 371–435 (2010)
Eliasson, L.H., Kuksin, S.: On reducibility of Schrödinger equations with quasiperiodic in time potentials. Commun. Math. Phys. 286, 125–135 (2009)
Geng, J., Xu, X., You, J.: An infinite dimensional KAM theorem and its application to the two dimensional cubic Schrödinger equation. Adv. Math. 226(6), 5361–5402 (2011)
Gentile, G., Procesi, M.: Periodic solutions for a class of nonlinear partial differential equations in higher dimension. Commun. Math. Phys. 289(3), 863–906 (2009)
Grebert, B., Thomann, L.: KAM for the quantum harmonic oscillator. Commun. Math. Phys. 307(2), 383–427 (2011)
Iooss, G., Plotnikov, P.I., Toland, J.F.: Standing waves on an infinitely deep perfect fluid under gravity. Arch. Rational Mech. Anal. 177(3), 367–478 (2005)
Iooss, G., Plotnikov, P.I.: Small divisor problem in the theory of three-dimensional water gravity waves. Mem. Am. Math. Soc. 200, no. 940 (2009)
Iooss, G., Plotnikov, P.I.: Asymmetrical three-dimensional travelling gravity waves. Arch. Rational Mech. Anal. 200(3), 789–880 (2011)
Kappeler, T., Pöschel, J.: KAM and KdV. Springer, Berlin (2003)
Klainermann, S., Majda, A.: Formation of singularities for wave equations including the nonlinear vibrating string. Commun. Pure Appl. Math. 33, 241–263 (1980)
Kuksin, S.: Hamiltonian perturbations of infinite-dimensional linear systems with imaginary spectrum, Funktsional. Anal. i Prilozhen. 21(3), 22–37, 95 (1987)
Kuksin, S.: A KAM theorem for equations of the Korteweg–de Vries type. Rev. Math. Math Phys. 10(3), 1–64 (1998)
Kuksin, S.: Analysis of Hamiltonian PDEs. In: Oxford Lecture Series in Mathematics and its Applications, vol. 19. Oxford University Press, Oxford (2000)
Kuksin, S., Pöschel, J.: Invariant Cantor manifolds of quasi-periodic oscillations for a nonlinear Schrödinger equation. Ann. Math. (2) 143, 149–179 (1996)
Lax, P.: Development of singularities of solutions of nonlinear hyperbolic partial differential equations. J. Math. Phys. 5, 611–613 (1964)
Liu, J., Yuan, X.: Spectrum for quantum Duffing oscillator and small-divisor equation with large-variable coefficient. Commun. Pure Appl. Math. 63(9), 1145–1172 (2010)
Liu, J., Yuan, X.: A KAM theorem for hamiltonian partial differential equations with unbounded perturbations. Commun. Math. Phys. 307, 629–673 (2011)
Moser, J.: A rapidly convergent iteration method and non-linear partial differential equations-I. Ann. Sc. Norm. Sup. Pisa Cl. Sci. (3) 20(2), 265–315 (1966)
Pöschel, J.: A KAM-theorem for some nonlinear partial differential equations. Ann. Sci. Norm. Sup. Pisa Cl. Sci. (4) 23, 119–148 (1996)
Procesi, C., Procesi, M.: A KAM algorithm for the completely resonant nonlinear Schrödinger equation (2012, preprint)
Procesi, M., Xu, X.: Quasi-Töplitz Functions in KAM Theorem. SIAM J. Math. Anal. 45(4), 2148–2181 (2013)
Rabinowitz, P.H.: Periodic solutions of nonlinear hyperbolic partial differential equations, Part I. Commun. Pure Appl. Math. 20, 145–205 (1967)
Rabinowitz, P.H.: Periodic solutions of nonlinear hyperbolic partial differential equations. Part II. Commun. Pure Appl. Math. 22, 15–39 (1969)
Wang, W.M.: Supercritical nonlinear Schrödinger equations I: quasi-periodic solutions (2010, preprint)
Wayne, E.: Periodic and quasi-periodic solutions of nonlinear wave equations via KAM theory. Commun. Math. Phys. 127, 479–528 (1990)
Zhang, J., Gao, M., Yuan, X.: KAM tori for reversible partial differential equations. Nonlinearity 24, 1189–1228 (2011)
Acknowledgments
We warmly thank W. Craig for many discussions about the reduction approach of the linearized operators and the reversible structure, and P. Bolle for deep observations about the Hamiltonian case. We also thank T. Kappeler, M. Procesi for many useful comments.
Author information
Authors and Affiliations
Corresponding author
Additional information
This research was supported by the European Research Council under FP7 and partially by the PRIN2009 grant “Critical Point Theory and Perturbative Methods for Nonlinear Differential Equations”.
Appendix: Tame and Lipschitz estimates
Appendix: Tame and Lipschitz estimates
In this Appendix we present standard tame and Lipschitz estimates for composition of functions and changes of variables which are used in the paper.
Let \( H^s := H^{s}({\mathbb {T}}^d,{\mathbb {C}}) \) (with norm \( \Vert \ \Vert _s \)) and \( W^{s, \infty } := W^{s, \infty }({\mathbb {T}}^d,{\mathbb {C}}) \), \( d \ge 1\).
Lemma 6.1
Let \( s_0 > d/2\). Then
-
(i)
Embedding. \(\Vert u \Vert _{L^\infty } \le C(s_0) \Vert u \Vert _{s_0}\) for all \(u \in H^{s_0} \).
-
(ii)
Algebra. \(\Vert uv \Vert _{s_0} \le C(s_0) \Vert u \Vert _{s_0} \Vert v \Vert _{s_0}\) for all \(u, v \in H^{s_0}\).
-
(iii)
Interpolation. For \(0 \le s_1 \le s \le s_2\), \(s = \lambda s_1 + (1-\lambda ) s_2\),
$$\begin{aligned} \Vert u \Vert _{s} \le \Vert u \Vert _{s_1}^\lambda \Vert u \Vert _{s_2}^{1-\lambda } , \quad \forall u \in H^{s_2}. \end{aligned}$$(6.1)Let \( a_0, b_0 \ge 0\) and \( p,q > 0 \). For all \( u \in H^{a_0 + p + q} \), \( v \in H^{b_0 + p + q} \),
$$\begin{aligned} \Vert u \Vert _{a_0 + p} \Vert v \Vert _{b_0 + q} \le \Vert u \Vert _{a_0 + p + q} \Vert v \Vert _{b_0} + \Vert u \Vert _{a_0} \Vert v \Vert _{b_0 + p + q}. \end{aligned}$$(6.2)Similarly, for the \(|u|_{s, \infty } := \sum _{|\beta | \le s} | D^\beta u |_{L^\infty } \) norm,
$$\begin{aligned} | u |_{s, \infty } \le C(s_1, s_2) | u |_{s_1, \infty }^\lambda | u |_{s_2, \infty }^{1-\lambda } , \quad \forall u \in W^{s_2, \infty } , \end{aligned}$$(6.3)and \( \forall u \in W^{a_0 + p + q, \infty } \), \( v \in W^{b_0 + p + q, \infty } \),
$$\begin{aligned}&| u |_{a_0 + p, \infty } | v |_{b_0 + q, \infty }\nonumber \\&\quad \le C(a_0, b_0, p,q)\left( | u |_{a_0 + p + q, \infty } | v |_{b_0, \infty } + | u |_{a_0, \infty } | v |_{b_0 + p + q, \infty }\right) . \end{aligned}$$(6.4) -
(iv)
Asymmetric tame product. For \(s \ge s_0\),
$$\begin{aligned} \Vert uv \Vert _s \le C(s_0) \Vert u\Vert _s \Vert v\Vert _{s_0} + C(s) \Vert u\Vert _{s_0} \Vert v \Vert _s , \quad \forall u,v \in H^s. \end{aligned}$$(6.5) -
(v)
Asymmetric tame product in \(W^{s,\infty }\). For \(s \ge 0\), \(s \in {\mathbb {N}}\),
$$\begin{aligned} | uv |_ {s, \infty } \le \tfrac{3}{2} \, | u |_ {L^\infty } |v|_ {s, \infty } + C(s) |u|_ {s, \infty } |v|_ {L^\infty } , \quad \forall u,v \in W^{s,\infty }. \end{aligned}$$(6.6) -
(vi)
Mixed norms asymmetric tame product. For \(s \ge 0\), \(s \in {\mathbb {N}}\),
$$\begin{aligned} \Vert uv \Vert _s \le \tfrac{3}{2} \, |u|_ {L^\infty } \Vert v \Vert _s + C(s)| u |_ {s, \infty } \Vert v \Vert _0 , \quad \forall u \in W^{s,\infty } , \ v \in H^s . \end{aligned}$$(6.7)If \(u := u(\lambda )\) and \(v := v(\lambda )\) depend in a Lipschitz way on \(\lambda \in \Lambda \subset {\mathbb {R}}\), all the previous statements hold if we replace the norms \(\Vert \cdot \Vert _s\), \(\vert \cdot \vert _ {s, \infty } \) with the norms \(\Vert \cdot \Vert _s^{\mathrm{{Lip}(\gamma )}}\), \(\vert \cdot \vert _ {s, \infty }^{\mathrm{{Lip}(\gamma )}}\).
Proof
The interpolation estimate (6.1) for the Sobolev norm (1.5) follows by Hölder inequality, see also [36], page 269. Let us prove (6.2). Let \( a = a_0 \lambda + a_1 (1-\lambda ) \), \( b = b_0 (1-\lambda ) + b_1 \lambda \), \( \lambda \in [0,1] \). Then (6.1) implies
by Young inequality. Applying (6.8) with \( a = a_0 + p \), \( b = b_0 + q \), \( a_1 = a_0 + p + q \), \( b_1 = b_0 + p + q \), then \( \lambda = q \slash (p+q) \) and we get (6.2). Also the interpolation estimates (6.3) are classical (see [11]) and (6.3) implies (6.4) as above.
\((iv)\): see the Appendix of [11]. \((v)\): we write, in the standard multi-index notation,
Using \( |(D^\beta u)(D^\gamma v)|_ {L^\infty } \le |D^\beta u|_ {L^\infty } |D^\gamma v|_ {L^\infty } \le |u|_{|\beta |, \infty } |v|_{|\gamma |, \infty } \), and the interpolation inequality (6.3) for every \( \beta \ne 0\) with \(\lambda := |\beta | / |\alpha | \in (0,1]\) (where \( |\alpha | \le s \)), we get, for any \(K > 0\),
Then (6.6) follows by (6.9), (6.10) taking \( K := K(s) \) large enough. \((vi)\): same proof as \((v)\), using the elementary inequality \(\Vert (D^\beta u)(D^\gamma v) \Vert _0 \le |D^\beta u|_{L^\infty } \Vert D^\gamma v \Vert _0 \). \(\square \)
We now recall classical tame estimates for composition of functions, see [36], section 2, pages 272–275, and [40, 41]-I, Lemma 7 in the Appendix, pages 202–203.
A function \( f : {\mathbb {T}}^d \times B_1 \rightarrow {\mathbb {C}}\), where \(B_1 := \{ y \in {\mathbb {R}}^m : |y| < 1\} \), induces the composition operator
where \(D^k u(x)\) denotes the partial derivatives \(\partial _x^\alpha u(x)\) of order \(|\alpha |=k\) (the number \( m \) of \( y \)-variables depends on \(p, d\)).
Lemma 6.2
(Composition of functions) Assume \( f \in C^r ({\mathbb {T}}^d \times B_1)\). Then
-
(i)
For all \( u \in H^{r+p} \) such that \( |u |_{p, \infty } < 1 \), the composition operator (6.11) is well defined and
$$\begin{aligned} \Vert \tilde{f}(u) \Vert _r \le C \Vert f \Vert _{C^r} (\Vert u\Vert _{r+p} + 1) \end{aligned}$$where the constant \(C \) depends on \( r,d,p \). If \( f \in C^{r+2} \), then, for all \( |u|_{p, \infty } \), \( | h |_{p, \infty } < 1 / 2 \),
$$\begin{aligned} \big \Vert \tilde{f}(u+h) - \tilde{f} (u) \big \Vert _r&\le C \Vert f \Vert _{C^{r+1}} \, ( \Vert h \Vert _{r+p} + | h |_{p,\infty } \Vert u \Vert _{r+p}) , \\ \big \Vert \tilde{f}(u+h) - \tilde{f} (u) - \tilde{f}'(u) [h] \big \Vert _r&\le C \Vert f \Vert _{C^{r+2}} \, | h |_{p,\infty } ( \Vert h \Vert _{r+p} \!+\! | h |_{p,\infty } \Vert u \Vert _{r+p}). \end{aligned}$$ -
(ii)
The previous statement also holds replacing \(\Vert \ \Vert _r\) with the norms \(| \ |_{r, \infty } \).
Lemma 6.3
(Lipschitz estimate on parameters) Let \(d \in {\mathbb {N}}\), \(d/2 < s_0 \le s\), \(p \ge 0\), \(\gamma >0\). Let \( F \) be a \( C^1 \)-map satisfying the tame estimates: \( \forall \Vert u \Vert _{s_0+p} \le 1 \), \( h \in H^{s+p} \),
For \(\Lambda \subset {\mathbb {R}}\), let \( u(\lambda ) \) be a Lipschitz family of functions with \( \Vert u \Vert _{s_0 +p}^{\mathrm{{Lip}(\gamma )}} \le 1 \) (see (2.2)). Then
The same statement also holds when all the norms \(\Vert \ \Vert _s \) are replaced by \(| \ |_{s, \infty } \).
Proof
By (6.12) we get \( \sup _\lambda \Vert F(u(\lambda )) \Vert _s \le C(s) ( 1 + \Vert u \Vert _{s+p}^{\mathrm{{Lip}(\gamma )}}) \). Then, denoting \( u_1 := u(\lambda _1)\) and \(h := u(\lambda _2) - u(\lambda _1)\), we have
whence
because \( \Vert u \Vert _{s_0 +p}^{\mathrm{{Lip}(\gamma )}} \le 1 \), and the lemma follows. \(\square \)
The next lemma is also classical, see for example [24], Appendix G. The present version is proved in [2], except for the part on the Lipschitz dependence on a parameter, which is proved here below.
Lemma 6.4
(Change of variable) Let \(p:{\mathbb {R}}^d \rightarrow {\mathbb {R}}^d\) be a \(2\pi \)-periodic function in \(W^{s,\infty }\), \( s \ge 1\), with \( |p|_{1, \infty } \le 1/2 \). Let \(f(x) = x + p(x)\). Then:
-
(i)
\(f\) is invertible, its inverse is \(f^{-1}(y) = g(y) = y + q(y)\) where \(q\) is \( 2 \pi \)-periodic, \(q \in W^{s,\infty }({\mathbb {T}}^d,{\mathbb {R}}^d)\), and \(|q|_{s, \infty } \le C |p|_{s, \infty } \). More precisely,
$$\begin{aligned} | q |_{L^\infty } \!=\! | p |_{L^\infty }, \quad \! | Dq |_{L^\infty } \le 2 | Dp |_{L^\infty } , \quad \! | Dq |_{s-1, \infty } \le C | Dp |_{s-1, \infty }.\quad \quad \end{aligned}$$(6.14)where the constant \(C\) depends on \(d, s\). Moreover, suppose that \(p = p_\lambda \) depends in a Lipschitz way by a parameter \(\lambda \in \Lambda \subset {\mathbb {R}}\), and suppose, as above, that \(|D_x p_\lambda |_ {L^\infty } \le 1/2\) for all \(\lambda \). Then \(q = q_\lambda \) is also Lipschitz in \(\lambda \), and
$$\begin{aligned} |q|_{s, \infty }^{\mathrm{{Lip}(\gamma )}} \!\le \! C \left( |p|_ {s, \infty } ^{\mathrm{{Lip}(\gamma )}} \!+\! \big \{ \sup _{\lambda \in \Lambda } |p_\lambda |_{s+1, \infty } \big \} \, |p|_{L^\infty }^{\mathrm{{Lip}(\gamma )}} \right) \!\le \! C |p|_{s+1, \infty }^{\mathrm{{Lip}(\gamma )}}.\quad \quad \end{aligned}$$(6.15)The constant \(C\) depends on \(d, s \) (and is independent of \(\gamma \)).
-
(ii)
If \(u \in H^s ({\mathbb {T}}^d,{\mathbb {C}})\), then \(u\circ f(x) = u(x+p(x))\) is also in \(H^s \), and, with the same \(C\) as in \((i)\),
$$\begin{aligned} \Vert u \circ f \Vert _s&\le C (\Vert u\Vert _s + |Dp|_{s-1, \infty } \Vert u\Vert _1), \end{aligned}$$(6.16)$$\begin{aligned} \Vert u \circ f - u \Vert _s&\le C \left( | p |_{L^\infty } \Vert u \Vert _{s + 1} + |p|_{s, \infty } \Vert u \Vert _{2} \right) , \end{aligned}$$(6.17)$$\begin{aligned} \Vert u \circ f \Vert _{s}^{\mathrm{{Lip}(\gamma )}}&\le C \, \left( \Vert u \Vert _{s+1}^{\mathrm{{Lip}(\gamma )}} + |p|_{s, \infty }^{\mathrm{{Lip}(\gamma )}} \Vert u \Vert _2^{\mathrm{{Lip}(\gamma )}} \right) . \end{aligned}$$(6.18) -
(iii)
Part \((ii)\) also holds with \(\Vert \cdot \Vert _k\) replaced by \(| \cdot |_{k, \infty }\), and \(\Vert \cdot \Vert _{s}^{\mathrm{{Lip}(\gamma )}}\) replaced by \(\vert \cdot \vert _{s, \infty }^{\mathrm{{Lip}(\gamma )}}\), namely
$$\begin{aligned} | u \circ f |_{s, \infty }&\le C (|u|_{s, \infty } + |Dp|_{s-1, \infty } |u|_{1, \infty }), \end{aligned}$$(6.19)$$\begin{aligned} | u \circ f |_{s, \infty }^\mathrm{{Lip}(\gamma )}&\le C (|u|_{s+1, \infty }^\mathrm{{Lip}(\gamma )}+ |Dp|_{s-1, \infty }^\mathrm{{Lip}(\gamma )}|u|_{2, \infty }^\mathrm{{Lip}(\gamma )}). \end{aligned}$$(6.20)
Proof
The bounds (6.14), (6.16) and (6.19) are proved in [2], Appendix B. Let us prove (6.15). Denote \(p_\lambda (x) := p(\lambda ,x)\), and similarly for \(q_\lambda , g_\lambda , f_\lambda \). Since \(y = f_\lambda (x) = x + p_\lambda (x)\) if and only if \(x = g_\lambda (y) = y + q_\lambda (y)\), one has
Let \(\lambda _{1}, \lambda _{2} \in \Lambda \), and denote, in short, \(q_1 = q_{\lambda _1}\), \(q_2 = q_{\lambda _2}\), and so on. By (6.21),
where \( G_2 h := h \circ g_2 \), \( G_t h := h \circ \left( g_1 + (t-1) [g_2 - g_1] \right) \), \( t \in [1,2]\). By (6.22), the \( L^\infty \) norm of \((q_2 - q_1)\) satisfies
whence, using the assumption \(|D_{x} p_1|_{L^\infty } \le 1/2\), we get \( |q_2 - q_1|_{L^\infty } \le 2 |p_2 - p_1|_{L^\infty } \).
By (6.22), using (6.6), the \(W^{s,\infty }\) norm of \((q_2 - q_1)\), for \(s \ge 0\), satisfies
Since \(|G_t (D_{x} p_1)|_{L^\infty } = |D_x p_1|_{L^\infty } \le 1/2\),
Using \( |q_2 - q_1|_{L^\infty } \le 2 |p_2 - p_1|_{L^\infty } \), (6.19), (6.4) and (6.14),
and (6.15) follows. The proof of (6.17), (6.18), (6.20) may be obtained similarly. \(\square \)
Lemma 6.5
(Composition) Suppose that for all \(\Vert u \Vert _{s_{0}+ \mu _i} \le 1\) the operator \(\mathcal{Q}_i(u)\) satisfies
Let \(\tau := \mathrm{max}\{ \tau _1, \tau _2 \}\), \(\mu := \mathrm{max}\{ \mu _1, \mu _2 \}\). Then, for all
the composition operator \(\mathcal{Q} := \mathcal{Q}_{1} \circ \mathcal{Q}_2\) satisfies the tame estimate
Moreover, if \(\mathcal{Q}_1\), \(\mathcal{Q}_2\), \(u\) and \(h\) depend in a Lipschitz way on a parameter \(\lambda \), then (6.25) also holds with \(\Vert \cdot \Vert _s\) replaced by \(\Vert \cdot \Vert _{s}^{\mathrm{{Lip}(\gamma )}}\).
Proof
Apply the estimates for (6.23) to \(\mathcal{Q}_1\) first, then to \(\mathcal{Q}_2\), using condition (6.24). \(\square \)
Rights and permissions
About this article
Cite this article
Baldi, P., Berti, M. & Montalto, R. KAM for quasi-linear and fully nonlinear forced perturbations of Airy equation. Math. Ann. 359, 471–536 (2014). https://doi.org/10.1007/s00208-013-1001-7
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00208-013-1001-7