Invariant tori via higher order averaging method: existence, regularity, convergence, stability, and dynamics

Novaes, Douglas D.; Pereira, Pedro C. C. R.

doi:10.1007/s00208-023-02654-2

Invariant tori via higher order averaging method: existence, regularity, convergence, stability, and dynamics

Published: 24 June 2023

Volume 389, pages 543–590, (2024)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Mathematische Annalen Aims and scope Submit manuscript

Invariant tori via higher order averaging method: existence, regularity, convergence, stability, and dynamics

Download PDF

328 Accesses
2 Citations
Explore all metrics

Abstract

Important information about the dynamical structure of a differential system can be revealed by looking into its invariant compact manifolds, such as equilibria, periodic orbits, and invariant tori. This knowledge is significantly increased if asymptotic properties of the trajectories nearby such invariant manifolds can be determined. In this paper, we present a result providing sufficient conditions for the existence of invariant tori in perturbative differential systems. The regularity, convergence, and stability of such tori as well as the dynamics defined on them are also investigated. The conditions are given in terms of their so-called higher order averaged equations. This result is an extension to a wider class of differential systems of theorems due to Krylov, Bogoliubov, Mitropolsky, and Hale.

Invariant Manifold Theory

Computation of Quasiperiodic Normally Hyperbolic Invariant Tori: Rigorous Results

Article 13 May 2017

Asymptotic stability analysis of autonomous systems by applying the method of localization of compact invariant sets

Article 01 July 2016

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction and statement of the main result

The averaging method has been employed by Krylov, Bogoliubov, and Mitropolski [1, 16] to study the existence of invariant tori in the extended phase space of T-periodic non-autonomous perturbative differential equations of the kind ${\dot{\textbf{x}}}=\varepsilon F_1(t,\textbf{x}).$ Those results were generalized by Hale in [11] and [12] and assert that the existence of invariant tori is associated to the existence of limit cycles of the so-called averaged equation,

$$\begin{aligned} {\dot{\textbf{x}}}=\dfrac{1}{T}\int _0^T F_1(t,\textbf{x})dt. \end{aligned}$$

In this paper, we are concerned with a wider class of T-periodic non-autonomous perturbative differential equations of the following kind:

$$\begin{aligned} {\dot{\textbf{x}}}= \sum _{i=1}^N \varepsilon ^i F_i(t, \textbf{x}) + \varepsilon ^{N+1} {\tilde{F}}(t, \textbf{x},\varepsilon ),\quad (t,\textbf{x},\varepsilon )\in \mathbb {R}\times D \times [0,\varepsilon _0], \end{aligned}$$

(1)

where D is an open bounded subset of $\mathbb {R}^n,$ $\varepsilon _0>0,$ and the functions $F_i:\mathbb {R}\times D \rightarrow \mathbb {R}^n,$ $i\in \{1,\ldots ,N\}$, and $\tilde{F}:\mathbb {R}\times D \times [0,\varepsilon _0]\rightarrow \mathbb {R}^n$ are of class $C^r$, $r\ge 2$, and T-periodic in the variable t. Our goal is to extend the mentioned results of Krylov, Bogoliubov, Mitropolsky, and Hale concerning the existence of invariant tori to the differential equation (1). More specifically, we aim to provide sufficient conditions for the existence of invariant tori in the extended phase space of (1) which, due to the periodicity in the variable t, can be seen as a vector field defined on a cylinder:

$$\begin{aligned} \left\{ \begin{array}{@{}l@{}} \tau '=1,\\ \textbf{x}' = \sum _{i=1}^N \varepsilon ^i F_i(\tau , \textbf{x}) + \varepsilon ^{N+1} {\tilde{F}}(\tau , \textbf{x},\varepsilon ), \end{array} \right. \quad (\tau ,\textbf{x})\in \mathbb {S}^1\times D, \end{aligned}$$

(2)

where $\mathbb {S}^1=\mathbb {R}/(T\mathbb {Z})$. In addition, results concerning the regularity, convergence, and stability of such tori as well as information about the dynamics defined on them will also be presented.

1.1 Introduction to the averaging theory

Some notions from the averaging theory will appear in the statement of our main result, Theorem A. Thus, in order to state it, we must provide a brief introduction to the averaging method, with special attention to the concept of higher order averaged functions.

The averaging method or averaging theory stemmed from the works of Clairaut, Lagrange, and Laplace regarding perturbartions of differential equations (see [22, Appendix A]), even though its formalization was only established much later, by Fatou, Krylov, Bogoliubov, and Mitropolsky (see [1, 2, 8, 16]). It is particularly useful in the study of nonlinear oscillating systems which are affected by small perturbations, by providing asymptotic estimates for solutions of non-autonomous differential equations given in the standard form (1).

The estimates provided by the averaging method depend on the averaged functions, $\textbf{g}_i:D\rightarrow \mathbb {R}^n$ for $i\in \{1,\ldots ,N\},$ which appear as solutions of homological equations when transforming system (1) according to the following result.

Theorem 1

([22, Lemma 2.9.1]) There exists a T-periodic near-identity transformation of class $C^r$

$$\begin{aligned} \textbf{x}=U(t,\textbf{z},\varepsilon )=\textbf{z}+\sum _{i=1}^N \varepsilon ^i\, \textbf{u}_i(t,\textbf{z}), \end{aligned}$$

(3)

satisfying $U(0,\textbf{z},\varepsilon )=\textbf{z}$, such that the differential equation (1) is transformed into

$$\begin{aligned} \dot{\textbf{z}}=\sum _{i=1}^N\varepsilon ^i\textbf{g}_i(\textbf{z})+\varepsilon ^{N+1} r_N(t,\textbf{z},\varepsilon ). \end{aligned}$$

The condition $U(0,\textbf{z},\varepsilon )=\textbf{z}$, called stroboscopic condition, ensures that the functions $\textbf{g}_i$ are uniquely determined. In that case, $\textbf{g}_i$ is named the averaged function of order i. One can easily verify that $\textbf{g}_1$ is, indeed, the time-average of $F_1(t,\textbf{x})$, that is,

$$\begin{aligned} \textbf{g}_1(\textbf{z}) = \frac{1}{T} \int _0^T F_1(s,\textbf{z}) \, ds. \end{aligned}$$

(4)

In general terms, the averaging theory guarantees that, for time $\mathcal {O}(1/\varepsilon )$ and $\varepsilon $ small, any solution of (1) remains $\varepsilon ^N$-close to the solution of the truncated averaged equation

$$\begin{aligned} \dot{\textbf{z}}= \sum _{i=1}^N \varepsilon ^i \textbf{g}_i(\textbf{z}), \end{aligned}$$

with the same initial conditions.

In addition to the aforesaid quantitative estimates, the averaging theory has found great success when applied to investigate invariant manifolds; for instance, to guarantee the existence of invariant tori, as mentioned in the introduction of this work, [1, 4, 11, 12]. It has also been successfully applied to the study of simpler compact invariant manifolds, such as periodic solutions (see, for example, [3, 13, 17, 19, 21, 24]).

Recently, the paper [20] provided a general recursive formula for the higher order averaged functions in terms of Melnikov functions. Accordingly, define the Melnikov function of order i, $\textbf{f}_i,$ for $i\in \{0,\ldots ,N\},$ by

$$\begin{aligned} \textbf{f}_0(\textbf{z})=0\quad \text {and}\quad \textbf{f}_i(\textbf{z})=\dfrac{y_i(T,\textbf{z})}{i!}, \end{aligned}$$

(5)

where

$$\begin{aligned} \begin{aligned} y_1(t,\textbf{z})=&\int _0^tF_1(s,\textbf{z})\,ds\,\, \text { and }\\ y_i(t,\textbf{z})=&\int _0^t\bigg (i!F_i(s,\textbf{z})+\sum _{j=1}^{i-1}\sum _{m=1}^j\dfrac{i!}{j!}\partial _{\textbf{x}}^m F_{i-j} (s,\textbf{z})B_{j,m}\big (y_1,\ldots ,y_{j-m+1}\big )(s,\textbf{z})\bigg )ds, \end{aligned} \end{aligned}$$

(6)

for $i\in \{2,\ldots ,N\}.$ In the formulae above, for p and q positive integers, $B_{p,q}$ denotes the partial Bell polynomials (see, for instance, [6]). Roughly speaking, the Melnikov functions determine the N-jet in $\varepsilon $ of the time-T-map of (1), that is, $\varphi (T,\textbf{z},\varepsilon )=\textbf{z}+\sum _{i=1}^N \varepsilon ^i \textbf{f}_i(\textbf{z})+\mathcal {O}(\varepsilon ^{N+1})$, where $\varphi (t,\textbf{z},\varepsilon )$ corresponds to the solution of (1) with initial condition $\varphi (0,\textbf{z},\varepsilon )=\textbf{z}$ (see [17, 18]).

In particular, (4) and (6) ensure that $\textbf{f}_1(\textbf{z})=T\textbf{g}_1(\textbf{z}).$ The next result states that the same holds for higher order averaged functions provided that some conditions are satisfied.

Proposition 1

([20, Corollary A]) Let $\ell \in \{2,\ldots ,N\}$. If either $\textbf{f}_1=\cdots =\textbf{f}_{\ell -1}=0$ or $\textbf{g}_1=\cdots =\textbf{g}_{\ell -1}=0,$ then $\textbf{f}_i=T\,\textbf{g}_i$ for $i\in \{1,\ldots ,\ell \}.$

The relationship established in Proposition 1 allows us to directly calculate the first non-vanishing averaged function, thus motivating the main result of this work, i.e., an extension of the results of Krylov, Bogoliubov, Mitropolsky, and Hale to higher order averaged functions.

1.2 Statement of the main theorem

The existence of invariant tori in a differential system, as in the case of existence of equilibria and periodic orbits, reveals important information about the dynamical structure of the differential system. This knowledge is significantly increased if asymptotic properties of the trajectories nearby such invariant tori can be determined. Thus, before introducing our main result, we must set forth the following definition regarding asymptotic stability of invariant manifolds.

Definition 1

Let

$$\begin{aligned} \dot{\textbf{x}}= F(\textbf{x}) \end{aligned}$$

(7)

be an autonomous differential system in $\mathbb {R}^n$ and let $\textbf{x}(t,\textbf{x}_0)$ be the solution of (7) satisfying $\textbf{x}(0,\textbf{x}_0)=\textbf{x}_0$. Let also M be an m-dimensional invariant manifold of system (7) and V be a neighborhood of the manifold M.

(a)
The local stable set of M with respect to V is
$$\begin{aligned} \mathcal {S}^V_M:=\left\{ \textbf{x}_0 \in V: \textbf{x}(t,\textbf{x}_0) \in V \; \text {for all} \; t>0 \; \text {and} \; \lim _{t \rightarrow \infty } d(\textbf{x}(t,\textbf{x}_0),M)) = 0 \right\} . \end{aligned}$$
(b)
The local unstable set of M with respect to V is
$$\begin{aligned} \mathcal {U}^V_M:=\left\{ \textbf{x}_0 \in V: \textbf{x}(t,\textbf{x}_0) \in V \; \text {for all} \; t<0 \; \text {and} \; \lim _{t \rightarrow -\infty } d(\textbf{x}(t,\textbf{x}_0),M)) = 0 \right\} . \end{aligned}$$

Now, we are ready to provide our main result.

Theorem A

Consider the $C^r$, $r\ge 2$, differential equation (1) and its extension (2). Suppose that, for some $\ell \in \{1,\ldots ,\min (N,r-2)\},$ $\textbf{f}_0=\ldots =\textbf{f}_{\ell -1}=0,$ $\textbf{f}_{\ell }\ne 0$. Assume that the guiding system $\dot{\textbf{z}}= \textbf{g}_\ell (\textbf{z})$ has an $\omega $-periodic hyperbolic limit cycle $\varphi (t)$. Then, there exists $\overline{\varepsilon }>0$ such that, for each $\varepsilon \in (0,\overline{\varepsilon }]$, the following statements hold:

(a)
Existence: The differential system (2) has an invariant torus $M_\varepsilon $. In addition, there exists a neighborhood $V\subset D$ of $\Gamma :=\{\varphi (t):\,t\in \mathbb {R}\}$ such that any invariant compact manifold of (2) contained in $\mathbb {S}^1\times V$ must be contained in $M_{\varepsilon }$. In particular, $M_{\varepsilon }$ is the unique invariant torus in $\mathbb {S}^1\times V$.
(b)
Regularity: The invariant torus $M_\varepsilon $ is of class $C^{r-\ell }$. Furthermore, there is a $C^0$-continuous family of $C^{r-\ell }$ functions $\mathcal {F}_{\varepsilon }:\mathbb {R}^2\rightarrow \mathbb {R}^{n}$, $T-$periodic in the first coordinate and $\omega $-periodic in the second coordinate, such that $M_\varepsilon =\{(\tau ,\mathcal {F}_{\varepsilon }(\tau ,\theta )):\,(\tau ,\theta )\in \mathbb {S}^1\times \mathbb {R}\}.$
(c)
Convergence: There is a continuous function $\delta : [0,\overline{\varepsilon }] \rightarrow \mathbb {R}_+$ satisfying $ \delta (0) = 0$ such that $\Vert \mathcal {F}_\varepsilon (\tau ,\theta ) - U(\tau ,\varphi (\theta ),\varepsilon )\Vert <\delta (\varepsilon )$ for every $(\tau ,\theta )\in \mathbb {R}^2$, where U is the transformation given by Theorem 1. In particular, $M_{\varepsilon }$ converges to $\mathbb {S}^1\times \Gamma $ in the Hausdorff distance as $\varepsilon \rightarrow 0$.
(d)
Stability: Let $k \le n-1$ be the number of characteristic multipliers of $\Gamma $ with modulus less than 1. Then, there are neighborhoods $W_s$, $V_s$, $W_u$, and $V_u$ of $M_\varepsilon $ such that
1. (d.1)
  $\mathcal {S}^{V_s}_{M_\varepsilon } \cap W_s$ is a $(k+2)$-dimensional manifold embedded in $\mathbb {R}^{n+1}$;
2. (d.2)
  $\mathcal {U}^{V_u}_{M_\varepsilon } \cap W_u$ is a $(n-k+1)$-dimensional manifold embedded in $\mathbb {R}^{n+1}$.
(e)
Dynamics: The flow of (2) restricted to $M_{\varepsilon }$ defines a first return map $p_\varepsilon :S_{\varepsilon }\rightarrow S_{\varepsilon }$ where, for $\Sigma =\{(0,\textbf{x}):\, \textbf{x}\in D\}$ a transversal section of (2), $S_{\varepsilon }:=\Sigma \cap M_{\varepsilon }$ is $C^{r-\ell }$ diffeomorphic to the circle $\mathbb {S}^1$. Moreover, $p_\varepsilon $ is of class $C^{r-\ell }$; its rotation number $\rho (\varepsilon )$ is a continuous function on $\varepsilon \in [0,\overline{\varepsilon }]$ satisfying $\rho (\varepsilon )=\varepsilon ^{\ell } T/\omega +\mathcal {O}(\varepsilon ^{\ell +1})$; and, finally, if $r-\ell \ge 4$, then $\rho $ maps zero Lebesgue measure sets to zero Lebesgue measure sets, and there exists a positive Lebesgue measure set $E\subset [0,\overline{\varepsilon }]$ such that, for every $\varepsilon \in E$, $\rho (\varepsilon )$ is irrational and $p_{\varepsilon }$ is $C^{r-\ell -3}$ conjugated to an irrational rotation.

Theorem A is proved in Sect. 3 after establishing some preliminary results in Sect. 2.

Remark 1

Since $p_\varepsilon $ is at least of class $C^2$ (because $r-\ell \ge 2$), if $\rho (\varepsilon )$ is irrational, then $p_\varepsilon $ is topologically conjugate to an irrational rotation (see, for instance, [13, Theorems 2.4 and 2.5]). In this case, the dynamics of (2) on the invariant torus $M_{\varepsilon }$ corresponds to an irrational flow and, therefore, the torus $M_{\varepsilon }$ is a minimal invariant compact manifold, in the sense that there is no other compact invariant manifold of (2) contained in $M_{\varepsilon }$ besides itself. Therefore, since $\rho (\varepsilon )=\varepsilon ^{\ell } T/\omega +\mathcal {O}(\varepsilon ^{\ell +1})$ is continuous, we can always find $\varepsilon ^*\in (0,\overline{\varepsilon }]$ such that $M_{\varepsilon ^*}$ is minimal.

Remark 2

The conclusion provided by statement e) that “if $r-\ell \ge 4$, then $\rho $ maps zero Lebesgue measure sets to zero Lebesgue measure sets” is known as Luzin-N-property of the function $\rho $.

1.3 Application: invariant tori in 4D vector fields

Theorem A provides a means for investigating the existence of invariant tori also in higher dimensional vector fields. For instance, for a positive integer $N\ge 2$, consider the following 4D autonomous differential system

$$\begin{aligned} \begin{aligned}&\dot{x} = -y +\varepsilon ^N f_1(x,y,u,v) + \varepsilon ^{N+1} g_1(x,y,u,v)+\varepsilon ^{N+2} h_1(x,y,u,v,\varepsilon ), \\&\dot{y} = x + \varepsilon ^N f_2(x,y,u,v) + \varepsilon ^{N+1} g_2(x,y,u,v) +\varepsilon ^{N+2} h_2(x,y,u,v,\varepsilon ), \\&\dot{u} = \varepsilon ^N f_3(x,y,u,v) + \varepsilon ^{N+1} g_3(x,y,u,v) +\varepsilon ^{N+2} h_3(x,y,u,v,\varepsilon ), \\&\dot{v} = \varepsilon ^N f_4(x,y,u,v) + \varepsilon ^{N+1} g_4(x,y,u,v)+\varepsilon ^{N+2} h_4(x,y,u,v,\varepsilon ), \end{aligned} \end{aligned}$$

(8)

where $\varepsilon $ is a small positive parameter; $\mu \in \{-1,1\}$; $f_i$, for $i \in \{1,2,3,4\}$, are functions of class $C^r$, $r\ge 4$, satisfying that

$$\begin{aligned} \begin{aligned}&\theta \mapsto \cos (\theta ) f_1(r \cos (\theta ),r \sin (\theta ),u,v)+\sin (\theta ) f_2(r \cos (\theta ),r \sin (\theta ),u,v),\\&\theta \mapsto f_3(r \cos (\theta ),r \sin (\theta ),u,v), \,\, \text {and}\\&\theta \mapsto f_4(r \cos (\theta ),r \sin (\theta ),u,v) \\ \end{aligned} \end{aligned}$$

(9)

have vanishing average over $\theta \in [0,2\pi ]$; $g_i$, for $i \in \{1,2,3,4\}$, are given by

$$\begin{aligned} \begin{aligned}&g_1(x,y,u,v) = \mu x(x^2+y^2), \\&g_2(x,y,u,v) = -\mu y (x^2+y^2)^2,\\&g_3(x,y,u,v) = x^2 (u (-u^2-v^2+1)+v),\\&g_4(x,y,u,v) = y^2(v (-u^2-v^2+1)-u); \end{aligned} \end{aligned}$$

and $h_i$, for $i \in \{1,2,3,4\}$, are $C^r$, $r\ge 4$, functions.

Proposition 2

Assume the conditions above for the differential system (8). Then, for any integer $N\ge 2$ and $\varepsilon >0$ sufficiently small, the differential system (8) has an invariant torus $\mathbb {T}_{\varepsilon }$ converging, as $\varepsilon $ goes to 0, to $\mathbb {T}=\mathbb {S}^1\times \mathbb {S}^1$. Moreover, the invariant torus is asymptotically stable provided that $\mu =1$ and has an unstable direction provided that $\mu =-1$.

Proposition 2 is proven in Sect. 4.

Example 1

Assuming that

$$\begin{aligned}{} & {} f_1(x,y,u,v)=y u,\,\, f_2(x,y,u,v)=-x v,\\ {}{} & {} f_3(x,y,u,v)=x^3,\,\,\text {and}\,\, f_4(x,y,u,v)=y^3,\\ \end{aligned}$$

one can easily see that the functions given in (9) have vanishing average. Thus, Proposition 2 can be applied to provide the existence of an invariant torus for $\varepsilon >0$ sufficiently small. In Fig. 1, assuming $N=2, $ $\mu =1$, $h_i=0,$ $i=1,\ldots ,4$, and $\varepsilon =1/15$, we provide a numeric simulation (performed on Mathematica) of the Poincaré map defined on the section $\Sigma =\{(x,0,u,v):\,x>0\}$ of the differential system (8). The asymptotically stable invariant tori $\mathbb {T}_{\varepsilon }$ corresponds to an asymptotically stable invariant closed curve $\gamma _{\varepsilon }:=\mathbb {T}_{\varepsilon }\cap \Sigma $ for the Poincaré map.

2 Fundamental Lemma

The proof of Theorem A makes use of some results concerning integral manifolds of a class of perturbed differential systems. Such results, and the methods employed for obtaining them, are similar to those established by Hale in [11, Lemmas 2.1, 2.2 and 2.3] (see also [2, Sect. 28, Lemmas 1, 2, and 3] and [12]). In this section, we state and prove those results in the form of a single Lemma, along with Propositions addressing the issue of regularity of the integral manifolds obtained.

Throughout the paper, we will adopt the notation $\text {diag}(A_1,\ldots ,A_n)$ to represent the direct sum $A_1 \oplus \dots \oplus A_n$ of the square matrices $A_i,$ $i\in \{1,\ldots ,n\}$. We will also employ the notation $B_n(p,r)$ for the n-dimensional open ball $\{x \in \mathbb {R}^n: \Vert x-p\Vert <r \}$.

We consider a one-parameter family of differential systems of the form

$$\begin{aligned} \begin{aligned}&\theta ' = 1 + \zeta _0(t,\theta ,y,z,\varepsilon ), \\&y' = H_1\cdot y + \zeta _1(t,\theta ,y,z,\varepsilon ), \\&z' = H_2 \cdot z + \zeta _2(t,\theta ,y,z,\varepsilon ), \end{aligned} \end{aligned}$$

(10)

where $\theta \in \mathbb {R}$, $y \in \mathbb {R}^m$, $z \in \mathbb {R}^n$, $\varepsilon $ is a real parameter, $H_1$ is a real $m \times m$ matrix, $H_2$ is a real $n \times n$ matrix, and the continuous functions $\zeta _0:\mathbb {R} \times \mathbb {R} \times B_m(0,\rho _1) \times B_n(0,\rho _2) \times (0,\varepsilon _0] \rightarrow \mathbb {R}$, $\zeta _1:\mathbb {R} \times \mathbb {R} \times B_m(0,\rho _1) \times B_n(0,\rho _2) \times (0,\varepsilon _0] \rightarrow \mathbb {R}^m$, and $\zeta _2:\mathbb {R} \times \mathbb {R} \times B_m(0,\rho _1) \times B_n(0,\rho _2) \times (0,\varepsilon _0] \rightarrow \mathbb {R}^n$ have Lipschitz continuous partial derivatives with respect to $(\theta ,y,z)$ up to the p-th order, where $p\ge 1$, $\rho _1,\rho _2>0$, and $\varepsilon _0>0$. For conciseness, we define, for each $(\sigma ,\mu )\in (0,\rho _1) \times (0,\rho _2)$, the set

$$\begin{aligned} \Sigma _{\sigma ,\mu }^{\varepsilon _0}:= \mathbb {R}\times \mathbb {R} \times {\bar{B}}_m(0,\sigma ) \times {\bar{B}}_n(0,\mu ) \times (0,\varepsilon _0]. \end{aligned}$$

We suppose that the following hypotheses are satisfied by (10):

(i)
There is $\omega >0$ such that
$$\begin{aligned} \begin{aligned}&\zeta _0(t,\theta +\omega ,y,z,\varepsilon ) = \zeta _0(t,\theta ,y,-z,\varepsilon ), \\&\zeta _1(t,\theta +\omega ,y,z,\varepsilon ) = \zeta _1(t,\theta ,y,-z,\varepsilon ), \\&\zeta _2(t,\theta +\omega ,y,z,\varepsilon ) = -\zeta _2(t,\theta ,y,-z,\varepsilon ). \end{aligned} \end{aligned}$$
(ii)
There is a continuous function $M:[0,\varepsilon _0]\rightarrow \mathbb {R}_+$ such that $M(0)=0$ and the functions $\zeta _i$ satisfy $|\zeta _0(t,\theta ,0,0,\varepsilon )|\le M(\varepsilon )$, $\Vert \zeta _1(t,\theta ,0,0,\varepsilon )\Vert \le M(\varepsilon )$, and $\Vert \zeta _2(t,\theta ,0,0,\varepsilon )\Vert \le M(\varepsilon )$ for all $(t,\theta ,\varepsilon ) \in \mathbb {R}\times \mathbb {R} \times (0,\varepsilon _0]$.
(iii)
There is a continuous function $L:(0,\varepsilon _0] \times [0,\rho _1) \times [0,\rho _2) \rightarrow \mathbb {R}_+$ such that
$$\begin{aligned} \lim _{(\varepsilon ,\sigma ,\mu ) \rightarrow (0,0,0)} L(\varepsilon ,\sigma ,\mu )=0, \end{aligned}$$
and, for $(t,\theta _1,y_1,z_1,\varepsilon ),(t,\theta _2,y_2,z_2,\varepsilon ) \in \Sigma _{\sigma ,\mu }^{\varepsilon _0}$, the following inequalities hold true:
$$\begin{aligned} \begin{aligned}&|\zeta _0(t,\theta _1,y_1,z_1,\varepsilon )- \zeta _0(t,\theta _2,y_2,z_2,\varepsilon )|\le L(\varepsilon ,\sigma ,\mu )\Vert (\theta _1,y_1,z_1)-(\theta _2,y_2,z_2)\Vert , \\&\Vert \zeta _1(t,\theta _1,y_1,z_1,\varepsilon )- \zeta _1(t,\theta _2,y_2,z_2,\varepsilon )\Vert \le L(\varepsilon ,\sigma ,\mu )\Vert (\theta _1,y_1,z_1)-(\theta _2,y_2,z_2)\Vert , \\&\Vert \zeta _2(t,\theta _1,y_1,z_1,\varepsilon )- \zeta _2(t,\theta _2,y_2,z_2,\varepsilon )\Vert \le L(\varepsilon ,\sigma ,\mu )\Vert (\theta _1,y_1,z_1)-(\theta _2,y_2,z_2)\Vert . \end{aligned} \end{aligned}$$
(iv)
The eigenvalues of $H_1$ and $H_2$ have non-zero real parts.

Let $\big (\theta (t,t_0,\theta _0,y_0,z_0,\varepsilon ),y(t,t_0,\theta _0,y_0,z_0,\varepsilon ),z(t,t_0,\theta _0,y_0,z_0,\varepsilon )\big )$ denote the solution of (10) with initial conditions $(t_0,\theta _0,y_0,z_0,\varepsilon )$. Having set forth the hypotheses above, we are now ready to state the Lemma.

Lemma 1

Consider system (10) with the hypotheses presented in this section. There are $\varepsilon _1 \in (0,\varepsilon _0)$ and families of continuous functions $f_\varepsilon :\mathbb {R}\times \mathbb {R} \rightarrow \mathbb {R}^m$ and $g_\varepsilon :\mathbb {R}\times \mathbb {R}\rightarrow \mathbb {R}^n$ such that, for each $\varepsilon \in (0,\varepsilon _1]$, the autonomous system

$$\begin{aligned} \begin{aligned}&t'=1, \\&\theta ' = 1 + \zeta _0(t,\theta ,y,z,\varepsilon ), \\&y' = H_1\cdot y + \zeta _1(t,\theta ,y,z,\varepsilon ), \\&z' = H_2 \cdot z + \zeta _2(t,\theta ,y,z,\varepsilon ), \end{aligned} \end{aligned}$$

(11)

has an invariant manifold given by $y = f_\varepsilon (t,\theta )$, $z=g_\varepsilon (t,\theta )$. The families $f_\varepsilon $ and $g_\varepsilon $ also satisfy:

(a)
There is a continuous function $D:[0,\varepsilon _1] \rightarrow \mathbb {R}_+$ such that $D(0)=0$ and
$$\begin{aligned} \begin{aligned}&\Vert f_\varepsilon (t,\theta )\Vert \le D(\varepsilon ),&\Vert g_\varepsilon (t,\theta )\Vert \le D(\varepsilon ) \end{aligned} \end{aligned}$$
for all $(t,\theta ,\varepsilon ) \in \mathbb {R}\times \mathbb {R} \times (0,\varepsilon _1]$. Furthermore, there is a constant $C_H>0$, depending only on the matrices $H_1$ and $H_2$, such that $D(\varepsilon ) = C_H M(\varepsilon )$, where $M(\varepsilon )$ is the function appearing in hypothesis ii).
(b)
There is a continuous function $\Delta :[0,\varepsilon _1] \rightarrow \mathbb {R}_+$ such that $\Delta (0)=0$ and
$$\begin{aligned} \begin{aligned}&\Vert f_\varepsilon (t,\theta _1) - f_\varepsilon (t,\theta _2)\Vert \le \Delta (\varepsilon ) |\theta _1-\theta _2|,\\&\Vert g_\varepsilon (t,\theta _1) - g_\varepsilon (t,\theta _2)\Vert \le \Delta (\varepsilon ) |\theta _1-\theta _2| \end{aligned} \end{aligned}$$
for all $t \in \mathbb {R}$, all $\theta _1,\theta _2 \in \mathbb {R}$ and all $\varepsilon \in (0,\varepsilon _1]$.
(c)
$f_\varepsilon $ is $\omega $-periodic in $\theta $ for all $\varepsilon \in (0,\varepsilon _1]$ and $g_\varepsilon $ satisfies $g_\varepsilon (t,\theta +\omega ) = -g_\varepsilon (t,\theta )$ for all $(t,\theta ,\varepsilon ) \in \mathbb {R}\times \mathbb {R}\times (0,\varepsilon _1]$;
(d)
If, for a given $\varepsilon \in (0,\varepsilon _1]$, the functions $ \zeta _0(t,\theta ,y,z,\varepsilon )$, $\zeta _1(t,\theta ,y,z,\varepsilon )$, and $\zeta _2(t,\theta ,y,z,\varepsilon )$ are $T_\varepsilon $-periodic in the variable t, then so are $f_\varepsilon $ and $g_\varepsilon $;
(e)
$f_\varepsilon $ and $g_\varepsilon $ have bounded and uniformly continuous derivatives with respect to $\theta $ up to the p-th order for all $\varepsilon \in (0,\varepsilon _1]$;
(f)
Let $\pi _1: \mathbb {R}^m \times \mathbb {R}^n \rightarrow \mathbb {R}^m$ and $\pi _2: \mathbb {R}^m \times \mathbb {R}^n \rightarrow \mathbb {R}^n$ be the canonical projections. If $m_s\le m$ of the eigenvalues of $H_1$ and $n_s\le n$ of the eigenvalues of $H_2$ have negative real parts, there are positive constants r, $\lambda $, C, $\sigma _0$, and $\sigma _1$ such that $r\le \sigma _0$, $D(\varepsilon )<\sigma _0<\sigma _1$, and, for each $(t_0,\theta _0,\varepsilon ) \in \mathbb {R} \times \mathbb {R} \times (0,\varepsilon _1]$, there is in ${\bar{B}}_m(0,\sigma _0) \times {\bar{B}}_n(0,\sigma _0)$ a local $(m_s+n_s)$-dimensional embedded submanifold $S(t_0,\theta _0,\varepsilon )$ of $\mathbb {R}^m \times \mathbb {R}^n$, containing the point $(f_\varepsilon (t_0,\theta _0),g_\varepsilon (t_0,\theta _0))$, and having the following properties:
1. (f.1)
  If $(y_0,z_0) \in {\bar{B}}_m(0,\sigma _0) \times {\bar{B}}_n(0,\sigma _0) {\setminus } S(t_0,\theta _0,\varepsilon )$, there is $t_*>t_0$ for which
  $$\begin{aligned} (y(t_*,t_0,\theta _0,y_0,z_0,\varepsilon ),z(t_*,t_0,\theta _0,y_0,z_0,\varepsilon )) \notin {\bar{B}}_m(0,\sigma _1) \times {\bar{B}}_n(0,\sigma _1). \end{aligned}$$
2. (f.2)
  Reciprocally, if $(y_0,z_0) \in S(t_0,\theta _0,\varepsilon )$, then, for all $t\ge t_0$,
  $$\begin{aligned} (y(t,t_0,\theta _0,y_0,z_0,\varepsilon ),z(t,t_0,\theta _0,y_0,z_0,\varepsilon )) \in {\bar{B}}_m(0,\sigma _1) \times {\bar{B}}_n(0,\sigma _1) \end{aligned}$$
  and the following inequalities hold:
  $$\begin{aligned} \begin{aligned}&\Vert y(t,t_0,\theta _0,y_0,z_0,\varepsilon ) - f_\varepsilon (t,\theta (t,t_0,\theta _0,f_\varepsilon (t_0,\theta _0),g_\varepsilon (t_0,\theta _0),\varepsilon ))\Vert \\ {}&\quad \le C e^{-\lambda (t-t_0)} \Vert y_0-f_\varepsilon (t_0,\theta _0)\Vert , \\&\Vert z(t,t_0,\theta _0,y_0,z_0,\varepsilon ) - g_\varepsilon (t,\theta (t,t_0,\theta _0,f_\varepsilon (t_0,\theta _0),g_\varepsilon (t_0,\theta _0),\varepsilon ))\Vert \\&\quad \le C e^{-\lambda (t-t_0)} \Vert z_0-g_\varepsilon (t_0,\theta _0)\Vert . \end{aligned} \end{aligned}$$
3. (f.3)
  There is a continuous function $\phi ^\varepsilon _1: \mathbb {R} \times \mathbb {R} \times {\bar{B}}_{m_s}(0,r) \times {\bar{B}}_{n_s}(0,r) \rightarrow {\bar{B}}_{m-m_s}(0,\sigma _0)$ such that
  $$\begin{aligned}\pi _1(S(t_0,\theta _0,\varepsilon )) = \{(\phi ^\varepsilon _1(t_0,\theta _0, \xi _1,\xi _2),\xi _1):(\xi _1,\xi _2) \in {\bar{B}}_{m_s}(0,r) \times {\bar{B}}_{n_s}(0,r)\}. \end{aligned}$$
  Similarly, there is a continuous function $\phi ^\varepsilon _2: \mathbb {R} \times \mathbb {R} \times {\bar{B}}_{m_s}(0,r) \times {\bar{B}}_{n_s}(0,r) \rightarrow {\bar{B}}_{n-n_s}(0,\sigma _0)$ such that
  $$\begin{aligned} \pi _2(S(t_0,\theta _0,\varepsilon )) = \{(\phi ^\varepsilon _2(t_0,\theta _0,\xi _1, \xi _2),\xi _2):(\xi _1,\xi _2) \in {\bar{B}}_{m_s}(0,r) \times {\bar{B}}_{n_s}(0,r)\}. \end{aligned}$$
4. (f.4)
  The functions $\phi _1^\varepsilon $ and $\phi _2^\varepsilon $ satisfy
  $$\begin{aligned} \phi _1^\varepsilon (t_0,\theta _0+\omega ,\xi _1,\xi _2) = \phi _1^\varepsilon (t_0,\theta _0,\xi _1,-\xi _2) \end{aligned}$$
  and
  $$\begin{aligned} \phi _2^\varepsilon (t_0,\theta _0+\omega ,\xi _1,\xi _2) = -\phi _2^\varepsilon (t_0,\theta _0,\xi _1,-\xi _2) \end{aligned}$$
  for all $(t_0,\theta _0,\xi _1,\xi _2) \in \mathbb {R} \times \mathbb {R} \times {\bar{B}}_{m_s}(0,r) \times {\bar{B}}_{n_s}(0,r)$.

Proof

The argument is very similar to the one found in [11, Lemmas 2.1, 2.2, and 2.3] (see also [2, Sect. 28, Lemmas 1, 2, and 3]). We will omit computations when analogous ones can be found in those references, simply referring the reader to them.

Without loss of generality, suppose that $H_1=\text {diag}(H_1^+,H_1^-)$ and $H_2=\text {diag}(H_2^+,H_2^-)$, with the eigenvalues of $H_i^+$ and $H_i^-$, $i \in \{1,2\}$, having respectively positive and negative real parts. For each $i \in \{1,2\}$, define

Consider the complete metric space $\mathcal {P}_\omega (D,\Delta )$ of continuous functions $F \in C(\mathbb {R}^2;\mathbb {R}^m)$ satisfying:

$F(t,\theta +\omega ) = F(t,\theta )$ for all $(t,\theta ) \in \mathbb {R}\times \mathbb {R}$;
$\Vert F(t,\theta )\Vert \le D$ for all $(t,\theta ) \in \mathbb {R}\times \mathbb {R}$;
$\Vert F(t,\theta _1) - F(t,\theta _2)\Vert \le \Delta |\theta _2-\theta _1|$ for all $(t,\theta _1),(t,\theta _2) \in \mathbb {R}\times \mathbb {R}$,

where the metric is given by the uniform norm. Similarly, consider the complete metric space $\mathcal {A}_\omega (D,\Delta )$ of continuous functions $G \in C(\mathbb {R}^2;\mathbb {R}^n)$ satisfying:

$G(t,\theta +\omega ) = -G(t,\theta )$ for all $(t,\theta ) \in \mathbb {R}\times \mathbb {R}$;
$\Vert G(t,\theta )\Vert \le D$ for all $(t,\theta ) \in \mathbb {R}\times \mathbb {R}$;
$\Vert G(t,\theta _1) - G(t,\theta _2)\Vert \le \Delta |\theta _2-\theta _1|$ for all $(t,\theta _1),(t,\theta _2) \in \mathbb {R}\times \mathbb {R}$.

For each $(F,G) \in \mathcal {P}_\omega (D,\Delta ) \times \mathcal {A}_\omega (D,\Delta )$, let $T_{F,G}(t,t_0,\theta _0,\varepsilon )$ denote the solution of

$$\begin{aligned} \theta '= 1+ \zeta _0(t,\theta ,F(t,\theta ),G(t,\theta ),\varepsilon ) \end{aligned}$$

satisfying $T_{F,G}(t_0,t_0,\theta _0,\varepsilon ) = \theta _0$. Since $T_{F,G}(t,t_0,\theta _0+\omega ,\varepsilon )$ and $T_{F,G}(t,t_0,\theta _0,\varepsilon )+\omega $ are both solutions of the same initial problem, it follows that

$$\begin{aligned} T_{F,G}(t,t_0,\theta _0+\omega ,\varepsilon )=T_{F,G}(t,t_0,\theta _0,\varepsilon )+\omega . \end{aligned}$$

(12)

For each $\varepsilon \in (0,\varepsilon _0]$, define the function $S^\varepsilon (F,G) = (S_1^\varepsilon (F,G),S_2^\varepsilon (F,G)) \in C(\mathbb {R}^2;\mathbb {R}^m) \times C(\mathbb {R}^2;\mathbb {R}^n)$, acting on the metric space $\mathcal {P}_\omega (D,\Delta ) \times \mathcal {A}_\omega (D,\Delta )$ and given by

$$\begin{aligned} S_1^\varepsilon (F,G)(t,\theta )&= \int _{-\infty }^\infty J_1(x) \zeta _1 (t+x,T_{F,G}(t+x,t,\theta ,\varepsilon ),F(t\nonumber \\&\quad +x,T_{F,G}(t+x,t,\theta ,\varepsilon )),G(t+x,T_{F,G}(t+x,t,\theta ,\varepsilon )),\varepsilon ) dx, \\ S_2^\varepsilon (F,G)(t,\theta )&= \int _{-\infty }^\infty J_2(x) \zeta _2 (t+x,T_{F,G}(t+x,t,\theta ,\varepsilon ),F(t\nonumber \\&\quad +x,T_{F,G}(t+x,t,\theta ,\varepsilon )),G(t+x,T_{F,G}(t+x,t,\theta ,\varepsilon )),\varepsilon )dx. \end{aligned}$$

By performing the change of variable of integration $\tau =x+t$ and differentiating the compositions $F(t,T_{F,G}(t,t_0,\theta _0,\varepsilon ))$ and $G(t,T_{F,G}(t,t_0,\theta _0,\varepsilon ))$ with respect to t, it is easy to see that, if $(f_\varepsilon ,g_\varepsilon )$ is a fixed-point of $S^\varepsilon $, then the manifold given by $y=f_\varepsilon (t,\theta )$, $z=g_\varepsilon (t,\theta )$ is invariant under (11). Thus, the problem is reduced to proving that $S^\varepsilon $ admits a fixed-point.

Following the arguments found in [11, Lemma 2.1], we conclude that it is possible to find $\varepsilon _1 \in (0,\varepsilon _0]$ and define $D(\varepsilon )$ and $\Delta (\varepsilon )$ such that, for $\varepsilon \in (0,\varepsilon _1]$, the function $S^\varepsilon $ is a contraction of $\mathcal {P}_\omega (D(\varepsilon ),\Delta (\varepsilon )) \times \mathcal {A}_\omega (D(\varepsilon ),\Delta (\varepsilon ))$ into itself. The only change compared to the arguments found in the reference is that, in order to guarantee that $S_2^\varepsilon (F,G) \in \mathcal {A}(D(\varepsilon ),\Delta (\varepsilon ))$, we need to show that $S_2^\varepsilon (F,G)$ satisfies $S_2^\varepsilon (F,G)(t,\theta +\omega ) = -S_2^\varepsilon (F,G)(t,\theta )$. However, this is easily seen by considering (12) and the properties of functions $\zeta _1$ and $\zeta _2$. Furthermore, since in [11] it is proved that $\Delta (\varepsilon ) \rightarrow 0$ as $\varepsilon \rightarrow 0$ and that there is $C_H>0$ such that $D(\varepsilon ) = C_H M(\varepsilon )$, properties (a) and (b) are ensured to hold.

It remains to show that properties (c) to (f) are valid. Observe that property (c) follows directly from the fact that $(f_\varepsilon ,g_\varepsilon ) \in \mathcal {P}_\omega (D(\varepsilon ),\Delta (\varepsilon )) \times \mathcal {A}_\omega (D(\varepsilon ),\Delta (\varepsilon ))$. Property (d) follows from the same argument found in [11, Lemma 2.2]. Property (e) is not directly discussed in [11], but it is stated and proved in [2, Sect. 28, Lemma 1]. The proof in our case is essentially the same.

Finally, we proceed to discussing property f). For $t_0,\sigma _0,\nu \in \mathbb {R}$ and $k \in \mathbb {N}$, let $\mathcal {C}^{t_0}_{k}(\sigma _0,\nu )$ be the complete metric space of continuous functions $W: [t_0,\infty ) \times \mathbb {R}\rightarrow \mathbb {R}^k$ satisfying:

$\Vert W(t,\theta )\Vert \le \sigma _0$ for all $(t,\theta ) \in [t_0,\infty ) \times \mathbb {R}$;
$\Vert W(t,\theta _1) - W(t,\theta _2)\Vert \le \nu |\theta _1 -\theta _2|$ for all $(t,\theta _1),(t,\theta _2) \in [t_0,\infty ) \times \mathbb {R}$.

The metric of $\mathcal {C}^{t_0}_{k}(\sigma _0,\nu )$ is given by the uniform norm.

Let $t_0,\sigma _0,\nu \in \mathbb {R}$ be given. For each $b_1 \in \mathbb {R}^m$ and each $b_2 \in \mathbb {R}^n$, define the following functions acting on $\mathcal {C}^{t_0}_m(\sigma _0,\nu )$ and $\mathcal {C}^{t_0}_n( \sigma _0,\nu )$, respectively:

$$\begin{aligned}&S_1^{\varepsilon ,b_1}(W_1,W_2)(t,\theta ) = J_1(t_0-t)b_1\\ {}&\quad + \int _{t_0}^\infty J_1(\tau -t) \zeta _1 (\tau ,T_{W_1,W_2}(\tau ,t,\theta ,\varepsilon ),W_1(\tau ,T_{W_1,W_2}(\tau ,t,\theta ,\varepsilon )),\\&\quad W_2(\tau ,T_{W_1,W_2}(\tau ,t,\theta ,\varepsilon )),\varepsilon ) d\tau , \\&S_2^{\varepsilon ,b_2}(W_1,W_2)(t,\theta ) = J_2(t_0-t)b_2\\&\quad +\int _{t_0}^\infty J_2(\tau -t) \zeta _2 (\tau ,T_{W_1,W_2}(\tau ,t,\theta ,\varepsilon ),W_1(\tau ,T_{W_1,W_2}(\tau ,t,\theta ,\varepsilon )),\\&\quad W_2(\tau ,T_{W_1,W_2}(\tau ,t,\theta ,\varepsilon )),\varepsilon ) d\tau . \end{aligned}$$

Let $S^{\varepsilon ,b_1,b_2}$ act on $\mathcal {C}^{t_0}_m(\sigma _0,\nu ) \times \mathcal {C}^{t_0}_n( \sigma _0,\nu )$ by $S^{\varepsilon ,b_1,b_2}(W_1,W_2) = (S^{\varepsilon ,b_1}_1(W_1,W_2),S^{\varepsilon ,b_2}_2(W_1,W_2))$. Then, following the same procedure as before, we can ensure that, by taking $\varepsilon _1$, $\sigma _0$, $\nu $, and $r<\sigma _0$ sufficiently small, $S^{\varepsilon ,b_1,b_2}$ becomes a contraction of $\mathcal {C}^{t_0}_m(\sigma _0,\nu ) \times \mathcal {C}^{t_0}_n( \sigma _0,\nu )$ into itself if $\Vert b_1\Vert ,\Vert b_2\Vert \le r$.

Define $\Psi _1^{\varepsilon ,t_0}$ and $\Psi _2^{\varepsilon ,t_0}$ to be such that $(t,\theta ) \mapsto (\Psi _1^{\varepsilon ,t_0}(t,\theta ,b_1,b_2),\Psi _2^{\varepsilon ,t_0}(t,\theta ,b_1,b_2))$ is the fixed point of the operator $S^{\varepsilon ,b_1,b_2}$. Then, it is easy to see that there is $C_0>0$ such that

$$\begin{aligned}{} & {} \Vert \Psi _i^{\varepsilon ,t_0} (t,\theta ,b_1,b_2) - \Psi _i^{\varepsilon ,t_0} (t,\tilde{\theta },{\tilde{b}}_1,{\tilde{b}}_2)\Vert \nonumber \\{} & {} \quad \le C_0 e^{\frac{-\alpha }{2}(t-t_0)} \left( \Vert b_1 - {\tilde{b}}_1\Vert + \Vert b_2 - {\tilde{b}}_2\Vert \right) + \nu |\theta - \tilde{\theta }| \end{aligned}$$

(13)

for $i \in \{1,2\}$, $t\in (t_0,+\infty )$, $\theta ,\tilde{\theta } \in \mathbb {R}$, $b_1,{\tilde{b}}_1 \in {\bar{B}}_m(0,r)$, and $b_2,{\tilde{b}}_2 \in {\bar{B}}_n(0,r)$. This ensures, in particular, that, for $i \in \{1,2\}$, $\Psi _i^{\varepsilon ,t_0}$ is continuous if seen as a function on $[t_0,+\infty ) \times \mathbb {R}\times \bar{B}_m(0,r) \times \bar{B}_n(0,r)$.

Following the argument in [2, Sect. 28, Lemma 3], we can prove that, if $\sigma _1\ge \sigma _0$, every solution of (11) satisfying

$y_0 \in {\bar{B}}_m(0,\sigma _0)$ and $y(t,t_0,\theta _0,y_0,z_0,\varepsilon ) \in {\bar{B}}_m(0,\sigma _1)$;
$z_0 \in {\bar{B}}_n(0,\sigma _0)$ and $z(t,t_0,\theta _0,y_0,z_0,\varepsilon ) \in {\bar{B}}_n(0,\sigma _1)$

must be of the form $(t,\theta (t),\Psi _1^\varepsilon (t,\theta (t),b_1,b_2),\Psi _2^\varepsilon (t,\theta (t),b_1,b_2))$ for some $(b_1,b_2) \in B_m(0,r) \times B_n(0,r)$, where $\theta (t)$ denotes $T_{\Psi _1^\varepsilon ,\Psi _2^\varepsilon }(t,t_0,\theta _0,\varepsilon )$. Conversely, every solution of the form given above clearly satisfies the two conditions set forth. Therefore, define

$$\begin{aligned} S(t_0,\theta _0,\varepsilon ): = \left\{ \lim _{t \rightarrow t_0^+} \left( \Psi _1^{\varepsilon ,t_0}(t,\theta _0,b_1,b_2),\Psi _2^{\varepsilon ,t_0}(t,\theta _0,b_1,b_2)\right) : (b_1,b_2) \in {\bar{B}}_m(0,r) \times {\bar{B}}_n(0,r)\right\} . \end{aligned}$$

Then, considering also (13), properties f.1) and f.2) follow immediately.

Observe that, from the definition of the operators $S_1^{\varepsilon ,b_1}$ and $S_2^{\varepsilon ,b_2}$, it follows that the last $m_s$ and $n_s$ elements of the vectors $\Psi _1^{\varepsilon ,t_0}(t_0^+,\theta ,b_1,b_2)$ and $\Psi _2^{\varepsilon ,t_0}(t_0^+,\theta ,b_1,b_2)$ coincide with, respectively, the last $m_s$ and $n_s$ elements of the vectors $b_1$ and $b_2$. Thus, for each $i \in \{1,2\}$, define $\phi _i^\varepsilon $ by

$$\begin{aligned} (\phi _i^\varepsilon (t_0,\theta _0,\xi _1,\xi _2),\xi _i) = \lim _{t \rightarrow t_0^+}\Psi _i^{\varepsilon ,t_0}(t,\theta _0,(0,\xi _1),(0,\xi _2)). \end{aligned}$$

Then, it is clear that $\phi _i^\varepsilon $ is continuous. It is also clear from this definition that property f.3) holds.

Finally, since the change of variables $(t,\theta ,y,z) \rightarrow ({\tilde{t}},\tilde{\theta },{\tilde{y}},{\tilde{z}}) = (t,\theta -\omega ,y,-z)$ carries system (11) into an identical system, it follows from the already proved properties f.1) and f.2) that: a point $(y_*,z_*) \in \mathbb {R}^m \times \mathbb {R}^n$ is in $S(t_0,\theta _0+\omega ,\varepsilon )$ if, and only if, $(y_*,-z_*) \in S(t_0,\theta _0,\varepsilon )$. Hence, it follows that, for each $(\xi _1,\xi _2) \in {\bar{B}}_{m_s}(0,r) \times {\bar{B}}_{n_s}(0,r)$, there is $(\tilde{\xi }_1,\tilde{\xi }_2) \in {\bar{B}}_{m_s}(0,r) \times {\bar{B}}_{n_s}(0,r)$ such that

$$\begin{aligned} (\phi _1^\varepsilon (t_0,\theta _0+\omega ,\xi _1,\xi _2),\xi _1) =(\phi _1^\varepsilon (t_0,\theta _0,\tilde{\xi }_1,\tilde{\xi }_2),\tilde{\xi }_1), \end{aligned}$$

and

$$\begin{aligned} (\phi _2^\varepsilon (t_0,\theta _0+\omega ,\xi _1,\xi _2),\xi _2) = - (\phi _2^\varepsilon (t_0,\theta _0,\tilde{\xi }_1,\tilde{\xi }_2),\tilde{\xi }_2). \end{aligned}$$

Therefore, $\xi _1=\tilde{\xi }_1$ and $\xi _2=-\tilde{\xi }_2$, and property f.4) follows. This concludes the proof of the Lemma. $\square $

The following corollary addresses the issue of uniqueness of the invariant manifold found in the previous Lemma. Its proof will not be presented here, but it follows essentially from the stability property provided by statement f) (see, for instance, [11, Remark 2.2] and, for more details, [2, Remark of page 494]).

Corollary 1

For each $\varepsilon \in (0,\varepsilon _1]$, the invariant manifold given by $y=f_\varepsilon (t,\theta )$, $z=g_\varepsilon (t,\theta )$ is unique in $\mathbb {R}\times \mathbb {R}\times B_m(0,\sigma _0) \times B_n(0,\sigma _0)$, that is, every invariant manifold contained in $\mathbb {R}\times \mathbb {R}\times B_m(0,\sigma _0) \times B_n(0,\sigma _0)$ must be contained in the set given by $y=f_\varepsilon (t,\theta )$, $z=g_\varepsilon (t,\theta )$.

The remainder of this section is devoted to present technical propositions to address issues of regularity. Their proofs are provided in the Appendix. The first proposition is concerned with the regularity of the invariant manifold whose existence was established in the previous Lemma.

Proposition 3

Consider system (10) with the hypotheses presented in this section. Suppose that, for each $\varepsilon \in (0,\varepsilon _1]$ and each $i \in \{0,1,2\}$ fixed, the functions $(t,\theta ,y,z) \mapsto \zeta _i(t,\theta ,y,z,\varepsilon )$ are of class $C^p$. Then, the invariant manifold found in Lemma 1 above, that is, the manifold $M_\varepsilon =\{(t,\theta ,f_\varepsilon (t,\theta ),g_\varepsilon (t,\theta )) \in \mathbb {R}\times \mathbb {R}\times B_m(0,\sigma _0) \times B_n(0,\sigma _0): (t,\theta ) \in \mathbb {R}\times \mathbb {R}\}$, is of class $C^p$. Moreover, for each $\varepsilon \in (0,\varepsilon _1]$, the functions $(t,\theta ) \mapsto f_\varepsilon (t,\theta )$ and $(t,\theta ) \mapsto g_\varepsilon (t,\theta )$ are also of class $C^p$.

The next three propositions consider the issue of regularity of the family $(f_\varepsilon ,g_\varepsilon )$ with respect to the parameter $\varepsilon $. They will be crucial when we discuss the statement concerning dynamics of Theorem A.

Proposition 4

Consider system (10) with the hypotheses presented in this section. Suppose that $\zeta _0$, $\zeta _1$, and $\zeta _2$ are of class $C^2$. Let $c: (0,\varepsilon _1) \rightarrow C(\mathbb {R}^2;\mathbb {R}^m) \times C(\mathbb {R}^2; \mathbb {R}^n) $ be defined by $c(\varepsilon ) = (f_\varepsilon ,g_\varepsilon )$, where $C(\mathbb {R}^2;\mathbb {R}^m)$ and $C(\mathbb {R}^2; \mathbb {R}^n)$ are equipped with the uniform norm. If $\varepsilon _1>0$ is sufficiently small, then c is of class $C^1$.

Proposition 5

Consider system (10) with the hypotheses presented in this section. Suppose that $\zeta _0$, $\zeta _1$, and $\zeta _2$ are of class $C^{p+1}$. Let $T_{F,G}$ be defined as in the proof of Lemma 1. Then, if $\varepsilon _1>0$ is sufficiently small, then the following holds: there is $N_T \in \mathbb {N}$ and, for each compact interval $[a,b] \subset (0,\varepsilon _1]$, there are $C_{[a,b]}>0$ and $M_{[a,b]}>0$ such that

$$\begin{aligned} \left\| \frac{\partial ^q f_\varepsilon }{\partial \theta ^q} \right\| \le C_{[a,b]}, \qquad \left\| \frac{\partial ^q g_\varepsilon }{\partial \theta ^q} \right\| \le C_{[a,b]}, \end{aligned}$$

and

$$\begin{aligned} \left| \frac{\partial ^q T_{f_\varepsilon ,g_\varepsilon }}{\partial \theta _0^q}(t+x,t,\theta ,\varepsilon ) \right| \le M_{[a,b]}\, e^{N_{T} L(\varepsilon )(1+2\Delta (\varepsilon )) |x|} \end{aligned}$$

for all $q \in \{1,\ldots ,p+1\}$ and all $\varepsilon \in [a,b]$.

Proposition 5 admits the following corollary which is a straightforward application of the mean value inequality.

Corollary 2

Consider system (10) with the hypotheses presented in this section. Suppose that $\zeta _0$, $\zeta _1$, and $\zeta _2$ are of class $C^{p+1}$. Then, if $\varepsilon _1>0$ is sufficiently small, then the following holds: for each compact interval $[a,b] \subset (0,\varepsilon _1]$, there is $C_{[a,b]}>0$ such that

$$\begin{aligned}{} & {} \left\| \frac{\partial ^q f_\varepsilon }{\partial \theta ^q} (t,\theta _2) - \frac{\partial ^q f_\varepsilon }{\partial \theta ^q} (t,\theta _1)\right\| \le C_{[a,b]} |\theta _2-\theta _1|,\\ {}{} & {} \qquad \left\| \frac{\partial ^q g_\varepsilon }{\partial \theta ^q} (t,\theta _2) - \frac{\partial ^q g_\varepsilon }{\partial \theta ^q} (t,\theta _1)\right\| \le C_{[a,b]} |\theta _2-\theta _1| \end{aligned}$$

for all $q \in \{1,\ldots ,p\}$, all $t,\theta _1,\theta _2 \in \mathbb {R}$, and all $\varepsilon \in [a,b]$.

Proposition 6

Consider system (10) with the hypotheses presented in this section. Suppose that $\zeta _0$, $\zeta _1$, and $\zeta _2$ are of class $C^{p+1}$. Let $q\le p$ be a non-negative integer. Then, if $\varepsilon _1>0$ is sufficiently small, then the functions

$$\begin{aligned} \varepsilon \mapsto \frac{\partial ^{q}f_\varepsilon }{\partial \theta ^{q}} \quad \text {and} \quad \varepsilon \mapsto \frac{\partial ^{q}g_\varepsilon }{\partial \theta ^{q}} \end{aligned}$$

are locally Lipschitz continuous in the uniform norm for $\varepsilon \in (0,\varepsilon _1]$.

3 Proof of Theorem A

This section is devoted to the proof of Theorem A. In Sect. 3.1, we perform the change of variables that transforms system (1) into a system to which Lemma 1 can be applied. In Sect. 3.2, we apply this Lemma in order to prove the statements regarding existence, regularity, and convergence of Theorem A. In Sect. 3.3, we prove the statement regarding stability, and finally, in Sect. 3.4, the statement regarding the dynamics on the object $M_\varepsilon $ of the same Theorem.

3.1 Change of variables

Consider the differential equation (1). We shall first find a change of coordinates transforming this system into one to which we can apply Lemma 1. Thus, let $\ell \in \{1,\ldots ,\min (N,r-2)\}$ be such that $\textbf{f}_1=\cdots \textbf{f}_{\ell -1}=0$ and $\textbf{f}_{\ell }\ne 0$. By Theorem 1, there exists a T-periodic near-identity transformation (3) that transforms the differential equation (1) into

$$\begin{aligned} \dot{\textbf{z}}=\varepsilon ^{\ell }\textbf{g}_{\ell }(\textbf{z})+\varepsilon ^{\ell +1} r_{\ell }(t,\textbf{z},\varepsilon ). \end{aligned}$$

(14)

Observe that the formulas given in (6) ensure that $\textbf{g}_\ell $ is of class $C^{r-\ell +1}$. Moreover, $r_\ell $ is of class $C^{r-\ell }$.

Consider the $\omega $-periodic hyperbolic limit cycle $\varphi (s)$ of the guiding system $\dot{\textbf{z}}=\textbf{g}_\ell (\textbf{z})$. Also, consider the linear variational equation

$$\begin{aligned} \frac{dy}{dt} = D \textbf{g}_\ell (\varphi (t)) \cdot y. \end{aligned}$$

(15)

Observe that $\varphi '(t)$ is a solution to the linear periodic system (15). Let $\Phi (t)$ denote a fundamental matrix solution of this system. We will use Floquet theory to obtain a useful change of variables in a neighborhood of the limit cycle $\Gamma $.

We remind the reader that the characteristic multipliers of (15) are, for any choice of $\Phi $, the eigenvalues of the monodromy matrix $\Phi ^{-1}(0) \Phi (\omega )$ (see, for instance, [5]). Since $\Gamma $ is hyperbolic, we know that 1 is an eigenvalue of multiplicity exactly 1 of this matrix, all its other eigenvalues being outside the unit circle. By taking into account the real Jordan canonical form of the monodromy matrix, we see that $\Phi (t)$ can be chosen satisfying

$$\begin{aligned} \Phi ^{-1}(0) \Phi (\omega ) = \text {diag}(1,\mathcal {J}_1,\mathcal {J}_2), \end{aligned}$$

(16)

where $\mathcal {J}_1 \in \mathbb {R}^{(n-d-1) \times (n-d-1)}$ and $\mathcal {J}_2 \in \mathbb {R}^{d \times d}$ are matrices in the real Jordan canonical form satisfying the following condition: each Jordan block of $\mathcal {J}_1$ associated to a real negative eigenvalue appears an even number of times, and every Jordan block of $\mathcal {J}_2$ is associated to a real negative eigenvalue and appears only once in this matrix. We remark that, with this choice, the first of column of $\Phi $ must be the only $\omega $-periodic solution of (15), which is given by $\varphi '(t)$.

Let $I_d$ denote the $d \times d$ identity matrix. Under the above-mentioned conditions, by considering the logarithm of the matrices $R_1$ and $R_2$ (see, [7] and [9, page 100]), we know that there exist real matrices $R_1$ and $R_2$ such that $e^{\omega R_1} = \mathcal {J}_1$ and $e^{\omega R_2 + i\pi I_d}= \mathcal {J}_2$. In particular, we remark that the eigenvalues of $R_1$ and $R_2$ all have non-zero real parts. For the same reason, the number of eigenvalues (counting multiplicity) of the Poincaré map defined in a transversal section of $\Gamma $ with modulus less than 1 is equal to the number of eigenvalues (counting multiplicity) of $R:= \text {diag}(R_1,R_2)$ with strictly negative real part.

Define the matrices

$$\begin{aligned} {\tilde{B}}:= \text {diag}\left( 0,R_1,R_2+i\frac{\pi }{\omega }I_d \right) \end{aligned}$$

and

$$\begin{aligned} B:= \text {diag}\left( 0,R_1,R_2\right) = \text {diag}(0,R). \end{aligned}$$

It is easy to see that $e^{\omega {\tilde{B}}} = \Phi ^{-1}(0) \Phi (\omega )$ and $e^{2\omega B} = e^{2\omega {\tilde{B}}} = (\Phi ^{-1}(0) \Phi (\omega ))^2$.

Since $ D \textbf{g}_\ell (\varphi (t))$ is $\omega $-periodic, Floquet’s theorem ensures that there are a $\omega $-periodic matrix function $t \mapsto {\tilde{P}}(t) \in \mathbb {C}^{n \times n}$ and a $2\omega $-periodic matrix function $t\mapsto P(t) \in \mathbb {R}^{n \times n}$, both of class $C^{r-\ell +1}$, such that

$$\begin{aligned} \Phi (t)={\tilde{P}}(t) e^{t{\tilde{B}}} = P(t) e^{tB}. \end{aligned}$$

In particular, since the first column of $\Phi (t)$ is $\varphi '(t)$, it follows that P(t) is of the form

$$\begin{aligned} P(t) = \left[ \varphi '(t) \, | \; Q(t)\right] , \end{aligned}$$

where $t \mapsto Q(t) \in \mathbb {R}^{n \times (n-1)}$ is $2\omega $-periodic. Also, considering that B and ${\tilde{B}}$ clearly commute, it follows that

$$\begin{aligned} P(t+\omega ) = P(t) e^{\omega ({\tilde{B}}-B)} = \left[ \varphi '(t) \; | \; Q(t) A\right] , \end{aligned}$$

where

$$\begin{aligned} A:= \text {diag}(I_{n-d-1},-I_d). \end{aligned}$$

(17)

Thus, it is clear that Q satisfies

$$\begin{aligned} Q(t+\omega ) = Q(t) A \end{aligned}$$

(18)

for all $t \in \mathbb {R}. $

Since $\Phi (t)$ solves (15), it follows that

$$\begin{aligned} P'(t) + P(t) \cdot B = D \textbf{g}_\ell (\varphi ( t)) \cdot P( t). \end{aligned}$$

Thus, by restricting the equality above to the last $n-1$ columns, we obtain

$$\begin{aligned} Q'( t) + Q( t) \cdot R = D \textbf{g}_\ell (\varphi ( t)) \cdot Q( t), \end{aligned}$$

(19)

for all $t \in \mathbb {R}.$

We apply the transformation $\textbf{z}\mapsto (s,\textbf{h} ) \in \mathbb {R}\times \mathbb {R}^{n-1}$ given by

$$\begin{aligned} \textbf{z}= \varphi (s) +Q(s)\cdot \textbf{h}. \end{aligned}$$

(20)

Observe that, by taking $\textbf{h}$ to be sufficiently small and $s \in [0,\omega )$, we can ensure that the transformation $(s,\textbf{h}) \mapsto \textbf{z}$ is injective. Accordingly, we will assume henceforth that $\Vert \textbf{h}\Vert \le 4\rho $, ensuring that our transformation is bijective. Let us find the differential equation in $(s,\textbf{h})$ that is equivalent to (14). In order to do so, we differentiate (20) with respect to t and obtain

$$\begin{aligned} \dot{\textbf{z}}= (\varphi '(s) + Q'(s) \cdot \textbf{h})\, \dot{s} + Q(s) {\dot{\textbf{h}}}. \end{aligned}$$

Thus, by (14), it follows that

$$\begin{aligned} (\varphi '(s) + Q'(s) \cdot \textbf{h})\, \dot{s} + Q(s) \cdot {\dot{\textbf{h}}}= & {} \varepsilon ^\ell \textbf{g}_\ell (\varphi (s) + Q(s) \cdot \textbf{h}) \nonumber \\{} & {} + \varepsilon ^{\ell +1} r(t,\varphi (s) + Q(s) \cdot \textbf{h},\varepsilon ). \end{aligned}$$

(21)

Observe that (19) ensures that

$$\begin{aligned} \begin{aligned}&\varepsilon ^\ell \varphi '(s) + \varepsilon ^\ell Q'(s) \cdot \textbf{h} + \varepsilon ^\ell Q(s) R \cdot \textbf{h} = \varepsilon ^\ell \textbf{g}_\ell (\varphi (s)) + \varepsilon ^\ell D \textbf{g}_\ell (\varphi (s)) \cdot Q(s) \cdot \textbf{h}. \end{aligned}\nonumber \\ \end{aligned}$$

(22)

Let us define the functions

$$\begin{aligned} \begin{aligned}&Y(s,\textbf{h}):= \textbf{g}_{\ell }\left( \varphi (s) +Q(s) \cdot \textbf{h}\right) - \textbf{g}_{\ell }(\varphi (s)) - D \textbf{g}_\ell (\varphi (s)) \cdot Q(s) \cdot \textbf{h}, \\&Z(t,s,\textbf{h}, \varepsilon ):= r_\ell \left( t,\varphi (s) +Q(s) \cdot \textbf{h},\varepsilon \right) . \end{aligned} \end{aligned}$$

By subtracting (22) from (21), we obtain

$$\begin{aligned} \left( \varphi '(s) + Q'(s) \textbf{h}\right) (\dot{s} - \varepsilon ^\ell ) + Q(s) ({\dot{\textbf{h}}} - \varepsilon ^\ell R \cdot \textbf{h}) = \varepsilon ^\ell Y(s,\textbf{h}) + \varepsilon ^{\ell +1}Z(t,s,\textbf{h}, \varepsilon ). \nonumber \\ \end{aligned}$$

(23)

Observe that (23) can be rewritten as:

(24)

Observe that the $C^{r-\ell }$ matrix function

satisfies $C(s,0)=P(s)$ for all $s \in \mathbb {R}$. Since P(s) is non-singular, for sufficiently small values of $\textbf{h}$, the matrix $C(s,\textbf{h})$ can be inverted. Therefore, assuming that $\rho >0$ is sufficiently small, if $\Vert \textbf{h}\Vert \le 4\rho $, then (24) can be transformed into

$$\begin{aligned} \left[ \begin{array}{c} \dot{s} \\ {\dot{\textbf{h}}} \end{array}\right] = \varepsilon ^\ell \left[ \begin{array}{c} 1\\ R \textbf{h} \end{array}\right] + \varepsilon ^{\ell } (C(s,\textbf{h}))^{-1} \cdot Y(s, \textbf{h})+ \varepsilon ^{\ell +1}(C(s,\textbf{h}))^{-1} \cdot Z(t,s,\textbf{h}, \varepsilon ).\nonumber \\ \end{aligned}$$

(25)

Set $\textbf{h} = (\textbf{v},\textbf{w}) \in \mathbb {R}^{n-d-1} \times \mathbb {R}^d$. Define $\Lambda _0(s, \textbf{v},\textbf{w})$ and $\tilde{\Lambda }_0(t,s,\textbf{v},\textbf{w}, \varepsilon )$ to be the first line of the products $(C(s,\textbf{h}))^{-1} \cdot Y(s, \textbf{h})$ and $(C(s,\textbf{h}))^{-1} \cdot Z(t,s,\textbf{h}, \varepsilon )$, respectively. Similarly, define $\Lambda _1(s, \textbf{v},\textbf{w})$ and $\tilde{\Lambda }_1(t,s,\textbf{v},\textbf{w}, \varepsilon )$ to be the next $n-d-1$ lines and $\Lambda _2(s, \textbf{v},\textbf{w})$ and $\tilde{\Lambda }_2(t,s,\textbf{v},\textbf{w}, \varepsilon )$ to be the last d lines of those products. Then, (25) becomes

$$\begin{aligned} \begin{aligned}&\dot{s} = \varepsilon ^\ell + \varepsilon ^\ell \Lambda _0(s,\textbf{v},\textbf{w}) + \varepsilon ^{\ell +1} \tilde{\Lambda }_0(t,s,\textbf{v},\textbf{w}, \varepsilon ), \\&{\dot{\textbf{v}}} = \varepsilon ^\ell R_1\cdot \textbf{v} + \varepsilon ^\ell \Lambda _1(s,\textbf{v},\textbf{w}) + \varepsilon ^{\ell +1} \tilde{\Lambda }_1(t,s,\textbf{v},\textbf{w}, \varepsilon ), \\&{\dot{\textbf{w}}} = \varepsilon ^\ell R_2\cdot \textbf{w} + \varepsilon ^\ell \Lambda _2(s,\textbf{v},\textbf{w}) + \varepsilon ^{\ell +1} \tilde{\Lambda }_2(t,s,\textbf{v},\textbf{w}, \varepsilon ). \end{aligned} \end{aligned}$$

(26)

We apply the time rescaling $\varepsilon ^\ell t = {\tilde{t}}$ to (26) and finally obtain

$$\begin{aligned} \begin{aligned}&s' = 1 + \Lambda _0(s,\textbf{v},\textbf{w}) + \varepsilon \tilde{\Lambda }_0({\tilde{t}}/\varepsilon ^\ell ,s,\textbf{v},\textbf{w}, \varepsilon ), \\&\textbf{v}' = R_1\cdot \textbf{v} + \Lambda _1(s,\textbf{v},\textbf{w}) + \varepsilon \tilde{\Lambda }_1({\tilde{t}}/\varepsilon ^\ell ,s,\textbf{v},\textbf{w}, \varepsilon ), \\&\textbf{w}' = R_2\cdot \textbf{w} + \Lambda _2(s,\textbf{v},\textbf{w}) + \varepsilon \tilde{\Lambda }_2({\tilde{t}}/\varepsilon ^\ell ,s,\textbf{v},\textbf{w}, \varepsilon ). \end{aligned} \end{aligned}$$

(27)

where $\prime $ denotes a derivative with respect to ${\tilde{t}}$. Such differential system is well defined on $({\tilde{t}},s,\textbf{v}, \textbf{w},\varepsilon ) \in \mathbb {R}\times \mathbb {R}\times B_{n-d-1}(0,2\rho ) \times B_{d}(0,2\rho ) \times (0,\varepsilon _0]$.

3.2 Existence, regularity, and convergence

Henceforth, we consider that (27) is defined over $\mathbb {R}\times \mathbb {R}\times B_{n-d-1}(0,\rho ) \times B_{d}(0,\rho ) \times (0,\varepsilon _0]$. Observe that (27) is of the form considered in Lemma 1. We must now show that the hypotheses required for the application of that Lemma hold in our case. Observe that the fact that the parameter $\varepsilon $ appears in the denominator of the first argument of $\tilde{\Lambda }_0$, $\tilde{\Lambda }_1$, and $\tilde{\Lambda }_2$ in (27) will not be an impediment to the application of the Lemma, since it is not required in its hypotheses that the functions appearing in the system be defined at $\varepsilon =0$. In fact, the conditions concerning boundedness, be it of the functions themselves or of their Lipschitz constants, can still be proved by resorting to the periodicity of Z.

For convenience, we will henceforth adopt the following notation

$$\begin{aligned} \begin{aligned}&(C^{-1} \cdot Y) (s,\textbf{h}):= (C (s,\textbf{h}))^{-1} \cdot Y (s,\textbf{h}), \\&(C^{-1} \cdot Z) (t,s,\textbf{h}, \varepsilon ):= (C (s,\textbf{h}))^{-1} \cdot Z (t,s,\textbf{h}, \varepsilon ),\\&(C^{-1} \cdot Y + \varepsilon C^{-1} \cdot Z) (t,s,\textbf{h}, \varepsilon ):= (C (s,\textbf{h}))^{-1} \cdot Y (s,\textbf{h}) + \varepsilon (C (s,\textbf{h}))^{-1} \cdot Z (t,s,\textbf{h}, \varepsilon ). \end{aligned} \end{aligned}$$

With that in mind, we proceed to proving that Lemma 1 can be applied to (27).

As remarked before, the eigenvalues of the matrix R have non-zero real parts, so that it is immediate that hypothesis (iv) holds. Regarding hypothesis (i), observe that, by their definitions, we know that $Y(s+\omega ,\textbf{h}) = Y(s,A\cdot \textbf{h})$ and $Z(t,s+\omega ,\textbf{h}, \varepsilon ) = Z(t,s, A \cdot \textbf{h}, \varepsilon )$. Furthermore,

Thus, since $A^2=I_{n-1}$, it follows that

Hence, it is easily verified that the following conditions hold:

$\Lambda _0(s+\omega ,\textbf{v},\textbf{w}) = \Lambda _0(s,\textbf{v},-\textbf{w})$;
$\tilde{\Lambda }_0 (t,s+\omega , \textbf{v},\textbf{w}, \varepsilon ) = \tilde{\Lambda }_0 (t,s+\omega , \textbf{v},-\textbf{w}, \varepsilon )$;
$\Lambda _1(s+\omega ,\textbf{v},\textbf{w}) = \Lambda _1(s,\textbf{v},-\textbf{w})$;
$\tilde{\Lambda }_1 (t,s+\omega , \textbf{v},\textbf{w}, \varepsilon ) = \tilde{\Lambda }_1 (t,s+\omega , \textbf{v},-\textbf{w}, \varepsilon )$;
$\Lambda _2(s+\omega ,\textbf{v},\textbf{w}) = -\Lambda _2(s,\textbf{v},-\textbf{w})$;
$\tilde{\Lambda }_2 (t,s+\omega , \textbf{v},\textbf{w}, \varepsilon ) = - \tilde{\Lambda }_2 (t,s+\omega , \textbf{v},-\textbf{w}, \varepsilon )$.

This ensures that (i) is valid. It remains to show that hypotheses (ii) and (iii) hold.

In order to do so, let $B_n(p,r)$ denote the open ball $\{x \in \mathbb {R}^n: \Vert x-p\Vert <r \}$. Also, let us define the following functions:

$$\begin{aligned} \begin{aligned}&\alpha _Y(s,\textbf{h}):= \left\| \frac{\partial (C^{-1}\cdot Y)}{\partial (s,\textbf{h})}(s,\textbf{h})\right\| , \\&\alpha _Z{(s,\textbf{h},t,\varepsilon )}: =\left\| \frac{\partial (C^{-1}\cdot {\tilde{Z}})}{\partial (s,\textbf{h})}(t,s,\textbf{h},\varepsilon )\right\| , \end{aligned} \end{aligned}$$

where $\Vert \cdot \Vert $ denotes the operator norm.

Let $\varepsilon _0>0$ be fixed. For $\sigma \in (0,\rho )$, define

$$\begin{aligned} L(\sigma ):= \sup \left\{ \alpha _Y{(s,\textbf{h})}: (s,\textbf{h}) \in \mathbb {R} \times {\bar{B}}_{n-1}(0,\sigma ) \right\} . \end{aligned}$$

Observe that $\lim \nolimits _{\sigma \rightarrow 0} L(\sigma ) = 0$, because

$$\begin{aligned} \alpha _Y(s,\textbf{h}) \le \left\| \frac{\partial (C^{-1}\cdot Y)}{\partial s}(s,\textbf{h})\right\| + \left\| \frac{\partial (C^{-1}\cdot Y)}{\partial \textbf{h}}(s,\textbf{h})\right\| \end{aligned}$$

and $Y(s,0)=\frac{\partial Y}{\partial s} (s,0) = \frac{\partial Y}{\partial \textbf{h}} (s,0) = 0$ for all $s \in \mathbb {R}$. Thus, we can extend L continuously to $[0,\rho )$ by setting $L(0)=0$. Moreover, since $(C^{-1} \cdot Y)$ is also of class $C^{r-\ell }$, with $r-\ell \ge 2$, it follows by the mean value inequality that there is $\mathcal {M}_\rho >0$ such that

$$\begin{aligned} \alpha _Y(s,\textbf{h}) \le \mathcal {M}_\rho \Vert \textbf{h}\Vert \le \mathcal {M}_\rho \sigma \end{aligned}$$

for all $(s,\textbf{h}) \in \mathbb {R}\times {\bar{B}}_{n-1}(0,\sigma )$, where $\sigma \in (0,\rho )$. Then, again by the mean value inequality, we conclude that

$$\begin{aligned} \Vert (C^{-1} \cdot Y)(s_1,\mathbf{h_1}) - (C^{-1} \cdot Y)(s_2,\mathbf{h_2}) \Vert\le & {} L(\sigma ) \Vert (s_1,\mathbf{h_1}) - (s_2, \mathbf{h_2}) \Vert \nonumber \\\le & {} \mathcal {M}_\rho \sigma \Vert (s_1,\mathbf{h_1}) - (s_2, \mathbf{h_2}) \Vert , \end{aligned}$$

(28)

for all $(s_1,\mathbf{h_1}), (s_2, \mathbf{h_2}) \in \mathbb {R} \times {\bar{B}}_{n-1}(0,\sigma )$.

Since $r-\ell \ge 2$, and since the $C^{r-\ell }$ function $\Vert C^{-1}\cdot Z\Vert $ is T-periodic in its first entry and $2\omega $-periodic in its second entry, it follows that there is $M>0$ such that

$$\begin{aligned} \sup \left\{ \left\| (C^{-1}\cdot Z)({\tilde{t}}/\varepsilon ^\ell ,s,\textbf{h},\varepsilon )\right\| : ({\tilde{t}},s,\textbf{h},\varepsilon ) \in \mathbb {R} \times \mathbb {R}\times \{0\} \times (0,\varepsilon _0] \right\} \le M, \end{aligned}$$

and

$$\begin{aligned} \sup \left\{ \alpha _Z{({\tilde{t}}/\varepsilon ^\ell ,s,\textbf{h},\varepsilon )}: ({\tilde{t}},s,\textbf{h},\varepsilon ) \in \mathbb {R} \times \mathbb {R}\times {\bar{B}}_{n-1}(0,\rho ) \times (0,\varepsilon _0] \right\} \le M. \end{aligned}$$

Thus, it follows on the one hand that

$$\begin{aligned}{} & {} \Vert \varepsilon (C^{-1} \cdot Z)({\tilde{t}}/\varepsilon ^\ell ,s,0,\varepsilon )\Vert =\Vert (C^{-1} \cdot Y)(s,0) \nonumber \\{} & {} \quad + \varepsilon (C^{-1} \cdot Z)({\tilde{t}}/\varepsilon ^\ell ,s,0,\varepsilon )\Vert \le \varepsilon M, \end{aligned}$$

(29)

for all $({\tilde{t}},s,\varepsilon ) \in \mathbb {R} \times \mathbb {R} \times (0,\varepsilon _0]$, proving that (ii) is valid with $M(\varepsilon )=\varepsilon M$. On the other hand, the mean value inequality ensures that

$$\begin{aligned}{} & {} \Vert (C^{-1} \cdot Z)({\tilde{t}}/\varepsilon ^\ell ,s_1,\mathbf{h_1},\varepsilon ) - (C^{-1}\cdot Z)({\tilde{t}}/\varepsilon ^\ell ,s_2,\mathbf{h_2},\varepsilon ) \Vert \nonumber \\{} & {} \quad \le M \Vert (s_1,\mathbf{h_1}) - (s_2, \mathbf{h_2}) \Vert , \end{aligned}$$

(30)

for all $(s_1,\mathbf{h_1}), (s_2, \mathbf{h_2}) \in \mathbb {R} \times {\bar{B}}_{n-1}(0,\rho )$ and all $({\tilde{t}},\varepsilon ) \in \mathbb {R} \times (0,\varepsilon _0]$. Hence, combining (28) and (30), we conclude that

$$\begin{aligned}{} & {} \Vert (C^{-1}\cdot Y+ \varepsilon C^{-1} \cdot Z)({\tilde{t}}/\varepsilon ^\ell ,s_1,\mathbf{h_1},\varepsilon ) - (C^{-1}\cdot Y+ \varepsilon C^{-1} \cdot Z)({\tilde{t}}/\varepsilon ^\ell ,s_2,\mathbf{h_2},\varepsilon ) \Vert \nonumber \\ {}{} & {} \quad \le (\mathcal {M}_L \sigma + \varepsilon M) \Vert (s_1,\mathbf{h_1}) - (s_2, \mathbf{h_2}) \Vert , \end{aligned}$$

(31)

for $(s_1,\mathbf{h_1}),(s_2,\mathbf{h_2}) \in \mathbb {R}\times {\bar{B}}_{n-1}(0,\sigma )$ and $({\tilde{t}},\varepsilon ) \in \mathbb {R}\times (0,\varepsilon _0]$, ensuring that (iii) also holds.

Hence, all the hypotheses required for the application of Lemma 1 are valid for system (27). Since $(C^{-1}\cdot Y)$ and $(t,s,\textbf{h}) \mapsto (C^{-1}\cdot Z)(t,s,\textbf{h},\varepsilon )$ are of class $C^{r-\ell }$, Proposition 3 may also be applied with $p=r-\ell $. Applying the above-mentioned results, we obtain $\varepsilon _1>0$ and families of functions $\{f_\varepsilon \in C^{r-\ell }(\mathbb {R}^2; \mathbb {R}^{n-d-1}): {\varepsilon \in (0,\varepsilon _1]}\}$ and $\{g_\varepsilon \in C^{r-\ell }(\mathbb {R}^2; \mathbb {R}^{d}): {\varepsilon \in (0,\varepsilon _1]}\}$ such that

(I)
For each $\varepsilon \in (0,\varepsilon _1]$, the set defined by the relation $\textbf{h} = (f_\varepsilon (\tilde{t},s),g_\varepsilon (\tilde{t},s))$ is an invariant manifold for system
$$\begin{aligned} \left[ \begin{array}{c} s^\prime \\ \mathbf{h'} \end{array}\right] = \left[ \begin{array}{c} 1\\ H \textbf{h} \end{array}\right] + (C^{-1} \cdot Y)(s, \textbf{h})+ \varepsilon (C^{-1} \cdot Z)(\tilde{t}/\varepsilon ^\ell , s,\textbf{h}, \varepsilon ), \qquad \tilde{t}'=1. \end{aligned}$$
(32)
(II)
There is $D(\varepsilon )>0$ such that $\Vert f_\varepsilon \Vert _{C^0}\le D(\varepsilon )$, $\Vert g_\varepsilon \Vert _{C^0}\le D(\varepsilon )$ and $\lim _{\varepsilon \rightarrow 0} D(\varepsilon ) = 0$. Furthermore, there is a constant $C_R>0$, depending only on the matrix R, such that $D(\varepsilon ) = C_R M(\varepsilon ) = C_R \varepsilon M$.
(III)
There is $\Delta (\varepsilon )>0$ such that $f_\varepsilon $ and $g_\varepsilon $ are Lipschitz continuous in s with Lipschitz constant $\Delta (\varepsilon )$ and $\lim _{\varepsilon \rightarrow 0 } \Delta (\varepsilon ) = 0$.
(IV)
$f_\varepsilon $ is $\omega $-periodic in s and $g_\varepsilon $ satisfies $g_\varepsilon (\tilde{t},s+\omega ) = -g(\tilde{t},s)$.
(V)
$f_\varepsilon $ and $g_\varepsilon $ are $\varepsilon ^\ell T$-periodic in $\tilde{t}$.
(VI)
Let $\pi _1: \mathbb {R}^{n-d-1}\times \mathbb {R}^d \rightarrow \mathbb {R}^{n-d-1}$ and $\pi _2: \mathbb {R}^{n-d-1} \times \mathbb {R}^d \rightarrow \mathbb {R}^d$ be the canonical projections. Also, let $k_1\le n-d-1$ and $k_2\le d$ of the eigenvalues of $R_1$ and $R_2$, respectively, have negative real parts. There are positive constants r, $\lambda $, C, $\sigma _0$, and $\sigma _1$ such that $r<\sigma _0$, $D(\varepsilon )<\sigma _0<\sigma _1<\rho $, and, for each $(\tilde{t}_0,s_0,\varepsilon ) \in \mathbb {R} \times \mathbb {R} \times (0,\varepsilon _1]$, there is in $\bar{B}_{n-d-1}(0,\sigma _0) \times \bar{B}_d(0,\sigma _0)$ a local $(k_1+k_2)$-dimensional embedded submanifold $S(t_0,s_0,\varepsilon )$ of $\mathbb {R}^{n-d-1} \times \mathbb {R}^d$, containing the point $(f_\varepsilon (\tilde{t}_0,s_0),g_\varepsilon (\tilde{t}_0,s_0))$, and having the following properties:
1. (VI.1)
  If $\mathbf{h_0} \in \bar{B}_{n-d-1}(0,\sigma _0) \times \bar{B}_d(0,\sigma _0) {\setminus } S(\tilde{t}_0,s_0,\varepsilon )$, there is $\tilde{t}_*>\tilde{t}_0$ for which
  $$\textbf{h}(\tilde{t}_*,\tilde{t}_0,s_0,\mathbf{h_0},\varepsilon ) \notin \bar{B}_{n-d-1}(0,\sigma _1) \times \bar{B}_d(0,\sigma _1).$$
2. (VI.2)
  Reciprocally, if $\mathbf{h_0} \in S(\tilde{t}_0,s_0,\varepsilon )$, then, for all $\tilde{t}\ge \tilde{t}_0$,
  $$\textbf{h}(\tilde{t},\tilde{t}_0,s_0,\mathbf{h_0},\varepsilon ) \in \bar{B}_{n-d-1}(0,\sigma _1) \times \bar{B}_d(0,\sigma _1)$$
  and the following inequality holds:
  $$\begin{aligned} \begin{aligned}&\left\| \textbf{h}(\tilde{t},\tilde{t}_0,s_0,\mathbf{h_0},\varepsilon ) - \textbf{h}(\tilde{t},\tilde{t}_0,s_0,(f_\varepsilon (\tilde{t}_0,s_0),g_\varepsilon (\tilde{t}_0,s_0)),\varepsilon )\right\| \\&\quad \le C e^{-\lambda (\tilde{t}-\tilde{t}_0)} \Vert \mathbf{h_0}-(f_\varepsilon (\tilde{t}_0,s_0),g_\varepsilon (\tilde{t}_0,s_0))\Vert . \end{aligned} \end{aligned}$$
3. (VI.3)
  There is a continuous function $\phi ^\varepsilon _1: \mathbb {R} \times \mathbb {R} \times \bar{B}_{k_1}(0,r) \times \bar{B}_{k_2}(0,r) \rightarrow \bar{B}_{n-d-k_1-1}(0,\sigma _0)$ such that
  $$\pi _1(S(\tilde{t}_0,s_0,\varepsilon )) = \{(\phi ^\varepsilon _1(\tilde{t}_0,s_0, \xi _1,\xi _2),\xi _1):(\xi _1,\xi _2) \in \bar{B}_{k_1}(0,r) \times \bar{B}_{k_2}(0,r)\}.$$
  Similarly, there is a continuous function $\phi ^\varepsilon _2: \mathbb {R} \times \mathbb {R} \times \bar{B}_{k_1}(0,r) \times \bar{B}_{k_2}(0,r) \rightarrow \bar{B}_{d-k_2}(0,\sigma _0)$ such that
  $$\pi _2(S(\tilde{t}_0,s_0,\varepsilon )) = \{(\phi ^\varepsilon _2(\tilde{t}_0,s_0,\xi _1, \xi _2),\xi _2):(\xi _1,\xi _2) \in \bar{B}_{k_1}(0,r) \times \bar{B}_{k_2}(0,r)\}.$$
4. (VI.4)
  The functions $\phi _1^\varepsilon $ and $\phi _2^\varepsilon $ satisfy
  $$\begin{aligned} \phi _1^\varepsilon (\tilde{t}_0,s_0+\omega ,\xi _1,\xi _2) = \phi _1^\varepsilon (\tilde{t}_0,s_0,\xi _1,-\xi _2) \end{aligned}$$
  and
  $$\begin{aligned} \phi _2^\varepsilon (\tilde{t}_0,s_0+\omega ,\xi _1,\xi _2) = -\phi _2^\varepsilon (\tilde{t}_0,s_0,\xi _1,-\xi _2) \end{aligned}$$
  for all $(\tilde{t}_0,s_0,\xi _1,\xi _2) \in \mathbb {R} \times \mathbb {R} \times \bar{B}_{k_1}(0,r) \times \bar{B}_{k_2}(0,r)$.

Let $\mathcal {X}$ be the function associated to the change of coordinates we have performed, that is,

$$\begin{aligned} \mathcal {X}(s,\textbf{h}) = \varphi (s) + Q(s) \cdot \textbf{h}. \end{aligned}$$

Define $w_\varepsilon :\mathbb {R}\times \mathbb {R}\rightarrow \mathbb {R}\times \mathbb {R}^n$ by

$$\begin{aligned} w_\varepsilon (\tau ,s):=\Big (\tau ,\mathcal {X}\left( s,\big (f_\varepsilon (\varepsilon ^\ell \tau ,s),g_\varepsilon (\varepsilon ^\ell \tau ,s)\big )\right) \Big ). \end{aligned}$$

Since $\mathcal {X}$ is injective for $s \in [0,\omega )$ and $\Vert \textbf{h}\Vert \le \rho $, the function $w_\varepsilon $ restricted to $\mathbb {R}\times [0,\omega )$ is injective. It is also clear that $\varepsilon _1$ can be taken sufficiently small as to ensure that $w_\varepsilon $ is an immersion for all $\varepsilon \in (0,\varepsilon _1]$.

It is easy to see that property (IV) above guarantees that $w_\varepsilon $ is $\omega $-periodic in s. In fact, considering (18) and the definition of A given in (17), we have that

$$\begin{aligned} w_\varepsilon (\tau ,s+\omega ) = \Big (\tau ,\varphi (s) + Q(s) \, A \cdot \big (f_\varepsilon (\varepsilon ^\ell \tau ,s),-g_\varepsilon (\varepsilon ^\ell \tau ,s)\big ) \Big ) = w_\varepsilon (\tau ,s). \end{aligned}$$

Thus, $W_\varepsilon :=\{w_\varepsilon (\tau ,s): (\tau ,s ) \in \mathbb {R}\times \mathbb {R}\}\subset \mathbb {R}\times \mathbb {R}^n$ is an embedded cylinder of class $C^{r-\ell }$ that is invariant under the flow of

$$\begin{aligned} \left\{ \begin{array}{@{}l@{}} \textbf{z}' = \displaystyle \varepsilon ^{\ell }\textbf{g}_{\ell }(\textbf{z})+\varepsilon ^{\ell +1} r_{\ell }(\textbf{z},\tau ,\varepsilon ), \\ \tau '=1. \end{array} \right. \end{aligned}$$

(33)

Property (V) ensures that $w_\varepsilon $ satisfies $w_\varepsilon (\tau +T,s) = (T,0) +w_\varepsilon (\tau ,s)$. We can thus consider $\tau $ an angular variable modulo T in (33), and $W_\varepsilon $ becomes an invariant torus in $\mathbb {S}^1 \times \mathbb {R}^n$. Finally, the torus $M_\varepsilon $, invariant under (2), is obtained from $W_\varepsilon $ by reverting the near-identity periodic transformation $\textbf{x}= U(\tau ,\textbf{z},\varepsilon )$ that we employed in the beginning of the proof. This proves the existence of $M_\varepsilon $ stated in Theorem A. The fact that there is a neighborhood V of $\Gamma $ such that any compact manifold that is invariant under (2) and contained in $\mathbb {S}^1 \times V$ must also be contained in $M_\varepsilon $ follows from Corollary 1.

We proceed to proving the statement regarding regularity of $M_\varepsilon $ in Theorem A. Define $\mathcal {F}_\varepsilon $ by

$$\begin{aligned} \mathcal {F}_\varepsilon (\tau ,s) = U\Big (\tau ,\mathcal {X}\left( s,\big (f_\varepsilon (\varepsilon ^\ell \tau ,s),g_\varepsilon (\varepsilon ^\ell \tau ,s)\big )\right) ,\varepsilon \Big ). \end{aligned}$$

Observe that $\{\mathcal {F}_\varepsilon \}_\varepsilon $ is a family of $C^{r-\ell }$ functions that are also $\omega $-periodic in s and T-periodic in $\tau $, and that $M_\varepsilon $ is given by the relation $\textbf{x}= \mathcal {F}_\varepsilon (\tau ,s)$, i.e.,

$$\begin{aligned} M_\varepsilon = \{(\tau ,\mathcal {F}_\varepsilon (\tau ,s)) \in \mathbb {S}^1 \times \mathbb {R}^n: (\tau ,s) \in \mathbb {R}\times \mathbb {R}\}. \end{aligned}$$

Moreover, by Proposition 4, it follows that the family $\{\mathcal {F}_\varepsilon \}_\varepsilon $ is $C^0$-continuous, that is, continuous in the $C^0$-norm, provided that $\varepsilon _1$ is chosen sufficiently small. In fact, this Proposition guarantees that this family is $C^1$ in the $C^0$-norm.

Regarding the statement about convergence, observe that it follows from property (II) that there is $D^*(\varepsilon )$ such that

$$\begin{aligned} \left\| \mathcal {X}\left( s,\big (f_\varepsilon (\varepsilon ^\ell \tau ,s),g_\varepsilon (\varepsilon ^\ell \tau ,s)\big )\right) - \varphi (s)\right\| <D^*(\varepsilon ) \end{aligned}$$

and $\lim _{\varepsilon \rightarrow 0} D'(\varepsilon ) = 0$. Then, considering that U is locally Lipschitz in its second argument and that both functions appearing inside the norm of the inequality above are periodic, it follows that there is $\delta (\varepsilon )\ge 0$ such that $\delta (0)=0$ and $\Vert \mathcal {F}_\varepsilon (\tau ,s) - U(\tau ,\varphi (s),\varepsilon )\Vert <\delta (\varepsilon )$.

3.3 Stability

Let the non-negative integers $k_1\le n-d-1$ and $k_2\le d$ denote the number of eigenvalues with negative real parts of the matrices $R_1$ and $R_2$ respectively. Define the function $q_\varepsilon :\mathbb {R}\times \mathbb {R}\times B_{k_1}(0,r) \times B_{k_2}(0,r) \rightarrow \mathbb {R}\times \mathbb {R}^n$ by

$$\begin{aligned} q_\varepsilon (\tau ,s,\xi _1,\xi _2) = \Big ( \tau , U\Big (\tau ,\mathcal {X}\left( s,\big (\phi _1^\varepsilon (\varepsilon ^\ell \tau ,s,\xi _1,\xi _2),\xi _1,\phi _2^\varepsilon (\varepsilon ^\ell \tau ,s,\xi _1,\xi _2),\xi _2\big )\right) , \,\varepsilon \Big )\Big ). \end{aligned}$$

Let $S_{M_\varepsilon }$ be the image of $q_\varepsilon $. We will show that $S_{M_\varepsilon }$ is an embedded submanifold in $\mathbb {R}\times \mathbb {R}^n$.

For convenience, we denote by $q_\varepsilon |_{I}$ the restriction of $q_\varepsilon $ to the set $\mathbb {R}\times I \times B_{k_1}(0,r) \times B_{k_2}(0,r)$, where $I \subset \mathbb {R}$. Observe that the properties of $\phi _1^\varepsilon $ and $\phi _2^\varepsilon $ given in (VI.4), along with (18), ensure that $S_{M_\varepsilon }$ is contained in the image of $q_\varepsilon |_{[0,\, \omega )}$. Hence, $S_{M_\varepsilon }$ is contained in the union of the images of $q_\varepsilon |_{(0,\omega )}$ and $q_\varepsilon |_{(-\frac{\omega }{2},\frac{\omega }{2})}$.

Now, since $r\le \sigma _0\le \rho $, $\mathcal {X}(s,\textbf{h})$ is injective for $(s, \textbf{h}) \in [0,\omega ) \times {\bar{B}}_{n-1}(0,\sigma _0)$, ensuring that $q_\varepsilon |_{(0,\omega )}$ and $q_\varepsilon |_{(-\frac{\omega }{2},\frac{\omega }{2})}$ are injective. It is then easy to see that $q_\varepsilon |_{(0,\omega )}$ and $q_\varepsilon |_{(-\frac{\omega }{2},\frac{\omega }{2})}$ are homeomorphisms onto their images, proving that $S_{M_\varepsilon }$ is a $(k_1+k_2+2)$-dimensional embedded submanifold of $\mathbb {R}\times \mathbb {R}^n$. As remarked before, in Sect. 3.1, if k is the number of characteristic multipliers of the limit cycle $\Gamma $ whose absolute values are less than 1, then $k=k_1+k_2$. Thus, $S_{M_\varepsilon }$ is k-dimensional.

We will prove that $S_{M_\varepsilon }$ is locally the stable set of $M_\varepsilon $. Suppose that $\mathbf{h_0} \in S({\tilde{t}}_0,s_0,\varepsilon )$. For convenience, let us define $s_*({\tilde{t}}):= s({\tilde{t}},{\tilde{t}}_0,s_0,\mathbf{h_0},\varepsilon )$, $s_{M_\varepsilon }({\tilde{t}}):=s({\tilde{t}},{\tilde{t}}_0,s_0,(f_\varepsilon ({\tilde{t}}_0,s_0),g_\varepsilon ({\tilde{t}}_0,s_0)),\varepsilon )$, $\mathbf{h_*}({\tilde{t}}):=\textbf{h}({\tilde{t}},{\tilde{t}}_0,s_0,\mathbf{h_0},\varepsilon )$, and finally $\textbf{h}_{M_\varepsilon }({\tilde{t}}):=\textbf{h}({\tilde{t}},{\tilde{t}}_0,s_0,(f_\varepsilon ({\tilde{t}}_0,s_0),g_\varepsilon ({\tilde{t}}_0,s_0)),\varepsilon )$. Also, let

$$\begin{aligned} u({\tilde{t}}):=\Vert s_*({\tilde{t}}) - s_b({\tilde{t}})\Vert + \Vert \mathbf{h_*}({\tilde{t}}) - \textbf{h}_{M_\varepsilon }({\tilde{t}})\Vert . \end{aligned}$$

Observe that, considering (27), along with the boundedness and Lipschitz continuity properties that $\Lambda _0$ and $\tilde{\Lambda }_0$ are proved to satisfy, we have that

$$\begin{aligned} u({\tilde{t}}) \le \Vert \mathbf{h_*}({\tilde{t}}) + \textbf{h}_{M_\varepsilon }({\tilde{t}})\Vert +\int _{{\tilde{t}}_0}^{{\tilde{t}}} (\mathcal {M}_L \sigma _1 + \varepsilon M) u(x) dx. \end{aligned}$$

Thus, considering property (VI.2) and applying Grönwall’s inequality, it follows that

$$\begin{aligned} u({\tilde{t}}) \le C e^{(-\lambda +\mathcal {M}_L \sigma _1 + \varepsilon M) ({\tilde{t}}-{\tilde{t}}_0)} \Vert \mathbf{h_0}-(f_\varepsilon ({\tilde{t}}_0,s_0),g_\varepsilon ({\tilde{t}}_0,s_0))\Vert . \end{aligned}$$

Hence, if $\rho $ and $\varepsilon _1$ are chosen sufficiently small, we ensure that $u({\tilde{t}}) \rightarrow 0$ as ${\tilde{t}} \rightarrow \infty $. Thus, it follows that, if $\mathbf{h_0} \in S({\tilde{t}}_0,s_0,\varepsilon )$, then

$$\begin{aligned} \lim _{{\tilde{t}} \rightarrow \infty }\Vert \mathcal {X}(s_*({\tilde{t}}),\mathbf{h_*}({\tilde{t}})) - \mathcal {X}(s_{M_e}({\tilde{t}}),\textbf{h}_{M_\varepsilon }({\tilde{t}})) \Vert =0. \end{aligned}$$

(34)

Since $\sigma _1<\rho $, we know that $\mathcal {X}(s,\textbf{h})$ is injective for $(s, \textbf{h}) \in [0,\omega ) \times {\bar{B}}_{n-1}(0,\sigma _1)$. Consider the following neighborhoods of $M_\varepsilon $:

$$\begin{aligned} \begin{aligned} V_s:= \left\{ \left( \tau ,U\left( \tau ,\mathcal {X}(s,\textbf{h}),\varepsilon \right) \right) : (\tau ,s,\textbf{h}) \in \mathbb {R}\times \mathbb {R}\times B_{n-1}(0,\sigma _1) \right\} , \\ W_s:= \left\{ \left( \tau ,U\left( \tau ,\mathcal {X}(s,\textbf{h}),\varepsilon \right) \right) : (\tau ,s,\textbf{h}) \in \mathbb {R}\times \mathbb {R}\times B_{n-1}(0,\sigma _0) \right\} . \end{aligned} \end{aligned}$$

It is then clear, considering (34) and the fact that $M_\varepsilon $ is an invariant manifold, that the local stable set of $M_\varepsilon $ with respect to $V_s$ satisfies $\mathcal {S}^{V_s}_{M_\varepsilon } \cap W_s = S_{M_\varepsilon }$.

The same argument with time reversed proves the analogous statement for the local unstable set $\mathcal {U}^{V_u}_{M_\varepsilon } \cap W_u$. In this case, the dimension of the manifold obtained is $2+(n-1-k) = n-k+1$, because the number of eigenvalues of R with positive real part is $n-1-k$.

3.4 Dynamics

Let $S_\varepsilon \subset \mathbb {R}^{n+1}$ be defined as the section $\tau =0$ of the torus $M_\varepsilon $, that is, the image of the real 1-periodic function $\Pi _\varepsilon : \theta \mapsto (0,\mathcal {F}_\varepsilon (0,\omega \theta ))$. It is clear that $S_\varepsilon $ is $C^{r-\ell }$-diffeomorphic to the circle $S^1$. Once more, let $t \mapsto (s(t,t_0,s_0,\mathbf{h_0},\varepsilon ),\textbf{h}(t,t_0,s_0,\mathbf{h_0},\varepsilon ))$ be the solution of (25) satisfying $(s(t_0,t_0,s_0,\mathbf{h_0},\varepsilon ),\textbf{h}(t_0,t_0,s_0,\mathbf{h_0},\varepsilon )) = (s_0,\mathbf{h_0})$. Define, for $(\nu ,\theta ) \in \mathbb {R}\times \mathbb {R}$,

$$\begin{aligned} s_\varepsilon (\nu ,\theta ):=s(\nu T,0,\theta ,(f_\varepsilon (0,\theta ),g_\varepsilon (0,\theta )),\varepsilon ). \end{aligned}$$

Since $\tau '=1$ in (2), it follows that the first-return map $p_\varepsilon $ defined on $S_\varepsilon $ under the action of this differential system is well defined. Moreover, it is clear that

$$\begin{aligned} p_\varepsilon \big (\Pi _\varepsilon (\theta ) \big ) = \Pi _\varepsilon \left( \frac{s_\varepsilon (1,\omega \theta )}{\omega }\right) . \end{aligned}$$

Thus, the real function

$$\begin{aligned} {\tilde{p}}_\varepsilon :\theta \mapsto \frac{s_\varepsilon (1,\omega \theta )}{\omega } \end{aligned}$$

is a lift of $p_\varepsilon $ with respect to the covering map $\Pi _\varepsilon :\mathbb {R}\rightarrow S_\varepsilon $. Moreover, this ensures that $p_\varepsilon $ is at least of class $C^{r-\ell }$.

Observe that

$$\begin{aligned} {\tilde{p}}_\varepsilon ^n(\theta ) = \frac{s_\varepsilon (n,\omega \theta )}{\omega } \end{aligned}$$

for all $n \in \mathbb {N}$. Then, it is clear that the rotation number of $p_\varepsilon $ is given by

$$\begin{aligned} \rho (\varepsilon ): = \lim _{n \rightarrow \infty } \frac{{\tilde{p}}_\varepsilon ^n(\theta )-\theta }{n} = \lim _{n \rightarrow \infty } \frac{s_\varepsilon (n,\omega \theta ) - \omega \theta }{n\omega }. \end{aligned}$$

We will rewrite this limit so as to be able to calculate it up to $\ell $-th order of $\varepsilon $.

Integrating the first equation of (26) from $t=0$ to $t=nT$, we obtain

$$\begin{aligned} s_\varepsilon (n,\theta )= & {} \theta + \varepsilon ^\ell nT + \varepsilon ^\ell \int _0^{nT} \Lambda _0\big (s(\tau ,0,\theta ,(f_\varepsilon (0,\theta ),g_\varepsilon (0,\theta )),\varepsilon ),\nonumber \\{} & {} \textbf{h} (\tau ,0,\theta ,(f_\varepsilon (0,\theta ),g_\varepsilon (0,\theta )),\varepsilon )\big ) d\tau \nonumber \\{} & {} + \varepsilon ^{\ell +1} \int _0^{nT} \tilde{\Lambda }_0\big (\tau ,s(\tau ,0,\theta ,(f_\varepsilon (0,\theta ),g_\varepsilon (0,\theta )),\varepsilon ),\nonumber \\{} & {} \textbf{h} (\tau ,0,\theta ,(f_\varepsilon (0,\theta ),g_\varepsilon (0,\theta )),\varepsilon ),\varepsilon )\big ) d\tau . \end{aligned}$$

(35)

Thus, we can define the sequence of functions

$$\begin{aligned} \begin{aligned} G_n(\theta ,\varepsilon )&:=\frac{1}{n} \int _0^{nT} \frac{\Lambda _0}{\varepsilon }\big (s(\tau ,0,\theta (f_\varepsilon (0,\theta ),g_\varepsilon (0,\theta )),\varepsilon ),\textbf{h} (\tau ,0,\theta ,(f_\varepsilon (0,\theta ),g_\varepsilon (0,\theta )),\varepsilon )\big ) d\tau \\&\quad + \frac{1}{n}\int _0^{nT}\tilde{\Lambda }_0\big (\tau ,s(\tau ,0,\theta ,(f_\varepsilon (0,\theta ),g_\varepsilon (0,\theta )),\textbf{h} (\tau ,0,\theta ,(f_\varepsilon (0,\theta ),g_\varepsilon (0,\theta )),\varepsilon )\big ) d\tau , \end{aligned} \end{aligned}$$

so that (35) becomes

$$\begin{aligned} s_\varepsilon (n,\theta ) = \theta + \varepsilon ^\ell nT + \varepsilon ^{\ell +1} n \, G_n(\theta ,\varepsilon ). \end{aligned}$$

Since $M_\varepsilon $ is an invariant manifold, it is clear that

$$\begin{aligned} \textbf{h} (\tau ,0,\theta ,(f_\varepsilon (0,\theta ),g_\varepsilon (0,\theta )),\varepsilon )= & {} (f_\varepsilon (\tau ,s(\tau ,0,\theta ,(f_\varepsilon (0,\theta ),g_\varepsilon (0,\theta )),\varepsilon )),\\{} & {} g_\varepsilon (\tau ,s(\tau ,0,\theta , (f_\varepsilon (0,\theta ),g_\varepsilon (0,\theta )),\varepsilon ))). \end{aligned}$$

Then, by changing the variables in the integral, it follows that

$$\begin{aligned} \begin{aligned}&G_n(\theta ,\varepsilon ):= \int _0^{T} \frac{\Lambda _0}{\varepsilon }\big (s(n\tau ,0,\theta ,(f_\varepsilon (0,\theta ),g_\varepsilon (0,\theta )),\varepsilon ),\\&\textbf{h} (n\tau ,0,\theta ,(f_\varepsilon (0,\theta ),g_\varepsilon (0,\theta )),\varepsilon )\big ) d\tau \\&\quad + \int _0^{T}\tilde{\Lambda }_0\big (n\tau ,s(n\tau ,0,\theta ,(f_\varepsilon (0,\theta ),g_\varepsilon (0,\theta )),\varepsilon ),\\&\textbf{h} (n\tau ,0,\theta ,(f_\varepsilon (0,\theta ),g_\varepsilon (0,\theta )),\varepsilon ),\varepsilon )\big ) d\tau . \end{aligned} \end{aligned}$$

Observe that

$$\begin{aligned} \Vert \Lambda _0(s,\textbf{h}) + \varepsilon \tilde{\Lambda }_0(t,s,\textbf{h},\varepsilon )\Vert\le & {} \Vert \Lambda _0(s,\textbf{h}) + \varepsilon \tilde{\Lambda }_0(t,s,\textbf{h},\varepsilon ) - \varepsilon \tilde{\Lambda }_0(t,s,0,\varepsilon )\Vert \nonumber \\{} & {} + \Vert \varepsilon \tilde{\Lambda }_0(t,s,0,\varepsilon )\Vert \end{aligned}$$

for all $(t,s,\textbf{h},\varepsilon ) \in \mathbb {R}\times \mathbb {R}\times B_{n-1}(0,\rho ) \times (0,\varepsilon _0]$. Then, considering that $\Lambda (s,0)=0$, it follows from (29) and (31) that

$$\begin{aligned}{} & {} \Vert \Lambda _0(s,(f_\varepsilon (n\tau ,s),g_\varepsilon (n\tau ,s)))\nonumber \\{} & {} \quad + \tilde{\Lambda }_0(n\tau , s,(f_\varepsilon (n\tau ,s),g_\varepsilon (n\tau ,s)),\varepsilon )\Vert \le \mathcal {M}_L (\Vert f_\varepsilon \Vert + \Vert g_\varepsilon \Vert ) + \varepsilon M \end{aligned}$$

for all $(s,\tau ,\varepsilon ) \in \mathbb {R}\times \mathbb {R}\times (0,\varepsilon _1]$. Then, from property (II), it follows that

$$\begin{aligned} \Vert \Lambda _0(s,(f_\varepsilon (n\tau ,s),g_\varepsilon (n\tau ,s))) + \tilde{\Lambda }_0(n\tau , s,(f_\varepsilon (n\tau ,s),g_\varepsilon (n\tau ,s)),\varepsilon )\Vert \le C_G \varepsilon , \end{aligned}$$

where $C_G:=2\mathcal {M}_L C_R M +M$. Hence, it is easy to see that

$$\begin{aligned} |G_n(\theta ,\varepsilon )| \le C_G T \end{aligned}$$

(36)

for all $\theta \in \mathbb {R}$ and all $\varepsilon \in (0,\varepsilon _1]$.

Considering that

$$\begin{aligned} \frac{s_\varepsilon (n,\omega \theta )-\omega \theta }{n \omega } = \varepsilon ^\ell \frac{T}{\omega } + \varepsilon ^{\ell +1} \frac{G_n(\omega \theta ,\varepsilon )}{\omega }, \end{aligned}$$

and since the limit

$$\begin{aligned} \lim _{n \rightarrow \infty } \frac{s_\varepsilon (n,\omega \theta )-\omega \theta }{n\omega } \end{aligned}$$

corresponding to the rotation number exists and does not depend on $\theta $, it is ensured that

$$\begin{aligned} G(\varepsilon ): = \lim _{n \rightarrow \infty } G_n(\omega \theta ,\varepsilon ) \end{aligned}$$

is well defined. Moreover, from (36), it is clear that $|G(\varepsilon )|\le C_G T$. Hence, it follows at once that

$$\begin{aligned} \rho (\varepsilon ) = \lim _{n \rightarrow \infty } \varepsilon ^\ell \frac{T}{\omega } + \varepsilon ^{\ell +1} \frac{G_n(\omega \theta ,\varepsilon )}{\omega } = \varepsilon ^\ell \frac{T}{\omega } + \varepsilon ^{\ell +1} \frac{G(\varepsilon )}{\omega } = \varepsilon ^\ell \frac{T}{\omega }+ \mathcal {O}(\varepsilon ^{\ell +1}). \end{aligned}$$

By Proposition 6 combined with the definition of ${\tilde{p}}_\varepsilon $, it follows that the family $\{{\tilde{p}}_\varepsilon \}_\varepsilon $ is continuous in the space of homeomorphisms of $\mathbb {S}^1$ with the $C^0$ topology. Hence, $\rho $ is continuous in $(0,\varepsilon _1]$. Since we also know that system (2) becomes $\tau '=1, \textbf{x}'=0$, when $\varepsilon =0$, it follows that $\rho (0)=0$, so that $\rho $ is actually continuous in $[0,\varepsilon _1]$. In particular, the relation $\omega \rho (\varepsilon ) = \varepsilon ^{\ell } T + \varepsilon ^{\ell +1} G(\varepsilon )$ ensures that G is also continuous in $(0,\varepsilon _1]$.

In order to prove the rest of the statement concerning Dynamics of Theorem A, we will make use of the following result, which can be found in [15, Theorem 6.1].

Theorem 2

Let $\gamma \ge 3$ and $D^\gamma (\mathbb {S}^1)$ be the class of $C^{\gamma }$-diffeomorphisms of the circle $\mathbb {S}^1$ endowed with the norm $C^\gamma $. Let $c:[a,b] \rightarrow D^\gamma (\mathbb {S}^1)$ be a continuous path satisfying: c is of class $C^1$ if considered as a function on $D^0(\mathbb {S}^1)$. Let $\rho (\lambda )$ denote the rotation number of $c(\lambda )$, $\lambda \in [a,b]$. If $\rho (a)\ne \rho (b)$, then the Lebesgue measure $\lambda $ of the set

$$\begin{aligned} \{x \in [a,b]: c(x) \;\text {is}\; C^{\gamma -2}\text {-conjugated to an irrational rotation}\} \end{aligned}$$

is strictly positive. Also, $\rho $ maps zero Lebesgue measure sets to zero Lebesgue measure sets.

Suppose that $r -\ell \ge 4$. Then, Propositions 4 and 6 ensure that $\varepsilon \mapsto {\tilde{p}}_\varepsilon $ satisfies the regularity conditions stated in Theorem 2 with $\gamma =r-\ell -1$. Moreover, from the fact that $\omega \rho (\varepsilon ) = \varepsilon ^\ell T + \mathcal {O}(\varepsilon ^{\ell +1})$, it is clear that there is an interval $[a,b] \subset (0,\varepsilon _1]$ such that $\rho (a) \ne \rho (b)$. Hence, there is a subset of $I\subset [a,b]$ of positive Lebesgue measure such that ${\tilde{p}}_\varepsilon $ is $C^{r-\ell -3}$-conjugated to an irrational rotation for all $\varepsilon \in I$.

4 Invariant torus in 4D vector fields

Consider the differential system (8) under the assumptions established in Sect. 1.3. By applying the cylindrical change of coordinates $(x,y,u,v)=(r\cos \theta ,r\sin \theta ,u,v)$, $r>0$, system (8) becomes

$$\begin{aligned} \dot{r}{} & {} = \varepsilon ^N \left( \cos (\theta ) f_1(r \cos (\theta ),r \sin (\theta ),u,v)+ \sin (\theta ) f_2(r \cos (\theta ),r \sin (\theta ),u,v) \right) \nonumber \\{} & {} \quad -\frac{\varepsilon ^{N+1}}{2}\, r^3 \mu \left( r^2-\left( r^2+1\right) \cos (2 \theta )-1\right) + \mathcal {O}(\varepsilon ^{N+2}), \nonumber \\ \dot{\theta }{} & {} = 1+ \varepsilon ^N \left( \frac{ \cos (\theta ) f_2(r \cos (\theta ),r \sin (\theta ),u,v)-\sin (\theta ) f_1(r \cos (\theta ),r \sin (\theta ),u,v)}{r}\right) \nonumber \\{} & {} \quad -\varepsilon ^{N+1} \mu \left( r^2\sin (\theta ) \cos (\theta ) + r^4 \sin (\theta ) \cos (\theta ) \right) + \mathcal {O}(\varepsilon ^{N+2}), \nonumber \\ \dot{u}{} & {} = \varepsilon ^N f_3(r \cos (\theta ),r \sin (\theta ),u,v) + \varepsilon ^{N+1} r^2 \cos ^2 (\theta ) (u-u^3+v-u v^2 ) + \mathcal {O}(\varepsilon ^{N+2}), \nonumber \\ \dot{v}{} & {} = \varepsilon ^N f_4(r \cos (\theta ),r \sin (\theta ),u,v) + \varepsilon ^{N+1} r^2 \sin ^2 (\theta ) (v-u-u^2v -v^3 ) + \mathcal {O}(\varepsilon ^{N+2}).\nonumber \\ \end{aligned}$$

(37)

Since $\dot{\theta }= 1 + \mathcal {O}(\varepsilon ^2)>0$, it follows that $\dot{\theta }>0$ for $\varepsilon $ sufficiently small. Thus, we can take $\theta $ to be the independent variable, and system (37) becomes

$$\begin{aligned} \begin{aligned}&r' = \varepsilon ^N R_N(\theta , r, u,v) + \varepsilon ^{N+1} R_{N+1} (\theta , r, u,v) + \mathcal {O}(\varepsilon ^{N+2}), \\&u' = \varepsilon ^N U_N(\theta ,r,u,v) + \varepsilon ^{N+1} U_{N+1} (\theta ,r,u,v) + \mathcal {O}(\varepsilon ^{N+2}), \\&v' = \varepsilon ^N V_N(\theta ,r,u,v) + \varepsilon ^{N+1} V_{N+1} (\theta ,r,u,v) + \mathcal {O}(\varepsilon ^{N+2}), \end{aligned} \end{aligned}$$

(38)

where $'$ indicates derivative with respect to the variable $\theta $, and the functions $R_i$, $U_i$, and $V_i$, $i \in \{N,N+1\}$, are given by

$$\begin{aligned} \begin{aligned} R_N(\theta ,r,u,v)&= \cos (\theta ) f_1(r \cos (\theta ),r \sin (\theta ),u,v)\\ {}&\quad +\sin (\theta ) f_2(r \cos (\theta ),r \sin (\theta ),u,v); \\ R_{N+1}(\theta ,r,u,v)&= \frac{1}{2} r^3 \mu \left( \left( r^2+1\right) \cos (2 \theta )-r^2+1\right) ; \\ U_N(\theta ,r,u,v)&= f_3(r \cos (\theta ),r \sin (\theta ),u,v); \\ U_{N+1}(\theta ,r,u,v)&= r^2 \cos ^2(\theta ) \left( -u^3-u v^2+u+v\right) ; \\ V_N(\theta , r,u,v)&= f_4(r \cos (\theta ),r \sin (\theta ),u,v); \\ V_{N+1}(\theta , r, u,v)&= -r^2 \sin ^2(\theta ) \left( u^2 v+u+v^3-v\right) . \end{aligned} \end{aligned}$$

(39)

We remark that each of the functions defined above is $2\pi $-periodic in $\theta $. By defining $\textbf{x}=(r,u,v)$, system (38) can be written as

$$\begin{aligned} \textbf{x}' = \varepsilon ^N F_N(\theta ,\textbf{x}) + \varepsilon ^{N+1} F_{N+1}(\theta ,\textbf{x}) + \varepsilon ^{N+2} {\tilde{F}} (\theta ,\textbf{x},\varepsilon ), \end{aligned}$$

(40)

where

$$\begin{aligned} F_i(\theta ,\textbf{x}) = (R_i(\theta ,\textbf{x}),U_i(\theta ,\textbf{x}),V_i(\theta ,\textbf{x})). \end{aligned}$$

(41)

Using formulas (5) and (6), we can calculate the Melnikov function of order N for this system as

$$\begin{aligned} \textbf{f}_N(\textbf{x}) = \int _0^{2\pi } F_N(s,\textbf{x}) ds. \end{aligned}$$

(42)

Since, by hypothesis, the average of functions $R_N$, $U_N$, and $V_N$ over $\theta \in [0,2\pi ]$ vanish identically, then it follows that $\textbf{f}_N=0$, so that formulas (5) and (6) provide

$$\begin{aligned} \begin{aligned} \textbf{f}_{N+1}(\textbf{x}) =&\int _0^{2\pi } F_{N+1}(s,\textbf{x}) ds \\ =&\left( \mu \frac{r^3}{2} \left( 1-r^2\right) , \frac{r^2}{2} \left( -u^3-u v^2+u+v\right) ,-\frac{r^2}{2} \left( u^2 v+u+v^3-v\right) \right) , \end{aligned}\nonumber \\ \end{aligned}$$

(43)

because $F_i=0$ for all $i \in \{1,2,\ldots ,N-1\}$ and $y_1=0$ in this case. Thus, since it is clear that $\textbf{f}_i =0$ for all $i \in \{1,2,\ldots ,N-1\}$, it follows from Proposition 1 that

$$\begin{aligned} \textbf{g}_{N+1} (\textbf{x}) = \frac{1}{2\pi } \textbf{f}_{N+1} (\textbf{x}). \end{aligned}$$

Let us prove that the guiding system $\textbf{x}' = {{{\textbf {g}}}}_{N+1}(\textbf{x})$ has a hyperbolic limit cycle. First, observe that the curve

$$\begin{aligned} \gamma (t)= \left( 1,\cos \left( \frac{t}{4\pi }\right) ,-\sin \left( \frac{t}{4\pi }\right) \right) \end{aligned}$$

satisfies

$$\begin{aligned} \gamma ' (t) = \left( 0, -\frac{1}{4\pi } \sin \left( \frac{t}{4\pi }\right) , -\frac{1}{4\pi } \cos \left( \frac{t}{4\pi }\right) \right) = {{{\textbf {g}}}}_{N+1}(\gamma (t)), \end{aligned}$$

and is therefore a $8\pi ^2$-periodic orbit of $\textbf{x}' = {{\textbf {g}}}_{N+1}(\textbf{x})$. Define $\Gamma $ as the image of $\gamma (t)$. Notice that $\Gamma =\{1\}\times \mathbb {S}^1$.

In order to show that $\Gamma $ is indeed a hyperbolic limit cycle, we shall find the eigenvalues of the Poincaré map P associated to it. Observe that

$$\begin{aligned} \text {div} \, {{{\textbf {g}}}}_{N+1} (\textbf{x}) = -\mu \frac{5r^4}{4\pi } +\frac{r^2}{4\pi } \left( 2+3\mu -4u^2-4v^2\right) . \end{aligned}$$

By [23, Corollary 12.5], we know that the determinant of the derivative of P at a point $\textbf{x}_0$ in the periodic orbit $\Gamma $ is equal to the determinant of the monodromy matrix associated to $\Gamma $. Thus, by Liouville’s formula, we have

$$\begin{aligned} \det (DP(\textbf{x}_0)) = \exp \int _0^{8\pi ^2} \text {div} \, {{\textbf {g}}}_{N+1} (\gamma (s)) \, ds = e^{-4\pi (1+\mu )}. \end{aligned}$$

(44)

Since the surface given by $r=1$ is an invariant manifold for this system, we can also study $\Gamma $ as a periodic orbit of the system $\textbf{x}' = {{{\textbf {g}}}}_{N+1}(\textbf{x})$ restricted to such surface, which is the planar system $(u',v') = \bar{{{{\textbf {g}}}}}_{n+1}(u,v)$, given by

$$\begin{aligned} \begin{aligned}&u' = \frac{1}{4\pi }(-u^3-uv^2+u+v), \\&v' = -\frac{1}{4\pi } (u^2 v+u+v^3-v). \end{aligned} \end{aligned}$$

(45)

Let L be the intersection of the surface $r=1$ with the transversal section corresponding to the Poincaré map P. Then, once again by [23, Corollary 12.5], the determinant of derivative of the restriction $P|_L$ at $\textbf{x}_0$ is given by

$$\begin{aligned} \det (D(P|_L)(\textbf{x}_0)) = \exp \int _0^{8\pi ^2} \text {div} \, \bar{{{{\textbf {g}}}}}_{N+1} \left( \cos \left( \frac{s}{4\pi }\right) ,-\sin \left( \frac{s}{4\pi }\right) \right) ds = e^{-4\pi }. \end{aligned}$$

Since $D(P|_L)(\textbf{x}_0)$ acts on a one-dimensional space, it follows that its eigenvalue is equal to $e^{-4\pi }$.

We have thus found one of the eigenvalues of $DP(\textbf{x}_0)$, to wit, $e^{-4\pi }<1$. In order to find the other, it suffices to notice that the determinant of $DP(\textbf{x}_0)$ must be equal to the product of its two eigenvalues. Therefore, it follows from (44) that the other eigenvalue is $e^{-4\pi \mu }\ne 1$. Hence, it follows that $\Gamma $ is a hyperbolic limit cycle and that the eigenvalues of the derivative of the Poincaré map associated to it are $\lambda _1=e^{-4\pi }$ and $\lambda _2=e^{-4\pi \mu }$.

Thus, Theorem A ensures that there is $\varepsilon _0>0$ such that, for each $\varepsilon \in [0,\varepsilon _0]$, system

$$\begin{aligned} \theta ' = 1, \quad \textbf{x}' = \varepsilon ^N F_N(\theta ,\textbf{x}) + \varepsilon ^{N+1} F_{N+1}(\theta ,\textbf{x}) + \varepsilon ^{N+2} {\tilde{F}} (\theta ,\textbf{x},\varepsilon ) \end{aligned}$$

admits an invariant torus $M_\varepsilon $ of class $C^{r-3}$. Moreover, $M_\varepsilon $ converges to $\mathbb {S}^1 \times \Gamma $ as $\varepsilon \rightarrow 0$. The stability of $M_\varepsilon $ is controlled by the parameter $\mu $. If $\mu =1$, then $M_\varepsilon $ is asymptotically stable, since $\mathcal {S}^{V_s}_{M_\varepsilon }$ locally becomes a neighborhood of $M_\varepsilon $. If, on the other hand, $\mu =-1$, then $\mathcal {S}^{V_s}_{M_\varepsilon }$ is locally a 3-dimensional manifold embedded in $\mathbb {R}^4$.

Transforming back to the original coordinates, we obtain, for each $\varepsilon \in [0,\varepsilon _0]$, an invariant torus $\mathbb {T}_\varepsilon $ converging as $\varepsilon \rightarrow 0$ to the torus $\mathbb {T}=\mathbb {S}^1\times \mathbb {S}^1$ parameterized by $(\theta , t)\in [0,2\pi ] \times [0,2\pi ] \mapsto \left( \cos \theta , \sin \theta , \cos t, -\sin t\right) $.

Data availability

All data generated or analysed during this study are included in this published article

References

Bogoliubov, N.N., Mitropolsky, Y.A.: Asymptotic methods in the theory of non-linear oscillations. Translated from the second revised Russian edition. International Monographs on Advanced Mathematics and Physics. Hindustan Publishing Corp., Delhi, Gordon and Breach Science Publishers, New York, (1961)
Bogolyubov, N.: O Nekotoryh Statističeskih Metodah v Matematičeskoĭ Fizike (On Some Statistical Methods in Mathematical Physics). Akademiya Nauk Ukrainskoĭ SSR, Kiev (1945)
Google Scholar
Buică, A., Llibre, J.: Averaging methods for finding periodic orbits via Brouwer degree. Bull. Sci. Math. 128(1), 7–22 (2004)
Article MathSciNet Google Scholar
Cândido, M.R., Novaes, D.D.: On the torus bifurcation in averaging theory. J. Diff. Equ. 268(8), 4555–4576 (2020)
Article MathSciNet Google Scholar
Chicone, C.: Ordinary differential equations with applications, volume 34 of Texts in Applied Mathematics. Springer, New York, second edition edition, (2006)
Comtet, L.: Advanced combinatorics. D. Reidel Publishing Co., Dordrecht, enlarged edition, The art of finite and infinite expansions (1974)
Culver, W.J.: On the existence and uniqueness of the real logarithm of a matrix. Proc. Am. Math. Soc. 17, 1146–1151 (1966)
Article MathSciNet Google Scholar
Fatou, P.: Sur le mouvement d’un système soumis à des forces à courte période. Bull. Soc. Math. France 56, 98–139 (1928)
Article MathSciNet Google Scholar
Gantmacher, F.R.: The theory of matrices. Vols. 1, 2. Chelsea Publishing Co., New York, Translated by K. A. Hirsch (1959)
Hájek, P., Johanis, M.: Smooth analysis in Banach spaces. De Gruyter Series in Nonlinear Analysis and Applications, vol. 19. De Gruyter, Berlin (2014)
Hale, J.K.: Integral manifolds of perturbed differential systems. Ann. Math. 2(73), 496–531 (1961)
Article MathSciNet Google Scholar
Hale, J.K.: Oscillations in Nonlinear Systems. McGraw-Hill Book Co., Inc, New York-Toronto-London (1963)
Google Scholar
Hale, J.K.: Ordinary Differential Equations, 2nd edn. Robert E. Krieger Publishing Co., Inc, Huntington, N.Y. (1980)
Google Scholar
Hartman, P.: Ordinary Differential Equations. Society for Industrial and Applied Mathematics, second edition, (2002)
Herman, M.-R.: Mesure de Lebesgue et nombre de rotation. In Geometry and topology (Proc. III Latin Amer. School of Math., Inst. Mat. Pura Aplicada CNPq, Rio de Janeiro, 1976), Lecture Notes in Math., Vol. 597, pp. 271–293. Springer, Berlin (1977)
Krylov, N., Bogolyubov, N.: Prilozhenie metodov nelineinoi mekhaniki k teorii statsionarnykh kolebanii (The Application of Methods of Nonlinear Mechanics to the Theory of Stationary oscillations). Kiev: Akademiya Nauk Ukrainskoĭ SSR, Kiev], (1934)
Llibre, J., Novaes, D.D., Teixeira, M.A.: Higher order averaging theory for finding periodic solutions via Brouwer degree. Nonlinearity 27, 563–583 (2014)
Article MathSciNet Google Scholar
Novaes, D.D.: An equivalent formulation of the averaged functions via Bell polynomials. In Extended abstracts Spring 2016—nonsmooth dynamics, volume 8 of Trends Math. Res. Perspect. CRM Barc., pp. 141–145. Birkhäuser/Springer, Cham, (2017)
Novaes, D.D.: An averaging result for periodic solutions of Carathéodory differential equations. Proc. Am. Math. Soc. (2021). https://doi.org/10.1090/proc/15810
Article Google Scholar
Novaes, D.D.: Higher order stroboscopic averaged functions: a general relationship with Melnikov functions. Electron. J. Qual. Theory Differ. Equ. 2021(77), 1–9 (2021)
Article MathSciNet Google Scholar
Novaes, D.D., Silva, F.B.: Higher order analysis on the existence of periodic solutions in continuous differential equations via degree theory. SIAM J. Math. Anal. 53(2), 2476–2490 (2021)
Article MathSciNet Google Scholar
Sanders, J.A., Verhulst, F., Murdock, J.A.: Averaging methods in nonlinear dynamical systems, volume 59. Springer, (2007)
Teschl, G.: Ordinary differential equations and dynamical systems. Graduate Studies in Mathematics, vol. 140. American Mathematical Society, Providence, RI (2012)
Verhulst, F.: Nonlinear differential equations and dynamical systems. Universitext. Springer-Verlag, Berlin, second edition edition, 1996. Translated from the 1985 Dutch original

Download references

Author information

Authors and Affiliations

Departamento de Matemática-Instituto de Matemática, Estatística e Computação Científica (IMECC), Universidade Estadual de Campinas (UNICAMP), Rua Sérgio Buarque de Holanda, 651, Cidade Universitária Zeferino Vaz, Campinas, SP, 13083-859, Brazil
Douglas D. Novaes & Pedro C. C. R. Pereira

Authors

Douglas D. Novaes
View author publications
You can also search for this author in PubMed Google Scholar
Pedro C. C. R. Pereira
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Pedro C. C. R. Pereira.

Ethics declarations

Conflict of interest

On behalf of all authors, the corresponding author states that there is no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

DDN is partially supported by São Paulo Research Foundation (FAPESP) grants 2022/09633-5, 2019/10269-3, and 2018/13481-0, and by Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq) grants 438975/2018-9 and 309110/2021-1. PCCRP is supported by São Paulo Research Foundation (FAPESP) grant 2020/14232-4.

Appendix

1.1 Proof of Proposition 3

Let $\varepsilon \in (0,\varepsilon _1]$ be fixed throughout all the proof. We shall prove that $M_\varepsilon $ can be parameterized by a $C^p$ function $\alpha _\varepsilon (t,\theta )$. In fact, let $\tau \mapsto \varphi _\varepsilon (\tau ,t_0,\theta _0,y_0,z_0)$ be the flow of system (11) satisfying $\varphi _\varepsilon (0,t_0,\theta _0,y_0,z_0)=(t_0,\theta _0,y_0,z_0)$. Then, results about smooth dependence on initial conditions (see, for instance, [14, Corollary 4.1 of Chapter V]) ensure that $\varphi _\varepsilon $ is of class $C^p$. Define $\alpha _\varepsilon :\mathbb {R}\times \mathbb {R}\rightarrow \mathbb {R} \times \mathbb {R}\times B_m(0,\sigma _0) \times B_n(0,\sigma _0)$ by

$$\begin{aligned} \alpha _\varepsilon (t,\theta ): = \varphi _\varepsilon (t,0,\theta ,f_\varepsilon (0,\theta ),g_\varepsilon (0,\theta )). \end{aligned}$$

Observe that statement (e) of Lemma 1 guarantees that $\alpha _\varepsilon $ is of class $C^p$. Let us prove that $\alpha _\varepsilon $ is injective and that its image is $M_\varepsilon $.

In order to prove that $\alpha _\varepsilon $ is injective, let $(t_1,\theta _1), (t_2,\theta _2) \in \mathbb {R}\times \mathbb {R}$ be such that $\alpha _\varepsilon (t_1,\theta _1) = \alpha _\varepsilon (t_2,\theta _2)$. Define the functions $t_\varepsilon (\tau ,t_0,\theta _0,y_0,z_0)$, $\theta _\varepsilon (\tau ,t_0,\theta _0,y_0,z_0)$, $y_\varepsilon (\tau ,t_0,\theta _0,y_0,z_0)$, and $z_\varepsilon (\tau ,t_0,\theta _0,y_0,z_0)$ as being the components of the flow $\varphi _\varepsilon (\tau ,t_0,\theta _0,y_0,z_0)$. Then, it is clear by (11) that $t_\varepsilon (\tau ,t_0,\theta _0,y_0) = t_0 +\tau $. Hence, $\alpha (t_1,\theta _1) = \alpha (t_2,\theta _2)$ implies at once that $t_1+t_0=t_2+t_0$, that is, $t_1=t_2$. Therefore, the uniqueness of the flow $\varphi _\varepsilon $ ensures that the points $(0,\theta _1,f_\varepsilon (0,\theta _1),g_\varepsilon (0,\theta _1))$ and $(0,\theta _2,f_\varepsilon (0,\theta _2),g_\varepsilon (0,\theta _2))$ must be the same. Thus, $\theta _1=\theta _2$, and $\alpha _\varepsilon $ is indeed injective.

To show that the image of $\alpha _\varepsilon $ is $M_\varepsilon $, we first observe that, since $(0,\theta ,f_\varepsilon (0,\theta ),g_\varepsilon (0,\theta )) \in M_\varepsilon $ for all $\theta \in \mathbb {R}$ and $M_\varepsilon $ is invariant, it follows that $\alpha _\varepsilon (t,\theta ) = \varphi _\varepsilon (t,0,\theta , f_\varepsilon (0,\theta ),g_\varepsilon (0,\theta )) \in M_\varepsilon $ for all $(t,\theta ) \in \mathbb {R}\times \mathbb {R}$, i.e., the image of $\alpha _\varepsilon $ is contained in $M_\varepsilon $. On the other hand, every point in $M_\varepsilon $ is, by definition, of the form $(t,\theta ,f_\varepsilon (t,\theta ),g_\varepsilon (t,\theta ))$ for some $(t,\theta ) \in \mathbb {R}\times \mathbb {R}$. By properties of the flow, defining $\tilde{\theta }_\varepsilon =\theta _\varepsilon (-t,t,\theta ,f_\varepsilon (t,\theta ),g_\varepsilon (t,\theta ))$, we have

$$\begin{aligned} (t,\theta ,f_\varepsilon (t,\theta ),g_\varepsilon (t,\theta )) = \varphi _\varepsilon \big (t,0,\tilde{\theta }_\varepsilon ,f_\varepsilon (0,\tilde{\theta }_\varepsilon ),g_\varepsilon (0,\tilde{\theta }_\varepsilon )\big ) = \alpha _\varepsilon (t,\tilde{\theta }_\varepsilon ), \end{aligned}$$

which implies that $M_\varepsilon $ is contained in the image of $\alpha _\varepsilon $. Thus, we have proved that $\alpha _\varepsilon $ is an injective function of class $C^p$ whose image is $M_\varepsilon $ and, therefore, is a $C^p$ parametrization of $M_\varepsilon $. This ensures that $M_\varepsilon $ is of class $C^p$.

We shall now prove that $f_\varepsilon $ and $g_\varepsilon $ are of class $C^p$. In order to do so, we remark that $\big (t,\theta ,f_\varepsilon (t,\theta ),g_\varepsilon (t,\theta )) \in M_\varepsilon $ for every $(t,\theta )\in \mathbb {R}\times \mathbb {R}$. Then, for each $(t,\theta )\in \mathbb {R}\times \mathbb {R}$, there is $({\tilde{t}},\tilde{\theta }) \in \mathbb {R}\times \mathbb {R}$ such that

$$\begin{aligned}&\big (t,\theta ,f_\varepsilon (t,\theta ),g_\varepsilon (t,\theta ))= \alpha _\varepsilon ({\tilde{t}},\tilde{\theta }\big ) = \big ({\tilde{t}},\theta _\varepsilon ({\tilde{t}},0,\tilde{\theta },f_\varepsilon (0,\tilde{\theta }), g_\varepsilon (0,\tilde{\theta })),y_\varepsilon ({\tilde{t}},0,\tilde{\theta },\nonumber \\&\quad f_\varepsilon (0,\tilde{\theta }), g_\varepsilon (0,\tilde{\theta })), z_\varepsilon ({\tilde{t}},0,\tilde{\theta },f_\varepsilon (0,\tilde{\theta }),g_\varepsilon (0,\tilde{\theta }))\big ). \end{aligned}$$

(46)

Define the function $h({\tilde{t}},\tilde{\theta }) = \big ({\tilde{t}},\theta _\varepsilon ({\tilde{t}},0,\tilde{\theta },f_\varepsilon (0,\tilde{\theta }), g_\varepsilon (0,\tilde{\theta }))\big )$. It is then clear that the inverse of h exists and is given by

$$\begin{aligned} h^{-1}(t,\theta ) = \big (t,\theta _\varepsilon (-t,t,\theta ,f_\varepsilon (t,\theta ),g_\varepsilon (t,\theta ))\big ). \end{aligned}$$

Now, by taking $({\tilde{t}}(t,\theta ), \tilde{\theta }(t,\theta )) = h^{-1}(t,\theta )$, we get from (46) that

$$\begin{aligned} \begin{aligned} f_\varepsilon (t,\theta )=y_\varepsilon ({\tilde{t}}(t,\theta ),0,\tilde{\theta },f_\varepsilon (0,\tilde{\theta }(t,\theta )),g_\varepsilon (0,\tilde{\theta }(t,\theta )),\\ g_\varepsilon (t,\theta )=z_\varepsilon ({\tilde{t}}(t,\theta ),0,\tilde{\theta },f_\varepsilon (0,\tilde{\theta }(t,\theta )),g_\varepsilon (0,\tilde{\theta }(t,\theta )). \end{aligned} \end{aligned}$$

Thus, since $y_\varepsilon $, $z_\varepsilon $, and $\theta \mapsto g_\varepsilon (0,\theta )$ are of class $C^p$, in order to prove that $f_\varepsilon $ and $g_\varepsilon $ are of class $C^p$ it only remains to show that $h^{-1}$ is of class $C^p$. First, observe that h is clearly of class $C^p$, because $\tilde{\theta } \mapsto f_\varepsilon (0,\tilde{\theta })$ and $\tilde{\theta } \mapsto g_\varepsilon (0,\tilde{\theta })$ are of class $C^p$ by statement (e) of Lemma 1. From the Inverse Function Theorem, it suffices then to prove that the derivative of h is non-singular at every point $(t,\theta ) \in \mathbb {R}\times \mathbb {R}$. Observe that

$$\begin{aligned} Dh({\tilde{t}},\tilde{\theta }) = \left[ \begin{array}{cc} 1 &{} 0 \\ 1+\zeta _0({\tilde{t}},\theta _\varepsilon ,f_\varepsilon ({\tilde{t}},\theta _\varepsilon ),g_\varepsilon ({\tilde{t}},\theta _\varepsilon ),\varepsilon ) &{} \frac{\partial \theta _\varepsilon }{\partial \theta _0} + \frac{\partial \theta _\varepsilon }{\partial y_0} \cdot \frac{\partial f_\varepsilon }{\partial \theta }(0,\tilde{\theta }) + \frac{\partial \theta _\varepsilon }{\partial z_0} \cdot \frac{\partial g_\varepsilon }{\partial \theta }(0,\tilde{\theta }) \end{array}\right] , \end{aligned}$$

where the argument of $\theta _\varepsilon $ and its partial derivatives is $({\tilde{t}},0,\tilde{\theta },f_\varepsilon (0,\tilde{\theta }),g_\varepsilon (0,\tilde{\theta }))$ and has been omitted for conciseness. Thus, $Dh({\tilde{t}},\tilde{\theta })$ is non-singular if, and only if,

$$\begin{aligned} \begin{aligned}&\frac{\partial \theta _\varepsilon }{\partial \theta _0}({\tilde{t}},0,\tilde{\theta },f_\varepsilon (0,\tilde{\theta }),g_\varepsilon (0,\tilde{\theta })) \\ {}&\quad + \frac{\partial \theta _\varepsilon }{\partial y_0}({\tilde{t}},0,\tilde{\theta },f_\varepsilon (0,\tilde{\theta }),g_\varepsilon (0,\tilde{\theta })) \cdot \frac{\partial f_\varepsilon }{\partial \theta }(0,\tilde{\theta }) \\ {}&\quad + \frac{\partial \theta _\varepsilon }{\partial z_0}({\tilde{t}},0,\tilde{\theta },f_\varepsilon (0,\tilde{\theta }),g_\varepsilon (0,\tilde{\theta })) \cdot \frac{\partial g_\varepsilon }{\partial \theta }(0,\tilde{\theta }) \ne 0. \end{aligned} \end{aligned}$$

(47)

The matrix

$$\begin{aligned} \mathcal {M}(\tau ,t_0,\theta _0,y_0,z_0): = \left[ \begin{array}{cccc} \frac{\partial t_\varepsilon }{\partial t_0}&{} \frac{\partial t_\varepsilon }{\partial \theta _0} &{} \frac{\partial t_\varepsilon }{\partial y_0} &{} \frac{\partial t_\varepsilon }{\partial z_0}\\ \frac{\partial \theta _\varepsilon }{\partial t_0}&{} \frac{\partial \theta _\varepsilon }{\partial \theta _0} &{} \frac{\partial \theta _\varepsilon }{\partial y_0} &{} \frac{\partial \theta _\varepsilon }{\partial z_0} \\ \frac{\partial y_\varepsilon }{\partial t_0}&{} \frac{\partial y_\varepsilon }{\partial \theta _0} &{} \frac{\partial y_\varepsilon }{\partial y_0} &{} \frac{\partial y_\varepsilon }{\partial z_0} \\ \frac{\partial z_\varepsilon }{\partial t_0}&{} \frac{\partial z_\varepsilon }{\partial \theta _0} &{} \frac{\partial z_\varepsilon }{\partial y_0} &{} \frac{\partial z_\varepsilon }{\partial z_0} \end{array}\right] , \end{aligned}$$

where the argument of each entry is given by $(\tau ,t_0,\theta _0,y_0,z_0)$ is a fundamental solution of the first variational equation associated to (11). Thus, $\mathcal {M}(\tau ,t_0,\theta _0,y_0,z_0)$ is invertible. Moreover, since $t_\varepsilon (\tau ,t_0,\theta _0,y_0) = \tau +t_0$, it follows that

$$\begin{aligned} \mathcal {M}(\tau ,t_0,\theta _0,y_0,z_0) = \left[ \begin{array}{cccc} 1 &{}0 &{}0 &{}0\\ \frac{\partial \theta _\varepsilon }{\partial t_0}&{} \frac{\partial \theta _\varepsilon }{\partial \theta _0} &{} \frac{\partial \theta _\varepsilon }{\partial y_0} &{} \frac{\partial \theta _\varepsilon }{\partial z_0} \\ \frac{\partial y_\varepsilon }{\partial t_0}&{} \frac{\partial y_\varepsilon }{\partial \theta _0} &{} \frac{\partial y_\varepsilon }{\partial y_0} &{} \frac{\partial y_\varepsilon }{\partial z_0} \\ \frac{\partial z_\varepsilon }{\partial t_0}&{} \frac{\partial z_\varepsilon }{\partial \theta _0} &{} \frac{\partial z_\varepsilon }{\partial y_0} &{} \frac{\partial z_\varepsilon }{\partial z_0} \end{array}\right] , \end{aligned}$$

Hence, we conclude that

$$\begin{aligned} \mathcal {N}(\tau ,t_0,\theta _0,y_0,z_0):=\left[ \begin{array}{ccc} \frac{\partial \theta _\varepsilon }{\partial \theta _0}(\tau ,t_0,\theta _0,y_0,z_0) &{} \frac{\partial \theta _\varepsilon }{\partial y_0}(\tau ,t_0,\theta _0,y_0,z_0) &{} \frac{\partial \theta _\varepsilon }{\partial z_0}(\tau ,t_0,\theta _0,y_0,z_0)\\ \frac{\partial y_\varepsilon }{\partial \theta _0}(\tau ,t_0,\theta _0,y_0,z_0) &{} \frac{\partial y_\varepsilon }{\partial y_0}(\tau ,t_0,\theta _0,y_0,z_0) &{} \frac{\partial y_\varepsilon }{\partial z_0}(\tau ,t_0,\theta _0,y_0,z_0)\\ \frac{\partial z_\varepsilon }{\partial \theta _0}(\tau ,t_0,\theta _0,y_0,z_0) &{} \frac{\partial z_\varepsilon }{\partial y_0}(\tau ,t_0,\theta _0,y_0,z_0) &{} \frac{\partial z_\varepsilon }{\partial z_0}(\tau ,t_0,\theta _0,y_0,z_0) \end{array}\right] \end{aligned}$$

is invertible for all $(\tau ,t_0,\theta _0,y_0,z_0) \in [-\Omega ,\Omega ] \times \mathbb {R}\times \mathbb {R} \times B_m(0,\rho ) \times B_m(0,\rho )$, where $[-\Omega ,\Omega ]$ is the maximal interval where the flow is defined. In particular, if $t_0=0$, $\theta _0=\tilde{\theta }$, $y_0 =f_\varepsilon (0,\tilde{\theta })$, and $z_0=g_\varepsilon (0,\tilde{\theta })$, then the flow is defined for all $\tau \in \mathbb {R}$, and it follows that $\mathcal {N}({\tilde{t}},0,\tilde{\theta },f_\varepsilon (0,\tilde{\theta }),g_\varepsilon (0,\tilde{\theta }))$ is invertible for all $({\tilde{t}},\tilde{\theta }) \in \mathbb {R}\times \mathbb {R}$. Thus, the product

$$\begin{aligned} \begin{aligned} \mathcal {N}({\tilde{t}},0,\tilde{\theta },f_\varepsilon (0,\tilde{\theta }),g_\varepsilon (0,\tilde{\theta })) \cdot \left[ \begin{array}{c} 1 \\ \frac{\partial f_\varepsilon }{\partial \theta }(0,\tilde{\theta }) \\ \frac{\partial g_\varepsilon }{\partial \theta }(0,\tilde{\theta }) \end{array}\right] = \left[ \begin{array}{c} \frac{\partial \theta _\varepsilon }{\partial \theta _0} + \frac{\partial \theta _\varepsilon }{\partial y_0} \cdot \frac{\partial f_\varepsilon }{\partial \theta } +\frac{\partial \theta _\varepsilon }{\partial z_0}\cdot \frac{\partial g_\varepsilon }{\partial \theta }\\ \frac{\partial y_\varepsilon }{\partial \theta _0} + \frac{\partial y_\varepsilon }{\partial y_0} \cdot \frac{\partial f_\varepsilon }{\partial \theta } +\frac{\partial y_\varepsilon }{\partial z_0}\cdot \frac{\partial g_\varepsilon }{\partial \theta } \\ \frac{\partial z_\varepsilon }{\partial \theta _0} + \frac{\partial z_\varepsilon }{\partial y_0} \cdot \frac{\partial f_\varepsilon }{\partial \theta } +\frac{\partial z_\varepsilon }{\partial z_0}\cdot \frac{\partial g_\varepsilon }{\partial \theta } \end{array}\right] \end{aligned} \end{aligned}$$

cannot vanish, where the arguments of the derivatives of $\theta _\varepsilon $, $y_\varepsilon $ and $z_\varepsilon $, as well as the arguments of the derivatives of $f_\varepsilon $ and $g_\varepsilon $, have been omitted, but should be read, respectively, as $({\tilde{t}},0,\tilde{\theta },f_\varepsilon (0,\tilde{\theta }),g_\varepsilon (0,\tilde{\theta }))$ and $(0,\tilde{\theta })$. Suppose, by contradiction, that (47) does not hold at $({\tilde{t}}_*,\tilde{\theta }_*) \in \mathbb {R}^2$, so that the first line of product above vanishes. Let us show that this implies that the product vanishes altogether.

In fact, observe that the invariance of $M_\varepsilon $ ensures that, for all $({\tilde{t}},\tilde{\theta }) \in \mathbb {R}^2$,

$$\begin{aligned} \begin{aligned} y_\varepsilon ({\tilde{t}},0,\tilde{\theta },f_\varepsilon (0,\tilde{\theta }),g_\varepsilon (0,\tilde{\theta }))= f_\varepsilon ({\tilde{t}},\theta _\varepsilon ({\tilde{t}},0,\tilde{\theta },f_\varepsilon (0,\tilde{\theta }),g_\varepsilon (0,\tilde{\theta }))),\\ z_\varepsilon ({\tilde{t}},0,\tilde{\theta },f_\varepsilon (0,\tilde{\theta }),g_\varepsilon (0,\tilde{\theta }))= g_\varepsilon ({\tilde{t}},\theta _\varepsilon ({\tilde{t}},0,\tilde{\theta },f_\varepsilon (0,\tilde{\theta }),g_\varepsilon (0,\tilde{\theta }))). \end{aligned} \end{aligned}$$

Thus, differentiating with respect to $\tilde{\theta }$, it follows that

$$\begin{aligned} \begin{aligned} \frac{\partial y_\varepsilon }{\partial \theta _0} + \frac{\partial y_\varepsilon }{\partial y_0} \cdot \frac{\partial f_\varepsilon }{\partial \theta } +\frac{\partial y_\varepsilon }{\partial z_0} \cdot \frac{\partial g_\varepsilon }{\partial \theta } = \frac{\partial f_\varepsilon }{\partial \theta } \left[ \frac{\partial \theta _\varepsilon }{\partial \theta _0} + \frac{\partial \theta _\varepsilon }{\partial y_0} \cdot \frac{\partial f_\varepsilon }{\partial \theta } +\frac{\partial \theta _\varepsilon }{\partial z_0} \cdot \frac{\partial g_\varepsilon }{\partial \theta }\right] \end{aligned} \end{aligned}$$

and

$$\begin{aligned} \begin{aligned} \frac{\partial z_\varepsilon }{\partial \theta _0} + \frac{\partial z_\varepsilon }{\partial y_0} \cdot \frac{\partial f_\varepsilon }{\partial \theta } +\frac{\partial z_\varepsilon }{\partial z_0} \cdot \frac{\partial g_\varepsilon }{\partial \theta } = \frac{\partial g_\varepsilon }{\partial \theta } \left[ \frac{\partial \theta _\varepsilon }{\partial \theta _0} + \frac{\partial \theta _\varepsilon }{\partial y_0} \cdot \frac{\partial f_\varepsilon }{\partial \theta } +\frac{\partial \theta _\varepsilon }{\partial z_0} \cdot \frac{\partial g_\varepsilon }{\partial \theta }\right] \end{aligned} \end{aligned}$$

where, once again, the arguments of the derivatives of $\theta _\varepsilon $, $y_\varepsilon $ and $z_\varepsilon $, as well as the arguments of the derivatives of $f_\varepsilon $ and $g_\varepsilon $, have been omitted, but should be read, respectively, as $({\tilde{t}},0,\tilde{\theta },f_\varepsilon (0,\tilde{\theta }),g_\varepsilon (0,\tilde{\theta }))$ and $(0,\tilde{\theta })$.

Since we assumed that (47) does not hold at $({\tilde{t}}_*,\tilde{\theta }_*)$, it follows at once that

$$\begin{aligned} \left[ \begin{array}{c} \frac{\partial \theta _\varepsilon }{\partial \theta _0} + \frac{\partial \theta _\varepsilon }{\partial y_0} \cdot \frac{\partial f_\varepsilon }{\partial \theta } +\frac{\partial \theta _\varepsilon }{\partial z_0}\cdot \frac{\partial g_\varepsilon }{\partial \theta }\\ \frac{\partial y_\varepsilon }{\partial \theta _0} + \frac{\partial y_\varepsilon }{\partial y_0} \cdot \frac{\partial f_\varepsilon }{\partial \theta } +\frac{\partial y_\varepsilon }{\partial z_0}\cdot \frac{\partial g_\varepsilon }{\partial \theta } \\ \frac{\partial z_\varepsilon }{\partial \theta _0} + \frac{\partial z_\varepsilon }{\partial y_0} \cdot \frac{\partial f_\varepsilon }{\partial \theta } +\frac{\partial z_\varepsilon }{\partial z_0}\cdot \frac{\partial g_\varepsilon }{\partial \theta } \end{array}\right] = 0 \end{aligned}$$

when the arguments of the derivatives of $\theta _\varepsilon $, $y_\varepsilon $ and $z_\varepsilon $ are given by $({\tilde{t}}_*,0,\tilde{\theta }_*,f_\varepsilon (0,\tilde{\theta }_*),g_\varepsilon (0,\tilde{\theta }_*))$ and the arguments of the derivatives of $f_\varepsilon $ and $g_\varepsilon $ are given by $(0,\tilde{\theta }_*)$. Therefore,

$$\begin{aligned} \mathcal {N}({\tilde{t}}_*,0,\tilde{\theta }_*,f_\varepsilon (0,\tilde{\theta }_*),g_\varepsilon (0,\tilde{\theta }_*)) \cdot \left[ \begin{array}{c} 1 \\ \frac{\partial f_\varepsilon }{\partial \theta }(0,\tilde{\theta }_*) \\ \frac{\partial g_\varepsilon }{\partial \theta }(0,\tilde{\theta }_*) \end{array}\right] = 0. \end{aligned}$$

As remarked above, this would imply that $\mathcal {N}({\tilde{t}}_*,0,\tilde{\theta }_*,f_\varepsilon (0,\tilde{\theta }_*),g_\varepsilon (0,\tilde{\theta }_*))$, which we have proved to be invertible, is not invertible. Since we have reached a contradiction, it is proved that $h^{-1}$ is indeed of class $C^p$, which concludes the proof of the proposition.

1.2 Proof of Proposition 4

It is not difficult to see that we can assume without loss of generality that the function L appearing in hypothesis (ii) of the Lemma satisfies: $L(\varepsilon ,\sigma ,\mu )\ge \varepsilon $ for all $(\varepsilon ,\sigma ,\mu ) \in (0,\varepsilon _0] \times [0,\rho _1) \times [0,\rho _2)$.

Let $D(\varepsilon )$, $\Delta (\varepsilon )$, $J_1$, $J_2$ be given as in the proof of Lemma 1. It is clear that there are $K>0$ and $\alpha >0$ such that $\Vert J_i(t)\Vert \le K e^{-\alpha |t|}$ for all $i \in \{1,2\}.$ By choosing $\varepsilon _1$ to be sufficiently small, we can then ensure that the following inequalities hold for all $\varepsilon \in (0,\varepsilon _1]$:

$\Delta (\varepsilon )<\frac{1}{2}$ and $D(\varepsilon ) < \rho :=\min (\rho _1,\rho _2)$;
$32L(\varepsilon ,D(\varepsilon ),D(\varepsilon )) <\alpha ;$
$64 K L(\varepsilon ,D(\varepsilon ),D(\varepsilon ))<\alpha $.

Let $\mathcal {P}_\omega (D,\Delta )$, $\mathcal {A}_\omega (D,\Delta )$, and the operator $S^\varepsilon $ be given as in the proof of Lemma 1. For each $\varepsilon \in (0,\varepsilon _1]$, define the sequence $(P_k, A_k)_{k \in \mathbb {N}}$, where $ P_k: (0,\varepsilon _1) \rightarrow \mathcal {P}_\omega (D,\Delta )$ and $ A_k: (0,\varepsilon _1) \rightarrow \mathcal {A}_\omega (D,\Delta )$ are functions of class $C^1$ given by:

$(P_0(\varepsilon ),A_0(\varepsilon )) = (0,0)$ for all $\varepsilon \in (0,\varepsilon _1)$;
$(P_{k+1}(\varepsilon ),A_{k+1}(\varepsilon )) = S^\varepsilon (P_k(\varepsilon ),A_k(\varepsilon ))$ for all $k \in \mathbb {N}$ and all $\varepsilon \in (0,\varepsilon _1)$.

From Lemma 1, it is clear that this sequence satisfies

$$\begin{aligned} \lim _{k \rightarrow \infty } (P_k(\varepsilon ),A_k(\varepsilon )) = (f_\varepsilon ,g_\varepsilon ). \end{aligned}$$

(48)

Let $a,b \in (0,\varepsilon _1)$ be fixed. Effecting cumbersome calculations, which are very similar to those presented in the proofs of Propositions 5 and 6, and for this reason are omitted, we can show that the following hold for all $\varepsilon \in [a,b]$ and all $k \in \mathbb {N}$:

(1)
$\Vert P_{k+2}(\varepsilon ) - P_{k+1}(\varepsilon ) \Vert + \Vert A_{k+2}(\varepsilon ) - A_{k+1}(\varepsilon ) \Vert \le \frac{1}{4} \big [\Vert P_{k+1}(\varepsilon ) - P_k(\varepsilon ) \Vert + \Vert A_{k+1}(\varepsilon ) - A_k(\varepsilon ) \Vert \big ].$
(2)
There is $C_1>0$ such that
$$\begin{aligned} \begin{aligned}&\left\| \frac{\partial P_{k+2}(\varepsilon )}{\partial \theta } -\frac{\partial P_{k+1}(\varepsilon )}{\partial \theta }\right\| + \left\| \frac{\partial A_{k+2}(\varepsilon )}{\partial \theta } -\frac{\partial A_{k+1}(\varepsilon )}{\partial \theta }\right\| \\&\quad \le C_1 \left[ \Vert P_{k+1}(\varepsilon ) - P_k(\varepsilon ) \Vert + \Vert A_{k+1}(\varepsilon ) -A_k(\varepsilon ) \Vert \right] \\&\qquad +\frac{1}{4}\left[ \left\| \frac{\partial P_{k+1}(\varepsilon )}{\partial \theta } -\frac{\partial P_{k}(\varepsilon )}{\partial \theta }\right\| + \left\| \frac{\partial A_{k+1}(\varepsilon )}{\partial \theta } -\frac{\partial A_{k}(\varepsilon )}{\partial \theta }\right\| \right] . \end{aligned} \end{aligned}$$
(3)
There is $C_2>0$ such that
$$\begin{aligned} \begin{aligned}&\left\| P'_{k+2}(\varepsilon ) -P'_{k+1}(\varepsilon )\right\| + \left\| A'_{k+2}(\varepsilon ) -A'_{k+1}(\varepsilon )\right\| \\&\quad \le C_2 \Bigg [\Vert P_{k+1}(\varepsilon ) - P_k(\varepsilon ) \Vert + \Vert A_{k+1}(\varepsilon ) - A_k(\varepsilon ) \Vert \\&\qquad +\left. \left\| \frac{\partial P_{k+1}(\varepsilon )}{\partial \theta } -\frac{\partial P_{k}(\varepsilon )}{\partial \theta }\right\| + \left\| \frac{\partial A_{k+1}(\varepsilon )}{\partial \theta } -\frac{\partial A_{k}(\varepsilon )}{\partial \theta }\right\| \right] \\&\qquad + \frac{1}{4}\left[ \left\| P'_{k+1}(\varepsilon ) -P'_{k}(\varepsilon )\right\| + \left\| A'_{k+1}(\varepsilon ) -A'_{k}(\varepsilon )\right\| \right] . \end{aligned} \end{aligned}$$

From those inequalities, it follows easily that there is $C>0$ such that

$$\begin{aligned} \sup _{\varepsilon \in [a,b]}\left\| P'_{k+1}(\varepsilon ) -P'_{k}(\varepsilon )\right\| + \left\| A'_{k+1}(\varepsilon ) -A'_{k}(\varepsilon )\right\| \le \frac{C}{2^k} \end{aligned}$$

for all $k \in \mathbb {N}$. Hence, the sequence $(P'_k\,,A'_k)_{k \in \mathbb {N}}$ converges uniformly on [a, b]. Since a and b were arbitrary, this implies that $(P'_k,A'_k)_{k \in \mathbb {N}}$ converges uniformly on compact subsets of $(0,\varepsilon _1)$. Therefore, considering (48), it follows that the function c given in the statement of this Lemma is of class $C^1$ (see, for instance, [10, Theorem 85, Chapter 1]), concluding the proof.

1.3 Proof of Proposition 5

Let $D(\varepsilon )$, $\Delta (\varepsilon )$, $J_1$, $J_2$, and $T_{F,G}$ be given as in the proof of Lemma 1. It is clear that there are $K>0$ and $\alpha >0$ such that $\Vert J_i(t)\Vert \le K e^{-\alpha |t|}$ for all $i \in \{1,2\}.$ For convenience, we will denote $L(\varepsilon ,D(\varepsilon ),D(\varepsilon ))$ by $L(\varepsilon )$ throughout the proof. As in Proposition 4, we assume that the function L appearing in hypothesis (ii) of the Lemma satisfies: $L(\varepsilon ,\sigma ,\mu )\ge \varepsilon $ for all $(\varepsilon ,\sigma ,\mu ) \in (0,\varepsilon _0] \times [0,\rho _1) \times [0,\rho _2)$.

Let $(P_k,A_k)_{k \in \mathbb {N}}$ be the sequence defined in Proposition 4. For convenience, define

$T_{k}^{x,t,\varepsilon }(\theta ): = T_{P_k(\varepsilon ),A_k(\varepsilon )}(t+x,t,\theta ,\varepsilon )$;
$\Lambda _k^{x,t,\varepsilon }(\theta )=(t+x,\theta ,P_k(\varepsilon )(t+x,\theta ),A_k(\varepsilon )(t+x,\theta ),\varepsilon )$;
$\zeta _{i,k}^{x,t,\varepsilon }(\theta ):=\zeta _i(t+x,\theta ,P_k(\varepsilon )(t+x,\theta ),A_k(\varepsilon )(t+x,\theta ),\varepsilon ) = \zeta _i \circ \Lambda _k^{x,t,\varepsilon }(\theta )$.

We will prove by induction on q that, if $\varepsilon _1$ is sufficiently small, the following hold for each $q \in \{1,\ldots , p+1\}$:

(P1.)
There is $N_q \in \mathbb {N}$ and, for each $[a,b] \subset (0,\varepsilon _1]$, there is $C_{0,q}>0$ such that
$$\begin{aligned} \left| \left( T_k^{x,t,\varepsilon }\right) ^{(q)}(\theta ) \right| \le C_{0,q}\, e^{N_{q} L(\varepsilon )(1+2\Delta (\varepsilon )) |x|}, \end{aligned}$$
for all $k \in \mathbb {N}$ and all $(x,t,\theta ,\varepsilon ) \in \mathbb {R}\times \mathbb {R}\times \mathbb {R}\times [a,b]$.
(P2.)
For each $[a,b] \subset (0,\varepsilon _1]$, there is $C_{1,q}>0$ such that
$$\begin{aligned} \left\| \frac{\partial ^q P_k(\varepsilon )}{\partial \theta ^q} \right\| + \left\| \frac{\partial ^q A_k(\varepsilon )}{\partial \theta ^q} \right\| \le C_{1,q}, \end{aligned}$$
for all $k \in \mathbb {N}$ and all $\varepsilon \in [a,b]$.

Let us then consider the case $q=1$. Observe that, from the definition of $T_k^{x,t,\varepsilon }$, it follows that

$$\begin{aligned} \begin{aligned}&\frac{\partial }{\partial \theta } \left( \frac{\partial T_{P_k(\varepsilon ),A_k(\varepsilon )}}{\partial x}\right) (t+x,t,\theta ,\varepsilon ) \\&\quad = D\zeta _0 \left( \Lambda _k^{x,t,\varepsilon }\left( T_k^{x,t,\varepsilon }(\theta )\right) \right) \cdot \left( \Lambda _k^{x,t,\varepsilon }\right) ' \left( T_k^{x,t,\varepsilon }(\theta )\right) \cdot \left( T_k^{x,t,\varepsilon }\right) '(\theta ). \end{aligned} \end{aligned}$$

(49)

Hence, we obtain by changing order of derivatives and integrating

$$\begin{aligned} \left| \left( T_k^{x,t,\varepsilon }\right) '(\theta ) - \left( T_k^{0,t,\varepsilon }\right) '(\theta ) \right| \le \int _{0}^x L(\varepsilon ) (1+2\Delta (\varepsilon )) \left| \left( T_k^{x,t,\varepsilon }\right) '(\theta ) \right| dx. \end{aligned}$$

Since

$$\begin{aligned} \left( T_k^{0,t,\varepsilon }\right) '(\theta ) =1, \end{aligned}$$

it follows by an application of Grönwall’s inequality that

$$\begin{aligned} \left| \left( T_k^{x,t,\varepsilon }\right) '(\theta ) \right| \le e^{L(\varepsilon )(1+2\Delta (\varepsilon ))|x|}. \end{aligned}$$

(50)

This proves property (P1.). Property (P2.) follows directly, with $C_{1,1} = \sup _{\varepsilon \in [a,b]} \Delta (\varepsilon )$, from the fact that $(P_k,A_k) \in \mathcal {P}_\omega (D(\varepsilon ),\Delta (\varepsilon )) \times \mathcal {A}_\omega (D(\varepsilon ),\Delta (\varepsilon ))$ for all $k \in \mathbb {N}$.

Let $N \in \{2,\ldots ,p+1\}$ be given and suppose that the Lemma is true for every $q \in \mathbb {N}$ such that $1\le q \le N-1$. We will show that the Lemma also holds for $q=N$.

Henceforth, we will employ the index i to denote any element of the set $\{0,1,2\}$, since the considerations done below are the same. By Faà di Bruno’s formula, since $\zeta _{i,k}^{\varepsilon ,x,t} =\zeta _i \circ \Lambda _k^{x,t,\varepsilon }$, it follows that

$$\begin{aligned} \begin{aligned} \left( \zeta _{i,k}^{x,t,\varepsilon }\right) ^{(q)} (\theta ) = \sum _{j=1}^{q} D^{(j)} \zeta _i(\Lambda _k^{x,t,\varepsilon }(\theta )) \cdot B_{q,j} \left( \left( \Lambda _k^{x,t,\varepsilon }\right) '(\theta ),\ldots , \left( \Lambda _k^{x,t,\varepsilon }\right) ^{(q-j+1)}(\theta )\right) , \end{aligned} \end{aligned}$$

for each $q \in \{1,\ldots ,p+1\}$, where $B_{q,j}$ is a Bell polynomial. Observe that $D^{(j)} \zeta _i(\Lambda _k^{x,t,\varepsilon }(\theta ))$ is a symmetric multilinear map that can be thought of as being applied to a “product" of vectors. Its application to a polynomial is simply a linear combination of different applications to such “products". In particular, for $q=N$, we can write

$$\begin{aligned} \begin{aligned} \left( \zeta _{i,k}^{x,t,\varepsilon }\right) ^{(N)} (\theta )&= D^{(N)} \zeta _i(\Lambda _k^{x,t,\varepsilon }(\theta )) \cdot \left( \left( \Lambda _k^{x,t,\varepsilon }\right) '(\theta )\right) ^{N} \\&\quad +D\zeta _i(\Lambda _k^{x,t,\varepsilon }(\theta )) \cdot \left( \left( \Lambda _k^{x,t,\varepsilon }\right) ^{(N)}(\theta )\right) \\&\quad + \sum _{j=2}^{N-1} D^{(j)} \zeta _i(\Lambda _k^{x,t,\varepsilon }(\theta )) \cdot B_{N,j} \left( \left( \Lambda _k^{x,t,\varepsilon }\right) '(\theta ),\ldots , \left( \Lambda _k^{x,t,\varepsilon }\right) ^{(N-j+1)}(\theta )\right) . \end{aligned} \end{aligned}$$

By the same formula, we also have:

$$\begin{aligned} \begin{aligned} \left( \zeta _{i,k}^{x,t,\varepsilon } \left( T_k^{x,t,\varepsilon } (\theta )\right) \right) ^{(N)}&= \left( \zeta _{i,k}^{x,t,\varepsilon }\right) ^{(N)} \left( T_k^{x,t,\varepsilon }(\theta )\right) \cdot \left( \left( T^{x,t,\varepsilon }_k\right) '(\theta )\right) ^{N} \\&\quad + \left( \zeta _{i,k}^{x,t,\varepsilon }\right) '\left( T_k^{x,t,\varepsilon }(\theta )\right) \cdot \left( \left( T_k^{x,t,\varepsilon }\right) ^{(N)}(\theta )\right) \\&\quad + \sum _{j=2}^{N-1} \left( \zeta _{i,k}^{x,t,\varepsilon }\right) ^{(j)} \left( T_k^{x,t,\varepsilon }(\theta )\right) \cdot \\&B_{N,j}\left( \left( T_k^{x,t,\varepsilon }\right) '(\theta ),\ldots ,\left( T_k^{x,t,\varepsilon }\right) ^{(N-j+1)}(\theta )\right) . \end{aligned} \end{aligned}$$

Thus, it follows that

$$\begin{aligned}{} & {} \left( \zeta _{i,k}^{x,t,\varepsilon } \left( T_k^{x,t,\varepsilon } (\theta )\right) \right) ^{(N)}\nonumber \\{} & {} \quad = D^{(N)} \zeta _i\left( \Lambda _k^{x,t,\varepsilon }\left( T_k^{x,t,\varepsilon }(\theta )\right) \right) \cdot \left( \left( \Lambda _k^{x,t,\varepsilon }\right) '\left( T_k^{x,t,\varepsilon }(\theta )\right) \right) ^{N} \cdot \left( \left( T^{x,t,\varepsilon }_k\right) '(\theta )\right) ^{N} \nonumber \\{} & {} \qquad +D\zeta _i\left( \Lambda _k^{x,t,\varepsilon }\left( T_k^{x,t,\varepsilon }(\theta )\right) \right) \cdot \left( \left( \Lambda _k^{x,t,\varepsilon }\right) ^{(N)}\left( T_k^{x,t,\varepsilon }(\theta )\right) \right) \cdot \left( \left( T^{x,t,\varepsilon }_k\right) '(\theta )\right) ^{N}\nonumber \\{} & {} \qquad + \sum _{j=2}^{N-1} D^{(j)} \zeta _i\left( \Lambda _k^{x,t,\varepsilon }\left( T_k^{x,t,\varepsilon }(\theta )\right) \right) \nonumber \\{} & {} \qquad \cdot B_{N,j} \left( \left( \Lambda _k^{x,t,\varepsilon }\right) '\left( T_k^{x,t,\varepsilon }(\theta )\right) ,\ldots , \left( \Lambda _k^{x,t,\varepsilon }\right) ^{(N-j+1)}\left( T_k^{x,t,\varepsilon }(\theta )\right) \right) \left( \left( T^{x,t,\varepsilon }_k\right) '(\theta )\right) ^{N}\nonumber \\{} & {} \qquad + D\zeta _i\left( \Lambda _k^{x,t,\varepsilon }\left( T_k^{x,t,\varepsilon }(\theta )\right) \right) \cdot \left( \left( \Lambda _k^{x,t,\varepsilon }\right) '\left( T_k^{x,t,\varepsilon }(\theta )\right) \right) \cdot \left( \left( T_k^{x,t,\varepsilon }\right) ^{(N)}(\theta )\right) \nonumber \\{} & {} \qquad + \sum _{j=2}^{N-1} \sum _{l=1}^j \left[ D^{(l)}\zeta _i\left( \Lambda _k^{x,t,\varepsilon }\left( T_k^{x,t,\varepsilon }(\theta )\right) \right) \right. \nonumber \\{} & {} \left. \qquad \cdot B_{j,l} \left( \left( \Lambda _k^{x,t,\varepsilon }\right) '\left( T_k^{x,t,\varepsilon }(\theta )\right) ,\ldots , \left( \Lambda _k^{x,t,\varepsilon }\right) ^{(j-l+1)}\left( T_k^{x,t,\varepsilon }(\theta )\right) \right) \right] \nonumber \\{} & {} \qquad \times B_{N,j}\left( \left( T_k^{x,t,\varepsilon }\right) '(\theta ),\ldots ,\left( T_k^{x,t,\varepsilon }\right) ^{(N-j+1)}(\theta )\right) .\nonumber \\ \end{aligned}$$

(51)

For simplicity, we will denote the summands on the right-hand side of this equation by I, II, III, IV, and V, respectively.

By definition of $T_k^{x,t,\varepsilon } (\theta )$, it follows that

$$\begin{aligned} \begin{aligned} \frac{\partial ^{N}}{\partial \theta ^{N}} \left( \frac{\partial T_{P_k(\varepsilon ),A_k(\varepsilon )}}{\partial x}\right) (t+x,t,\theta ,\varepsilon ) = \left( \zeta _{0,k}^{x,t,\varepsilon }\left( T_k^{x,t,\varepsilon }(\theta )\right) \right) ^{(N)}. \end{aligned} \end{aligned}$$

(52)

Observe that, for $q \in \{1,\ldots ,p+1\}$,

$$\begin{aligned} \left( \Lambda _k^{x,t,\varepsilon }\right) ^{(q)}(\theta ) = \left( 0,\delta _{1q},\frac{\partial ^q P_k(\varepsilon )}{\partial \theta ^q}(t+x,\theta ),\frac{\partial ^q A_k(\varepsilon )}{\partial \theta ^q}(t+x,\theta ),0\right) , \end{aligned}$$

where $\delta _{ij}$ is the Kronecker delta. Thus, since $N\ge 2$, it follows that

$$\begin{aligned}{} & {} \left\| D\zeta _i\left( \Lambda _k^{x,t,\varepsilon }\left( T_k^{x,t,\varepsilon }(\theta )\right) \right) \cdot \left( \left( \Lambda _k^{x,t,\varepsilon }\right) ^{(N)}\left( T_k^{x,t,\varepsilon }(\theta )\right) \right) \right\| \\{} & {} \quad \le L(\varepsilon ) \left[ \left\| \frac{\partial ^{N} P_k(\varepsilon )}{\partial \theta ^{N}} \right\| + \left\| \frac{\partial ^{N} A_k(\varepsilon )}{\partial \theta ^{N}} \right\| \right] , \end{aligned}$$

which, combined with (50), ensures that

$$\begin{aligned} \Vert II\Vert \le L(\varepsilon ) \left[ \left\| \frac{\partial ^{N} P_k(\varepsilon )}{\partial \theta ^{N}} \right\| + \left\| \frac{\partial ^{N} A_k(\varepsilon )}{\partial \theta ^{N}} \right\| \right] e^{NL(\varepsilon )(1+2\Delta (\varepsilon ))|x|}. \end{aligned}$$

(53)

Moreover, we also have

$$\begin{aligned}{} & {} \left\| D\zeta _i\left( \Lambda _k^{x,t,\varepsilon }\left( T_k^{x,t,\varepsilon }(\theta )\right) \right) \cdot \left( \left( \Lambda _k^{x,t,\varepsilon }\right) '\left( T_k^{x,t,\varepsilon }(\theta )\right) \right) \right\| \\{} & {} \quad \le L(\varepsilon ) \left[ 1 + \left\| \frac{\partial P_k(\varepsilon )}{\partial \theta } \right\| + \left\| \frac{\partial A_k(\varepsilon )}{\partial \theta } \right\| \right] , \end{aligned}$$

so that

$$\begin{aligned} \Vert IV\Vert \le L(\varepsilon ) (1+2\Delta (\varepsilon )) \left( T_k^{x,t,\varepsilon }\right) ^{(N)}(\theta ). \end{aligned}$$

(54)

Observe that, since $\zeta _i$ is of class $C^{p+1}$ and periodic in its first two entries, there is $C_\zeta >0$ such that

$$\begin{aligned} \left\| D^{(q)} \zeta _i\left( \Lambda _k^{x,t,\varepsilon }\left( T_k^{x,t,\varepsilon }(\theta )\right) \right) \right\| \le C_\zeta \end{aligned}$$

for all $q \in \{1,\ldots ,p+1\}$, all $k \in \mathbb {N}$, and all $(x,t,\theta ,\varepsilon ) \in \mathbb {R}\times \mathbb {R}\times \mathbb {R}\times [a,b]$. Thus, considering the hypothesis of induction, it follows that there are ${\tilde{C}}>0$ and ${\tilde{N}} \in \mathbb {N}$, where ${\tilde{C}}$ depends on the choice of the interval [a, b] but ${\tilde{N}}$ does not, such that

$$\begin{aligned} \Vert I\Vert +\Vert III\Vert +\Vert V\Vert \le {\tilde{C}} e^{{\tilde{N}}L(\varepsilon )(1+2\Delta (\varepsilon ))|x|}. \end{aligned}$$

(55)

Therefore, considering (53), (54), and (55), it follows by changing the order of derivatives of (52) and integrating with respect to x that

$$\begin{aligned} \begin{aligned} \left| \left( T_k^{x,t,\varepsilon }\right) ^{(N)}(\theta )\right|&\le \int _0^x L(\varepsilon ) (1+2\Delta (\varepsilon )) \left| \left( T_k^{\tau ,t,\varepsilon }\right) ^{(N)}(\theta )\right| d\tau + \frac{{\tilde{C}} \; e^{{\tilde{N}}L(\varepsilon )(1+2\Delta (\varepsilon ))|x|}}{{\tilde{N}}L(\varepsilon )(1+2\Delta (\varepsilon ))} \\ {}&\quad + \frac{1}{N(1+2\Delta (\varepsilon ))} \left[ \left\| \frac{\partial ^{N} P_k(\varepsilon )}{\partial \theta ^{N}} \right\| + \left\| \frac{\partial ^{N} A_k(\varepsilon )}{\partial \theta ^{N}} \right\| \right] e^{NL(\varepsilon )(1+2\Delta (\varepsilon ))|x|}. \end{aligned} \end{aligned}$$

Thus, by taking $N_q:= \max (N+1,{\tilde{N}}+1)$, an application of Grönwall’s inequality ensures that

$$\begin{aligned} \left| \left( T_k^{x,t,\varepsilon }\right) ^{(N)}(\theta )\right|\le & {} \left( \frac{{\tilde{C}} \; }{{\tilde{N}}L(\varepsilon )} + \frac{1}{1+2\Delta (\varepsilon )} \left[ \left\| \frac{\partial ^{N} P_k(\varepsilon )}{\partial \theta ^{N}} \right\| + \left\| \frac{\partial ^{N} A_k(\varepsilon )}{\partial \theta ^{N}} \right\| \right] \right) \nonumber \\{} & {} e^{N_q L(\varepsilon )(1+2\Delta (\varepsilon ))|x|}. \end{aligned}$$

(56)

Having proved (56), we proceed to showing that (P2.) holds for $q=N$. This will be done by induction on $k \in \mathbb {N}$. Define

$$\begin{aligned} C_{1,N}: =\frac{48K {\tilde{C}}}{\alpha }. \end{aligned}$$

Since $(P_0,A_0)=(0,0)$, property (P2.) is trivially true with this constant for $q=N$ and $k=0$. Suppose it holds for all non-negative integers up to a given $k \in \mathbb {N}$. Let us show that is must also hold for $k+1$. Observe that

$$\begin{aligned} \begin{aligned} \frac{\partial ^N P_{k+1}(\varepsilon )}{\partial \theta ^N} (t,\theta ) = \int _{-\infty }^{\infty } J_1(x) \left( \zeta _{1,k}^{x,t,\varepsilon } \left( T_k^{x,t,\varepsilon } (\theta )\right) \right) ^{(N)} dx. \end{aligned} \end{aligned}$$

Hence, considering (53), (54), (55), and (56), it follows that

$$\begin{aligned} \begin{aligned} \left\| \frac{\partial ^N P_{k+1}(\varepsilon )}{\partial \theta ^N}\right\|&\le \int _{-\infty }^\infty K e^{-\alpha |x|} \left( \frac{{\tilde{C}}(1+2\Delta (\varepsilon ))}{{\tilde{N}}} + {\tilde{C}}\right) e^{N_q L(\varepsilon ) (1+2\Delta (\varepsilon ))|x|} dx \\&\quad + \int _{-\infty }^\infty K e^{-\alpha |x|} 2L(\varepsilon ) \left[ \left\| \frac{\partial ^{N} P_k(\varepsilon )}{\partial \theta ^{N}} \right\| + \left\| \frac{\partial ^{N} A_k(\varepsilon )}{\partial \theta ^{N}} \right\| \right] \, \\ {}&e^{N_q L(\varepsilon ) (1+2\Delta (\varepsilon ))|x|} dx. \end{aligned} \end{aligned}$$

If $\varepsilon _1$ is sufficiently small as to ensure that $2N_qL(\varepsilon )(1+2\Delta (\varepsilon )) \le \alpha $ for all $\varepsilon \in (0,\varepsilon _1]$, and considering the hypothesis of induction, it follows that

$$\begin{aligned} \begin{aligned} \left\| \frac{\partial ^N P_{k+1}(\varepsilon )}{\partial \theta ^N}\right\| \le \frac{4K}{\alpha } \left( \frac{{\tilde{C}}(1+2\Delta (\varepsilon ))}{{\tilde{N}}} + {\tilde{C}}\right) + \frac{8KL(\varepsilon )}{\alpha } C_{1,N}. \end{aligned} \end{aligned}$$

(57)

We proceed identically for $A_{k+1}$ and obtain

$$\begin{aligned} \begin{aligned} \left\| \frac{\partial ^N A_{k+1}(\varepsilon )}{\partial \theta ^N}\right\| \le \frac{4K}{\alpha } \left( \frac{{\tilde{C}}(1+2\Delta (\varepsilon ))}{{\tilde{N}}} + {\tilde{C}}\right) + \frac{8KL(\varepsilon )}{\alpha } C_{1,N}. \end{aligned} \end{aligned}$$

(58)

If $\varepsilon _1$ is also chosen sufficiently small as to ensure that $32KL(\varepsilon )<\alpha $ and $2\Delta (\varepsilon )<1$ for all $\varepsilon \in (0,\varepsilon _1]$, then

$$\begin{aligned} \left\| \frac{\partial ^N P_{k+1}(\varepsilon )}{\partial \theta ^N}\right\| + \left\| \frac{\partial ^N A_{k+1}(\varepsilon )}{\partial \theta ^N}\right\| \le \frac{24K{\tilde{C}}}{\alpha } + \frac{C_{1,N}}{2} \le C_{1,N}, \end{aligned}$$

proving property (P2.).

Observe that the validity of property (P1.) for $q=N$ follows immediately from (56) and the fact that (P2.) holds for $q=N$. Therefore, by induction on q, it is proved that both properties hold for all $q \in \{1,\ldots ,p+1\}$. The Lemma then follows by defining

$$\begin{aligned} N_T:= \max _{q \in \{1,\ldots ,p+1\}}N_q, \end{aligned}$$

and, for each interval $[a,b] \subset (0,\varepsilon _1]$, the positive constants

$$\begin{aligned} C_{[a,b]}: = \max _{q \in \{1,\ldots ,p+1\}}C_{1,q}, \qquad M_{[a,b]}:= \max _{q \in \{1,\ldots ,p+1\}}C_{0,q}, \qquad \end{aligned}$$

and observing that $(f_\varepsilon ,g_\varepsilon )$ is the limit of the sequence $(P_k(\varepsilon ),A_k(\varepsilon ))_{k \in \mathbb {N}}$.

1.4 Proof of Proposition 6

Let $D(\varepsilon )$, $\Delta (\varepsilon )$, $J_1$, $J_2$, and $T_{F,G}$ be given as in the proof of Lemma 1. It is clear that there are $K>0$ and $\alpha >0$ such that $\Vert J_i(t)\Vert \le K e^{-\alpha |t|}$ for all $i \in \{1,2\}.$ Once again, we will denote $L(\varepsilon ,D(\varepsilon ),D(\varepsilon ))$ by $L(\varepsilon )$ throughout the proof. As in Proposition 4, we assume that the function L appearing in hypothesis (ii) of the Lemma satisfies: $L(\varepsilon ,\sigma ,\mu )\ge \varepsilon $ for all $(\varepsilon ,\sigma ,\mu ) \in (0,\varepsilon _0] \times [0,\rho _1) \times [0,\rho _2)$.

For convenience, define

$T^{x,t,\varepsilon }(\theta ): = T_{f_\varepsilon ,g_\varepsilon } (t+x,t,\theta )$;
$\Lambda ^{x,t,\varepsilon }(\theta )=(t+x,\theta ,f_\varepsilon (t+x,\theta ),g_\varepsilon (t+x,\theta ),\varepsilon )$;
$\zeta _i^{x,t,\varepsilon }(\theta ):=\zeta _i(t+x,\theta ,f_\varepsilon (t+x,\theta ),g_\varepsilon (t+x,\theta ),\varepsilon ) = \zeta _i \circ \Lambda ^{x,t,\varepsilon }(\theta )$.

First, let us consider $k=0$. Let us restrict the possible values of the parameter $\varepsilon $ to a compact interval $[a,b] \subset (0,\varepsilon _1]$, and let $\rho >0$ be such that $D(\varepsilon )< \rho $ for all $\varepsilon \in (0,\varepsilon _1]$. In this case, the functions $\zeta _0$, $\zeta _1$, $\zeta _2$ are Lipschitz continuous with Lipschitz constant R over $\mathbb {R}\times \mathbb {R}\times {\bar{B}}_m(0,\rho ) \times {\bar{B}}_n(0,\rho ) \times [a,b]$. Thus, it is clear that, if $\varepsilon ,\tilde{\varepsilon } \in [a,b] \subset (0,\varepsilon _1]$, then

$$\begin{aligned} \begin{aligned} | T^{x,t,\varepsilon }(\theta ) - T^{x,t,\tilde{\varepsilon }}(\theta )| \le&\int _{0}^x L(\varepsilon )(1+2\Delta (\varepsilon ))| T^{\tau ,t,\varepsilon }(\theta ) - T^{\tau ,t,\tilde{\varepsilon }}(\theta )| d\tau \\&+ \int _{0}^x L(\varepsilon ) \left[ \Vert f_\varepsilon -f_{\tilde{\varepsilon }}\Vert + \Vert g_\varepsilon -g_{\tilde{\varepsilon }}\Vert \right] dx \\ {}&+ \int _{0}^x R|\varepsilon -\tilde{\varepsilon }| dx. \end{aligned} \end{aligned}$$

Hence, from Grönwall’s inequality, it follows that

$$\begin{aligned} | T^{x,t,\varepsilon }(\theta ) - T^{x,t,\tilde{\varepsilon }}(\theta )|\le & {} \frac{e^{L(\varepsilon )(1+2\Delta (\varepsilon ))|x|} - 1}{1+2\Delta (\varepsilon )} \left[ \Vert f_\varepsilon -f_{\tilde{\varepsilon }}\Vert + \Vert g_\varepsilon -g_{\tilde{\varepsilon }}\Vert \right] \nonumber \\{} & {} + \frac{R(e^{L(\varepsilon )(1+2\Delta (\varepsilon ))|x|} - 1)}{L(\varepsilon ) (1+2\Delta (\varepsilon ))} |\varepsilon - \tilde{\varepsilon }|. \end{aligned}$$

(59)

Now, since $(f_\varepsilon ,g_\varepsilon )$ is a fixed point of the operator $S^\varepsilon $ given in the proof of Lemma 1, it follows by subtracting $S_1^{\tilde{\varepsilon }}(f_{\tilde{\varepsilon }},g_{\tilde{\varepsilon }})$ from $S_1^\varepsilon (f_\varepsilon ,g_\varepsilon )$ that

$$\begin{aligned} \begin{aligned} \Vert f_\varepsilon (t,\theta ) - f_{\tilde{\varepsilon }}(t,\theta )\Vert \le&\int _{-\infty }^{\infty } K e^{-\alpha |x|} L(\varepsilon )(1+2\Delta (\varepsilon )) | T^{x,t,\varepsilon }(\theta ) - T^{x,t,\tilde{\varepsilon }}(\theta )| dx \\&+ \int _{-\infty }^{\infty } K e^{-\alpha |x|}L(\varepsilon ) \left[ \Vert f_\varepsilon - f_{\tilde{\varepsilon }}\Vert + \Vert g_\varepsilon -g_{\tilde{\varepsilon }}\Vert \right] dx \\ {}&+\int _{-\infty }^{\infty } K e^{-\alpha |x|}R|\varepsilon -\tilde{\varepsilon }| dx. \end{aligned} \end{aligned}$$

Thus, considering (59), if $\varepsilon _1$ is chosen sufficiently small so that $2L(\varepsilon )(1+2\Delta (\varepsilon ))<\alpha $ for all $\varepsilon \in (0,\varepsilon _1]$, it follows that

$$\begin{aligned} \Vert f_\varepsilon (t,\theta ) - f_{\tilde{\varepsilon }}(t,\theta )\Vert \le \frac{4K L(\varepsilon )}{\alpha } \left[ \Vert f_\varepsilon - f_{\tilde{\varepsilon }}\Vert + \Vert g_\varepsilon -g_{\tilde{\varepsilon }}\Vert \right] + R |\varepsilon -\tilde{\varepsilon }|. \end{aligned}$$

A similar argument ensures that

$$\begin{aligned} \Vert g_\varepsilon (t,\theta ) - g_{\tilde{\varepsilon }}(t,\theta )\Vert \le \frac{4K L(\varepsilon )}{\alpha } \left[ \Vert f_\varepsilon - f_{\tilde{\varepsilon }}\Vert + \Vert g_\varepsilon -g_{\tilde{\varepsilon }}\Vert \right] + R |\varepsilon -\tilde{\varepsilon }|. \end{aligned}$$

Therefore, if $\varepsilon _1$ is also small enough to ensure that $16KL(\varepsilon )<\alpha $ for all $\varepsilon \in (0,\varepsilon _1]$, it follows that

$$\begin{aligned} \Vert f_\varepsilon - f_{\tilde{\varepsilon }}\Vert + \Vert g_\varepsilon -g_{\tilde{\varepsilon }}\Vert \le 4R |\varepsilon -\tilde{\varepsilon }| \end{aligned}$$

(60)

if $\varepsilon ,\tilde{\varepsilon } \in [a,b]$. The procedure can be repeated for any choice of interval [a, b] with the exact same conditions required for the choice of $\varepsilon _1$, yielding generally different constants R, but ensuring local Lipschitz continuity nonetheless.

Consider the following properties, where $q \in \{0,\ldots ,p\}$:

(Q.1)
There is $N_q \in \mathbb {N}$ and, for each $[a,b] \subset (0,\varepsilon _1]$, there is $C_{0,q}>0$ such that
$$\begin{aligned} \begin{aligned} \left| \left( T^{x,t,\varepsilon }\right) ^{(q)}(\theta )- \left( T^{x,t,\tilde{\varepsilon }}\right) ^{(q)}(\theta ) \right| \le C_{0,q} |\varepsilon -\tilde{\varepsilon }| e^{N_q L(\varepsilon )(1+2\Delta (\varepsilon ))|x|} \end{aligned} \end{aligned}$$
for all $(x,t,\theta ) \in \mathbb {R}\times \mathbb {R}\times \mathbb {R}$ and all $\varepsilon ,\tilde{\varepsilon } \in [a,b]$.
(Q.2)
For each $[a,b] \subset (0,\varepsilon _1]$, there is $C_{1,q}>0$ such that
$$\begin{aligned} \left\| \frac{\partial ^q f_\varepsilon }{\partial \theta ^q} - \frac{\partial ^q f_{\tilde{\varepsilon }}}{\partial \theta ^q}\right\| + \left\| \frac{\partial ^q g_\varepsilon }{\partial \theta ^q} - \frac{\partial ^q g_{\tilde{\varepsilon }}}{\partial \theta ^q}\right\| \le C_{1,q} |\varepsilon -\tilde{\varepsilon }| \end{aligned}$$
for all $\varepsilon ,\tilde{\varepsilon } \in [a,b]$.

We will prove by induction that those properties hold for all $q \in \{0,\ldots ,p\}$.

Before we proceed to the proof itself, we make some considerations. Once again, the index i will be used to denote any element of the set $\{0,1,2\}$, since the arguments are the same. Let the interval $[a,b] \subset (0,\varepsilon _1]$ be fixed. First, since $\zeta _i$ is of class $C^{p+1}$, periodic in its first two entries, and since $f_\varepsilon $ and $g_\varepsilon $ are bounded for $\varepsilon \in [a,b]$, it follows that there are constants $C_\zeta >0$ and $L_\zeta >0$ such that, for all $j \in \{0,1,\ldots ,p\}$, the function $D^{(j)} \zeta _i$ satisfies

$$\begin{aligned} \Vert D^{(j)} \zeta _i (\Lambda ^{x,t,\varepsilon }(\theta ))\Vert \le C_\zeta \end{aligned}$$

(61)

and

$$\begin{aligned} \Vert D^{(j)} \zeta _i (\Lambda ^{x,t,\varepsilon }(\theta )) - D^{(j)} \zeta _i (\Lambda ^{x,t,\tilde{\varepsilon }}(\tilde{\theta }))\Vert \le L_\zeta \Vert \Lambda ^{x,t,\varepsilon }(\theta ) - \Lambda ^{x,t,\tilde{\varepsilon }}(\tilde{\theta })\Vert \end{aligned}$$

(62)

for all $j \in \{0,\ldots ,p\}$, all $(x,t) \in \mathbb {R}\times \mathbb {R}$ and all $(\theta ,\varepsilon ),(\tilde{\theta },\tilde{\varepsilon }) \in \mathbb {R}\times [a,b]$.

Furthermore, considering the definition of $\Lambda ^{x,t,\varepsilon }$ and (60), it follows that, for each $[a,b] \subset (0,\varepsilon _1]$, there is $R>0$ such that

$$\begin{aligned} \Vert \Lambda ^{x,t,\varepsilon }(\theta ) - \Lambda ^{x,t,\tilde{\varepsilon }}(\tilde{\theta })\Vert \le 4R|\varepsilon - \tilde{\varepsilon }| + (1+2\Delta (\varepsilon )) |\theta - \tilde{\theta }| \end{aligned}$$

(63)

for all $(x,t) \in \mathbb {R}\times \mathbb {R}$ and all $(\theta ,\varepsilon ),(\tilde{\theta },\tilde{\varepsilon }) \in \mathbb {R}\times [a,b]$. Also, observe that for any $j \in \{1,\ldots ,p+1\}$,

$$\begin{aligned} \left( \Lambda ^{x,t,\varepsilon }\right) ^{(j)}(\theta ) = \left( 0,\delta _{1j},\frac{\partial ^j f_\varepsilon }{\partial \theta ^j}(t+x,\theta ),\frac{\partial ^j g_\varepsilon }{\partial \theta ^j}(t+x,\theta ),0\right) , \end{aligned}$$

(64)

where $\delta _{ij}$ is the Kronecker delta. Thus, it follows that

$$\begin{aligned} \left\| \left( \Lambda ^{x,t,\varepsilon }\right) '(\theta )\right\| \le 1+ 2\Delta (\varepsilon ), \end{aligned}$$

(65)

for all $(x,t,\theta ,\varepsilon ) \in \mathbb {R}\times \mathbb {R}\times \mathbb {R}\times (0,\varepsilon _1]$. Moreover, from Proposition 5, there is, for each $[a,b] \subset (0,e_1]$, a constant $C_\Lambda >0$ such that

$$\begin{aligned} \left\| \left( \Lambda ^{x,t,\varepsilon }\right) ^{(j)}(\theta )\right\| \le C_\Lambda \end{aligned}$$

(66)

for all $j \in \{2,\ldots ,p+1\}$ and all $(x,t,\theta ,\varepsilon ) \in \mathbb {R}\times \mathbb {R}\times \mathbb {R}\times [a,b]$. Also, from Corollary 2, it follows that

$$\begin{aligned} \left\| \left( \Lambda ^{x,t,\varepsilon }\right) ^{(j)}(\theta ) - \left( \Lambda ^{x,t,\tilde{\varepsilon }}\right) ^{(j)}(\tilde{\theta })\right\|\le & {} \left\| \frac{\partial ^j f_\varepsilon }{\partial \theta ^j} - \frac{\partial ^j f_{\tilde{\varepsilon }}}{\partial \theta ^j}\right\| + \left\| \frac{\partial ^j g_\varepsilon }{\partial \theta ^j} - \frac{\partial ^j g_{\tilde{\varepsilon }}}{\partial \theta ^j}\right\| \nonumber \\{} & {} + C_\Lambda |\theta - \tilde{\theta }| \end{aligned}$$

(67)

for all $j \in \{1,\ldots ,p\}$, all $(x,t) \in \mathbb {R}\times \mathbb {R}$ and all $(\theta ,\varepsilon ),(\tilde{\theta },\tilde{\varepsilon }) \in \mathbb {R}\times [a,b]$.

Finally, observe that, from Proposition 5, there are $N_T$ and, for each $[a,b] \subset (0,\varepsilon _1]$, a constant $C_T>0$ such that

$$\begin{aligned} \left| \left( T^{x,t,\varepsilon }\right) ^{(j)}(\theta )\right| \le C_T e^{N_T L(\varepsilon )(1+2\Delta (\varepsilon ))|x|} \end{aligned}$$

(68)

for all $j \in \{1,\ldots ,p+1\}$ and all $(x,t,\theta ,\varepsilon ) \in \mathbb {R}\times \mathbb {R}\times \mathbb {R}\times [a,b]$.

We start the discussion of the induction argument. Observe that the case $q=0$ follows directly from (59) and (60). Let $N \in \{1,\ldots ,p\}$ and assume that properties (Q.1) and (Q.2) are valid for $0\le q\le N-1$. We will show that this ensures that such properties also hold for $q=N$.

Proceeding exactly as in the proof of Proposition 5, we obtain the following from Faà di Bruno’s formula:

$$\begin{aligned} \begin{aligned}&\left( \zeta _{i}^{x,t,\varepsilon } \left( T^{x,t,\varepsilon } (\theta )\right) \right) ^{(N)} \\ {}&\quad = D^{(N)} \zeta _i\left( \Lambda ^{x,t,\varepsilon }\left( T^{x,t,\varepsilon }(\theta )\right) \right) \cdot \left( \left( \Lambda ^{x,t,\varepsilon }\right) '\left( T^{x,t,\varepsilon }(\theta )\right) \right) ^{N} \cdot \left( \left( T^{x,t,\varepsilon }\right) '(\theta )\right) ^{N} \\&\qquad +D\zeta _i\left( \Lambda ^{x,t,\varepsilon }\left( T^{x,t,\varepsilon }(\theta )\right) \right) \cdot \left( \left( \Lambda ^{x,t,\varepsilon }\right) ^{(N)}\left( T^{x,t,\varepsilon }(\theta )\right) \right) \cdot \left( \left( T^{x,t,\varepsilon }\right) '(\theta )\right) ^{N} \\&\qquad + \sum _{j=2}^{N-1} D^{(j)} \zeta _i\left( \Lambda ^{x,t,\varepsilon }\left( T^{x,t,\varepsilon }(\theta )\right) \right) \\ {}&\qquad \cdot B_{N,j} \left( \left( \Lambda ^{x,t,\varepsilon }\right) '\left( T^{x,t,\varepsilon }(\theta )\right) ,\ldots , \left( \Lambda ^{x,t,\varepsilon }\right) ^{(N-j+1)}\left( T^{x,t,\varepsilon }(\theta )\right) \right) \left( \left( T^{x,t,\varepsilon }\right) '(\theta )\right) ^{N} \\&\qquad + D\zeta _i\left( \Lambda ^{x,t,\varepsilon }\left( T^{x,t,\varepsilon }(\theta )\right) \right) \cdot \left( \left( \Lambda ^{x,t,\varepsilon }\right) '\left( T^{x,t,\varepsilon }(\theta )\right) \right) \cdot \left( \left( T^{x,t,\varepsilon }\right) ^{(N)}(\theta )\right) \\&\qquad + \sum _{j=2}^{N-1} \sum _{l=1}^j \left[ D^{(l)}\zeta _i\left( \Lambda ^{x,t,\varepsilon }\left( T^{x,t,\varepsilon }(\theta )\right) \right) \right. \\ {}&\left. \qquad \cdot B_{j,l} \left( \left( \Lambda ^{x,t,\varepsilon }\right) '\left( T^{x,t,\varepsilon }(\theta )\right) ,\ldots , \left( \Lambda ^{x,t,\varepsilon }\right) ^{(j-l+1)}\left( T^{x,t,\varepsilon }(\theta )\right) \right) \right] \\&\qquad \times B_{N,j}\left( \left( T^{x,t,\varepsilon }\right) '(\theta ),\ldots ,\left( T^{x,t,\varepsilon }\right) ^{(N-j+1)}(\theta )\right) . \end{aligned}\nonumber \\ \end{aligned}$$

(69)

For simplicity, we will denote the summands on the right-hand side of this equation by I, II, III, IV, and V, respectively. If $\varepsilon $ is replaced by $\tilde{\varepsilon }$, we will denote those terms by $I'$, $II'$, $III'$, $IV'$, and $V'$, respectively

Considering the hypothesis of induction combined with (61), (62), (65), (67) and (68), it follows that there is $N_I>0$ and, for each $[a,b] \subset (0,\varepsilon _1]$, $C_I>0$ such that

$$\begin{aligned} \Vert I-I'\Vert \le C_I e^{N_I L(\varepsilon )(1+2\Delta (\varepsilon ))|x|} |\varepsilon -\tilde{\varepsilon }|. \end{aligned}$$

(70)

for all $(x,t,\theta ) \in \mathbb {R}\times \mathbb {R}\times \mathbb {R}$ and all $\varepsilon , \tilde{\varepsilon } \in [a,b]$. Similarly, since $B_{N,j}$ and $B_{j,l}$ are polynomials, there are $N_{III}>0$ and $N_V>0$, and, for each $[a,b] \subset (0,\varepsilon _1]$, $C_{III}>0$ and $C_V>0$ such that

$$\begin{aligned} \Vert III-III'\Vert \le C_{III} e^{N_{III} L(\varepsilon )(1+2\Delta (\varepsilon ))|x|} |\varepsilon -\tilde{\varepsilon }| \end{aligned}$$

(71)

and

$$\begin{aligned} \Vert V-V'\Vert \le C_{V} e^{N_V L(\varepsilon )(1+2\Delta (\varepsilon ))|x|} |\varepsilon -\tilde{\varepsilon }|. \end{aligned}$$

(72)

for all $(x,t,\theta ) \in \mathbb {R}\times \mathbb {R}\times \mathbb {R}$ and all $\varepsilon , \tilde{\varepsilon } \in [a,b]$.

Regarding II, observe that (64), (67), and the properties of Lipschitz continuity of $\zeta _i$ given in hypothesis (iii) guarantee that

$$\begin{aligned} \begin{aligned}&\left\| D\zeta _i\left( \Lambda ^{x,t,\varepsilon }\left( T^{x,t,\varepsilon }(\theta )\right) \right) \cdot \left( \left( \Lambda ^{x,t,\varepsilon }\right) ^{(N)}\left( T^{x,t,\varepsilon }(\theta )\right) - \left( \Lambda ^{x,t,\tilde{\varepsilon }}\right) ^{(N)}\left( T^{x,t,\tilde{\varepsilon }}(\theta )\right) \right) \right\| \\&\quad \le L(\varepsilon ) \left[ \left\| \frac{\partial ^{N} f_\varepsilon }{\partial \theta ^{N}} - \frac{\partial ^{N} f_{\tilde{\varepsilon }}}{\partial \theta ^{N}}\right\| + \left\| \frac{\partial ^{N} g_\varepsilon }{\partial \theta ^{N}} - \frac{\partial ^{N} g_{\tilde{\varepsilon }}}{\partial \theta ^{N}}\right\| \right] \\&\qquad + L(\varepsilon ) C_\Lambda \left| T^{x,t,\varepsilon }(\theta ) - T^{x,t,\tilde{\varepsilon }}(\theta )\right| . \end{aligned} \end{aligned}$$

Thus, the hypothesis of induction, combined with (62), (67), (65), (67) and (68), ensures that there is $N_{II}>0$ and, for each $[a,b] \subset (0,\varepsilon _1]$, $C_{II}>0$ such that

$$\begin{aligned} \Vert II-II'\Vert{} & {} \le C_{II} e^{N_{II} L(\varepsilon )(1+2\Delta (\varepsilon ))|x|} |\varepsilon -\tilde{\varepsilon }| \nonumber \\{} & {} \quad + L(\varepsilon ) \left[ \left\| \frac{\partial ^{N} f_\varepsilon }{\partial \theta ^{N}} - \frac{\partial ^{N} f_{\tilde{\varepsilon }}}{\partial \theta ^{N}}\right\| + \left\| \frac{\partial ^{N} g_\varepsilon }{\partial \theta ^{N}} - \frac{\partial ^{N} g_{\tilde{\varepsilon }}}{\partial \theta ^{N}}\right\| \right] e^{N_{II} L(\varepsilon )(1+2\Delta (\varepsilon ))|x|}. \end{aligned}$$

(73)

for all $(x,t,\theta ) \in \mathbb {R}\times \mathbb {R}\times \mathbb {R}$ and all $\varepsilon , \tilde{\varepsilon } \in [a,b]$.

Finally, a similar argument ensures that there is $N_{IV} \in \mathbb {N}$ and, for each $[a,b] \subset (0,\varepsilon _1]$, $C_{IV}>0$ such that

$$\begin{aligned} \Vert IV - IV'\Vert{} & {} \le C_{IV} e^{N_{IV} L(\varepsilon )(1+2\Delta (\varepsilon ))|x|} |\varepsilon -\tilde{\varepsilon }| \nonumber \\{} & {} + L(\varepsilon )(1+2\Delta (\varepsilon )) \left| \left( T^{x,t,\varepsilon }\right) ^{(N)}(\theta ) - \left( T^{x,t,\tilde{\varepsilon }}\right) ^{(N)}(\theta )\right| \end{aligned}$$

(74)

for all $(x,t,\theta ) \in \mathbb {R}\times \mathbb {R}\times \mathbb {R}$ and all $\varepsilon , \tilde{\varepsilon } \in [a,b]$.

By definition of $T^{x,t,\varepsilon }$, it follows that

$$\begin{aligned} \begin{aligned} \frac{\partial ^{N}}{\partial \theta ^{N}} \left( \frac{\partial T_{f_\varepsilon ,g_\varepsilon }}{\partial x}\right) (t+x,t,\theta ,\varepsilon ) = \left( \zeta _{0}^{x,t,\varepsilon }\left( T^{x,t,\varepsilon }(\theta )\right) \right) ^{(N)}. \end{aligned} \end{aligned}$$

Thus, considering inequalities (70)–(74), it follows that there is ${\tilde{N}}_T \in \mathbb {N}$ and, for each $[a,b] \subset (0,\varepsilon _1]$, ${\tilde{C}}_T>0$ such that

$$\begin{aligned} \begin{aligned}&\left| \left( T^{x,t,\varepsilon }\right) ^{(N)}(\theta ) - \left( T^{x,t,\tilde{\varepsilon }}\right) ^{(N)}(\theta )\right| \\&\quad \le \int _0^x L(\varepsilon ) (1+2\Delta (\varepsilon )) \left| \left( T^{\tau ,t,\varepsilon }\right) ^{(N)}(\theta ) - \left( T^{\tau ,t,\tilde{\varepsilon }}\right) ^{(N)}(\theta )\right| d\tau \\&\qquad +\frac{1}{1+2\Delta (\varepsilon )} \left[ \left\| \frac{\partial ^{N} f_\varepsilon }{\partial \theta ^{N}} - \frac{\partial ^{N} f_{\tilde{\varepsilon }}}{\partial \theta ^{N}}\right\| + \left\| \frac{\partial ^{N} g_\varepsilon }{\partial \theta ^{N}} - \frac{\partial ^{N} g_{\tilde{\varepsilon }}}{\partial \theta ^{N}}\right\| \right] e^{{\tilde{N}}_T L(\varepsilon )(1+2\Delta (\varepsilon ))|x|} \\&\qquad + {\tilde{C}}_T |\varepsilon - \tilde{\varepsilon }| e^{{\tilde{N}}_T L(\varepsilon )(1+2\Delta (\varepsilon ))|x|} \end{aligned} \end{aligned}$$

for all $(x,t,\theta ) \in \mathbb {R}\times \mathbb {R}\times \mathbb {R}$ and all $\varepsilon , \tilde{\varepsilon } \in [a,b]$. From Grönwall’s inequality, it follows that

$$\begin{aligned}{} & {} \left| \left( T^{x,t,\varepsilon }\right) ^{(N)}(\theta ) - \left( T^{x,t,\tilde{\varepsilon }}\right) ^{(N)}(\theta )\right| \nonumber \\{} & {} \quad \le \left[ {\tilde{C}}_T|\varepsilon -\tilde{\varepsilon }| + \frac{1}{1+2\Delta (\varepsilon )} \left[ \left\| \frac{\partial ^{N} f_\varepsilon }{\partial \theta ^{N}} - \frac{\partial ^{N} f_{\tilde{\varepsilon }}}{\partial \theta ^{N}}\right\| + \left\| \frac{\partial ^{N} g_\varepsilon }{\partial \theta ^{N}} - \frac{\partial ^{N} g_{\tilde{\varepsilon }}}{\partial \theta ^{N}}\right\| \right] \right] \nonumber \\{} & {} \quad e^{({\tilde{N}}_T+1) L(\varepsilon )(1+2\Delta (\varepsilon ))|x|} \end{aligned}$$

(75)

for all $(x,t,\theta ) \in \mathbb {R}\times \mathbb {R}\times \mathbb {R}$ and all $\varepsilon , \tilde{\varepsilon } \in [a,b]$.

Let us prove that property Q.2) holds for $q=N$. Observe that

$$\begin{aligned} \begin{aligned} \frac{\partial ^N f_\varepsilon }{\partial \theta ^N} (t,\theta ) = \int _{-\infty }^{\infty } J_1(x) \left( \zeta _{1}^{x,t,\varepsilon } \left( T^{x,t,\varepsilon } (\theta )\right) \right) ^{(N)} dx. \end{aligned} \end{aligned}$$

Thus, proceeding just as above, we obtain, for each $[a,b] \subset (0,\varepsilon _1]$, a constant ${\tilde{C}}>0$ such that

$$\begin{aligned} \begin{aligned} \left\| \frac{\partial ^N f_\varepsilon }{\partial \theta ^N} - \frac{\partial ^N f_{\tilde{\varepsilon }}}{\partial \theta ^N}\right\|&\le \int _{-\infty }^\infty K e^{-\alpha |x|} L(\varepsilon ) (1+2\Delta (\varepsilon )) \left| \left( T^{\tau ,t,\varepsilon }\right) ^{(N)}(\theta ) - \left( T^{\tau ,t,\tilde{\varepsilon }}\right) ^{(N)}(\theta )\right| d\tau \\&\quad +\int _{-\infty }^\infty K e^{-\alpha |x|} L(\varepsilon ) \left[ \left\| \frac{\partial ^{N} f_\varepsilon }{\partial \theta ^{N}} - \frac{\partial ^{N} f_{\tilde{\varepsilon }}}{\partial \theta ^{N}}\right\| + \left\| \frac{\partial ^{N} g_\varepsilon }{\partial \theta ^{N}} - \frac{\partial ^{N} g_{\tilde{\varepsilon }}}{\partial \theta ^{N}}\right\| \right] \\ {}&\quad \times e^{N_{II} L(\varepsilon )(1+2\Delta (\varepsilon ))|x|} + \int _{-\infty }^\infty K e^{-\alpha |x|} {\tilde{C}} |\varepsilon - \tilde{\varepsilon }| e^{{\tilde{N}}_T L(\varepsilon )(1+2\Delta (\varepsilon ))|x|}. \end{aligned} \end{aligned}$$

Define

$$\begin{aligned} C_{1,N}:= \frac{16K {\tilde{C}}}{\alpha }, \qquad {\tilde{N}}:= \max \{{\tilde{N}}_T+1, N_{II}\}. \end{aligned}$$

If $\varepsilon _1$ is sufficiently small as to ensure that $2{\tilde{N}}L(\varepsilon )(1+2\Delta (\varepsilon )) \le \alpha $ for all $\varepsilon \in (0,\varepsilon _1]$, then it follows by integrating and considering (75) that

$$\begin{aligned} \begin{aligned} \left\| \frac{\partial ^N f_\varepsilon }{\partial \theta ^N} - \frac{\partial ^N f_{\tilde{\varepsilon }}}{\partial \theta ^N}\right\| \le \frac{8KL(\varepsilon )}{\alpha } \left[ \left\| \frac{\partial ^{N} f_\varepsilon }{\partial \theta ^{N}} - \frac{\partial ^{N} f_{\tilde{\varepsilon }}}{\partial \theta ^{N}}\right\| + \left\| \frac{\partial ^{N} g_\varepsilon }{\partial \theta ^{N}} - \frac{\partial ^{N} g_{\tilde{\varepsilon }}}{\partial \theta ^{N}}\right\| \right] + \frac{4K{\tilde{C}}}{\alpha } |\varepsilon - \tilde{\varepsilon }|. \end{aligned} \end{aligned}$$

Proceeding similarly for $g_\varepsilon $, we obtain

$$\begin{aligned} \begin{aligned} \left\| \frac{\partial ^N g_\varepsilon }{\partial \theta ^N} - \frac{\partial ^N g_{\tilde{\varepsilon }}}{\partial \theta ^N}\right\| \le \frac{8KL(\varepsilon )}{\alpha } \left[ \left\| \frac{\partial ^{N} f_\varepsilon }{\partial \theta ^{N}} - \frac{\partial ^{N} f_{\tilde{\varepsilon }}}{\partial \theta ^{N}}\right\| + \left\| \frac{\partial ^{N} g_\varepsilon }{\partial \theta ^{N}} - \frac{\partial ^{N} g_{\tilde{\varepsilon }}}{\partial \theta ^{N}}\right\| \right] + \frac{4K{\tilde{C}}}{\alpha } |\varepsilon - \tilde{\varepsilon }|. \end{aligned} \end{aligned}$$

Hence, if $\varepsilon _1$ is also chosen small enough to ensure that $32L(\varepsilon )(1+2\Delta (\varepsilon ))<\alpha $, it follows that

$$\begin{aligned} \left\| \frac{\partial ^{N} f_\varepsilon }{\partial \theta ^{N}} - \frac{\partial ^{N} f_{\tilde{\varepsilon }}}{\partial \theta ^{N}}\right\| + \left\| \frac{\partial ^{N} g_\varepsilon }{\partial \theta ^{N}} - \frac{\partial ^{N} g_{\tilde{\varepsilon }}}{\partial \theta ^{N}}\right\| \le \frac{16K {\tilde{C}}}{\alpha } |\varepsilon - \tilde{\varepsilon }| = C_{1,N} |\varepsilon - \tilde{\varepsilon }|, \end{aligned}$$

proving that property (Q.2) holds for $q=N$. Thus, the validity of property (Q.1) for this value of q follows immediately considering (75).

Therefore, we have proved by induction that properties (Q.1) and (Q.2) are valid for $q \in \{0,\ldots ,p\}$. This concludes the proof of the Lemma, because property (Q.2) ensures local Lipschitz continuity of the functions considered.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Novaes, D.D., Pereira, P.C.C.R. Invariant tori via higher order averaging method: existence, regularity, convergence, stability, and dynamics. Math. Ann. 389, 543–590 (2024). https://doi.org/10.1007/s00208-023-02654-2

Download citation

Received: 20 December 2022
Revised: 19 May 2023
Accepted: 11 June 2023
Published: 24 June 2023
Issue Date: May 2024
DOI: https://doi.org/10.1007/s00208-023-02654-2

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Invariant tori via higher order averaging method: existence, regularity, convergence, stability, and dynamics

Abstract

Similar content being viewed by others

Invariant Manifold Theory

Computation of Quasiperiodic Normally Hyperbolic Invariant Tori: Rigorous Results

Asymptotic stability analysis of autonomous systems by applying the method of localization of compact invariant sets

1 Introduction and statement of the main result

1.1 Introduction to the averaging theory

Theorem 1

Proposition 1

1.2 Statement of the main theorem

Definition 1

Theorem A

Remark 1

Remark 2

1.3 Application: invariant tori in 4D vector fields

Proposition 2

Example 1

2 Fundamental Lemma

Lemma 1

Proof

Corollary 1

Proposition 3

Proposition 4

Proposition 5

Corollary 2

Proposition 6

3 Proof of Theorem A

3.1 Change of variables

3.2 Existence, regularity, and convergence

3.3 Stability

3.4 Dynamics

Theorem 2

4 Invariant torus in 4D vector fields

Data availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendix

Appendix

1.1 Proof of Proposition 3

1.2 Proof of Proposition 4

1.3 Proof of Proposition 5

1.4 Proof of Proposition 6

Rights and permissions

About this article

Cite this article

Share this article

Mathematics Subject Classification

Search

Navigation