1 Introduction

We study the Kuramoto–Sivashinsky equation on a rectangular domain \([0,L_{1}]\times [0,L_{2}]\) under periodic boundary conditions:

$$\begin{aligned} \psi _{t}=-\Delta ^{2}\psi -\Delta \psi -|\nabla \psi |^{2}. \end{aligned}$$
(1)

The Kuramoto–Sivashinsky equation is a well-known model of flame front propagation and was first derived in Kuramoto and Tsuzuki (1976), Sivashinsky (1977). We will prove a global existence theorem for solutions with sufficiently small data in a suitable function space in the presence of one linearly growing mode in each direction. There are a number of global existence theorems for the Kuramoto–Sivashinsky equation in one spatial dimension (Goodman (1994); Nicolaenko et al. (1985); Tadmor (1986)) and detailed studies of the asymptotics of these solutions (Bronski and Gambill (2006); Giacomelli and Otto (2005); Goldman et al. (2015); Otto (2009)) (see also Kostianko et al. (2018) for the effect of adding dispersion). These one-dimensional results rely on a particular structure of the nonlinearity that is not present in two spatial dimensions, and thus, there are far fewer global results available in the two-dimensional case.

In the spatially periodic case, the dynamics of the Kuramoto–Sivashinsky equation are in part governed by the size of the domain, as this determines how many linearly growing Fourier modes are present. In two spatial dimensions, most global existence results in the literature are inherently anisotropic, that is, the length of one period is small compared to that of the other period and/or the size of the initial data. In thin domains, solutions are shown to remain close to one-dimensional solutions. Such studies were initiated by Sell and Taboada (1992). Other anisotropic global existence theorems are the works (Benachour et al. (2014); Kukavica and Massatt (2021); Molinet (2000b)). Then, there are global existence and singularity formation results for modified equations. The fourth-order nature of the parabolic evolution (1) implies the absence of a maximum principle for the linearized evolution. Some authors have shown that related systems with maximum principles do have global solutions (Larios and Yamazaki (2020); Molinet (2000a)), while others have modified the nonlinear term, showing that related equations have finite-time singularities Bellout et al. (2003) or global solutions (Campos et al. (2011); Pinto (1999)); see also Tomlin et al. (2018) for a numerical study of a modified equation. The second author and Feng have shown that a modification of (1) with additional advection also has global solutions (Feng and Mazzucato (2020)). Rather than modifying the equation or relying on anisotropy, the authors have previously given a global existence theorem for the two-dimensional Kuramoto–Sivashinsky equation, but under the requirement that the domain size be sufficiently small, a requirement that precludes growing modes (Ambrose and Mazzucato (2019)). Nonlinear stability of the zero solution and decay rates in the long time limit for a generalized Kuramoto–Sivashinsky equation with damping was obtained in Zhao and Tang (2000) under conditions on the coefficients that ensure the stability for the linearized operator and for small data in \(L^2\).

The linear operators \(-\Delta ^{2}\) and \(-\Delta \) on the right-hand side of (1) may be viewed as being in competition with each other; these represent a higher-order forward parabolic effect and a lower-order backward parabolic effect, which gives rise to large-scale instabilities. Since \(-\Delta ^2-\Delta \) is an elliptic operator, there are at most finitely many linearly growing Fourier modes forward in time. To be precise, if \(L_{1}\) and \(L_{2}\) are each in the interval \((0,2\pi ),\) then there are no linearly growing Fourier modes in (1), and this is the case studied in Ambrose and Mazzucato (2019). In the current study, by taking each of \(L_{1}\) and \(L_{2}\) slightly larger than \(2\pi ,\) we ensure that there is exactly one linearly growing mode in each of the x-direction and the y-direction. In all previous global existence results for the two-dimensional Kuramoto–Sivashinsky equation, either there were no linearly growing modes at all (Ambrose and Mazzucato (2019)), or (in the strongly anisotropic works Benachour et al. 2014; Molinet 2000b; Sell and Taboada 1992) the linearly growing modes were only in one direction. The current work is therefore the first global existence theorem for the two-dimensional Kuramoto–Sivashinsky equation to allow a growing mode in each spatial direction. We note that the interested reader might see Kalogirou et al. (2015) for a detailed numerical study of the dependence of the dynamics of solutions on the size of the spatial domain/the number of linearly growing Fourier modes present. Numerically, in one space dimension one observes that the \(L^2\)-norm of \(\nabla \psi \) remains bounded in time even when growing modes are present. Hence, the nonlinearity has a restoring effect on the large-scale unstable modes. This mechanism was rigorously investigated in Nicolaenko et al. (1985) in two cases: in one dimension for even solutions; in dimensions 2 and 3, under the assumption of a global bound on the \(H^1\) norm of the solution, which implies global existence. The authors of Nicolaenko et al. (1985) rely on Lyapunov function techniques, which are also at the core of our proof and are able to estimate the number of determining modes and the size of the attractor in terms of the assumed \(H^1\) bound and in terms of the period.

The method of proof of our main theorem primarily combines ideas from prior work of the authors (Ambrose and Mazzucato (2019)) and from the one-dimensional global existence theorem of Goodman (Goodman (1994)). We will now describe the formulation of the problem to be used and how these ideas come into play.

We immediately notice that, while the mean of \(\psi \), \({\bar{\psi }},\) is not preserved under the time evolution, its growth is governed by the \(L^{2}\)-norm of the gradient of \(\psi \), which does not depend on the mean itself. As a matter of fact, if we define \(\phi ={\mathbb {P}}_{0}\psi ,\) where \({\mathbb {P}}_{0}\) is the projection which removes the mean of a periodic function, the equation satisfied by \(\phi \) is:

$$\begin{aligned} \phi _{t}=-\Delta ^{2}\phi -\Delta \phi -{\mathbb {P}}_{0}|\nabla \phi |^{2}. \end{aligned}$$
(2)

The evolution equation for \({\bar{\psi }}\) is then

$$\begin{aligned} {\bar{\psi }}_{t}=-\frac{1}{L_{1}L_{2}}\int _{0}^{L_{1}}\int _{0}^{L_{2}}|\nabla \phi |^{2}\ \mathrm{d}x\mathrm{d}y. \end{aligned}$$
(3)

We therefore see that the mean of \(\psi \) exists and is finite at time T as long as \(\phi \in L^{1}([0,T];{\dot{H}}^{1})\), where \({\dot{H}}^1\) denotes the homogeneous \(L^2\)-Sobolev space of order 1. We concentrate on solving (2) from now on.

As in Nicolaenko et al. (1985), we consider symmetric solutions:

$$\begin{aligned} \phi (x,y,t)=\sum _{k,j\ge 1}a_{k,j}(t)\cos \left( \frac{2\pi kx}{L_{1}}\right) \cos \left( \frac{2\pi jy}{L_{2}}\right) . \end{aligned}$$
(4)

We introduce a decomposition of the Fourier modes into three categories. With our choice that \(L_{1}\) and \(L_{2}\) are each slightly larger than \(2\pi ,\) we have exactly two linearly growing Fourier modes, \(a_{1,0}\) and \(a_{0,1};\) these linearly growing modes are the first type that we treat specially. We next take two intermediate modes, which are the \(a_{2,0}\) and \(a_{0,2}\) modes; these are linearly decaying modes that we use to absorb energy from the lowest modes. Finally, our third category consists of all remaining Fourier modes; we consider these to be strongly decaying.

We let \({\mathbb {P}}_{5}\) be the projection onto the complement of the span of the 4 modes introduced above. We may then write \(\phi \) as:

$$\begin{aligned} \phi (x,y,t)= & {} a_{1,0}(t)\cos \left( \frac{2\pi x}{L_{1}}\right) +a_{2,0}(t)\cos \left( \frac{4\pi x}{L_{1}}\right) \nonumber \\&+a_{0,1}(t)\cos \left( \frac{2\pi y}{L_{2}}\right) +a_{0,2}(t)\cos \left( \frac{4\pi y}{L_{2}}\right) +w(x,y,t), \end{aligned}$$
(5)

where \(w={\mathbb {P}}_{5}\phi .\) The KSE is equivalent, at least formally, to a coupled system of 5 equations, 4 ODEs for the modes \(a_{1,0},\) \(a_{2,0},\) \(a_{0,1},\) \(a_{0,2},\) and a PDE for w. Our first goal is to derive this coupled system. Throughout, for ease of notation, we will denote derivatives as subscripts, so \(a_{1,0 t}=\frac{d}{dt} a_{1,0}\).

We introduce some notations for the coefficients of the linear terms in the modes that we treat specially:

$$\begin{aligned} \varepsilon _{i}&=-\left( \frac{2\pi }{L_{i}}\right) ^{4}+\left( \frac{2\pi }{L_{i}}\right) ^{2},\nonumber \\ B_{i}&=\left( \frac{4\pi }{L_{i}}\right) ^{4}-\left( \frac{4\pi }{L_{i}}\right) ^{2}, \end{aligned}$$
(6)

where \(i\in \{1,2\}.\) We will sometimes denote \(\varepsilon =\max \{\varepsilon _{1},\varepsilon _{2}\}\) as well. Of course, since \(2\pi<L_{i}<4\pi \), we have that all of these coefficients are positive. Furthermore, by taking \(L_i\) only slightly larger than \(2\pi \) we can make \(\varepsilon _i\) arbitrarily small. Using this notation, our equations for the first two modes in the x-direction become:

$$\begin{aligned} a_{1,0t}= & {} \varepsilon _{1}a_{1,0}+\frac{8\pi ^{2}}{L_{1}^{2}}a_{1,0}a_{2,0}+F_{1,0,x}+F_{1,0,y},\\ a_{2,0t}= & {} -B_{1}a_{2,0}-\frac{2\pi ^{2}}{L_{1}^{2}}a_{1,0}^{2}+F_{2,0,x}+F_{2,0,y}. \end{aligned}$$

Here, we have brought out the quadratic interactions between these two modes, and we consider the rest of the nonlinearity to be a smaller remainder. We will give formulas for the forcing functions \(F_{i,0,x}\) and \(F_{i,0,y}\), \(i=1,2\), in Sect. 1.1. Similarly, the equations for the first two modes in the y-direction are:

$$\begin{aligned} a_{0,1t}= & {} \varepsilon _{2}a_{0,1}+\frac{8\pi ^{2}}{L_{2}^{2}}a_{0,1}a_{0,2}+F_{0,1,x}+F_{0,1,y},\\ a_{0,2t}= & {} -B_{2}a_{0,2}-\frac{2\pi ^{2}}{L_{2}^{2}}a_{0,1}^{2}+F_{0,2,x}+F_{0,2,y}. \end{aligned}$$

Finally, we may write the evolution equation for w simply as

$$\begin{aligned} w_{t}=-\Delta ^{2}w-\Delta w+{\mathbb {P}}_{5}\left( (\partial _{x}\phi )^{2}+(\partial _{y}\phi )^{2}\right) , \end{aligned}$$
(7)

where \(\phi \) and w are related through (5).

As we have indicated already, the four special modes will be treated with a Lyapunov function, generalizing Goodman’s result for a toy model Goodman (1994). In Goodman’s case, energy was conserved by the nonlinear terms. The conservation of energy (i.e., a conserved \(L^{2}\) norm) does not hold in dimension greater than one, due to the form of the nonlinearity. We observe, however, that Goodman’s argument is more robust than this and can be modified to handle the presence of small forcing. This Lyapunov function argument will show that the first 4 modes remain of size \(\varepsilon ^{1/2}\), if initially of that size. This result is inherently a nonlinear effect, since the first two modes are linearly growing in fact. For the two modes \(a_{2,0}\) and \(a_{0,2}\), this bound can be improved. We will find that the size of each of \(a_{2,0}\) and \(a_{0,2}\) is then at most proportional to \(\varepsilon \), if initially of that size. The norm of w (in a function space related to the Wiener algebra) will be shown to be bounded by \(\varepsilon ^{3/2}\), if initially of that size. The method of employing function spaces based on the Wiener algebra was used previously by the authors in Ambrose and Mazzucato (2019) and is inspired by the work of Duchon and Robert on vortex sheets Duchon and Robert (1988) (cf. also Biswas et al. 2014). The first author and his collaborators have additionally developed and used the technique in Ambrose (2016), Ambrose (2018), Ambrose (2019), Ambrose et al. (2019), Milgrom and Ambrose (2013).

The following is the (non-technical version of) our main theorem:

Theorem 1

There exists \(\varepsilon _{*}>0\) such that for any \(\varepsilon \in (0,\varepsilon _{*}),\) if

$$\begin{aligned}&a_{1,0}(0)\sim \varepsilon ^{1/2},\qquad a_{2,0}(0)\sim \varepsilon ,\\&a_{0,1}(0)\sim \varepsilon ^{1/2},\qquad a_{0,2}(0)\sim \varepsilon ,\qquad w_{0}\sim \varepsilon ^{3/2}, \end{aligned}$$

then the 2D Kuramoto–Sivashinsky equation with these data has a solution on an arbitrary time interval [0, T].

We will give more precise bounds on the initial data and will state a technical version of the theorem later in Theorem 2 in Sect. 5.

While we have not carried out the proof of our main theorem in the absence of the even symmetry reflected in (4), we expect that this symmetry is not critical for achieving the result. Our proof relies on two main ingredients, neither of which require this symmetry; the proof, however, would certainly be more complicated in the general case. One of our main ingredients is operator estimates in function spaces related to the Wiener algebra, and for these estimates, the symmetry is completely immaterial. The other main ingredient is Goodman’s Lyapunov function argument. Goodman actually introduced two Lyapunov functions, one for a toy model and one for the full one-dimensional Kuramoto–Sivashinsky equation in the absence of symmetry. In the present work, we generalize the Goodman result on the toy model, so using the simpler of the two Lyapunov functions. In the general case, we expect that using a different Lyapunov function inspired by Goodman’s argument would provide the same result as we prove here.

We focus on the two-dimensional case, which is the most physically motivated case as compared to higher dimensions. Indeed, (1) may be obtained from a coordinate-free model for the evolution of a flame front, which is modeled as a parametric surface (Frankel and Sivashinsky 1987, 1988). It would nevertheless be very interesting to investigate whether our global existence result extends to higher dimensions. The Duchon–Robert argument to control the remainder is based on the Wiener algebra, and hence, it does not rely on dimension-dependent embeddings. The key step in such an extension is the analysis of the reduced system for the first modes in each direction, which should still only contain quadratic interactions up to the remainders. We do expect the argument to carry over to higher dimensions.

The plan for the rest of the paper is as follows: In Sect. 1.1, we complete the description of the evolution equations satisfied by the components in (5) by detailing formulas for the forcing functions. In Sect. 2, we set up an iterative scheme, and we prepare to make estimates, which will be uniform in the iteration parameter. We develop propositions that give these uniform estimates on \(a_{1,0},\) \(a_{2,0},\) \(a_{0,1},\) and \(a_{0,2}\) in Sect. 3. We then develop tools that will give uniform bounds on w in Sect. 4. The uniform bounds are established, and the limit of the iterates is taken, in Sect. 5. We then make some concluding remarks on future directions in Sect. 6.

1.1 Formulas for the forcing functions

We let \({\mathbb {P}}_{1,0}\) be the projection onto the (1, 0) Fourier mode, and we let \({\mathbb {P}}_{2,0}\) be the projection onto the (2, 0) Fourier mode. Similarly, we let \({\mathbb {P}}_{0,1}\) be the projection onto the (0, 1) Fourier mode, and we let \({\mathbb {P}}_{0,2}\) be the projection onto the (0, 2) Fourier mode.

We will determine \(F_{1,0,x}\) and \(F_{1,0,y}\) by projecting the nonlinear terms onto the (1, 0) mode and then separating out certain quadratic interactions:

$$\begin{aligned}&{\mathbb {P}}_{1,0}\left( \left( \phi _{x}\right) ^{2}\right) =\left[ -\frac{8\pi ^{2}}{L_{1}^{2}}a_{1,0}a_{2,0}+F_{1,0,x}\right] \cos \left( \frac{2\pi x}{L_{1}}\right) ,\nonumber \\&{\mathbb {P}}_{1,0}\left( \left( \phi _{y}\right) ^{2}\right) =F_{1,0,y}\cos \left( \frac{2\pi x}{L_{1}}\right) . \end{aligned}$$
(8)

We decompose \((\phi _{x})^{2}\) as follows:

$$\begin{aligned} (\phi _{x})^{2}=\sum _{i=1}^{6}\Psi _{i}^{x}, \end{aligned}$$

with the terms \(\Psi _{i}^{x}\) defined as

$$\begin{aligned} \Psi _{1}^{x}= & {} \frac{4\pi ^{2}(a_{1,0})^{2}}{L_{1}^{2}}\sin ^{2}\left( \frac{2\pi x}{L_{1}}\right) ,\\ \Psi _{2}^{x}= & {} \frac{16\pi ^{2}(a_{2,0})^{2}}{L_{1}^{2}}\sin ^{2}\left( \frac{4\pi x}{L_{1}}\right) , \\ \Psi _{3}^{x}= & {} (w_{x})^{2},\\ \Psi _{4}^{x}= & {} \frac{16\pi ^{2}a_{1,0}a_{2,0}}{L_{1}^{2}}\sin \left( \frac{2\pi x}{L_{1}}\right) \sin \left( \frac{4\pi x}{L_{1}}\right) ,\\ \Psi _{5}^{x}= & {} -\frac{4\pi a_{1,0}}{L_{1}}w_{x}\sin \left( \frac{2\pi x}{L_{1}}\right) ,\\ \Psi _{6}^{x}= & {} -\frac{8\pi a_{2,0}}{L_{1}}w_{x}\sin \left( \frac{4\pi x}{L_{1}}\right) . \end{aligned}$$

The following equation then defines \(F_{1,0,x},\), after making elementary calculations using trigonometric identities:

$$\begin{aligned} F_{1,0,x}\cos \left( \frac{2\pi x}{L_{1}}\right) ={\mathbb {P}}_{1,0}\left[ \Psi _{3}^{x}+\Psi _{5}^{x}+\Psi _{6}^{x}\right] . \end{aligned}$$

To compute \(F_{1,0,y},\) we need the corresponding decomposition of \(\phi _{y}^{2}:\)

$$\begin{aligned} \phi _{y}^{2}=\sum _{i=1}^{6}\Psi _{i}^{y}, \end{aligned}$$

with the terms \(\Psi _{i}^{y}\) defined as

$$\begin{aligned} \Psi _{1}^{y}= & {} \frac{4\pi ^{2}}{L_{2}^{2}}a_{0,1}^{2}\sin ^{2}\left( \frac{2\pi y}{L_{2}}\right) ,\\ \Psi _{2}^{y}= & {} \frac{16\pi ^{2}}{L_{2}^{2}}a_{0,2}^{2}\sin ^{2}\left( \frac{4\pi y}{L_{2}}\right) , \\ \Psi _{3}^{y}= & {} w_{y}^{2},\\ \Psi _{4}^{y}= & {} \frac{16\pi ^{2}}{L_{2}^{2}}a_{0,1}a_{0,2}\sin \left( \frac{2\pi y}{L_{2}}\right) \sin \left( \frac{4\pi y}{L_{2}}\right) ,\\ \Psi _{5}^{y}= & {} -\frac{4\pi a_{0,1}}{L_{2}}w_{y}\sin \left( \frac{2\pi y}{L_{2}}\right) , \\ \Psi _{6}^{y}= & {} -\frac{8\pi a_{0,2}}{L_{2}}w_{y}\sin \left( \frac{4\pi y}{L_{2}}\right) . \end{aligned}$$

Again using some trigonometric identities and making other calculations, the equation for \(F_{1,0,y}\) is then:

$$\begin{aligned} F_{1,0,y}\cos \left( \frac{2\pi x}{L_{1}}\right) ={\mathbb {P}}_{1,0}[\Psi _{3}^{y}+\Psi _{5}^{y}+\Psi _{6}^{y}]. \end{aligned}$$

We now calculate the remaining forcing functions in a similar way, in particular only the third, fifth, and sixth components of \(\Psi \) enter into the formulas:

$$\begin{aligned} F_{2,0,x}\cos \left( \frac{4\pi x}{L_{1}}\right)= & {} {\mathbb {P}}_{2,0}\left[ \Psi _{3}^{x}+\Psi _{5}^{x}+\Psi _{6}^{x}\right] . \\ F_{2,0,y}\cos \left( \frac{4\pi x}{L_{1}}\right)= & {} {\mathbb {P}}_{2,0}[\Psi _{3}^{y}+\Psi _{5}^{y}+\Psi _{6}^{y}]. \\ F_{0,1,x}\cos \left( \frac{2\pi y}{L_{2}}\right)= & {} {\mathbb {P}}_{0,1}[\Psi _{3}^{x}+\Psi _{5}^{x}+\Psi _{6}^{x}], \\ F_{0,1,y}\cos \left( \frac{2\pi y}{L_{2}}\right)= & {} {\mathbb {P}}_{0,1}[\Psi _{3}^{y}+\Psi _{5}^{y}+\Psi _{6}^{y}], \\ F_{0,2,x}\cos \left( \frac{4\pi y}{L_{2}}\right)= & {} {\mathbb {P}}_{0,2}[\Psi _{3}^{x}+\Psi _{5}^{x}+\Psi _{6}^{x}], \\ F_{0,2,y}\cos \left( \frac{4\pi y}{L_{2}}\right)= & {} {\mathbb {P}}_{0,2}[\Psi _{3}^{y}+\Psi _{5}^{y}+\Psi _{6}^{y}]. \end{aligned}$$

2 Iterative Scheme

We will solve the coupled system of ODEs for the 4 specialized modes and the PDE for the remainder w via an iterative scheme for \(\phi ^n\), where \(\phi ^n\) is defined by:

$$\begin{aligned} \phi ^{n}(x,y,t)= & {} a_{1,0}^{n}(t)\cos \left( \frac{2\pi x}{L_{1}}\right) +a_{2,0}^{n}(t)\cos \left( \frac{4\pi x}{L_{1}}\right) \nonumber \\&+a_{0,1}^{n}(t)\cos \left( \frac{2\pi y}{L_{2}}\right) +a_{0,2}^{n}(t)\cos \left( \frac{4\pi y}{L_{2}}\right) +w^{n}(x,y,t). \end{aligned}$$
(9)

In the scheme, the forcing terms are given by formulas corresponding to those in Sect. 1.1 in a straightforward way.

We start by giving the equations for the \(a^{n+1}\) coefficients:

$$\begin{aligned} a_{1,0t}^{n+1}= & {} \varepsilon _{1}a_{1,0}^{n+1}+\frac{8\pi ^{2}}{L_{1}^{2}}a_{1,0}^{n+1}a_{2,0}^{n+1} +F_{1,0,x}^{n}+F_{1,0,y}^{n}, \\ a_{2,0t}^{n+1}= & {} -B_{1}a_{2,0}^{n+1}-\frac{2\pi ^{2}}{L_{1}^{2}}(a_{1,0}^{n+1})^{2}+F_{2,0,x}^{n}+F_{2,0,y}^{n},\\ a_{0,1t}^{n+1}= & {} \varepsilon _{2}a_{0,1}^{n+1}+\frac{8\pi ^{2}}{L_{2}^{2}}a_{0,1}^{n+1}a_{0,2}^{n+1} +F_{0,1,x}^{n}+F_{0,1,y}^{n}, \\ a_{0,2t}^{n+1}= & {} -B_{2}a_{0,2}^{n+1}-\frac{2\pi ^{2}}{L_{2}^{2}}(a_{0,1}^{n+1})^{2}+F_{0,2,x}^{n}+F_{0,2,y}^{n}, \end{aligned}$$

To complete the scheme, we also give the iterated version of (7) for \(w^n\):

$$\begin{aligned} w^{n+1}_{t}=-\Delta ^{2}w^{n+1}-\Delta w^{n+1}+{\mathbb {P}}_{5}\left( (\partial _{x}\phi ^{n})^{2} +(\partial _{y}\phi ^{n})^{2}\right) . \end{aligned}$$
(10)

The iterated system is taken with initial data that do not depend on n,  namely,

$$\begin{aligned}&a_{1,0}^{n+1}(t)=a_{1,0}(0), \qquad a_{2,0}^{n+1}(t)=a_{2,0}(0), \\&a_{0,1}^{n+1}(t)=a_{0,1}(0), \qquad a_{0,2}^{n+1}(t)=a_{0,2}(0), \qquad w^{n+1}=w_{0}. \end{aligned}$$

2.1 List of Constants

For convenience, we label some combinations of constants that will appear in ensuing calculations. We first introduce \(M_{1,1}\) and \(M_{1,2},\) which will be used in the bounds for \(a_{1,0}^{n}\) and \(a_{0,1}^{n}:\)

$$\begin{aligned} M_{1,1}=\frac{12B_{1}L_{1}^{4}}{\pi ^{4}},\qquad M_{1,2}=\frac{12B_{2}L_{2}^{4}}{\pi ^{4}}. \end{aligned}$$

The following constants will be used in the bounds for \(a_{2,0}^{n}\) and \(a_{0,2}^{n}:\)

$$\begin{aligned} M_{2,1}=\frac{8\pi ^{2}M_{1,1}}{L_{1}^{2}}.\qquad M_{2,2}=\frac{8\pi ^{2}M_{1,2}}{L_{2}^{2}}. \end{aligned}$$

The constant \(M_3\) will be used in the bound for \(w^{n}:\)

$$\begin{aligned} M_{3}=\max \left\{ 6K_{1}\left( 2M_{1,1}^{1/2}M_{2,1}K_{2}\right) , 6K_{1}\left( 2M_{1,2}^{1/2}M_{2,2}K_{2}\right) \right\} . \end{aligned}$$

The formula above for \(M_{3}\) involves two other constants, \(K_{1}\) and \(K_{2}.\) Of these, \(K_{1}\) is a bound for the operator norm of an integral term in the mild formulation of the equation for \(w^{n}\); this formulation will be developed in Sect. 4. To specify the constant \(K_{1}\), we need to specify a set, A,  of special wavenumber pairs:

$$\begin{aligned} A=\{(0,0), (1,0), (2,0), (0,1), (0,2)\}. \end{aligned}$$

Then, \(K_{1}\) is given by:

$$\begin{aligned} K_{1}=\sup _{(k,j)\in {\mathbb {Z}}^{2}\setminus A}\frac{1+|k|+|j|}{-\sigma (k,j)}, \end{aligned}$$
(11)

where \(\sigma \) is the symbol of the linearized KSE operator \(-\Delta ^2-\Delta \),

$$\begin{aligned} \sigma (k,j)=-\left( \left( \frac{2\pi k}{L_{1}}\right) ^{2}+\left( \frac{2\pi j}{L_{2}}\right) ^{2}\right) ^{2} +\left( \frac{2\pi k}{L_{1}}\right) ^{2}+\left( \frac{2\pi j}{L_{2}}\right) ^{2}. \end{aligned}$$
(12)

We notice that the denominator in (11) is quartic with respect to k and j, while the numerator is linear. Also, the denominator is always positive, as the only pairs for which the denominator is nonpositive are \((k,j)=(0,0),\) \((k,j)=(1,0),\) and \((k,j)=(0,1),\) and these three pairs are excluded from the set A. Thus, the supremum in (11) is finite and positive.

We let \(K_{2}\) be an upper bound on the norm of some particular functions in a certain space, denoted \({\mathcal {B}}_{\rho }^{0}\) and defined in Sect. 4, which will be used for the analysis of the \(w^n\) equation:

$$\begin{aligned}&\left\| \frac{16\pi ^{2}}{L_{1}^{2}}\sin \left( \frac{2\pi x}{L_{1}}\right) \sin \left( \frac{4\pi x}{L_{1}}\right) \right\| _{{\mathcal {B}}_{\rho }^{0}} \le K_{2}, \qquad \left\| \frac{16\pi ^{2}}{L_{1}^{2}}\sin ^{2}\left( \frac{4\pi x}{L_{1}}\right) \right\| _{{\mathcal {B}}_{\rho }^{0}}\le K_{2}, \nonumber \\&\quad \left\| \frac{4\pi }{L_{1}}\sin \left( \frac{2\pi x}{L_{1}}\right) \right\| _{{\mathcal {B}}_{\rho }^{0}}\le K_{2}, \qquad \left\| \frac{8\pi }{L_{1}}\sin \left( \frac{4\pi x}{L_{1}}\right) \right\| _{{\mathcal {B}}_{\rho }^{0}}\le K_{2}, \nonumber \\&\quad \left\| \frac{16\pi ^{2}}{L_{2}^{2}}\sin \left( \frac{2\pi y}{L_{2}}\right) \sin \left( \frac{4\pi y}{L_{2}} \right) \right\| _{{\mathcal {B}}_{\rho }^{0}} \le K_{2}, \qquad \left\| \frac{16\pi ^{2}}{L_{2}^{2}}\sin ^{2}\left( \frac{4\pi y}{L_{2}}\right) \right\| _{{\mathcal {B}}_{\rho }^{0}}\le K_{2}, \nonumber \\&\quad \left\| \frac{4\pi }{L_{2}}\sin \left( \frac{2\pi y}{L_{2}}\right) \right\| _{{\mathcal {B}}_{\rho }^{0}}\le K_{2}, \qquad \left\| \frac{8\pi }{L_{2}}\sin \left( \frac{4\pi y}{L_{2}}\right) \right\| _{{\mathcal {B}}_{\rho }^{0}}\le K_{2}. \end{aligned}$$
(13)

Finally, we introduce a constant K that will be used in the bound on the forcing terms:

$$\begin{aligned} K= & {} \max \Bigg \{ \frac{3M_{1,1}^{1/2}M_{3}K_{2}}{\left\| \cos \left( \frac{2\pi x}{L_{1}}\right) \right\| _{{\mathcal {B}}_{\rho }^{0}}}, \frac{3M_{1,1}^{1/2}M_{3}K_{2}}{\left\| \cos \left( \frac{4\pi x}{L_{1}}\right) \right\| _{{\mathcal {B}}_{\rho }^{0}}}, \frac{3M_{1,2}^{1/2}M_{3}K_{2}}{\left\| \cos \left( \frac{2\pi y}{L_{2}}\right) \right\| _{{\mathcal {B}}_{\rho }^{0}}}, \frac{3M_{1,2}^{1/2}M_{3}K_{2}}{\left\| \cos \left( \frac{4\pi y}{L_{2}}\right) \right\| _{{\mathcal {B}}_{\rho }^{0}}}, \\&\frac{3M_{1,2}^{1/2}M_{3}K_{2}}{\left\| \cos \left( \frac{2\pi x}{L_{1}}\right) \right\| _{{\mathcal {B}}_{\rho }^{0}}}, \frac{3M_{1,2}^{1/2}M_{3}K_{2}}{\left\| \cos \left( \frac{4\pi x}{L_{1}}\right) \right\| _{{\mathcal {B}}_{\rho }^{0}}}, \frac{3M_{1,1}^{1/2}M_{3}K_{2}}{\left\| \cos \left( \frac{2\pi y}{L_{2}}\right) \right\| _{{\mathcal {B}}_{\rho }^{0}}}, \frac{3M_{1,1}^{1/2}M_{3}K_{2}}{\left\| \cos \left( \frac{4\pi y}{L_{2}}\right) \right\| _{{\mathcal {B}}_{\rho }^{0}}} \Bigg \}. \end{aligned}$$

3 Goodman’s Toy Model with Added Forcing

In Goodman (1994), Goodman proved that small solutions of the one-dimensional Kuramoto–Sivashinsky equation exist and stay small for all time, using a Lyapunov function argument. In his proof, the domain can be of arbitrary size, and hence, there can be any number of linearly growing modes. First, however, he motivated the argument with a toy model, which was constructed by considering the case in which there was only one growing mode, and neglecting contributions to the evolution from Fourier modes other than the first and second modes. The toy model demonstrated how energy transfers between a growing mode and a decaying mode, achieving balance. We make two modifications to Goodman’s toy model: we have a small parameter in front of the exponential growth term in the evolution equation for the growing mode (this growth term was of unit size in Goodman (1994)), and we allow a given forcing as well. In this section, we develop bounds in Proposition 1 and Proposition 2 that will be utilized in the induction argument in Sect. 5.

We study the following system:

$$\begin{aligned} a_{t}= & {} \varepsilon _{i} a+\frac{8\pi ^{2}}{L_{i}^{2}}ab+Q_{1}, \end{aligned}$$
(14)
$$\begin{aligned} b_{t}= & {} -B_{i}b-\frac{2\pi ^{2}}{L_{i}^{2}}a^{2}+Q_{2}, \end{aligned}$$
(15)

for \(i\in \{1,2\}\), where \(Q_1\) and \(Q_2\) are given functions in time. For existence and uniqueness of solutions, at least for short time, it is enough to assume that \(Q_{i}\in L^1((0,t))\). We will need a bit more hypotheses on these functions. For the remainder of the section, we fix a choice for \(i\in \{1,2\}.\) We will assume the following bounds for \(Q_{1}\) and \(Q_{2}:\)

$$\begin{aligned} \sup _{t\in [0,\infty )}|Q_{1}|\le 2K\varepsilon ^{2}, \qquad \sup _{t\in [0,\infty )}|Q_{2}|\le 2K\varepsilon ^{2}. \end{aligned}$$
(16)

Proposition 1

Assume (16) holds and let a and b solve (14)–(15). There exists \(\varepsilon _{*}>0\) such that for any value of \(\varepsilon >0\) satisfying \(\varepsilon \in (0,\varepsilon _{*}),\) if \(a^{2}(0)+b^{2}(0)\le M_{1,i}\,\varepsilon /4,\) then \(a^{2}(t)+b^{2}(t)\le 4M_{1,i}\,\varepsilon \) for all \(t>0\).

Proof

We define a Lyapunov function

$$\begin{aligned} G(a,b)=\frac{1}{2}a^{2}+2b^{2}+\frac{L_{i}^{2}\varepsilon }{\pi ^{2}}b. \end{aligned}$$

Let us assume that \(G(a,b)\ge M_{1,i}\varepsilon .\) Then, we have that

$$\begin{aligned} \frac{1}{2}a^{2}+2b^{2} \ge M_{1,i}\,\varepsilon -\left| \frac{L_{i}^{2}\varepsilon b}{\pi ^{2}}\right| \ge M_{1,i}\,\varepsilon -b^{2}-\frac{L_{i}^{4}\varepsilon ^{2}}{4\pi ^{4}} \ge \frac{M_{1,i}}{2}\varepsilon - b^{2}. \end{aligned}$$
(17)

For the first inequality, we have used that, by Young’s inequality,

$$\begin{aligned} \left| \frac{L_{i}^{2}\varepsilon b}{\pi ^{2}}\right| \le b^{2}+\frac{L_{i}^{4}\varepsilon ^{2}}{4\pi ^{4}}, \end{aligned}$$
(18)

while for the last inequality we have used that it is possible to choose \(\varepsilon \) small enough so that

$$\begin{aligned} \frac{L_{i}^{4}\varepsilon ^{2}}{4\pi ^{4}}\le \frac{M_{1,i}\,\varepsilon }{2}. \end{aligned}$$

It then follows from (17) that

$$\begin{aligned} \frac{1}{2}a^{2}+3b^{2}\ge \frac{M_{1,i}\,\varepsilon }{2}, \end{aligned}$$

from which we conclude that

$$\begin{aligned} a^{2}+b^{2}\ge \frac{M_{1,i}\,\varepsilon }{6}. \end{aligned}$$
(19)

We next take the derivative of G with respect to time and use (14)–(15):

$$\begin{aligned} G_{t}=(\varepsilon _{i}-2\varepsilon ) a^{2}-4B_{i}b^{2}-\frac{L_{i}^{2}\varepsilon B_{i}b}{\pi ^{2}}+aQ_{1}+4bQ_{2} +\frac{L_{i}^{2}\varepsilon }{\pi ^{2}}Q_{2}. \end{aligned}$$

We rewrite this expression as:

$$\begin{aligned} G_{t}=\Upsilon _{1}+\Upsilon _{2}, \end{aligned}$$

where \(\Upsilon _{1}\) and \(\Upsilon _{2}\) are given by:

$$\begin{aligned} \Upsilon _{1}= & {} \left( \frac{\varepsilon _{i}}{2}-\varepsilon \right) a^{2}-2B_{i}b^{2}-\frac{L_{i}^{2}\varepsilon B_{i}b}{\pi ^{2}},\\ \Upsilon _{2}= & {} \left( \frac{\varepsilon _{i}}{2}-\varepsilon \right) a^{2}-2B_{i}b^{2} +aQ_{1}+4bQ_{2}+\frac{L_{i}^{2}\varepsilon Q_{2}}{\pi ^{2}}. \end{aligned}$$

We will show that \(\Upsilon _{1}\) and \(\Upsilon _{2}\) are negative when a and b satisfy (19), at least for sufficiently small values of \(\varepsilon .\) For \(\Upsilon _{1},\) it is enough to consider the case \(b<0\), as \(\Upsilon _{1}<0\) if \(b\ge 0\). Next, we observe that if \(b<-\frac{L_{i}^{2}\varepsilon }{2\pi ^{2}},\) then

$$\begin{aligned} -2B_{i}b^{2}-\frac{L_{i}^{2}\varepsilon B_{i}b}{\pi ^{2}}<0, \end{aligned}$$

and thus \(\Upsilon _{1}<0.\) The remaining case to consider is

$$\begin{aligned} -\frac{L_{i}^{2}\varepsilon }{2\pi ^{2}}<b<0. \end{aligned}$$
(20)

For \(\varepsilon \) small enough, (19) and (20) together imply

$$\begin{aligned} a^{2}\ge \frac{M_{1,i}\varepsilon }{12}. \end{aligned}$$

Hence, if (20) holds, we may conclude the following bounds:

$$\begin{aligned}&\left( \frac{\varepsilon _{i}}{2}-\varepsilon \right) a^{2}\le -\frac{\varepsilon }{2}a^{2} \le -\frac{M_{1,i}\varepsilon ^{2}}{24}, \\&\left| \frac{L_{i}^{2}\varepsilon B_{i}b}{\pi ^{2}}\right| \le \frac{L_{i}^{4}\varepsilon ^{2}B_{i}}{2\pi ^{4}}. \end{aligned}$$

Using that \(M_{1,i}=\frac{12B_{i}L_{i}^{4}}{\pi ^{4}}\) by definition, we have

$$\begin{aligned} -\frac{\varepsilon }{2}a^{2}-\frac{L_{i}^{2}\varepsilon B_{i}b}{\pi ^{2}} \le -\frac{M_{1,i}\varepsilon ^{2}}{24}+\frac{L_{i}^{4}\varepsilon ^{2}B_{i}}{2\pi ^{4}} =0. \end{aligned}$$

We conclude that \(\Upsilon _{1}<0.\) We have shown then that \(\Upsilon _{1}<0\) in every case.

We now turn to \(\Upsilon _{2}.\) We estimate the terms containing \(Q_1\) and \(Q_2\) as follows. By Young’s inequality,

$$\begin{aligned} |aQ_{1}|\le \frac{\varepsilon a^{2}}{4}+\frac{Q_{1}^{2}}{\varepsilon }, \end{aligned}$$

which, combined with (16), gives

$$\begin{aligned} |aQ_{1}|\le \frac{\varepsilon a^{2}}{4}+\frac{4K^{2}\varepsilon ^{4}}{\varepsilon }=\frac{\varepsilon a^{2}}{4} +4K^{2}\varepsilon ^{3}. \end{aligned}$$

We similarly bound \(4bQ_{2}\) as

$$\begin{aligned} |4bQ_{2}|\le B_{i}b^{2}+\frac{4Q_{2}^{2}}{B_{i}}\le B_{i}b^{2}+\frac{16K^{2}\varepsilon ^{4}}{B_{i}}. \end{aligned}$$

Again using (16), we bound the last term in \(\Upsilon _2\) as

$$\begin{aligned} \left| \frac{L_{i}^{2}\varepsilon Q_{2}}{\pi ^{2}}\right| \le \frac{2L_{i}^{2}K\varepsilon ^{3}}{\pi ^{2}}. \end{aligned}$$

These estimates in turn give the following bound on \(\Upsilon _2\):

$$\begin{aligned} \Upsilon _{2}\le -\frac{\varepsilon a^{2}}{4}-B_{i}b^{2} +\left[ 4K^{2}\varepsilon ^{3}+\frac{16K^{2}\varepsilon ^{4}}{B_{i}} +\frac{2L_{i}^{2}K\varepsilon ^{3}}{\pi ^{2}}\right] . \end{aligned}$$
(21)

But we assumed that \(G\ge M_{1,i}\varepsilon \), which implies \(a^{2}+b^{2}\ge M_{1,i}\varepsilon /6\) as shown above, so that

$$\begin{aligned} -\frac{\varepsilon a^{2}}{4}-B_{i}b^{2} \le -\frac{\varepsilon a^{2}}{4}-\frac{\varepsilon b^{2}}{4}\le -\frac{M_{1,i}\varepsilon ^{2}}{24}. \end{aligned}$$

Therefore,

$$\begin{aligned} \Upsilon _{2}< -\frac{M_{1,i}\varepsilon ^{2}}{24} +\left[ 4K^{2}\varepsilon ^{3}+\frac{16K^{2}\varepsilon ^{4}}{B_{i}} +\frac{2L_{i}^{2}K\varepsilon ^{3}}{\pi ^{2}}\right] . \end{aligned}$$

We can take \(\varepsilon \) small enough so that

$$\begin{aligned} \left| 4K^{2}\varepsilon ^{3}+\frac{16K^{2}\varepsilon ^{4}}{B_{i}} +\frac{2L_{i}^{2}K\varepsilon ^{3}}{\pi ^{2}}\right| \le \frac{M_{1,i}\varepsilon ^{2}}{48}. \end{aligned}$$

For such values of \(\varepsilon \), we have \(\Upsilon _{2}<0.\)

We have concluded that \(G\ge M_{1,i}\varepsilon \) implies \(G_{t}<0.\) Hence, if G is initially less than \(M_{1,i}\,\varepsilon ,\) then necessarily \(G<M_{1,i}\,\varepsilon \) for all \(t>0\).

Next, we ask under which conditions \(G<M_{1,i}\varepsilon \) initially. We observe that, from the definition of G and (18),

$$\begin{aligned} G\le \frac{1}{2}a^{2}+2b^{2}+b^{2}+\frac{L_{i}^{4}\varepsilon ^{2}}{4\pi ^{4}}\le 3(a^{2}+b^{2}) +\frac{L_{i}^{4}\varepsilon ^{2}}{4\pi ^{4}}. \end{aligned}$$

Consequently, \(G(0)<M_{1,i}\varepsilon \) provided \(a^{2}(0)+b^{2}(0)\le \frac{M_{1,i}\varepsilon }{4}\) (which holds by hypothesis) and provided \(\varepsilon \) is taken small enough so that \(\frac{L_{i}^{4}\varepsilon ^{2}}{4\pi ^{4}}<\frac{M_{1,i}\varepsilon }{4}\).

Assuming then \(G(t)<M_{1,i}\varepsilon \) for all \(t>0\), we ask what can we say about \(a^{2}(t)+b^{2}(t)\). We again use the definition of G together with (18), now finding that

$$\begin{aligned}&M_{1,i}\varepsilon >\frac{1}{2}a^{2}+2b^{2}+\frac{L_{i}^{2}\varepsilon b}{\pi ^{2}} \ge \frac{1}{2}a^{2}+2b^{2}-\left| \frac{L_{i}^{2}\varepsilon b}{\pi ^{2}}\right| \\&\quad \ge \frac{1}{2}a^{2}+2b^{2}-b^{2}-\frac{L_{i}^{4}\varepsilon ^{2}}{4\pi ^{4}} \ge \frac{1}{2}\left( a^{2}+b^{2}\right) -\frac{L_{i}^{4}\varepsilon ^{2}}{4\pi ^{4}}. \end{aligned}$$

Rearranging the left-hand and right-hand sides of this expression gives:

$$\begin{aligned} \frac{1}{2}\left( a^{2}+b^{2}\right) < M_{1,i}\varepsilon +\frac{L_{i}^{4}\varepsilon ^{2}}{4\pi ^{4}}. \end{aligned}$$

We then take \(\varepsilon \) small enough so that \(\frac{L_{i}^{4}\varepsilon ^{2}}{4\pi ^{4}}\le M_{1,i}\varepsilon .\) Finally, we conclude

$$\begin{aligned} a^{2}+b^{2} < 4M_{1,i}\varepsilon . \end{aligned}$$

This completes the proof. \(\square \)

Proposition 2

Under the hypotheses of Proposition 1, if also \(b(0)\le M_{2,i}\varepsilon /2,\) then there exists \(\varepsilon _{*}>0\) such that for any value of \(\varepsilon \in (0,\varepsilon _{*}),\) \(|b(t)|\le M_{2,i}\,\varepsilon \) for all \(t>0\).

Proof

From Proposition 1, we have \((a(t))^{2}\le 4M_{1,i}\,\varepsilon \) for all \(t>0\). From (16), we also have \(|Q_{2}(t)|\le 2K\varepsilon ^{2}\) for all \(t>0\). Using Duhamel’s formula, we rewrite the equation for b in integral form:

$$\begin{aligned} b(t)=e^{-B_{i}t}b(0)+e^{-B_{i}t}\int _{0}^{t}e^{B_{i}s}\left[ \frac{2\pi ^{2}}{L_{i}^{2}}a^{2}(s)+Q_{2}(s)\right] \ ds. \end{aligned}$$

We recall that \(M_{2,i}=\frac{8\pi ^{2}M_{1,i}}{L_{i}^{2}}\), so that

$$\begin{aligned} |b(t)|\le e^{-B_{i}t}|b(0)|+\left( M_{2,i}\varepsilon +2K\varepsilon ^{2}\right) e^{-B_{i}t}\int _{0}^{t}e^{B_{i}s}\ ds. \end{aligned}$$

We evaluate the integral and bound the result as

$$\begin{aligned} |b(t)|\le e^{-B_{i}t}|b(0)|+\frac{1}{B_{i}}\left( M_{2,i}\varepsilon +2K\varepsilon ^{2}\right) . \end{aligned}$$

Now \(B_{i}\) is approximately equal to 12, since we are taking \(L_{i}\) close to \(2\pi ;\) we may thus say \(B_{i}>10.\) Lastly, by again taking \(\varepsilon \) sufficiently small and from the hypothesis \(|b(0)|\le \frac{M_{2,i}}{2}\varepsilon \), it follows that

$$\begin{aligned} |b(t)|\le M_{2,i}\varepsilon , \end{aligned}$$

for all \(t>0.\) \(\square \)

4 The Duchon–Robert Framework

In this section, we develop the estimates we will use for the iterates \(w^{n}.\) We will assume that \(w^n\) belongs to suitable function spaces of analytic functions in time based on the Wiener algebra. These spaces are Banach algebras and are well adapted to the inductive argument carried out in Sect. 5. The bounds on \(w^n\) follow from estimates on the semigroup generated by the linearized operator and by estimating the integral in the mild formulation of the PDE, exploiting the algebra structure to control the nonlinearity. These spaces and similar bounds were used by Duchon and Robert (Duchon and Robert (1988)) to prove the global existence of vortex sheet solutions in incompressible two-dimensional fluid flow.

For \(m\in {\mathbb {N}}\) and \(\rho \ge 0,\) we define the space \(B^{m}_{\rho }\) to be the space of distributions on the torus for which the following weighted sum of their Fourier coefficients is finite:

$$\begin{aligned} f\in B^{m}_{\rho } \iff \Vert f\Vert _{B^{m}_{\rho }}=\sum _{(k,j)\in {\mathbb {Z}}^{2}}e^{\rho (|k|+|j|)} (1+|k|+|j|)^{m}|f_{k,j}|<\infty . \end{aligned}$$

We also have a space-time version of this space, which we call \({\mathcal {B}}^{m}_{\rho }\), defined as the space of distributions on \([0,\infty )\times {\mathbb {T}}^2\) such that

$$\begin{aligned} \Vert g\Vert _{{\mathcal {B}}^{m}_{\rho }} =\sum _{(k,j)\in {\mathbb {Z}}^{2}}e^{\rho (|k|+|j|)}(1+|k|+|j|)^{m} \sup _{t\in [0,\infty )}|g_{k,j}(t)|<\infty . \end{aligned}$$

We observe that elements of both \(B^m_\rho \) and \({\mathcal {B}}^m_\rho \) are actually functions that are analytic in space with radius of analyticity at least \(\rho >0\) and at least bounded in time.

The spaces \(B^{0}_{\rho }\) and \({\mathcal {B}}^{0}_{\rho }\) are Banach algebras; indeed, \(B_{0}^{0}\) is exactly the Wiener algebra. If f and g are both in \({{\mathcal {B}}}^{0}_{\rho },\) we have

$$\begin{aligned}&\!\!\!\Vert fg\Vert _{{{\mathcal {B}}}^{0}_{\rho }}\\&\!\!\!\quad \le \sum _{(k,j)\in {\mathbb {Z}}^{2}} \sum _{(\ell ,n)\in {\mathbb {Z}}}e^{\rho (|k-\ell |+|j-n|)}e^{\rho (|\ell |+|n|)} \left( \sup _{t\in [0,\infty )}|f_{k-\ell ,j-n}(t)|\right) \left( \sup _{t\in [0,\infty )}|g_{\ell ,n}(t)|\right) \\&\!\!\!\quad \le \Vert f\Vert _{{{\mathcal {B}}}^0_{\rho }}\Vert g\Vert _{{{\mathcal {B}}}^0_{\rho }}. \end{aligned}$$

The analogous estimate for \(B_{\rho }^{0}\) follows immediately by observing that \(B_{\rho }^{m}\) consists precisely of the elements of \({\mathcal {B}}_{\rho }^{m}\) that are constant in time. Then, we may conclude (simply by the product rule) that the spaces \(B^{1}_{\rho }\) and \({\mathcal {B}}^{1}_{\rho }\) are also Banach algebras. Indeed, a function f is in \({\mathcal {B}}^{1}_{\rho }\) if and only if f and its partial derivatives \(\partial _{x}f\) and \(\partial _{y}f\) are all in \({\mathcal {B}}_{\rho }^{0}.\)

We note that we will not use the spaces \({\mathcal {B}}^{m}_{\rho }\) or \(B^{m}_{\rho }\) for \(m>1,\) although these are Banach algebras as well (for the same reasons).

We define the operator \(I^{+}\) by

$$\begin{aligned} I^{+}h(\cdot ,t)={\mathbb {P}}_{5}\int _{0}^{t}e^{-(\Delta ^{2}+\Delta )(t-s)}h(\cdot ,s)\ ds, \end{aligned}$$

where the integral is intended in the Bochner sense and \(e^{-t (\Delta ^2+ \Delta )}\) denotes the \(C^0\) (unbounded) semigroup generated by the linearized KSE operator on \(B_{\rho }^{m}\). We will show that \(I^{+}\) is bounded from \({\mathcal {B}}^{0}_{\rho }\) to \({\mathcal {B}}^{1}_{\rho }.\) (This is the only fact needed for our purposes, but the integral is actually bounded from \({\mathcal {B}}^{0}_{\rho }\) to \({\mathcal {B}}^{4}_{\rho }.\)) Let \(h\in {\mathcal {B}}^{0}_{\rho }\) be given. Then, the norm of \(I^{+}h\) is given by:

$$\begin{aligned}&\Vert I^{+}h\Vert _{{\mathcal {B}}^{1}_{\rho }}\\&\quad =\sum _{(k,j)\notin A}e^{\rho (|k|+|j|)}(1+|k|+|j|)\sup _{t\in [0,\infty )}\left| \int _{0}^{t}\exp \left\{ \sigma (k,j)(t-s)\right\} h_{k,j}(s)\ ds \right| , \end{aligned}$$

where \(\sigma \) is defined in (12). The triangle inequality then implies:

$$\begin{aligned}&\Vert I^{+}h\Vert _{{\mathcal {B}}^{1}_{\rho }} \le \sum _{(k,j)\notin A} e^{\rho (|k|+|j|)}(1+|k|+|j|)\sup _{t\in [0,\infty )}\exp \left\{ \sigma (k,j)t\right\} \cdot \\&\quad \cdot \int _{0}^{t}\exp \left\{ -\sigma (k,j)s\right\} |h_{k,j}(s)|\ ds. \end{aligned}$$

We take the supremum of \(|h_{k,j}(s)|\) in s, which we can then pull out to obtain:

$$\begin{aligned}&\Vert I^{+}h\Vert _{{\mathcal {B}}^{1}_{\rho }}\le \left( \sum _{(k,j)\notin A}e^{\rho (|k|+|j|)}\sup _{t\in [0,\infty )}|h_{k,j}(t)|\right) \\&\quad \Bigg [\sup _{t\in [0,\infty )}\sup _{(k,j)\notin A}\Bigg ((1+|k|+|j|) \exp \left\{ \sigma (k,j)t\right\} \int _{0}^{t}\exp \left\{ -\sigma (k,j)s\right\} \ ds \Bigg )\Bigg ]. \end{aligned}$$

The first factor on the right-hand side can simply be bounded by \(\Vert h\Vert _{{\mathcal {B}}^{0}_{\rho }}.\) A bound on the second factor (i.e., the double supremum) can be found by directly computing the integral, which gives:

$$\begin{aligned} \Vert I^{+}h\Vert _{{\mathcal {B}}^{1}_{\rho }}\le \Vert h\Vert _{{\mathcal {B}}^{0}_{\rho }} \left[ \sup _{t\in [0,\infty )}\sup _{(k,j)\notin A} \frac{(1+|k|+|j|)(1-\exp \{\sigma (k,j)t\}}{-\sigma (k,j)}\right] . \end{aligned}$$

The negative term in the numerator can be neglected. Therefore, we have

$$\begin{aligned} \Vert I^{+}h\Vert _{{\mathcal {B}}^{1}_{\rho }}\le K_{1}\Vert h\Vert _{{\mathcal {B}}^{0}_{\rho }}, \end{aligned}$$

where

$$\begin{aligned} K_{1}=\sup _{(k,j)\notin A}\frac{1+|k|+|j|}{-\sigma (k,j)}. \end{aligned}$$

We now turn to proving estimates on the semigroup. We show that \(e^{(-\Delta ^{2}-\Delta )t}\) maps \({\mathbb {P}}_5(B^{1}_{\rho })\) into \({\mathcal {B}}^{1}_{\rho }\) boundedly. In fact, we first observe that

$$\begin{aligned}&\Vert e^{(-\Delta ^{2}-\Delta )t} f\Vert _{{\mathcal {B}}_{\rho }^{1}} \\&\quad \le \sum _{(k,j)\notin A}e^{\rho (|k|+|j|)}(1+|k|+|j|)\sup _{t\in [0,\infty )} \exp \left\{ \sigma (k,j)t\right\} |(f)_{k,j}|, \end{aligned}$$

if \(f\in {\mathbb {P}}_5(B^{1}_{\rho })\). The supremum is achieved at \(t=0\) for every \((k,j)\notin A.\) (Recall that f is supported in Fourier space only on wavenumbers in the complement of the set A.) We therefore have

$$\begin{aligned} \Vert e^{(-\Delta ^{2}-\Delta )t} f\Vert _{{\mathcal {B}}^{1}_{\rho }} \le \sum _{(k,j)\notin A}e^{\rho (|k|+|j|)}(1+|k|+|j|)|(f)_{k,j}| = \Vert f\Vert _{B^{1}_{\rho }}. \end{aligned}$$
(22)

We will apply these semigroup estimates to the remainder terms \(w^n\).

5 Inductive Argument and Convergence

We are now ready to complete the proof of Theorem 1. First, in Proposition 3 we obtain uniform bounds on the iterates by induction, using the bounds already established. Then in Theorem 2, we state a precise version of our main result, existence of a global mild solution \(\phi \), which follows by passing to the limit \(n\rightarrow \infty \) and using compactness arguments.

Proposition 3

Fix \(\rho >0.\) Let \(\varepsilon =\max \{\varepsilon _{1},\varepsilon _{2}\}\), where \(\varepsilon _i\), \(i=1,2\), is given in (6). Assume the initial data \(a_{1,0}(0),\) \(a_{2,0}(0),\) \(a_{0,1}(0),\) \(a_{0,2}(0),\) and w(0) satisfy

$$\begin{aligned}&(a_{1,0}(0))^{2}+(a_{2,0}(0))^{2}\le \frac{M_{1,1}\varepsilon }{4}, \\&\quad (a_{0,1}(0))^{2}+(a_{0,2}(0))^{2}\le \frac{M_{1,2}\varepsilon }{4}, \\&\quad |a_{2,0}(0)|\le \frac{M_{2,1}\varepsilon }{2}, \qquad |a_{0,2}(0)|\le \frac{M_{2,2}\varepsilon }{2}, \\&\quad \Vert w(0)\Vert _{B_{\rho }^{1}}\le \frac{M_{3}}{6}\varepsilon ^{3/2}. \end{aligned}$$

Then, there exists \(\varepsilon _{*}>0\) such that for \(i\in \{1,2\},\) for all \(\varepsilon _{i}\in (0,\varepsilon _{*}),\) and for all n,  the following bounds are satisfied:

$$\begin{aligned}&\sup _{t\in [0,\infty )}|a^{n}_{1,0}|\le 2M_{1,1}^{1/2}\varepsilon ^{1/2}, \qquad \sup _{t\in [0,\infty )}|a^{n}_{0,1}|\le 2M_{1,2}^{1/2}\varepsilon ^{1/2}, \end{aligned}$$
(23)
$$\begin{aligned}&\sup _{t\in [0,\infty )}|a^{n}_{2,0}|\le M_{2,1}\varepsilon , \qquad \sup _{t\in [0,\infty )}|a^{n}_{0,2}|\le M_{2,2}\varepsilon , \end{aligned}$$
(24)
$$\begin{aligned}&\Vert w^{n}\Vert _{{\mathcal {B}}_{\rho }^{1}}\le M_{3}\varepsilon ^{3/2}. \end{aligned}$$
(25)
$$\begin{aligned}&\sup _{t\in [0,\infty )}|F_{1,0,x}^{n}|\le K\varepsilon ^{2}, \qquad \sup _{t\in [0,\infty )}|F_{1,0,y}^{n}|\le K\varepsilon ^{2}, \end{aligned}$$
(26)
$$\begin{aligned}&\sup _{t\in [0,\infty )}|F_{2,0,x}^{n}|\le K\varepsilon ^{2}, \qquad \sup _{t\in [0,\infty )}|F_{2,0,y}^{n}|\le K\varepsilon ^{2}. \end{aligned}$$
(27)
$$\begin{aligned}&\sup _{t\in [0,\infty )}|F_{0,1,x}^{n}|\le K\varepsilon ^{2}, \qquad \sup _{t\in [0,\infty )}|F_{0,1,y}^{n}|\le K\varepsilon ^{2}, \end{aligned}$$
(28)
$$\begin{aligned}&\sup _{t\in [0,\infty )}|F_{0,2,x}^{n}|\le K\varepsilon ^{2}, \qquad \sup _{t\in [0,\infty )}|F_{0,2,y}^{n}|\le K\varepsilon ^{2}. \end{aligned}$$
(29)

Proof

We initialize our iterative scheme with \(a_{1,0}^{0}=a_{2,0}^{0}=0,\) \(a_{0,1}^{0}=a_{0,2}^{0}(0)=0,\) and \(w^{0}=0.\) Bounds (23), (24), (25), (26), (27), (28), and (29) are trivially satisfied by \(a_{1,0}^{0},\) \(a_{2,0}^{0},\) \(a_{0,1}^{0},\) \(a_{0,2}^{0},\) and \(w^{0}.\) We assume (23), (24), (25), (26), (27), (28), and (29), as our inductive hypothesis. We now prove the analogues of these for the next iterate. We recall the definition of \(\phi ^{n}\) from (9).

An appeal to Proposition 1 with \(i=1\) immediately proves the desired bound on \(a_{1,0}^{n+1},\) and another appeal to Proposition 1 with \(i=2\) immediately proves the desired bound on \(a_{0,1}^{n+1}.\) Then appealing twice to Proposition 2 again immediately proves the desired bounds on \(a_{2,0}^{n+1}\) and \(a_{0,2}^{n+1}.\)

Next, we write the mild formulation of the equation for \(w^{n+1}\) from (10). Since \({\mathbb {P}}_{5}\) is a projection, we may write \({\mathbb {P}}_{5}={\mathbb {P}}_{5}^{2}.\) Using the definition of \(I^{+}\) introduced in Sect. 4, we have

$$\begin{aligned} w^{n+1}=e^{(-\Delta ^{2}-\Delta )t}w_{0}+I^{+}({\mathbb {P}}_{5}((\phi _{x}^{n})^{2} +(\phi _{y}^{n})^{2})). \end{aligned}$$

Using the bounds developed in Sect. 4, we can then estimate \(w^{n+1}\) as follows:

$$\begin{aligned} \Vert w^{n+1}\Vert _{{\mathcal {B}}_{\rho }^{1}} \le \Vert w_{0}\Vert _{B_{\rho }^{1}} +K_{1}\left\| {\mathbb {P}}_{5}((\phi _{x}^{n})^{2})\right\| _{{\mathcal {B}}_{\rho }^{0}} +K_{1}\left\| {\mathbb {P}}_{5}((\phi _{y}^{n})^{2})\right\| _{{\mathcal {B}}_{\rho }^{0}}. \end{aligned}$$
(30)

In order to close the induction argument, we need to express \(\phi _{x}^{n}\) and \(\phi _{y}^{n}\) in terms of the quantities we are estimating. To this end, we will use a different decomposition for \((\phi _{x}^{n})^{2}\) and \((\phi _{y}^{n})^{2}\) than the one used in Sect. 1.1. We decompose \((\phi _{x}^{n})^{2}\) in the following way:

$$\begin{aligned} (\phi _{x}^{n})^{2}=\Phi _{0}+\Phi _{1}+\Phi _{2}w_{x}^{n}+(w_{x}^{n})^{2}, \end{aligned}$$
(31)

where \(\Phi _{0},\) \(\Phi _{1},\) and \(\Phi _{2}\) are given by

$$\begin{aligned} \Phi _{0}= & {} \frac{4\pi ^{2}(a_{1,0}^{n})^{2}}{L_{1}^{2}}\sin ^{2}\left( \frac{2\pi x}{L_{1}}\right) ,\\ \Phi _{1}= & {} \frac{16\pi ^{2}a_{1,0}^{n}a_{2,0}^{n}}{L_{1}^{2}} \sin \left( \frac{2\pi x}{L_{1}}\right) \sin \left( \frac{4\pi x}{L_{1}}\right) \\&+\frac{16\pi ^{2}(a_{2,0}^{n})^{2}}{L_{1}^{2}}\sin ^{2}\left( \frac{4\pi x}{L_{1}}\right) , \\ \Phi _{2}= & {} -\frac{4\pi a_{1,0}^{n}}{L_{1}}\sin \left( \frac{2\pi x}{L_{1}}\right) \\&-\frac{8\pi a_{2,0}^{n}}{L_{1}}\sin \left( \frac{4\pi x}{L_{1}}\right) . \end{aligned}$$

One reason for this decomposition is that \({\mathbb {P}}_{5}\Phi _{0}=0.\) Another reason is that the term \(\Phi _{0}\) is larger than the remaining terms; \(\Phi _{1},\) \(\Phi _{2}w^{n}_{x},\) and \((w_{x}^{n})^{2}\) are all of order \(\varepsilon ^{3/2}\) or smaller, while the same is not true for \(\Phi _{0}.\)

We now estimate \(\left\| {\mathbb {P}}_{5}((\phi _{x}^{n})^{2})\right\| _{{\mathcal {B}}_{\rho }^{0}},\) using the fact that \({\mathbb {P}}_{5}\Phi _{0}=0\) and the fact \({\mathbb {P}}_{5}\) is bounded:

$$\begin{aligned} \left\| {\mathbb {P}}_{5}((\phi _{x}^{n})^{2})\right\| _{{\mathcal {B}}_{\rho }^{0}} \le \Vert \Phi _{1}^{n}\Vert _{{\mathcal {B}}_{\rho }^{0}} + \Vert \Phi _{2}^{n}\Vert _{{\mathcal {B}}_{\rho }^{0}}\Vert w^{n}\Vert _{{\mathcal {B}}_{\rho }^{1}} +\Vert w^{n}\Vert _{{\mathcal {B}}_{\rho }^{1}}^{2}. \end{aligned}$$
(32)

Above, we have also used the algebra property for \({\mathcal {B}}_{\rho }^{0}\) and that \(\Vert w_{x}^{n}\Vert _{{\mathcal {B}}_{\rho }^{0}}\le \Vert w^{n}\Vert _{{\mathcal {B}}_{\rho }^{1}},\) which is a direct consequence of the definition.

We can then make some straightforward estimates of \(\Phi _{1}\) and \(\Phi _{2}.\) For \(\Phi _{1}\), we have

$$\begin{aligned} \Vert \Phi _{1}\Vert _{{\mathcal {B}}_{\rho }^{0}} \le 2M_{1,1}^{1/2}M_{2,1}K_{2}\varepsilon ^{3/2}+M_{2,1}^{2}K_{2}\varepsilon ^{2}. \end{aligned}$$

Of course, to get this bound, we have employed the inductive hypothesis and the fact that \( \varepsilon _{1}\le \varepsilon .\) We recall that \(M_{3}\le 6K_{1}(2M_{1,1}^{1/2}M_{2,1}K_{2})\) to conclude that

$$\begin{aligned} \Vert \Phi _{1}\Vert _{{\mathcal {B}}_{\rho }^{0}}\le \frac{M_{3}}{6K_{1}}\varepsilon ^{3/2} +M_{2,1}^{2}K_{2}\varepsilon ^{2}. \end{aligned}$$

We take \(\varepsilon \) small enough so that

$$\begin{aligned} M_{2,1}^{2}K_{2}\varepsilon ^{2}\le \frac{M_{3}}{6K_{1}}\varepsilon ^{3/2}. \end{aligned}$$

We thus have

$$\begin{aligned} \Vert \Phi _{1}\Vert _{{\mathcal {B}}_{\rho }^{0}}\le \frac{M_{3}}{3K_{1}}\varepsilon ^{3/2}. \end{aligned}$$
(33)

We next turn to bounding \(\Phi _{2}.\) By using again the inductive hypothesis and the definition of the constant \(K_{2},\) it follows that

$$\begin{aligned} \Vert \Phi _{2}\Vert _{{\mathcal {B}}_{\rho }^{0}}\le 2M_{1,1}^{1/2}K_{2}\varepsilon ^{1/2} +M_{2,1}K_{2}\varepsilon . \end{aligned}$$

Another application of the inductive hypothesis gives that

$$\begin{aligned} \Vert \Phi _{2}\Vert _{{\mathcal {B}}_{\rho }^{0}}\Vert w^{n}\Vert _{{\mathcal {B}}_{\rho }^{1}} \le 2M_{1,1}^{1/2}M_{3}K_{2}\varepsilon ^{2}+M_{2,1}M_{3}K_{2}\varepsilon ^{5/2}. \end{aligned}$$

We take \(\varepsilon \) small enough so that

$$\begin{aligned} 2M_{1,1}^{1/2}M_{3}K_{2}\varepsilon ^{2}+M_{2,1}M_{3}K_{2}\varepsilon ^{5/2} \le \frac{M_{3}}{24K_{1}}\varepsilon ^{3/2}. \end{aligned}$$

We then have

$$\begin{aligned} \Vert \Phi _{2}\Vert _{{\mathcal {B}}_{\rho }^{0}}\Vert w^{n}\Vert _{{\mathcal {B}}_{\rho }^{1}}\le \frac{M_{3}}{24K_{1}}\varepsilon ^{3/2}. \end{aligned}$$
(34)

We proceed in a similar fashion to bound the quadratic term:

$$\begin{aligned} \Vert w^{n}\Vert _{{\mathcal {B}}_{\rho }^{1}}^{2}\le M_{3}^{2}\varepsilon ^{3}. \end{aligned}$$

We take \(\varepsilon \) small enough so that

$$\begin{aligned} M_{3}^{2}\varepsilon ^{3}\le \frac{M_{3}}{24K_{1}}\varepsilon ^{3/2}, \end{aligned}$$
(35)

which implies

$$\begin{aligned} \Vert w^{n}\Vert _{{\mathcal {B}}_{\rho }^{1}}^{2}\le \frac{M_{3}}{24K_{1}}\varepsilon ^{3/2}. \end{aligned}$$
(36)

Having concluded our treatment of \((\phi _{x}^{n})^{2},\) we now consider \((\phi _{y}^{n})^{2}.\) We may treat \((\phi _{y}^{n})^{2}\) analogously to the way we treated \((\phi _{x}^{n})^{2}\) in (31), leading to the decomposition

$$\begin{aligned} (\phi _{y}^{n})^{2}=\Phi _{3}+\Phi _{4}+\Phi _{5}w_{y}^{n}+(w_{y}^{n})^{2}, \end{aligned}$$

with the formulas

$$\begin{aligned} \Phi _{3}= & {} \frac{4\pi ^{2}(a_{0,1}^{n})^{2}}{L_{2}^{2}}\sin ^{2}\left( \frac{2\pi y}{L_{2}}\right) , \\ \Phi _{4}= & {} \frac{16\pi ^{2}a_{0,1}^{n}a_{0,2}^{n}}{L_{2}^{2}} \sin \left( \frac{2\pi y}{L_{2}}\right) \sin \left( \frac{4\pi y}{L_{2}}\right) \\&+\frac{16\pi ^{2}(a_{0,2}^{n})^{2}}{L_{2}^{2}}\sin ^{2}\left( \frac{4\pi y}{L_{2}}\right) , \\ \Phi _{5}= & {} -\frac{4\pi a_{0,1}^{n}}{L_{2}}\sin \left( \frac{2\pi y}{L_{2}}\right) \\&-\frac{8\pi a_{0,2}^{n}}{L_{2}}\sin \left( \frac{4\pi y}{L_{2}}\right) . \end{aligned}$$

Similarly to the case for \(\Phi _{0},\) we have \({\mathbb {P}}_{5}\Phi _{3}=0.\) We may then bound \((\phi _{y}^{n})^{2}\) as

$$\begin{aligned} \Vert {\mathbb {P}}_{5}(\phi _{y}^{n})^{2}\Vert _{{\mathcal {B}}_{\rho }^{0}}\le \Vert \Phi _{4}\Vert _{{\mathcal {B}}_{\rho }^{0}} +\Vert \Phi _{5}\Vert _{{\mathcal {B}}_{\rho }^{0}}\Vert w^{n}\Vert _{{\mathcal {B}}_{\rho }^{1}} +\Vert w^{n}\Vert _{{\mathcal {B}}_{\rho }^{1}}^{2}. \end{aligned}$$
(37)

We next proceed to estimating the remaining terms in a manner analogous to the previous case, omitting details:

$$\begin{aligned} \Vert \Phi _{4}\Vert _{{\mathcal {B}}_{\rho }^{0}}\le \frac{M_{3}}{3K_{1}}\varepsilon ^{3/2}, \end{aligned}$$
(38)
$$\begin{aligned} \Vert \Phi _{5}\Vert _{{\mathcal {B}}_{\rho }^{0}}\Vert w^{n}\Vert _{{\mathcal {B}}_{\rho }^{1}} \le \frac{M_{3}}{24K_{1}}\varepsilon ^{3/2}, \end{aligned}$$
(39)

where \(\varepsilon \) is chosen sufficiently small.

We recall the condition on the initial data for w,  namely,

$$\begin{aligned} \Vert w_{0}\Vert _{B_{\rho }^{1}}\le \frac{M_{3}}{6}\varepsilon ^{3/2}. \end{aligned}$$
(40)

We then combine (30), (32), (33), (34), (36), (37), (38), (39), and (40) to conclude

$$\begin{aligned} \Vert w^{n+1}\Vert _{{\mathcal {B}}_{\rho }^{1}}\le M_{3}\varepsilon ^{3/2}. \end{aligned}$$
(41)

It remains to demonstrate the estimates for the forcing terms. We include a proof only for \(F^{n+1}_{1,0,x}\), as the other cases are similar.

Recalling the formulas of Sect. 1.1, we have the decomposition for \(F^{n+1}_{1,0,x}\)

$$\begin{aligned} F_{1,0,x}^{n+1}\cos \left( \frac{2\pi x}{L_{1}}\right) = {\mathbb {P}}_{1,0}\left[ \Psi _{3}^{n+1}+\Psi _{5}^{n+1}+\Psi _{6}^{n+1}\right] , \end{aligned}$$

where

$$\begin{aligned} \Psi _{3}^{n+1}= & {} (w_{x}^{n+1})^{2}, \\ \Psi _{5}^{n+1}= & {} -\frac{4\pi a_{1,0}^{n+1}}{L_{1}}w_{x}^{n+1}\sin \left( \frac{2\pi x}{L_{1}}\right) , \\ \Psi _{6}^{n+1}= & {} -\frac{8\pi a_{2,0}^{n+1}}{L_{1}}w_{x}^{n+1}\sin \left( \frac{4\pi x}{L_{1}}\right) . \end{aligned}$$

We may then estimate \(F^{n+1}_{1,0,x}\) as

$$\begin{aligned} |F_{1,0,x}^{n+1}|\le \frac{\Vert \Psi _{3}^{n+1}\Vert _{{\mathcal {B}}_{\rho }^{0}} +\Vert \Psi _{5}^{n+1}\Vert _{{\mathcal {B}}_{\rho }^{0}} +\Vert \Psi _{6}^{n+1}\Vert _{{\mathcal {B}}_{\rho }^{0}}}{\left\| \cos \left( \frac{2\pi x}{L_{1}}\right) \right\| _{{\mathcal {B}}_{\rho }^{0}}}. \end{aligned}$$
(42)

For \(\Psi _{3}^{n+1}\), we have the estimate

$$\begin{aligned} \Vert \Psi _{3}^{n+1}\Vert _{{\mathcal {B}}_{\rho }^{0}}\le \Vert w_{x}^{n+1} \Vert _{{\mathcal {B}}_{\rho }^{0}}^{2} \le \Vert w^{n+1}\Vert _{{\mathcal {B}}_{\rho }^{1}}^{2}\le M_{3}^{2}\varepsilon ^{3}. \end{aligned}$$
(43)

For \(\Psi _{5}^{n+1}\), we use (13), as well as the inductive hypothesis, finding

$$\begin{aligned} \Vert \Psi _{5}^{n+1}\Vert _{{\mathcal {B}}_{\rho }^{0}}\le 2M_{1,1}^{1/2}M_{3}K_{2}\varepsilon ^{2}. \end{aligned}$$
(44)

The next term, \(\Psi _{6}^{n+1},\) is similar, and we bound it as

$$\begin{aligned} \Vert \Psi _{6}^{n+1}\Vert _{{\mathcal {B}}_{\rho }^{0}}\le M_{2,1}M_{3}K_{2}\varepsilon ^{5/2}. \end{aligned}$$
(45)

From the definition of K, we have the bound

$$\begin{aligned} \frac{3M_{1,1}^{1/2}M_{3}K_{2}}{\left\| \cos \left( \frac{2\pi x}{L_{1}}\right) \right\| _{{\mathcal {B}}_{\rho }^{0}}}\le K. \end{aligned}$$

By exploiting this bound, combining (42), (43), (44), and (45), and taking \(\varepsilon \) sufficiently small, we have

$$\begin{aligned} |F_{1,0,x}^{n+1}|\le K\varepsilon ^{2}. \end{aligned}$$

We omit the details of the bounds for \(F^{n+1}_{2,0,x},\) \(F^{n+1}_{1,0,y},\) \(F^{n+1}_{2,0,y},\) \(F^{n+1}_{0,1,x},\) \(F^{n+1}_{0,1,y},\) \(F^{n+1}_{0,2,x},\) and \(F^{n+1}_{0,2,y}.\) Therefore, this completes the proof. \(\square \)

We now state our main theorem.

Theorem 2

Let \({\bar{\psi }}_{0}\in {\mathbb {R}}\) be given. Fix \(0<T<\infty \). Let \(\varepsilon _{i}=-\left( \frac{2\pi }{L_{i}}\right) ^{4}+\left( \frac{2\pi }{L_{i}}\right) ^{2}\), and set \(\varepsilon =\max \{\varepsilon _{1}, \varepsilon _{2}\}.\) There exists \(\varepsilon _{*}>0\) (dependent only on \(\rho ,\) \(L_{1}\) and \(L_{2}\) and not on T) such that if \(\varepsilon \in (0,\varepsilon _{*})\), and if

$$\begin{aligned}&(a_{1,0}(0))^{2}+(a_{2,0}(0))^{2}\le M_{1,1}\varepsilon /4, \\&|a_{2,0}(0)|\le M_{2,1}\varepsilon /2, \\&(a_{0,1}(0))^{2}+(a_{0,2}(0))^{2}\le M_{1,2}\varepsilon /4, \\&|a_{0,2}(0)|\le M_{2,2}\varepsilon /2, \\&\Vert w(0)\Vert _{{\mathcal {B}}_{\rho }^{1}}\le M_{3}\varepsilon ^{3/2}/6, \end{aligned}$$

then the Kuramoto–Sivashinsky equation (1) on the torus \({\mathbb {T}}^2=[0,L_1]\times [0,L_2]\) with initial data

$$\begin{aligned} \psi (x,y,0)= & {} {\bar{\psi }}_{0}+a_{1,0}(0)\cos \left( \frac{2\pi x}{L_{1}}\right) +a_{2,0}(0)\cos \left( \frac{4\pi x}{L_{1}}\right) \\&+a_{0,1}(0)\cos \left( \frac{2\pi y}{L_{2}}\right) +a_{0,2}(0)\cos \left( \frac{4\pi y}{L_{2}}\right) +w_{0}(x,y) \end{aligned}$$

has a mild solution that is analytic in space on [0, T].

Proof

By Proposition 3, the family \(\{\phi ^{n}\}_{n\in {\mathbb {N}}}\), where \(\phi ^n\) is given in (9), is a uniformly bounded family of functions analytic in space, with radius of analyticity \(\rho \) independent of t, and continuous and bounded in \(t\in [0,\infty )\). Upon passing to a subsequence if necessary, not relabeled, we may thus find a limit as \(n\rightarrow \infty \) that is analytic in space by Montel’s theorem, continuous and bounded in time. This is enough regularity to pass to the limit in the mild formulation of the evolution equations. Thus, the limit of the iterates, \(\phi ,\) exists and solves (2) on \([0,\infty )\).

We next turn to show the existence of \(\psi \) solving the 2D KSE (1) on [0, T] for an arbitrary \(0<T<\infty \). We may write \(\psi =\phi +{\bar{\psi }}\), where \({\bar{\psi }}\) solves (3) with the initial condition \({\bar{\psi }}_{0}.\) As noted in the introduction, the initial value problem for \({\bar{\psi }}\) can be solved on the time interval [0, T] as long as \(\phi \in L^{2}([0,T];{\dot{H}}^{1})\), which is the case given the regularity established above on \(\phi \). \(\square \)

Notice that we state this theorem on the interval [0, T] for arbitrary T,  rather than using the interval \([0,\infty ).\) This is because we do not know that the mean, \({\bar{\psi }},\) has a well-defined limit as \(t\rightarrow \infty \) since the mild formulation for \(\phi \) does not imply that \(\phi \in L^2([0,\infty );{\dot{H}}^1)\). Nevertheless, we have achieved the stated goal, showing that the two-dimensional Kuramoto–Sivashinsky equation has small solutions for all time in the presence of two linearly growing Fourier modes, one in each direction.

Remark 1

  1. (1)

    Our goal in this work is to establish global existence; therefore, we do not treat in detail the uniqueness of the solution. However, uniqueness in \(C([0,T],L^2)\) for any \(0<T<\infty \) follows in a manner similar to that for viscous Hamilton–Jacobi equations and other semilinear parabolic equations (see, for instance, (Taylor 2011, Proposition 1.1, p. 315).

  2. (2)

    As we have stated, our prior work Ambrose and Mazzucato (2019) proved global existence when \(L_{1}\) and \(L_{2}\) are each in the interval \((0,2\pi ),\) while here we have shown global existence when they are each slightly larger than \(2\pi .\) The present results do also extend to the case when only one of these lengths is slightly larger than \(2\pi ,\) and the other is smaller. We also expect that arguments similar to those in this work and in Ambrose and Mazzucato (2019) will give the existence of a mild solution with initial data in \(L^2\) under hypotheses akin to those in Theorem 2. For brevity and clarity, we chose not to pursue this result here.

  3. (3)

    In Ambrose and Mazzucato (2019), we proved two analyticity results for (1) in the absence of linearly growing modes. One of these results is that solutions of (1) with data in \(H^1\) are analytic for \(t>0\), and the radius of analyticity can grow polynomially at first and then decay exponentially. The other is that with small data in the Wiener algebra, the radius of analyticity grows at least linearly in time. By contrast, in the present theorem, we have taken data in the spaces \(B^m_\rho \) with \(\rho >0;\) such data is analytic with radius of analyticity at least \(\rho .\) The proof of Theorem 2 gives that the radius of analyticity of the remainder term remains analytic with radius of analyticity at least \(\rho .\) Regularity of local-in-time solutions for (1) in Gevrey classes was studied in Biswas and Swanson (2007).

6 Conclusion

We have shown the existence of small solutions of the Kuramoto–Sivashinsky equation in two space dimensions for all time, when the size of the domain admits a linearly growing mode in each direction. To our knowledge, this is the first result of this kind. The method of proof is new, in combining a dynamical system approach for a finite number of modes with function space estimates for the remaining infinitely many modes. This approach raises the possibility that significant further progress could be possible, extending beyond the present case of a pair of slightly growing modes, by designing different Lyapunov functions or making different choices of function spaces. Another possible area of extension is extracting more detailed information about the solutions; while we showed that \(w\sim \varepsilon ^{3/2},\) a finer description of amplitudes could be made for the modes encompassed by w. The method of the present work could also be extended to other systems, including more fundamental systems in flame propagation. That is, the Kuramoto–Sivashinsky equation is a weakly nonlinear model and can be proved to be a valid approximation for coordinate-free models; global existence of solutions for these coordinate-free and other models is of interest (see Akers and Ambrose 2021; Ambrose et al. 2020; Frankel and Sivashinsky 1987).