Abstract
This paper is devoted to proving the strong averaging principle for slow–fast stochastic partial differential equations with locally monotone coefficients, where the slow component is a stochastic partial differential equations with locally monotone coefficients and the fast component is a stochastic partial differential equations with strongly monotone coefficients. The result is applicable to a large class of examples, such as the stochastic porous medium equation, the stochastic p-Laplace equation, the stochastic Burgers type equation and the stochastic 2D Navier–Stokes equation, which are the nonlinear stochastic partial differential equations. The main techniques are based on time discretization and the variational approach to stochastic partial differential equations.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
For \(i=1,2\), let \((H_i, \Vert \cdot \Vert _{H_i})\) be a separable Hilbert spaces with inner product \(\langle \cdot ,\cdot \rangle _{H_i}\) and \(H^{*}_i\) its dual. Let \((V_i, \Vert \cdot \Vert _{V_i})\) be a reflexive Banach space, such that \(V_i\subseteq H_i\) continuously and densely. Then for its dual space \(V^{*}_i\) it follows that \(H^{*}_i\subseteq V^{*}_i\) continuously and densely. Identifying \(H_i\) and \(H^{*}_i\) via the Riesz isomorphism we have that
is a Gelfand triple. Let \(_{V^{*}_i}\langle ~, ~\rangle _{V_i}\) be the dualization between \(V^{*}_i\) and \(V_i\). Then it follows that
For \(i=1,2\), let \(\{W^{i}_t\}_{t\geqslant 0}\) be a cylindrical \(\mathscr {F}_t\)-Wiener process in a separable Hilbert space \((U_i, \Vert \cdot \Vert _{U_i})\) on a probability space \((\Omega ,\mathscr {F},\mathbb {P})\) with natural filtration \(\mathscr {F}_{t}\). Let \(L_{2}(U_i,H_i)\) be the space of Hilbert-Schmidt operator from \(U_i\rightarrow H_i\). The norm on \(L_{2}(U_i,H_i)\) is defined by
where \(\{e_{i,k}\}_{k\in \mathbb {N}}\) is an orthonormal basis of \(U_i\). We also assume the processes \(\{W^{1}_t\}_{t\geqslant 0}\) and \(\{W^{2}_t\}_{t\geqslant 0}\) are independent.
In this paper, we consider the following abstract stochastic partial differential equations (SPDEs)
where \(\varepsilon >0\) is a small parameter describing the ratio of the time scale between the slow component \(X^{\varepsilon }_t\) and the fast component \(Y^{\varepsilon }_t\), and the coefficients
and
are measurable.
The averaging principle has a long and rich history in multiscale models, which has wide applications in material sciences, chemistry, fluid dynamics, biology, ecology and climate dynamics, see, e.g., [1, 10, 17, 22] and the references therein. Usually, a multiscale model can be described through coupled equations, which correspond to the "slow" and "fast" component, respectively. The averaging principle is essential to describe the asymptotic behavior of the slow component, i.e., the slow component will convergence to the so-called averaged equation. Bogoliubov and Mitropolsky [2] first studied the averaging principle for deterministic systems, which then was extended to stochastic differential equations by Khasminskii [18].
Since the averaging principle for a general class of stochastic reaction-diffusion systems with two time-scales were investigated by Cerrai and Freidlin in [6], the averaging principle for slow–fast SPDEs has initiated further studies in the past decade, including other types of SPDEs, various ways of convergence and rates of convergence. For instance, Bréhier obtained the strong and weak orders in averaging for stochastic evolution equation of parabolic type with slow and fast time scales in [3]. Fu, Wan and Liu proved the strong averaging principle for stochastic hyperbolic-parabolic equations with slow and fast time-scales in [13]. Cerrai and Lunardi studied the averaging principle for nonautonomous slow–fast systems of stochastic reaction-diffusion equations in [7]. For some further results on this topic, we refer to [4, 11, 12, 21, 24, 25] and the references therein.
However, the references we mentioned above always assume that the coefficients satisfy Lipschitz conditions, and there are few results on the average principle for SPDEs with nonlinear terms. For example, stochastic reaction-diffusion equations with polynomial coefficients [5], stochastic Burgers equation [9], stochastic two dimensional Navier–Stokes equations [19], stochastic Kuramoto-Sivashinsky equation [14], stochastic Schrödinger equation [15] and stochastic Klein-Gordon equation [16]. But all these papers consider semilinear SPDEs (i.e., for operators \(A=A_1+A_2\) with \(A_1\) a linear operator and \(A_2\) a nonlinear perturbation), and use the mild solution approach to SPDEs exploiting the smoothing properties of the \(C_0\)- semigroup \(e^{A_1 t}\) generated by the linear operator \(A_1\) in an essential way. To the best of our knowledge, the case of the operator A has no linear part hasn’t been studied yet, such as the porous medium operator and the p-Laplace operator.
Hence, the main purpose of this paper is to prove the strong averaging principle for slow–fast SPDEs within the (generalized) variational framework, i.e., locally monotone and strongly monotone coefficients for the slow and fast equations respectively. Our result covers a large class of examples (see [20, Sects. 4 and 5]), especially for the case that the slow equation is a quasilinear SPDEs, such as the stochastic porous medium equation or the stochastic p-Laplace equation. Our result is also applicable to the stochastic Burgers type equation and stochastic two dimensional Navier–Stokes equation, whose coefficients only satisfy the local monotonicity conditions.
The main difficulty here is how to avoid applying the techniques which only work in the case of the mild solution approach, and use the techniques from the variational approach. More precisely, we will use the variational approach to estimate the integral of the time increment of \(X_{t}^{\varepsilon }\) instead of studying the Hölder continuity of time, which is strong enough for our purpose. We will also use the variational approach to obtain some apriori estimates of the solution, which are crucial to construct a proper stopping time to deal with the nonlinear terms.
The rest of the paper is organized as follows. In Sect. 2, under some suitable assumptions, we formulate our main result. Section 3 is devoted to proving the main result. In Sect. 4, we will give some examples to illustrate the wide applicability of our result. In the Appendix, we give the detailed proof of the existence and uniqueness of solutions to system (1.1).
Throughout the paper, C, \(C_T\) and \(C_{p,T}\) denote positive constants which may change from line to line, where \(C_T\) and \(C_{p,T}\) are used to emphasize that the constants depend on T and p, T respectively.
2 Main Result
For the coefficients of the slow equation, we suppose that there exist constants \(\alpha \in (1,\infty )\), \(\beta \in [0, \infty )\), \(\theta \in (0,\infty )\) and \(C>0\) such that the following conditions hold for all \(u,v,w\in V_1\), \(u_1,v_1\in H_1\) and \(u_2,v_2\in H_2\):
- A1:
-
(Hemicontinuity) The map \(\lambda \rightarrow {_{V^{*}_1}}\langle A(u+\lambda v), w\rangle _{V_1}\) is continuous on \(\mathbb {R}\).
- A2:
-
(Local monotonicity)
$$\begin{aligned} 2_{V^{*}_1}\langle A(u)-A(v), u-v\rangle _{V_1}+\Vert G_1(u)-G_1(v)\Vert ^2_{L_2(U_1,H_1)}\leqslant \rho (v)\Vert u-v\Vert ^2_{H_1}. \end{aligned}$$where \(\rho : V_1\rightarrow [0,\infty )\) is a measurable hemicontinuous function satisfying
$$\begin{aligned} |\rho (v)|\leqslant C\left[ (1+\Vert v\Vert ^{\alpha }_{V_1})(1+\Vert v\Vert ^{\beta }_{H_1})\right] . \end{aligned}$$Furthermore,
$$\begin{aligned} \Vert F(u_1,u_2)-F(v_1,v_2)\Vert _{H_1}\leqslant C(\Vert u_1-v_1\Vert _{H_1}+\Vert u_2-v_2\Vert _{H_2}). \end{aligned}$$(2.1) - A3:
-
(Coercivity)
$$\begin{aligned} _{V^{*}_1}\langle A(v), v\rangle _{V_1}\leqslant C\Vert v\Vert ^2_{H_1}-\theta \Vert v\Vert ^{\alpha }_{V_1}+C. \end{aligned}$$ - A4:
-
(Growth)
$$\begin{aligned} \Vert A(v)\Vert ^{\frac{\alpha }{\alpha -1}}_{V^{*}_1}\leqslant C(1+\Vert v\Vert ^{\alpha }_{V_1})(1+\Vert v\Vert ^{\beta }_{H_1}) \end{aligned}$$and
$$\begin{aligned} \Vert G_1(v)\Vert _{L_2(U_1,H_1)}\leqslant C(1+\Vert v\Vert _{H_1}). \end{aligned}$$
For the coefficients of the fast equation, we suppose that there exist constants \(\kappa \in (1,\infty )\), \(\gamma ,\eta \in (0, \infty )\), \(\zeta \in (0,1)\) and \(C>0\) such that the following conditions hold for all \(u,v,w\in V_2\), \(u_1,v_1\in H_1\):
- B1:
-
(Hemicontinuity) The map \(\lambda \rightarrow _{V^{*}_2}\langle B(u_1+\lambda v_1,u+\lambda v), w\rangle _{V_2}\) is continuous on \(\mathbb {R}\).
- B2:
-
(Strong monotonicity)
$$\begin{aligned}{} & {} 2_{V^{*}_2}\langle B(u_1,u)-B(v_1,v), u-v\rangle _{V_2}+\Vert G_2(u_1,u)-G_2(v_1,v)\Vert ^2_{L_2(U_2,H_2)}\nonumber \\{} & {} \quad \leqslant -\gamma \Vert u-v\Vert ^2_{H_2}+C\Vert u_1-v_1\Vert ^2_{H_1}. \end{aligned}$$(2.2) - B3:
-
(Coercivity)
$$\begin{aligned} _{V^{*}_2}\langle B(u_1,v), v\rangle _{V_2}\leqslant C\Vert v\Vert ^2_{H_2}-\eta \Vert v\Vert ^{\kappa }_{V_2}+C(1+\Vert u_1\Vert ^2_{H_1}). \end{aligned}$$ - B4:
-
(Growth)
$$\begin{aligned} \Vert B(u_1,v)\Vert _{V^{*}_2}\leqslant C\left( 1+\Vert v\Vert ^{\kappa -1}_{V_2}+\Vert u_1\Vert ^{\frac{2(\kappa -1)}{\kappa }}_{H_1}\right) \end{aligned}$$and
$$\begin{aligned} \Vert G_2(u_1,v)\Vert _{L_2(U_2,H_2)}\leqslant C(1+\Vert u_1\Vert _{H_1}+\Vert v\Vert ^{\zeta }_{H_2}). \end{aligned}$$(2.3)
Remark 2.1
We here give some comments for the assumptions above.
-
Condition (2.2) is also called the dissipativity condition, which guarantees that there exists a unique invariant measure for the frozen equation and the exponential ergodicity holds.
-
The \(\zeta \in (0,1)\) in condition (2.3) is used to prove the p-th moments of the solution \((X^{{\varepsilon }}_t, Y^{{\varepsilon }}_t)\) are finite, when p is large enough, which could be removed if we assume the Lispchitz constant of \(G_2\) is sufficiently small.
Now, we recall the definition of a variational solution in [20].
Definition 2.2
For any given \({\varepsilon }>0\), a continuous \(H_1\times H_2\)-valued \(\mathscr {F}_{t}\)-adapted process \((X^{{\varepsilon }}_t, Y^{{\varepsilon }}_t)_{t\in [0, T]}\) is called a solution of system (1.1), if for its \(dt\otimes \mathbb {P}\)-equivalence class \((\check{X}^{{\varepsilon }}, \check{Y}^{{\varepsilon }})\) we have \(\check{X}^{{\varepsilon }}\in L^{\alpha }([0, T]\times \Omega , dt\otimes \mathbb {P}; V_1)\cap L^2([0, T]\times \Omega , dt\otimes \mathbb {P}; H_1)\) with \(\alpha \) as in A3, \(\check{Y}^{{\varepsilon }}\in L^{\kappa }([0, T]\times \Omega , dt\otimes \mathbb {P}; V_2)\cap L^2([0, T]\times \Omega , dt\otimes \mathbb {P}; H_2)\) with \(\kappa \) as in B3 and \(\mathbb {P}\)-a.s.
where \((\tilde{X}^{\varepsilon }, \tilde{Y}^{\varepsilon })\) is any \(V_1\times V_2\)-valued progressively measurable \(dt\otimes \mathbb {P}\)-version of \((\check{X}^{{\varepsilon }},\check{Y}^{{\varepsilon }})\).
Using the variational approach in infinite dimensional space, we have the following well-posedness result, whose proof will be presented in the Appendix.
Theorem 2.3
Assume the conditions A1–A4, B1–B4 hold. Then for any \({\varepsilon }>0\) and initial values \((x, y)\in H_1\times H_2\), the system (1.1) has a unique solution \((X^{\varepsilon },Y^{\varepsilon })\).
The following is the main result of this work.
Theorem 2.4
Assume the conditions A1–A4, B1–B4 hold. Then for any initial values \((x, y)\in H_1\times H_2\), \(p\geqslant 1\) and \(T>0\), we have
where \(\bar{X}_t\) is the solution of the corresponding averaged equation:
with the average \(\bar{F}(x)=\int _{H_2}F(x,y)\mu ^{x}(dy)\). \(\mu ^{x}\) is the unique invariant measure of the transition semigroup of the frozen equation
where \(\{\bar{W}^{2}_t\}_{t\geqslant 0}\) is a \(\tilde{\mathscr {F}}_t\)-cylindrical Wiener process in a separable Hilbert space \(U_2\) on another probability space, with natural filtration \(\tilde{\mathscr {F}}_t\).
Remark 2.5
The advantage of using the variational approach is that it can cover some nonlinear SPDEs for slow component, such as stochastic power law fluids, and some quasilinear SPDEs for slow component, such as the stochastic porous medium equation and the stochastic p-Laplace equation, which can not be handled by the mild solution approach and thus have not been studied yet. Furthermore, our result also generalizes some known results of the cases that the slow component is a semilinear stochastic partial differential equation, such as the stochastic Burgers equation (see [9]) and stochastic two dimensional Navier–Stokes equation (see [19]). Besides some known results, our result can also be applied to many other unstudied hydrodynamical models in [8], such as the stochastic magneto-hydrodynamic equations, the stochastic Boussinesq model for the Bénard convection, the stochastic 2D magnetic Bénard problem, the stochastic 3D Leray-\(\alpha \) model and some stochastic shell models of turbulence.
3 Proof of the Main Result
This section is devoted to proving Theorem 2.4. The proof consists of the following four subsections: In Sect. 3.1, we give some apriori estimates for the solution \((X^{\varepsilon }_t, Y^{\varepsilon }_t)\). Using the apriori estimates, we get an estimate for the time increments for \(X_{t}^{\varepsilon }\), which plays an important role in the proof of the main result. In Sect. 3.2, we will use the technique of time discretization to construct an auxiliary process \(\hat{Y}_{t}^{\varepsilon }\) and give an estimate of the difference process \(Y^{\varepsilon }_t-\hat{Y}_{t}^{\varepsilon }\). In Sect. 3.3, by constructing a stopping time \(\tau _R\), we prove that \(X^{\varepsilon }_t\) strongly converges to \(\bar{X}_{t}\) for \(t<\tau _R\). Finally, the apriori estimates for the solution control the difference \(X^{{\varepsilon }}_t-\bar{X}_t\) after the stopping time. Note that we always assume conditions A1–A4 and B1–B4 hold and from now on we fix an initial value \((x,y)\in H_1\times H_2\) in this section.
3.1 Some Apriori Estimates of \((X^{\varepsilon }_t, Y^{\varepsilon }_t)\)
At first, we prove uniform bounds with respect to \({\varepsilon }\in (0,1)\) for the moments of the solution \((X_{t}^{\varepsilon }, Y_{t}^{{\varepsilon }})\) to the system (1.1).
Lemma 3.1
For any \(T>0\) and \(p\geqslant 1\), there exists a constant \(C_{p,T}>0\) such that
and
Proof
Applying Itô’s formula (see e.g. [20, Theorem 6.1.1]), we have
Then applying Itô’s formula for \(f(z)=(z)^{p}\) with \(z_t=\Vert Y_{t}^{{\varepsilon }}\Vert ^{2}_{H_2}\), and taking expectation on both sides, we obtain
By conditions B2–B4 and a similar argument in the proof of [20, Lemma 4.3.8], there exists a constant \({\hat{\gamma }}\in (0,\gamma )\) such that for any \(u\in H_1, v\in V_2\),
Then applying Young’s inequality and estimate (3.3), we get
Multiplication by the integrating factor \(e^{\frac{C_p t}{\varepsilon }}\) yields that
Integrate this from 0 to t, we get
On the other hand, applying Itô’s formula, we also have
Note that
is a local martingale, then applying Burkholder-Davis-Gundy inequality (see e.g. [20, Theorem 6.1.2]), conditions A2–A4 and (3.4), we get
Hence, applying Gronwall’s inequality (see e.g. [23, Exercise 5.17]), we obtain
which also gives that
The proof is complete. \(\square \)
Because the method of time discretization is used in this paper, the following estimate about the integral of the time increment plays an important role in the proof of our main result, which has been proved in the case of the stochastic 2D Navier–Stokes equation in [19].
Lemma 3.2
For any \(T>0\), \({\varepsilon }\in (0,1)\) and \(\delta >0\) small enough, there exist constants \(C_{T}, m>0\) such that
where \(t(\delta ):=[\frac{t}{\delta }]\delta \) and [s] denotes the largest integer which is smaller than s.
Proof
Note that
Then applying Itô’s formula we have
For the term \(I_1(t)\), by condition A4 and applying Hölder’s inequality, there exist constants \(m, C_T>0\) such that
where we applied Fubini’s theorem and (3.1) in the third and fourth inequality respectively.
For terms \(I_{2}(t)\) and \(I_3(t)\), by condition (2.1), estimates (3.1) and (3.2), we get
and
For the term \(I_{4}(t)\), note that
is a local martingale, then applying Burkholder-Davies-Gundy inequality, it follows
Combining estimates (3.8)–(3.11), we obtain
By the same argument above, we also have
Hence, the result (3.5) holds by estimates (3.6), (3.12) and (3.13). The proof is complete. \(\square \)
3.2 Construction of the Auxiliary Process
Based on the method of time discretization, which is inspired by [18], we first construct an auxiliary process \(\hat{Y}_{t}^{\varepsilon }\in H_2\) satisfying the following equation
where \(\delta \) is a fixed positive number depending on \({\varepsilon }\) and will be chosen later. Then for its \(dt\otimes \mathbb {P}\)-equivalence class \(\check{\hat{Y}}^{{\varepsilon }}\) we have \(\check{\hat{Y}}^{{\varepsilon }}\in L^{\kappa }([0, T]\times \Omega , dt\otimes \mathbb {P}; V_2)\cap L^2([0, T]\times \Omega , dt\otimes \mathbb {P}; H_2)\) with \(\kappa \) as in B3, and for any \(k\in \mathbb {N}\) and \(t\in [k\delta ,\min ((k+1)\delta ,T)]\), \(\mathbb {P}\)-a.s.
where \(\tilde{\hat{Y}}^{\varepsilon }\) is any \(V_2\)-valued progressively measurable \(dt\otimes \mathbb {P}\)-version of \(\check{\hat{Y}}^{{\varepsilon }}\).
By the construction of \(\hat{Y}_{t}^{\varepsilon }\), we obtain the following estimates, which will be used later.
Lemma 3.3
For any \(T>0\) and \({\varepsilon }\in (0,1)\), there exist a constant \(C_{T}>0\) and \(m\in \mathbb {N}\) such that
and
Proof
Because the proof of estimate (3.15) follows almost the same steps as in the proof of Lemma 3.1, we omit its proof and only prove (3.16) here.
Note that
Applying Itô’s formula, we obtain
Then by condition B2, there exists \(\gamma >0\) such that
The the same argument used in (3.4), we have
Then applying Fubini’s theorem, for any \(T>0\),
Therefore, by Lemma 3.2, we obtain
The proof is complete. \(\square \)
3.3 The Ergodicity of the Frozen Equation
The frozen equation associated to the fast motion for fixed slow component \(x\in H_1\) is as follows,
where \(\{\bar{W}^{2}_t\}_{t\geqslant 0}\) is a cylindrical \(\tilde{\mathscr {F}}_{t}\)-Wiener process in a separable Hilbert space \(U_2\) on another probability space \(({\tilde{\Omega }},\tilde{\mathscr {F}},\tilde{\mathbb {P})}\) with natural filtration \(\tilde{\mathscr {F}}_{t}\).
Under the assumptions B1–B4, for any fixed \(x\in H_1\) and initial data \(y\in H_2\), equation (3.17) has a unique variational solution \(Y_{t}^{x,y}\) in the sense of Definition 2.2, i.e., for its \(dt\otimes {\tilde{\mathbb {P}}}\)-equivalence class \(\hat{Y}\) we have \(\hat{Y}^{x,y}\in L^{\kappa }([0, T]\times {\tilde{\Omega }}, dt\otimes {\tilde{\mathbb {P}}}; V_2)\cap L^2([0, T]\times {\tilde{\Omega }}, dt\otimes {\tilde{\mathbb {P}}}; H_2)\) with \(\kappa \) as in B3, we have \({\tilde{\mathbb {P}}}\)-a.s.
where \(\tilde{Y}^{x,y}\) is any \(V_2\)-valued progressively measurable \(dt\otimes {\tilde{\mathbb {P}}}\)-version of \(\hat{Y}^{x,y}\). By the same arguments as in the proof of Lemma 3.1, it is easy to prove that
Let \(\{P^{x}_t\}_{t\geqslant 0}\) be the transition semigroup of the Markov process \(\{Y_{t}^{x,y}\}_{t\geqslant 0}\), that is, for any bounded measurable function \(\varphi \) on \(H_2\),
where \({{\tilde{\mathbb {E}}}}\) is the expectation on \((\tilde{\Omega },\tilde{\mathscr {F}},\tilde{\mathbb {P})}\). Then we have the following asymptotic behavior of \(P^x_t\), whose proof can be founded in [20, Theorem 4.3.9].
Proposition 3.4
The transition semigroup \(\{P^{x}_t\}_{t\geqslant 0}\) has a unique invariant measure \(\mu ^x\). Moreover, there exists a constant \(C>0\) such that for any Lipschitz function \(\varphi :H_2\rightarrow R\),
where \(\Vert \varphi \Vert _{Lip}=\sup _{y_1\ne y_2\in H_2}\frac{|\varphi (y_1)-\varphi (y_2)|}{\Vert y_1-y_2\Vert _{H_2}}\).
3.4 The Averaged Equation
We consider the corresponding averaged equation, i.e.
with the averaged coefficient
where \(\mu ^{x}\) is the unique invariant measure for the transition semigroup \(\{P^{x}_t\}_{t\geqslant 0}\).
Since F is Lipschitz continuous, it is easy to check \(\bar{F}\) is also Lipschitz continuous, i.e.
Then equation (3.20) has a unique variational solution \({\bar{X}}\) in the sense of Definition 2.2, i.e.,for its \(dt\otimes \mathbb {P}\)-equivalence class \(\check{\bar{X}}\) we have \(\check{\bar{X}}\in L^{\alpha }([0, T]\times \Omega , dt\otimes \mathbb {P}; V_1)\cap L^2([0, T]\times \Omega , dt\otimes \mathbb {P}; H_1)\) with \(\alpha \) as in A3, we have \(\mathbb {P}\)-a.s.
where \(\tilde{\bar{X}}\) is any \(V_1\)-valued progressively measurable \(dt\otimes \mathbb {P}\)-version of \(\check{\bar{X}}\). Moreover, we also have the following estimates. Because their proofs follows almost the same steps in the proof of Lemmas 3.1 and 3.2, we omit them here.
Lemma 3.5
For any \(T>0\), \(p\geqslant 1\), there exist constants \(C_{p,T},C_T>0\) and \(m>0\) such that for any \(x\in H_1\),
and
Next, we intend to prove that \(X_{t}^{{\varepsilon }}\) strongly converges to \(\bar{X}_t\) for \(t<\tau _{R}\) firstly, then the proof of the main result will follow from the fact that the difference process \(X^{{\varepsilon }}_t-\bar{X}_t\) after the stopping time is sufficient small when R is large enough, whose proof is given in Sect. 3.5.
Proposition 3.6
For any \((x,y)\in H_1\times H_2\), \(T,R>0\) and \({\varepsilon }\in (0,1)\), then there exist constants \(C_{R,T}, m>0\) such that
where
Proof
We will divide the proof into three steps.
Step 1 We note that
Applying Itô’s formula, we have
Then conditions A2 and A3 imply \(\mathbb {P}\)-a.s.,
Applying Gronwall’s inequality and the definition of the stopping time \(\tau _{R}\) , we deduce that
Note that
is a local martingale and \(T\wedge \tau _R\) is a stopping time, thus applying Burkholder-Davis inequality (see e.g. [20, Proposition D.0.1]), estimates (3.5), (3.16) and (3.22), there exists \(m>0\) such that
which implies
Applying Gronwall’s inequality, we finally get
Hence, the proof will be completed by the following estimate:
whose proof will be given in Step 2.
Step 2 We note that
For the term \(J_2(t)\), it is easy to see
For the term \(J_1(t)\), we have
where for any \(0\leqslant r\leqslant s\leqslant \frac{\delta }{{\varepsilon }}\),
and
whose proof will be presented in Step 3. Hence, we get
which completes the proof of estimate (3.24).
Step 3 For any \(s>0\), and any \(\mathscr {F}_s\)-measurable \(H_1\)-valued random variable X and \(H_2\)-valued random variable Y, we consider the following equation:
which has a unique solution \(\tilde{Y}^{{\varepsilon },s,X,Y}_t\). Then by the construction of \(\hat{Y}_{t}^{{\varepsilon }}\), for any \(k\in \mathbb {N}_{*}\) and \(t\in [k\delta ,(k+1)\delta ]\) we have \(\mathbb {P}\)-a.s.,
which implies
where the last equality comes from the fact that \(X_{k\delta }^{{\varepsilon }}\) and \({\hat{Y}}_{k\delta }^{{\varepsilon }}\) are \(\mathscr {F}_{k\delta }\)-measurable, and for any fixed \((x,y)\in H_1\times H_2\), \(\{{\tilde{Y}}^{{\varepsilon }, k\delta ,x,y}_{s{\varepsilon }+k\delta }\}_{s\geqslant 0}\) is independent of \(\mathscr {F}_{k\delta }\).
By the definition of process \(\tilde{Y}^{{\varepsilon },k\delta ,x,y}_t\), for its \(dt\otimes \mathbb {P}\)-equivalence class \(\check{\tilde{Y}}^{{\varepsilon },k\delta ,x,y}\) we have \(\check{\tilde{Y}}^{{\varepsilon },k\delta ,x,y}\in L^{\kappa }([k\delta , T]\times \Omega , dt\otimes \mathbb {P}; V_2)\cap L^2([k\delta , T]\times \Omega , dt\otimes \mathbb {P}; H_2)\) with \(\kappa \) as in B3 and \(\mathbb {P}\)-a.s.
where \(\tilde{\tilde{Y}}^{{\varepsilon },k\delta ,x,y}\) is any \(V_2\)-valued progressively measurable \(dt\otimes \mathbb {P}\)-version of \(\check{\tilde{Y}}^{{\varepsilon },k\delta ,x,y}\), \(\{W^{2, k\delta }_r:=W^2_{r+k\delta }-W^2_{k\delta }\}_{r\geqslant 0}\) and \(\{{\hat{W}}^{2,k\delta }_t:=\frac{1}{\sqrt{{\varepsilon }}}W^{2,k\delta }_{t{\varepsilon }}\}_{t\geqslant 0}\).
The uniqueness of the solution of equations (3.28) and (3.18) implies that the distribution of \(({\tilde{Y}}^{{\varepsilon }, k\delta , x,y}_{s{\varepsilon }+k\delta })_{0\leqslant s\leqslant \delta /{\varepsilon }}\) coincides with the distribution of \((Y_{s}^{x, y})_{0\leqslant s\leqslant \delta /{\varepsilon }}\). Then by Proposition 3.4, estimates (3.1) and (3.15), we have
which gives estimate (3.27). The proof is complete. \(\square \)
3.5 Proof of Theorem 2.4:
Applying Chebyshev’s inequality, Lemmas 3.1 and 3.5, we have
where m is a positive constant. Then taking \(\delta ={\varepsilon }^{\frac{2}{3}}\), estimates (3.23) and (3.29) give
Now, letting \({\varepsilon }\rightarrow 0\) first, then \(R\rightarrow \infty \), we have
Note that for any \(p>1\), by Lemmas 3.1 and 3.5, it is easy to see that
Hence, by (3.30), we finally get
The proof is complete. \(\square \)
4 Application to Examples
In this section we will apply our main result to establish the averaging principle for stochastic porous medium equations, p-Laplace equations, Burgers equations and 2D Navier–Stokes equations with slow and fast time-scales. Note that we here mainly focus on the nonlinear operator A, so we take the stochastic porous medium equation, p-Laplace equations, Burgers equations, or 2D Navier–Stokes equations for the slow component and stochastic heat equation with Lipschitz drift for the fast component for the simplicity.
Let \(\Lambda \subset \mathbb {R}^d\) be an open bounded domain and \(\Delta \) be the Laplace operator on \(\Lambda \) with Dirichlet boundary conditions, and for \(p\in [1, \infty )\) we use \(L^p(\Lambda )\) and \(H^{n,p}_0\) to denote the space of p-Lebesgue integrable functions on \(\Lambda \) and the Sobolev space of order n in \(L^p(\Lambda )\) with Dirichlet boundary conditions. Recall that \(X^{*}\) denotes the dual space of a Banach space X.
4.1 Stochastic Porous Medium Equations
Let \(\Psi :\mathbb {R}\rightarrow \mathbb {R}\) be a function having the following properties :
- \((\Psi 1)\):
-
\(\Psi \) is continuous.
- \((\Psi 2)\):
-
For all \(s, t\in \mathbb {R}\)
$$\begin{aligned} (t-s)(\Psi (t)-\Psi (s))\geqslant 0. \end{aligned}$$ - \((\Psi 3)\):
-
There exist \(p\in [2,\infty ), c_1\in (0, \infty ), c_2\in [0, \infty )\) such that for all \(s\in \mathbb {R}\)
$$\begin{aligned} s\Psi (s)\geqslant c_1|s|^p-c_2. \end{aligned}$$ - \((\Psi 4)\):
-
There exist \(c_3,c_4\in (0, \infty )\) such that for all \(s\in \mathbb {R}\)
$$\begin{aligned} |\Psi (s)|\leqslant c_3|s|^{p-1}+c_4, \end{aligned}$$
where p is as in \((\Psi 3)\).
Considering the Gelfand triple for the slow equation
and the Gelfand triple for the fast equation
Then we introduce the porous medium operator \(A(u): V_1\rightarrow V_1^{*}\) (see [20, Sect. 4.1] for details) by
Now, we consider the slow–fast stochastic porous medium-heat equations
where
are measurable mappings and
are Lipschitz continuous. More precisely,
Moreover, there exists \(\zeta \in (0,1)\) such that
and for the smallest eigenvalue \(\lambda _1\) of \(-\Delta \) in \(H_2\), the Lipschitz constants \(L_{B_2}\), \(L_{G_2}\) satisfy
It is well known that the porous medium operator A satisfies the monotonicity and coercivity properties (see, e.g., [20, pp. 87–88]). So it is easy to check all the conditions A1–A4 hold. Furthermore, the condition B2 holds by condition (4.7) and the conditions B1,B3 and B4 hold obviously. Hence, by Theorem 2.4, we have
where \(\bar{X}_t\) is the solution of the corresponding averaged equation.
4.2 Stochastic p-Laplace Equation
Now we consider the stochastic p-Laplace equation \((p\geqslant 2)\). We choose the Gelfand triple for the slow equation
and the Gelfand triple for the fast equation
and let \(A: V_1\rightarrow V^{*}_1\) be
More precisely, given \(u\in V_1\), we define
Here A is called the p-Laplace operator, also denoted by \(\Delta _p\). Note that \(\Delta _2=\Delta \).
Now, we consider the slow–fast stochastic p-Laplace-heat equations
where the coefficients \(F, G_1, B_2\) and \(G_2\) satisfy conditions (4.2)–(4.7).
It is well known that the p-Laplace operator satisfies the monotonicity and coercivity properties (see, e.g., [20, Example 4.1.9]). So it is easy to check that all the conditions A1–A4 and B1–B4 hold. Hence, by Theorem 2.4, we have
where \(\bar{X}_t\) is the solution of the corresponding averaged equation.
Note that in the above two examples both the porous medium and the p-Laplace operators are globally monotone. But our result also applies to many merely locally monotone operators. For illustration, we will consider the stochastic Burgers and stochastic 2D Naiver-Stokes equation below (see [20, Sects. 4.1 and 5.1] for a number of other examples).
4.3 Stochastic Burgers Equation
Now we consider the stochastic Burgers equation. Taking \(\Lambda =(0,1)\subset \mathbb {R}\), we choose the Gelfand triple for the slow equation
and the Gelfand triple for the fast equation
and let \(A: V_1\rightarrow V^{*}_1\) be
Now, we consider the slow–fast stochastic Burgers-heat equations
where the coefficients \(F, G_1, B_2\) and \(G_2\) satisfy conditions (4.2)–(4.7).
It is well known that the operator A satisfies the monotonicity and coercivity properties (see, e.g., [20, Lemma 5.1.6]). So it is easy to check that all the conditions A1–A4 and B1–B4 hold. Hence, by Theorem 2.4, we have
where \(\bar{X}_t\) is the solution of the corresponding averaged equation.
4.4 Stochastic 2D Navier–Stokes Equation
Now we consider the stochastic 2D Navier–Stokes equation. Let \(\Lambda \) be a bounded domain in \(\mathbb {R}^2\) with smooth boundary, denote
and let \(H_1\) be the closure of \(V_1\) in \(L^2(\Lambda ;\mathbb {R}^2)\). We choose the Gelfand triple for the slow equation
and the Gelfand triple for the fast equation
and let \(A: V_1\rightarrow V^{*}_1\) be
where \(P_H\) is the Helmholtz–Leray projection and \(u\cdot \nabla =\sum ^2_{i=1}u^i\partial _i\) with \(u=(u^1,u^2)\).
Now, we consider the slow–fast stochastic 2D Navier–Stokes-heat equation
where the coefficients \(F, G_1, B_2\) and \(G_2\) satisfy conditions (4.2)–(4.7).
It is well known that the operator A satisfies the monotonicity and coercivity properties (see, e.g., [20, Example 5.1.10]). So it is easy to check that all the conditions A1–A4 and B1–B4 hold. Hence, by Theorem 2.4, we have
where \(\bar{X}_t\) is the solution of the corresponding averaged equation.
References
Bertram, R., Rubin, J.E.: Multi-timescale systems and fast-slow analysis. Math. Biosci. 287, 105–121 (2017)
Bogoliubov, N.N., Mitropolsky, Y.A.: Asymptotic Methods in the Theory of Non-linear Oscillations. Gordon and Breach Science Publishers, New York (1961)
Bréhier, C.E.: Strong and weak orders in averaging for SPDEs. Stoch. Process. Appl. 122, 2553–2593 (2012)
Cerrai, S.: A Khasminskii type averaging principle for stochastic reaction-diffusion equations. Ann. Appl. Probab. 19, 899–948 (2009)
Cerrai, S.: Averaging principle for systems of reaction-diffusion equations with polynomial nonlinearities perturbed by multiplicative noise. SIAM J. Math. Anal. 43, 2482–2518 (2011)
Cerrai, S., Freidlin, M.: Averaging principle for stochastic reaction-diffusion equations. Probab. Theory Relat. Fields 144, 137–177 (2009)
Cerrai, S., Lunardi, A.: Averaging principle for nonautonomous slow-fast systems of stochastic reaction-diffusion equations: the almost periodic case. SIAM J. Math. Anal. 49, 2843–2884 (2017)
Chueshov, I., Millet, A.: Stochastic 2D hydrodynamical type systems: well posedness and large deviations. Appl. Math. Optim. 61, 379–420 (2010)
Dong, Z., Sun, X., Xiao, H., Zhai, J.: Averaging principle for one dimensional stochastic Burgers equation. J. Differ. Equ. 265, 4749–4797 (2018)
E, W., Engquist, B.: Multiscale modeling and computations. Not. AMS 50, 1062–1070 (2003)
E, W., Liu, D., Vanden-Eijnden, E.: Analysis of multiscale methods for stochastic differential equations. Commun. Pure Appl. Math 58, 1544–1585 (2005)
Fu, H., Liu, J.: Strong convergence in stochastic averaging principle for two time-scales stochastic partial differential equations. J. Math. Anal. Appl. 384, 70–86 (2011)
Fu, H., Wan, L., Liu, J.: Strong convergence in averaging principle for stochastic hyperbolic-parabolic equations with two time-scales. Stoch. Process. Appl. 125, 3255–3279 (2015)
Gao, P.: Averaging principle for stochastic Kuramoto-Sivashinsky equation with a fast oscillation. Discret. Contin. Dyn. Syst. A 38, 5649–5684 (2018)
Gao, P.: Averaging principle for the higher order nonlinear Schrödinger equation with a random fast oscillation. J. Stat. Phys. 171, 897–926 (2018)
Gao, P.: Averaging principle for multiscale stochastic Klein-Gordon-heat system. J. Nonlinear Sci. 29(4), 1701–1759 (2019)
Harvey, E., Kirk, V., Wechselberger, M., Sneyd, J.: Multiple timescales, mixed mode oscillations and canards in models of intracellular calcium dynamics. J. Nonlinear Sci. 21, 639–683 (2011)
Khasminskii, R.Z.: On the principle of averaging the Itô stochastic differential equations. Kibernetica 4, 260–279 (1968)
Li, S., Sun, X., Xie, Y., Zhao, Y.: Averaging principle for two dimensional stochatsic Navier–Stokes equations. arXiv:1810.02282
Liu, W., Röckner, M.: Stochastic Partial Differential Equations: An Introduction. Springer, Berlin (2015)
Liu, W., Röckner, M., Sun, X., Xie, Y.: Averaging principle for slow-fast stochastic differential equations with time dependent locally Lipschitz coefficients. J. Differ. Equ. 268(6), 2910–2948 (2020)
Mastny, E.A., Haseltine, E.L., Rawlings, J.B.: Two classes of quasi-steady-state model reductions for stochastic kinetics. J. Chem. Phys. 127, 094106 (2007)
Øksendal, B.: Stochastic Differential Equations. An Introduction with Application. Springer, Berlin (1995)
Wang, W., Roberts, A.J.: Average and deviation for slow-fast stochastic partial differential equations. J. Differ. Equ. 253, 1265–1286 (2012)
Wang, W., Roberts, A.J., Duan, J.: Large deviations and approximations for slow-fast stochastic reaction-diffusion equations. J. Differ. Equ. 253, 3501–3522 (2012)
Funding
The authors would like to thank the referees for some valuable comments. This work is supported by the National Natural Science Foundation of China (Nos. 11931004, 12171208, 11831014, 12271219, 12090011), the QingLan Project of Jiangsu Provience and the PAPD Project of Jiangsu Higher Education Institutions. Financial support of the DFG through CRC 1283 is also gratefully acknowledged.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors have not disclosed any competing interests.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Appendix
At the end of this section, we give the proof of Theorem 2.3 based on the techniques used in [20, Theorem 5.1.3].
Proof of Theorem 2.3
Let \(\mathcal {H}:={H_1}\times H_2\) be the product Hilbert space. For any \(\phi =(\phi _1,\phi _2),\varphi =(\varphi _1,\varphi _2)\in \mathcal {H}\), we denote the scalar product and the induced norm by
Similarly, we also define \(\mathcal {V}:=V_1\times V_2\). Then \(\mathcal {V}\) is a reflexive Banach space with the following norm:
Now we rewrite the system (1.1) for \(Z^{\varepsilon }_t=(X^{\varepsilon }_t,Y^{\varepsilon }_t)\) as
where \(W_t:=(W_t^{1},W_t^{2})\), which is a \(U_1\times U_2\)-valued cylindrical-Wiener process and
Moreover, G is an operator from \(\mathcal {V}\) to \(L_{2}(\mathcal {U},\mathcal {H})\), where \(\mathcal {U}:=U_1\times U_2\) and \(L_{2}(\mathcal {U},\mathcal {H})\) is the space of Hilbert-Schmidt operators from \(\mathcal {U}\) to \(\mathcal {H}\). The norm in \(L_{2}(\mathcal {U},\mathcal {H})\) is defined by
Let \(\mathcal {V}^{*}\) be the dual space of \(\mathcal {V}\) and we consider the following Gelfand triple \(\mathcal {V}\subset \mathcal {H}\equiv \mathcal {H}^{*}\subset \mathcal {V}^{*}\). It is easy to see that the following mappings
are well defined. To complete the proof, we only check whether the new coefficients in equation (5.1) satisfy the local monotonicity, coercivity and growth properties by [20, Theorem 5.1.3].
Indeed, for any \(w_1=(u_1,v_1),w_2=(u_2,v_2)\in \mathcal {V}\), by conditions A2 and B2, we have
which implies that the local monotonicity condition holds.
For any \(w=(u,v)\in \mathcal {V}\), there exist constants \(C_{{\varepsilon }}>0\) and \(C>0\) such that
and
for some \(\tilde{\beta }> 0\), which implies that the coercivity and growth conditions hold. \(\square \)
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Liu, W., Röckner, M., Sun, X. et al. Strong Averaging Principle for Slow–Fast Stochastic Partial Differential Equations with Locally Monotone Coefficients. Appl Math Optim 87, 39 (2023). https://doi.org/10.1007/s00245-022-09956-y
Accepted:
Published:
DOI: https://doi.org/10.1007/s00245-022-09956-y