Abstract
We consider linear elliptic equations in divergence form with stationary random coefficients of integrable correlations. We characterize the fluctuations of a macroscopic observable of a solution to relative order \(\frac{d}{2}\), where d is the spatial dimension; the fluctuations turn out to be Gaussian. As for previous work on the leading order, this higher-order characterization relies on a pathwise proximity of the macroscopic fluctuations of a general solution to those of the (higher-order) correctors, via a (higher-order) two-scale expansion injected into the “homogenization commutator”, thus confirming the scope of this notion. This higher-order generalization sheds a clearer light on the algebraic structure of the higher-order versions of correctors, flux correctors, two-scale expansions, and homogenization commutators. It reveals that in the same way as this algebra provides a higher-order theory for microscopic spatial oscillations, it also provides a higher-order theory for macroscopic random fluctuations, although both phenomena are not directly related. We focus on the model framework of an underlying Gaussian ensemble, which allows for an efficient use of (second-order) Malliavin calculus for stochastic estimates. On the technical side, we introduce annealed Calderón–Zygmund estimates for the elliptic operator with random coefficients, which conveniently upgrade the known quenched large-scale estimates.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
1.1 General overview
Let \({\varvec{a}}\) be a random coefficient field on \({\mathbb {R}}^d\) that is stationary and ergodic and satisfies the boundedness and ellipticity properties
for some \(\lambda >0\), and denote by \((\Omega ,{\mathcal {F}},\mathbb {P})\) the underlying probability space. Given a deterministic vector field \(f\in C^\infty _c({\mathbb {R}}^d)^d\), we consider the random family \((\nabla u_\varepsilon )_{\varepsilon >0}\) of unique Lax–Milgram solutions to the rescaled problems
where the rescaled coefficients \({\varvec{a}}(\tfrac{\cdot }{\varepsilon })\) vary on the “microscopic” scale \(\varepsilon \), and we study “macroscopic” observables of the form \(\int _{{\mathbb {R}}^d}g\cdot \nabla u_\varepsilon \) with \(g\in C_c^\infty ({\mathbb {R}}^d)^d\) deterministic. Qualitative homogenization theory [34, 45] states that almost surely \(\int _{{\mathbb {R}}^d}g\cdot \nabla u_\varepsilon \rightarrow \int _{{\mathbb {R}}^d}g\cdot \nabla {\bar{u}}\) as \(\varepsilon \downarrow 0\), where \(\nabla {\bar{u}}\) solves the (deterministic) homogenized equation
and the homogenized coefficients \({\bar{{\varvec{a}}}}\in {\mathbb {R}}^{d\times d}\) are given by \({\bar{{\varvec{a}}}}e_i=\mathbb {E}\left[ {\varvec{a}}(\nabla \varphi _i+e_i) \right] \) for \(1\le i\le d\), in terms of the corrector \(\varphi _i\), that is, the unique (up to a random additive constant) almost sure solution of the corrector equation,
in the class of functions the gradient of which is stationary, centered (that is, \(\mathbb {E}\left[ \nabla \varphi _i \right] =0\)), and has finite second moment. In other words, the field-flux constitutive relation \(\nabla w_\varepsilon \mapsto {\varvec{a}}(\tfrac{\cdot }{\varepsilon })\nabla w_\varepsilon \) is replaced on the macroscopic scale by the effective relation \(\nabla {\bar{w}}\mapsto {\bar{{\varvec{a}}}}\nabla {\bar{w}}\).
A well-travelled question in homogenization is to further characterize the oscillations of the solution \(u_\varepsilon \). As expected from formal asymptotics, the so-called two-scale expansionFootnote 1\((1+\varepsilon \varphi _i(\tfrac{\cdot }{\varepsilon })\nabla _i){\bar{u}}\) precisely captures the oscillations of \(u_\varepsilon \) to order \(O(\varepsilon )\) (in dimension \(d>2\)), in the sense of
This expansion has a natural geometric interpretation: by definition (1.3), the corrector \(\varphi _i\) defines the correction of Euclidean coordinates \(x\mapsto x_i\) into \({\varvec{a}}\)-harmonic coordinates \(x\mapsto x_i+\varphi _i(x)\),
so that the expansion \((1+\varepsilon \varphi _i(\tfrac{\cdot }{\varepsilon })\nabla _i){\bar{u}}\) amounts to locally correcting the homogenized solution \({\bar{u}}\) in terms of oscillating \({\varvec{a}}\)-harmonic coordinates. Such expansions are naturally pursued to higher order: while the (centered) first-order corrector \(\varphi =\varphi ^1\) is characterized by the property that \((1+\varphi _i\nabla _i){\bar{\ell }}\) is \({\varvec{a}}\)-harmonic for all affine functions \({\bar{\ell }}\), the (centered) second-order corrector \(\varphi ^2\) is characterized by the property that for all quadratic polynomials \({\bar{q}}\) the expansion \((1+\varphi _i\nabla _i+\varphi _{ij}^2\nabla _{ij}^2){\bar{q}}\) fully captures the oscillations imposed by the heterogeneous elliptic operator \(-\nabla \cdot {\varvec{a}}\nabla \) in the sense of \(-\nabla \cdot {\varvec{a}}\nabla (1+\varphi _i\nabla _i+\varphi _{ij}^2\nabla _{ij}^2){\bar{q}}\) being deterministic (see Sect. 2 for a precise statement). The second-order two-scale expansion \((1+\varepsilon \varphi _i(\tfrac{\cdot }{\varepsilon })\nabla _i+\varepsilon ^2\varphi ^2_{ij}(\tfrac{\cdot }{\varepsilon })\nabla ^2_{ij}){\bar{u}}^2_\varepsilon \) then captures the oscillations of \(u_\varepsilon \) to order \(O(\varepsilon ^2)\) (in dimension \(d>4\)), where \({\bar{u}}^2_\varepsilon \) is a proxy for a solution to the generically ill-posed higher-order homogenized equation
in terms of the second-order homogenized coefficients \({\bar{{\varvec{a}}}}^2_ie_j:=\mathbb {E}\big [ {\varvec{a}}(\nabla \varphi ^2_{ij}+\varphi _ie_j) \big ]\). In other words, the effective field-flux constitutive relation \(\nabla {\bar{w}}\mapsto {\bar{{\varvec{a}}}}\nabla {\bar{w}}\) is refined to its second-order version \(\nabla {\bar{w}}\mapsto ({\bar{{\varvec{a}}}}+\varepsilon {\bar{{\varvec{a}}}}^2_i\nabla _i)\nabla {\bar{w}}\). The proxy \({\bar{u}}_\varepsilon ^2\) for a second-order homogenized solution can e.g. be chosen in form of \({\bar{u}}^2_\varepsilon :={\bar{u}}+\varepsilon {\tilde{u}}^2\) with \({\tilde{u}}^2\) solving
Such a description of oscillations of \(u_\varepsilon \) via higher-order two-scale expansions is classical in the periodic setting [11]. It also holds in the random setting for large enough dimension and for decaying enough correlations [9, 24] (cf. [3, 20, 22] for the first-order level): under appropriate mixing conditions, higher-order expansions allow to capture oscillations up to the critical order \(O(\varepsilon ^{d/2})\) (with a logarithmic correction in even dimensions). The limitation is due to the fact that, in the random setting, not all higher-order correctors are well-behaved: \(\varphi ^n\) can be chosen stationary with finite moments only for \(n<\frac{d}{2}\). Precise statements and short proofs are included in Sect. 2 below in the Gaussian setting.
While periodic homogenization boils down to the description of oscillations of \(u_\varepsilon \), stochastic homogenization in addition means studying the random fluctuations of the macroscopic observables \(\int _{{\mathbb {R}}^d}g\cdot \nabla u_\varepsilon \). It was recently shown [25] that the centered rescaled observables \(\varepsilon ^{-d/2}\int _{{\mathbb {R}}^d}g\cdot (\nabla u_\varepsilon -\mathbb {E}\left[ \nabla u_\varepsilon \right] )\) converge in law to a Gaussian. Note that this does not enter the above theory of oscillations since accuracy there is precisely limited to order \(O(\varepsilon ^{d/2})\). We may naturally look for a finer description of this convergence by means of a two-scale expansion. As observed in [25], however, the limiting variance of \(\varepsilon ^{-d/2}\int _{{\mathbb {R}}^d}g\cdot \nabla u_\varepsilon \) generically differs from that of \(\varepsilon ^{-d/2}\int _{{\mathbb {R}}^d}g\cdot \nabla (1+\varepsilon \varphi _i(\tfrac{\cdot }{\varepsilon })\nabla _i){\bar{u}}\), which shows that two-scale expansion cannot be applied naïvely when it comes to fluctuations. In [19], we unravelled the full mechanism behind this observation by means of the so-called homogenization commutator, which led to a new pathwise theory of fluctuations in stochastic homogenization (in agreement with the pathwise heuristics in [25]). More precisely, fluctuations of \(\nabla u_\varepsilon \) were shown to coincide with fluctuations of a suitable deterministic (Helmholtz) projection of the corresponding homogenization commutator, which behaves like a Gaussian white noise on large scales and for which the two-scale expansion is accurate at leading order. In the present article we explain how this result naturally extends to higher order in parallel to the known theory of oscillations: we obtain a full characterization of fluctuations of \(\varepsilon ^{-d/2}\int _{{\mathbb {R}}^d}g\cdot \nabla u_\varepsilon \) to order \(O(\varepsilon ^{d/2})\) (with again a logarithmic correction in even dimensions). The main emphasis is on unravelling the suitable higher-order commutator structure.
The homogenization commutator can be viewed as a commutator between large-scale averaging and the field-flux relation; on first-order level it takes the form
This expression first appeared in [5] (see also [7]) and is particularly natural since H-convergence for (1.2) (cf. [39]) is precisely equivalent to weak convergence of \(\Xi _\varepsilon [\nabla u_\varepsilon ]\) to 0 (cf. [19]), thus summarizing the whole qualitative homogenization theory. This convergence property is the mathematical formulation of the Hill–Mandel relation in mechanics [27, 28] and can further be made quantitative [19]. For a higher-order theory, we need a higher-order extension of the commutator. In view of the second-order effective field-flux relation \(\nabla {\bar{w}}\mapsto ({\bar{{\varvec{a}}}}+\varepsilon {\bar{{\varvec{a}}}}^2_i\nabla _i)\nabla {\bar{w}}\), the second-order commutator is naturally defined as
and a higher-order Hill–Mandel relation can indeed be formulated in these terms (cf. Lemma 3.3 below). Note that \(w_\varepsilon \mapsto \Xi _\varepsilon ^1[\nabla w_\varepsilon ]\) and \(\Xi ^2_\varepsilon [\nabla w_\varepsilon ]\) are viewed as first- and second-order differential operators, respectively. Next, we define suitable two-scale expansions of these homogenization commutators. For the first order, we inject the first-order two-scale expansion of \(w_\varepsilon \) into \(\Xi _\varepsilon ^1[\nabla \cdot ]\) and truncate the obtained differential operator at first order, thus defining the following first-order standard commutator,
Similarly, injecting the second-order two-scale expansion of \(w_\varepsilon \) into \(\Xi _\varepsilon ^2[\nabla \cdot ]\) and truncating the obtained differential operator at second order, we define the following second-order standard commutator,
The standard commutators \({\bar{w}}\mapsto \Xi _\varepsilon ^{\circ ,1}[\nabla {\bar{w}}]\) and \(\Xi _\varepsilon ^{\circ ,2}[\nabla {\bar{w}}]\) are first- and second-order differential operators, respectively, with \(\varepsilon \)-rescaled (distributional) stationary random coefficients. These stationary differential operators are viewed as intrinsic quantities, with coefficients given by suitable combinations of correctors and of their gradients. The above definitions are efficiently summarized in the form
where \(T_x^1{\bar{w}}\) and \(T_x^2{\bar{w}}\) denote the first- and second-order Taylor polynomials of \({\bar{w}}\) at the basepoint x, respectively. Note that these quantities are centered: \(\mathbb {E}\big [ \Xi _\varepsilon ^{\circ ,n}[\nabla {\bar{w}}] \big ]=0\) for \(n=1,2\) and any smooth deterministic field \({\bar{w}}\).
Pursuing these constructions to higher order, our main result is threefold and is summarized as follows, under appropriate mixing conditions:
-
(i)
Up to order \(O(\varepsilon ^{d/2})\) in the fluctuation scaling, the fluctuations of \(\nabla u_\varepsilon \) are determined by those of the corresponding higher-order commutators; cf. Theorem 1(i) below.
-
(ii)
Up to order \(O(\varepsilon ^{d/2})\) in the fluctuation scaling (with a logarithmic correction in even dimensions), the two-scale expansions of the commutators are accurate: the fluctuations of the commutators are equivalent to those of the corresponding standard commutators; cf. Theorem 1(ii). This fully extends the pathwise theory of [19] to higher order.
-
(iii)
The standard commutators are approximately local functions of the coefficients \({\varvec{a}}\). This allows to infer a quantitative characterization of their limiting covariance structure, as well as a quantitative central limit theorem (CLT); cf. Theorem 1(iii)–(iv). Similarly as for the coefficient field \({\varvec{a}}\) itself, while at first order the scaling limit is given by a Gaussian white noise, corrections converge to derivatives of white noise.
1.2 Main results
For simplicity, we focus on the model setting of a Gaussian coefficient field, in which case a powerful Malliavin calculus is available on the probability space, substantially simplifying the analysis. More precisely, we set
where G is some \({\mathbb {R}}^\kappa \)-valued centered stationary Gaussian random field on \({\mathbb {R}}^d\) constructed on a probability space \((\Omega ,{\mathcal {F}},\mathbb {P})\) and where \(a_0\in C^2_b({\mathbb {R}}^\kappa )^{d\times d}\) is such that the ellipticity and boundedness assumptions (1.1) are satisfied. We use scalar notation \(a_0',a_0''\) for derivatives of \(a_0\). In addition, we assume that the Gaussian field G has integrable correlations: more precisely, it has the form \(G=c_0*\xi \), where \(\xi \) denotes \(\kappa \)-dimensional white noise on \({\mathbb {R}}^d\) and where the kernel \(c_0{:}\,{\mathbb {R}}^d\rightarrow {\mathbb {R}}^{\kappa \times \kappa }\) is chosen even (that is, \(c_0(x)=c_0(-x)\) for all x) and satisfies the integrability condition
where B(x) is the unit ball centered at x. The covariance function of G is then given by \(c:=c_0*c_0\) and satisfies \(\int _{{\mathbb {R}}^d}(\sup _{B(x)}|c|)\,dx\,\le \,1\). The case of non-integrable correlations could be treated similarly, but it would lead to different scalings, cf. [16]. Our main result is as follows.
Theorem 1
Consider the above Gaussian setting with integrable correlations (1.5) and set \(\ell :=\lceil \frac{d}{2}\rceil \), that is, the smallest integer \(\ge \frac{d}{2}\). For all \(1\le n\le \ell \), let \({\bar{u}}_\varepsilon ^{n}\) and \({\bar{v}}_\varepsilon ^{n}\) be the nth-order homogenized solutions of the primal and dual equations, respectively, as in Definition 2.7 below, and let the nth-order homogenization commutator \(\Xi _\varepsilon ^n\) and its standard version \(\Xi _\varepsilon ^{\circ ,n}\) be as in Definitions 3.1 and 3.4. Then the following hold for all \(0<\varepsilon \le \frac{1}{2}\).
-
(i)
Reduction to commutators: For all \(f,g\in C^\infty _c({\mathbb {R}}^d)^d\), \(1\le n\le \ell \), and \(p<\infty \),
$$\begin{aligned} N_p\bigg (\varepsilon ^{-\frac{d}{2}}\int _{{\mathbb {R}}^d}g\cdot \nabla u_\varepsilon -\varepsilon ^{-\frac{d}{2}}\int _{{\mathbb {R}}^d}\nabla {\bar{v}}_\varepsilon ^{n}\cdot \Xi _\varepsilon ^n[\nabla u_\varepsilon ]\bigg ) \,\lesssim _{p,f,g}\,\varepsilon ^{n}, \end{aligned}$$where \(N_p(X):=\mathbb {E}\left[ |X-\mathbb {E}\left[ X \right] \!|^p \right] ^\frac{1}{p}\).
-
(ii)
Two-scale expansion of commutators: For all \(f,g\in C^\infty _c({\mathbb {R}}^d)^d\), \(1\le n\le \ell \), and \(p<\infty \),
$$\begin{aligned} \quad N_p\bigg (\varepsilon ^{-\frac{d}{2}}\int _{{\mathbb {R}}^d} g\cdot \big (\Xi _\varepsilon ^{n}[\nabla u_\varepsilon ]-\Xi _\varepsilon ^{\circ ,n}[\nabla {\bar{u}}_\varepsilon ^{n}]\big )\bigg ) \,\lesssim _{p,f,g}\,\varepsilon ^n\mu _{d,n}(\tfrac{1}{\varepsilon }), \end{aligned}$$where
$$\begin{aligned} \quad \varepsilon ^n\mu _{d,n}(\tfrac{1}{\varepsilon })\,:=\,\left\{ \begin{array}{lll} \varepsilon ^n&{}:&{}n<\ell ,\\ \varepsilon ^{\frac{d}{2}}|\!\log \varepsilon |^\frac{1}{2}&{}:&{}n=\ell \text { and }d\text { even},\\ \varepsilon ^{\frac{d}{2}}&{}:&{}n=\ell \text { and }d\text { odd}. \end{array}\right. \end{aligned}$$(1.6) -
(iii)
Local covariance structure: Further assume that \(\int _{{\mathbb {R}}^d}|y|^\ell |c(y)|dy\le 1\). There exists an \((m+4)\)th-order tensor \(\mathcal {Q}_{l'k'lk}^m\) for all \(l',k',l,k\ge 0\) with \(l'+k'+l+k=m\le \ell -1\), such that for all \(g,\nabla {\bar{w}}\in C^\infty _c({\mathbb {R}}^d)^d\) and \(1\le n\le \ell \), setting
$$\begin{aligned} \mathcal {Q}_\varepsilon ^n[g,\nabla {\bar{w}}]:=\sum _{m=0}^{n-1}\varepsilon ^{m}\sum _{\begin{array}{c} l',k',l,k\ge 0\\ l'+k'+l+k=m \end{array}}\mathcal {Q}_{l'k'lk}^m\odot \int _{{\mathbb {R}}^d}\nabla ^{l'}g\otimes \nabla ^{k'}\nabla {\bar{w}}\otimes \nabla ^{l}g\otimes \nabla ^{k}\nabla {\bar{w}}, \end{aligned}$$where \(\odot \) denotes the complete contraction of tensors of the same order, there holds
$$\begin{aligned} \bigg |{\mathrm{Var}}\left[ \varepsilon ^{-\frac{d}{2}}\int _{{\mathbb {R}}^d}g\cdot \Xi _{\varepsilon }^{\circ ,n}[\nabla {\bar{w}}]\right] -\mathcal {Q}_\varepsilon ^n[g,\nabla {\bar{w}}]\bigg | \lesssim _{g,{\bar{w}}} \varepsilon ^n\mu _{d,n}(\tfrac{1}{\varepsilon }). \end{aligned}$$ -
(iv)
Normal approximation: For all \(g,\nabla {\bar{w}}\in C^\infty _c({\mathbb {R}}^d)^d\) and \(1\le n\le \ell \),
$$\begin{aligned} {\text {d}}_{{\mathcal {N}}}\left( {\varepsilon ^{-\frac{d}{2}}\int _{{\mathbb {R}}^d}g\cdot \Xi _\varepsilon ^{\circ ,n}[\nabla {\bar{w}}]}\right) \,\lesssim _{g,{\bar{w}}}\,\frac{\varepsilon ^{\frac{d}{2}}|\!\log \varepsilon |\big (|\!\log \varepsilon |\log ^2\!|\!\log \varepsilon |\big )^{\frac{1}{2d}n(n-1)}}{{\mathrm{Var}}\big [\varepsilon ^{-\frac{d}{2}}\int _{{\mathbb {R}}^d}g\cdot \Xi _\varepsilon ^{\circ ,n}[\nabla {\bar{w}}]\big ]}, \end{aligned}$$where we have set
$$\begin{aligned} {\text {d}}_{{\mathcal {N}}}(X)\,:=\,{\text {W}}_2\left( {\frac{X-\mathbb {E}\left[ X \right] }{{\mathrm{Var}}\left[ X\right] ^{\frac{1}{2}}}};{{\mathcal {N}}}\right) +{\text {d}}_{{\text {TV}}}\left( {\frac{X-\mathbb {E}\left[ X \right] }{{\mathrm{Var}}\left[ X\right] ^{\frac{1}{2}}}};{{\mathcal {N}}}\right) , \end{aligned}$$where \({\text {W}}_2\left( {\cdot };{{\mathcal {N}}}\right) \) and \({\text {d}}_{{\text {TV}}}\left( {\cdot };{{\mathcal {N}}}\right) \) denote the 2-Wasserstein and the total variation distances to a standard Gaussian law, respectively.
In the estimates above, the notation \(\lesssim _\gamma \) (with parameters \(\gamma )\) stands for \(\le \) up to a multiplicative constant \(C_\gamma \ge 1\) that only depends on d, \(\lambda \), \(\Vert a_0\Vert _{W^{2,\infty }}\), and on the parameters \(\gamma \), through an upper bound on suitable (weighted) Sobolev norms of \(\gamma \) if \(\gamma \) is a function. \(\diamondsuit \)
Choosing \(n=\ell \), we are thus led to an intrinsic description of the fluctuations of macroscopic observables \(\int _{{\mathbb {R}}^d} g\cdot (\nabla u_\varepsilon -\mathbb {E}\left[ \nabla u_\varepsilon \right] )\) with accuracy \(O(\varepsilon ^d)\) (with a logarithmic correction in even dimensions), that is, up to the square of the CLT order. In particular, non-Gaussian corrections are only expected beyond that order.
Corollary 1
With the assumptions and notation of Theorem 1, for all \(f,g\in C^\infty _c({\mathbb {R}}^d)^d\), in terms of the intrinsic quantity
there holds for all \(p<\infty \),
and, under the non-degeneracy condition \(\mathcal {Q}_{0000}\odot \int _{{\mathbb {R}}^d}(\nabla {\bar{v}}\otimes \nabla {\bar{u}})^{\otimes 2}\ne 0\),
where \({\text {d}}\left( {\cdot };{\cdot }\right) \) stands for the sum of the 2-Wasserstein and the total variation distances, where \({\mathcal {N}}\) denotes a standard Gaussian random variable, and where
\(\diamondsuit \)
Another question that one may formulate concerns the description of fluctuations of higher-order correctors—although this is somewhat artificial since only combinations of correctors in form of two-scale expansions are intrinsically relevant. As briefly described in Remark 3.6, the proof of the above main result can be adapted in a straightforward way to also describe the fluctuations of \(\varepsilon ^{n-\frac{d}{2}}\varphi ^{n}(\tfrac{\cdot }{\varepsilon })\) and \(\varepsilon ^{n-\frac{d}{2}}\nabla \varphi ^{n+1}(\tfrac{\cdot }{\varepsilon })\) with accuracy \(O(\varepsilon ^{\frac{d}{2}})\) for all \(n\le \ell -1\) (with a logarithmic correction in even dimensions).
1.3 Outline of the article
In Sect. 2, we start by recalling the theory of oscillations in stochastic homogenization in terms of higher-order correctors and two-scale expansions. Section 3 is devoted to the definition and elementary properties of higher-order homogenization commutators, including the (easy) proof of item (i) of Theorem 1 as well as the statement of quantitative higher-order Hill–Mandel relations. Next, we turn to the rest of the proof of Theorem 1, which is based on the following three main ingredients.
-
Malliavin calculus: In the present Gaussian setting, given a random variable X, a Poincaré inequality holds in the form \({\mathrm{Var}}\left[ X\right] \le C\,\mathbb {E}\left[ \Vert DX\Vert _{\mathfrak {H}}^2 \right] \) (cf. [15, 29]), where the Malliavin derivative DX encodes the infinitesimal variation of X with respect to changes in the underlying Gaussian field G and where \(\Vert \cdot \Vert _{\mathfrak {H}}\) is the Hilbert norm associated with the covariance structure of G. Similarly, the approximate normality of X can be estimated by the size of \(D^2X\) with help of a so-called second-order Poincaré inequality [14, 43]. With these functional analytic tools at hand, we are thus reduced to estimating infinitesimal variations of the quantities of interest with respect to G, thus somehow linearizing the dependence on G, which is then particularly amenable to PDE methods. On a more technical level, rather than using ad hoc derivatives with respect to the coefficient field as in previous works in stochastic homogenization, we use here the full power of the Gaussian Malliavin calculus. Relevant definitions and results are recalled in Sect. 4.
-
Representation formulas: In view of applying the above tools from Malliavin calculus, we consider the infinitesimal variations of the quantities of interest, for which we establish suitable representation formulas. This requires a good understanding of the algebra behind higher-order correctors and commutators, and is performed in Sect. 5.
-
PDE ingredients: It remains to estimate the infinitesimal variations of interest from their representation formulas. Based on the large-scale Lipschitz regularity theory for \({\varvec{a}}\)-harmonic functions as developed in [5,6,7, 21], we introduce in Sect. 6 a new annealed Calderón–Zygmund theory for linear elliptic equations in divergence form with random coefficients. This constitutes an upgrade of the quenched large-scale Calderón–Zygmund theory of [3, 4, 21] (see also [31] for a direct approach); it happens to be particularly well suited for our purposes in this article and is of independent interest.
Finally, with these three ingredients at hand, we turn to the proof of items (ii), (iii), and (iv) of Theorem 1 in Sects. 7–9, respectively.
1.4 Notation
-
We denote by \(C\ge 1\) any constant that only depends on d, \(\lambda \), and \(\Vert a_0\Vert _{W^{2,\infty }}\). We use the notation \(\lesssim \) (resp. \(\gtrsim \)) for \(\le C\times \) (resp. \(\ge \frac{1}{C}\times \)) up to such a multiplicative constant C. We write \(\simeq \) when both \(\lesssim \) and \(\gtrsim \) hold. We add subscripts to C, \(\lesssim \), \(\gtrsim \), \(\simeq \) in order to indicate dependence on other parameters. If the subscript is a function, then it is understood as dependence on an upper bound on suitable (weighted) Sobolev norms of the function.
-
The ball centered at x of radius r in \({\mathbb {R}}^d\) is denoted by \(B_r(x)\), and we simply write B(x) if \(r=1\).
-
For a function f and \(1\le p<\infty \), we write \([f]_p(x):=(\fint _{B(x)}|f|^p)^{1/p}\) for the local moving \({\text {L}}^p\) average, and similarly \([f]_\infty (x):=\sup _{B(x)}|f|\).
-
For a kth-order tensor T we define for any fixed indices \(i_1,\ldots ,i_k\),
$$\begin{aligned} {\text {Sym}}_{i_1\ldots i_k}T_{i_1\ldots i_k}\,:=\,\frac{1}{k!}\sum _{\sigma \in {\mathcal {S}}_k}T_{i_{\sigma (1)}\ldots i_{\sigma (k)}}, \end{aligned}$$with \({\mathcal {S}}_k\) denoting the set of permutations of the set \(\{1,\ldots ,k\}\). Although this notation may seem redundant at this point, it will prove very useful in the sequel.
-
For \(r\in {\mathbb {R}}\) we denote by \(\lceil r\rceil \) the smallest integer \(\ge r\) and by \(\lfloor r\rfloor \) the largest integer \(\le r\). For \(r,s\in {\mathbb {R}}\) we write \(r\vee s:=\max \{r,s\}\) and \(r\wedge s:=\min \{r,s\}\). We set \(\langle x\rangle :=(1+|x|^2)^{1/2}\) for \(x\in {\mathbb {R}}^d\), and \(\langle \nabla \rangle \) denotes the corresponding pseudo-differential operator.
2 Higher-order theory of oscillations
In this section, as a prelude to the study of higher-order fluctuations, we describe the classical higher-order theory of oscillations in stochastic homogenization in terms of two-scale expansions, including some new results and shorter arguments. We start by recalling the definition of higher-order correctors, homogenized coefficients, fluxes, and flux correctors as motivated by formal two-scale expansions (e.g. [11, 30]). A simple iterative way to view this definition is as follows: for \(n\ge 1\), assuming that the previous correctors \(\varphi ^1,\ldots ,\varphi ^{n-1}\) are well-defined as centered stationary objects, the next corrector gradient \(\nabla \varphi ^n\) is the unique centered stationary object that is characterized by the property that for all nth-order polynomials \({\bar{q}}\) the corrected polynomial \(F^n[{\bar{q}}]:={\bar{q}}+\sum _{k=1}^n\varphi ^k_{i_1\ldots i_k}\nabla _{i_1\ldots i_k}^k{\bar{q}}\) captures oscillations of the heterogeneous operator in the sense that \(\nabla \cdot {\varvec{a}}\nabla F^n[{\bar{q}}]\) is deterministic. In that case, there must hold [cf. (2.9)]
which in addition characterizes the higher-order homogenized coefficients \({\bar{{\varvec{a}}}}^2,\ldots ,{\bar{{\varvec{a}}}}^{n-1}\) next to the first-order one \({\bar{{\varvec{a}}}}^1:={\bar{{\varvec{a}}}}\). In other words, the higher-order corrector expansion \(F^n[\cdot ]\) precisely intertwines the heterogeneous elliptic operator with its higher-order homogenized version. Note that the above only characterizes the symmetric part \({\text {Sym}}_{i_1\ldots i_n}\varphi _{i_1\ldots i_n}\) of correctors, which is indeed the only relevant quantity in view of two-scale expansions. It is however convenient to define correctors with a suitable non-symmetric part in such a way that the corresponding fluxes can be written in terms of skew-symmetric flux correctors, which is a well-known crucial tool in view of error estimates (e.g. [30, p. 27]). As stated below, in contrast with the periodic case, only \(\ell :=\lceil \frac{d}{2}\rceil \) correctors can be defined, with the \(\ell -1\) first of them being stationary; this number would be further reduced in the case of coefficient fields with stronger correlations (e.g. [9, 20]).
Definition 2.1
(Higher-order correctors) Given \(\ell :=\lceil \frac{d}{2}\rceil \), we inductively define the correctors \((\varphi ^n)_{0\le n\le \ell }\), the homogenized coefficients \(({\bar{{\varvec{a}}}}^n)_{1\le n\le \ell }\), the fluxes \((q^n)_{1\le n\le \ell }\), and the flux correctors \((\sigma ^n)_{0\le n\le \ell }\) as follows:
-
\(\varphi ^0:=1\) and for all \(1\le n\le \ell \) we define \(\varphi ^n:=(\varphi ^n_{i_1\ldots i_n})_{1\le i_1,\ldots ,i_n\le d}\) with \(\varphi ^n_{i_1\ldots i_n}\) a scalar field that satisfies
$$\begin{aligned}&-\,\nabla \cdot {\varvec{a}}\nabla \varphi ^n_{i_1\ldots i_n}=\nabla \cdot \big (({\varvec{a}}\varphi ^{n-1}_{i_1\ldots i_{n-1}}-\sigma ^{n-1}_{i_1\ldots i_{n-1}})\,e_{i_n}\big ),\\&\quad \nabla \varphi ^n_{i_1\ldots i_n}~\text {stationary},\quad \mathbb {E}\left[ |\nabla \varphi ^n_{i_1\ldots i_n}|^2 \right] <\infty ,\quad \mathbb {E}\left[ \nabla \varphi ^n_{i_1\ldots i_n} \right] =0. \end{aligned}$$For \(n<\ell \) we can choose \(\varphi ^n\) itself stationary with \(\mathbb {E}\left[ \varphi ^n \right] =0\), while for \(n=\ell \) we choose the anchoring \(\varphi ^\ell (0)=0\).
-
For all \(1\le n\le \ell \) we define \({\bar{{\varvec{a}}}}^n:=({\bar{{\varvec{a}}}}^n_{i_1\ldots i_{n-1}})_{1\le i_1,\ldots ,i_{n-1}\le d}\) with \({\bar{{\varvec{a}}}}^n_{i_1\ldots i_{n-1}}\) the matrix given by
$$\begin{aligned} {\bar{{\varvec{a}}}}^n_{i_1\ldots i_{n-1}}e_{i_n}:=\mathbb {E}\big [ {\varvec{a}}\big (\nabla \varphi ^{n}_{i_1\ldots i_n}+\varphi ^{n-1}_{i_1\ldots i_{n-1}}e_{i_n}\big ) \big ]. \end{aligned}$$(2.1) -
For all \(1\le n\le \ell \) we define \(q^n:=(q^n_{i_1\ldots i_n})_{1\le i_1,\ldots ,i_n\le d}\) with \(q^n_{i_1\ldots i_n}\) the vector field given by
$$\begin{aligned} q^n_{i_1\ldots i_n}:={\varvec{a}}\nabla \varphi ^{n}_{i_1\ldots i_n}+({\varvec{a}}\varphi ^{n-1}_{i_1\ldots i_{n-1}}-\sigma ^{n-1}_{i_1\ldots i_{n-1}})\,e_{i_n}-{\bar{{\varvec{a}}}}_{i_1\ldots i_{n-1}}^{n}e_{i_n}, \end{aligned}$$where the definition of \({\bar{{\varvec{a}}}}^n\) and the normalization of \(\sigma ^{n-1}\) below ensure \(\mathbb {E}\left[ q^n \right] =0\).
-
\(\sigma ^0:=0\) and for all \(1\le n\le \ell \) we define \(\sigma ^n:=(\sigma ^n_{i_1\ldots i_n})_{1\le i_1,\ldots ,i_n\le d}\) with \(\sigma ^n_{i_1\ldots i_n}\) a skew-symmetric matrix field that satisfies
$$\begin{aligned}&-\,\triangle \sigma ^n_{i_1\ldots i_n}=\nabla \times q^n_{i_1\ldots i_n},\quad \nabla \cdot \sigma ^n_{i_1\ldots i_n}=q^n_{i_1\ldots i_n},\\&\quad \nabla \sigma ^n_{i_1\ldots i_n}~\text {stationary},\quad \mathbb {E}\left[ |\nabla \sigma ^n_{i_1\ldots i_n}|^2 \right] <\infty ,\quad \mathbb {E}\left[ \nabla \sigma ^n_{i_1\ldots i_n} \right] =0, \end{aligned}$$with the notation \((\nabla \times X)_{ij}:=\nabla _iX_j-\nabla _jX_i\) for a vector field X and with the notation \((\nabla \cdot Y)_i:=\nabla _jY_{ij}\) for a matrix field Y. For \(n<\ell \) we can choose \(\sigma ^n\) itself stationary with \(\mathbb {E}\left[ \sigma ^n \right] =0\), while for \(n=\ell \) we choose the anchoring \(\sigma ^\ell (0)=0\).\(\diamondsuit \)
We first state that the above definition of higher-order correctors indeed makes sense. This easily follows from [9, proof of Proposition 9 and Lemma 12] together with Lemma 7.1 below in the considered Gaussian setting (see also [3, 20, 21]), and the proof is omitted. Note that the optimal stochastic integrability (that is, the optimal constants \(c_n\)’s below) is not required in our analysis, hence is not addressed here.
Proposition 2.2
(Higher-order correctors [3, 9, 20, 21]) All the quantities in Definition 2.1 exist and are uniquely defined. In addition, there exists a sequence \((c_n)_{0\le n\le \ell }\) of positive numbers with \(c_0=\frac{1}{2}\) such that for all \(0\le n\le \ell \) and \(1\le p<\infty \),
where in line with (1.6) we have set
\(\diamondsuit \)
For later reference we also state the following result on the fluctuation scaling of correctors, showing that higher-order correctors have stronger correlations hence worse fluctuation scalings, as first emphasized in [24].
Lemma 2.3
(Fluctuation scaling of higher-order correctors) For all \(g\in C^\infty _c({\mathbb {R}}^d)\), \(0\le n\le \ell -1\), and \(1\le p<\infty \),
\(\diamondsuit \)
If \({\varvec{a}}\) is replaced by the pointwise transpose field \({\varvec{a}}^*\), we write \((\varphi ^{*,n})_{0\le n\le \ell }\), \(({\bar{{\varvec{a}}}}^{*,n})_{1\le n\le \ell }\), \((q^{*,n})_{1\le n\le \ell }\), and \((\sigma ^{*,n})_{0\le n\le \ell }\) for the corresponding objects. While it is well-known that \({\bar{{\varvec{a}}}}^{*,1}=({\bar{{\varvec{a}}}}^1)^*\), the following lemma extends this relation to higher order.
Lemma 2.4
(Symmetries of homogenized coefficients) For all \(1\le n\le \ell \),Footnote 2
In particular, if \({\varvec{a}}\) is symmetric and if n is even, there holds
\(\diamondsuit \)
Remark 2.5
The proof of Lemma 2.4 proceeds by letting the successive inverse operators \((\nabla \cdot {\varvec{a}}\nabla )^{-1}\nabla \cdot \) defining \((\nabla \varphi ^n,\varphi ^{n-1})\) “migrate” to the test function \({\varvec{a}}\) in the definition (2.1) of \({\bar{{\varvec{a}}}}^n\). Using the algebra of correctors, a full migration precisely leads to the dual expression \({\bar{{\varvec{a}}}}^{*,n}\) and proves the relation (2.2). Rather stopping this migration argument at an intermediate step, we obtain for all \(1\le m< n\le \ell \),
Note that this turns into (2.2) for \(m=n-1\). We deduce for \(n=2m+1\) odd,
and for \(n=2m\) even, further using the algebra of correctors to symmetrize the expression,
This shows that \({\bar{{\varvec{a}}}}^n\) can be defined in terms of correctors \((\nabla \varphi ^{s+1},\varphi ^s)\) with \(s\le \lfloor \frac{n}{2}\rfloor \) only. This simple observation was independently made in [1, Theorem 3.5] (see also [47, Lemma 5.2.6]) and has the following striking consequence: Choosing a periodization in law \({\varvec{a}}_L\) of the coefficient field \({\varvec{a}}\) and recalling that the periodic Poincaré inequality ensures that all the correctors \(\varphi _L^n\) are defined as stationary objects so that all the corresponding homogenized coefficients \({\bar{{\varvec{a}}}}_L^n\) are well-defined, the above computation entails that \({\text {Sym}}_{i_1\ldots i_n}\big ({\bar{{\varvec{a}}}}^n_{L;i_1\ldots i_{n-1}}e_{i_n}\big )\) has a limit as \(L\uparrow \infty \) for all \(n\le d\) if d is odd and for all \(n< d\) if d is even. This partially solves a weak version of the Bourgain–Spencer conjecture [12, 33] as stated in [17, Section 4.2]. Refinements in this direction are postponed to a future work. \(\diamondsuit \)
We turn to the accuracy of two-scale expansions in the homogenization regime. Given \(f\in {\text {L}}^2({\mathbb {R}}^d)^d\), we consider the unique Lax–Milgram solution \(\nabla u_\varepsilon \) of the rescaled elliptic PDE (1.2),
Standard two-scale expansion techniques [11] formally suggest
where \({\bar{U}}_{\varepsilon }^n\) satisfies the nth-order homogenized equation
The effective field-flux constitutive relation \(\nabla {\bar{w}}\mapsto {\bar{{\varvec{a}}}}\nabla {\bar{w}}\) is thus refined into the higher-order relation
which includes dispersive corrections. However, Eq. (2.3) is ill-posed in general for n even and a suitable proxy needs to be devised (cf. also [32]). We start by introducing a notation for two-scale expansions.
Definition 2.6
(Higher-order two-scale expansions) For \(0\le n\le \ell \), given a deterministic smooth function \({\bar{w}}\), its nth-order two-scale expansion \(F^n_\varepsilon [{\bar{w}}]\) is defined as
At the level of gradients, we similarly define for \(0\le n\le \ell \),
\(\diamondsuit \)
In particular, note that the quantities \(\nabla F^n_\varepsilon [{\bar{w}}]\) and \(E^n_\varepsilon [\nabla {\bar{w}}]\) are related as follows: for all \(0\le n\le \ell \),
where all coefficients of \(E^n_\varepsilon [\nabla {\bar{w}}]\) are stationary. We turn to the definition of a suitable proxy for the ill-posed higher-order homogenized equation (2.3). For later reference we simultaneously consider the dual equations.
Definition 2.7
(Higher-order homogenized equations) Given \(f,g\in C^\infty _c({\mathbb {R}}^d)^d\), let \(\nabla u_\varepsilon \) and \(\nabla v_\varepsilon \) be the unique Lax–Milgram solutions of the rescaled elliptic PDEs
For \(1\le n\le \ell \), we define the corresponding nth-order homogenized solutions
where \(\nabla {\tilde{u}}^1=\nabla {\bar{u}}\) and \(\nabla {\tilde{v}}^1=\nabla {\bar{v}}\) are the unique Lax–Milgram solutions in \({\mathbb {R}}^d\) of the first-order homogenized equations,
and where for \(2\le n\le \ell \) we inductively define the nth-order corrections \(\nabla {\tilde{u}}^n\) and \(\nabla {\tilde{v}}^n\) as the unique Lax–Milgram solutions in \({\mathbb {R}}^d\) of
\(\diamondsuit \)
With the above notation, the main result from the higher-order corrector theory in stochastic homogenization takes the following guise. For \(q=2\), it is a straightforward consequence of the definition of higher-order correctors and of the corrector estimates of Proposition 2.2. It extends [3, 9, 20, 22] to higher order and completes [24, Theorem 1.6] up to the optimal order with finer norms. For \(n=\ell \), the obtained maximal accuracy is \(O(\varepsilon ^{d/2})\) (with a logarithmic correction in even dimensions), indicating that the study of fluctuations goes beyond the theory of oscillations.
Proposition 2.8
(Accuracy of higher-order two-scale expansions) For all \(f\in C^\infty _c({\mathbb {R}}^d)^d\), \(1\le n\le \ell \), and \(1< q\le p<\infty \),
\(\diamondsuit \)
2.1 Proof of Lemma 2.3
The result is easily obtained based on Malliavin calculus and on the annealed Calderón–Zygmund theory of Sect. 6. More precisely, for \(0\le n\le \ell -1\), we start from Proposition 4.1(iii) together with (4.2) in the form
and the conclusion follows after \(\varepsilon \)-rescaling from the estimates of Lemma 7.1 on the Malliavin derivatives of higher-order correctors. \(\square \)
2.2 Proof of Lemma 2.4
As explained in Remark 2.5, the argument consists in letting the successive inverse operators \((-\nabla \cdot {\varvec{a}}\nabla )^{-1}\nabla \cdot \) defining the correctors \((\nabla \varphi ^n,\varphi ^{n-1})\) migrate to the test function \({\varvec{a}}\) in the definition of the homogenized coefficient
Taking into account the particular algebraic structure of the correctors, a full migration precisely leads to the dual expression \({\bar{{\varvec{a}}}}^{*,n}\) up to additional terms involving higher-order flux correctors, which disappear when taking the symmetric part. We split the proof into two steps.
Step 1. Migration process: proof that for all \(1\le m<n\le \ell \),
We abundantly use the properties of higher-order correctors (cf. Definition 2.1). The equation for \(\varphi ^n\), in conjunction with the stationarity of all involved objects, and the skew-symmetry of \(\sigma ^{n-1}_{i_1\ldots i_{n-1}}\) yield
hence, using the equation \(\nabla \cdot \sigma ^{n-1}=q^{n-1}\) and the choice \(\mathbb {E}\left[ \varphi ^{*,m} \right] =0\),
We focus on rewriting the second right-hand side term. Using the equation for \(\varphi ^{*,m+1}\) and the skew-symmetry of \(\sigma ^{*,m}_{j_1\ldots j_m}\), we find
hence, using the equation for \(\nabla \cdot \sigma ^{*,m}\) and the choice \(\mathbb {E}\left[ \varphi ^{n-1} \right] =0\),
and the claim (2.8) follows.
Step 2. Conclusion.
For \(1\le n\le \ell \), the definition of \({\bar{{\varvec{a}}}}^n\) and the equation for \(\varphi ^{*,1}\) yield
Iterating identity (2.8) then leads to
By skew-symmetry of \(\sigma ^{*,l}_{ji_n\ldots i_{n-l+2}}\) and \(\sigma ^{n-l-2}_{i_1\ldots i_{n-l-2}}\) for all l, we deduce
hence, using the equation for \(\varphi ^{1}\) and the definition of \({\bar{{\varvec{a}}}}^{*,n}\),
and the conclusion follows. \(\square \)
2.3 Proof of Proposition 2.8
We focus on the case \(\varepsilon =1\) and drop it from all subscripts in the notation, while the final result is obtained after \(\varepsilon \)-rescaling. We split the proof into two steps.
Step 1. Equation for \(F_n[{\bar{w}}]\): proof that for all \({\bar{w}}\in C^\infty _c({\mathbb {R}}^d)\) and \(n\ge 0\),
We argue by induction. The claim is obvious for \(n=0\). Now, if it holds for some \(n\ge 0\), we deduce
The definition of \(\sigma _{i_1\ldots i_{n+1}}^{n+1}\) (cf. Definition 2.1) yields
Hence, using the skew-symmetry of \(\sigma _{i_1\ldots i_{n+1}}^{n+1}\) and decomposing
we obtain
Injecting this into (2.10) leads to the claim (2.9) at level \(n+1\).
Step 2. Conclusion.
Let \(n\ge 1\) be fixed. Applying (2.9) with \({\bar{w}}:={\bar{u}}^{n}\) and subtracting the Eq. (2.6) for u, we obtain
We now examine the equation satisfied by \({\bar{u}}^n=\sum _{k=1}^n{\tilde{u}}^k\): summing the defining equations for \(({\tilde{u}}^k)_{1\le k\le n}\) (cf. Definition 2.7), we find
or equivalently,
(This is to be compared with the formal ill-posed Eq. (2.3).) Inserting this into (2.11) yields
An energy estimate and the corrector estimates of Proposition 2.2 then imply for all \(p<\infty \),
where the last estimate follows from the weighted Calderón–Zygmund theory for the constant-coefficient equations defining \(({\tilde{u}}^k)_{1\le k\le n}\). The conclusion (2.7) in \({\text {L}}^2({\mathbb {R}}^d)\) follows after \(\varepsilon \)-rescaling. (Note that for the sake of shortness in the statement the norms of \(\nabla ^kf\) for \(n\le k\le 2n-1\) are replaced by the norm of \(\langle \nabla \rangle ^{2n-1}f\), which is no longer scale invariant.) The corresponding result in \({\text {L}}^q({\mathbb {R}}^d)\) for general \(1<q<\infty \) requires to replace the energy estimate for (2.12) by some \({\text {L}}^q\) theory, as provided e.g. by Theorem 6.1 below; details are omitted. \(\square \)
3 Higher-order homogenization commutators
As discovered in [5, 19], the homogenization commutator \(\Xi _\varepsilon ^1[\nabla u_\varepsilon ]:=({\varvec{a}}(\tfrac{\cdot }{\varepsilon })-{\bar{{\varvec{a}}}}^1)\nabla u_\varepsilon \), which takes the form of a commutator between large-scale averaging and the field-flux constitutive relation, plays a key role in the study of fluctuations in stochastic homogenization. Indeed, while the two-scale expansion of the solution \(\nabla u_\varepsilon \) is not accurate in the fluctuation scaling [25], the expansion of its commutator \(\Xi _\varepsilon ^1[\nabla u_\varepsilon ]\) is accurate [19]. The leading-order fluctuations are then governed by the standard homogenization commutator, that is, the commutator of the corrector field, \(\Xi _\varepsilon ^{\circ ,1}[\nabla {\bar{u}}^1]:=({\varvec{a}}(\tfrac{\cdot }{\varepsilon })-{\bar{{\varvec{a}}}}^1)(\nabla \varphi _i^1(\tfrac{\cdot }{\varepsilon })+e_i)\nabla _i{\bar{u}}^1\). For higher-order fluctuations, a suitable higher-order correction of the homogenization commutator \(\Xi _\varepsilon ^1[\nabla u_\varepsilon ]\) is defined as follows.
Definition 3.1
(Higher-order homogenization commutators) For \(0\le n\le \ell \), given a random function w, its nth-order homogenization commutator \(\Xi ^n_\varepsilon [\nabla w]\) is the (distributional) random vector field
\(\diamondsuit \)
This definition is natural in view of the higher-order effective field-flux constitutive relation (2.4). Note that the symmetries of the homogenized coefficients (cf. Lemma 2.4) and an integration by parts lead to the following duality relation for all smooth and compactly supported random functions \(w,w'\),
Similarly as for the first-order commutator [19, (1.8)], we state that the higher-order fluctuations of the field \(\nabla u_\varepsilon \) are determined by those of the higher-order commutators. While at first order we have the exact relation \(\int _{{\mathbb {R}}^d}g\cdot (\nabla u_\varepsilon -\nabla {\bar{u}})=\int _{{\mathbb {R}}^d}\nabla {\bar{v}}\cdot \Xi _\varepsilon ^1[\nabla u_\varepsilon ]\), this does no longer hold at higher orders due to the choice of a proxy for solutions of the generically ill-posed higher-order homogenized equations (2.3) (cf. (3.4) below). A corresponding formula can be deduced for the flux \({\varvec{a}}(\tfrac{\cdot }{\varepsilon })\nabla u_\varepsilon \). This key principle corresponds to item (i) in Theorem 1.
Proposition 3.2
(Reduction to commutators) For all \(1\le n\le \ell \) and \(p<\infty \),
where we recall the notation \(N_p(X)=\mathbb {E}\left[ |X-\mathbb {E}\left[ X \right] |^p \right] ^\frac{1}{p}\). \(\diamondsuit \)
Next, we wish to emphasize that the above definition of the higher-order commutator \(\Xi _\varepsilon ^n\) leads to a higher-order Hill–Mandel relation (e.g. [51] in the mechanics literature). First recall that the first-order commutator \(\Xi ^1_{\varepsilon }[\nabla u_\varepsilon ]:=({\varvec{a}}-{\bar{{\varvec{a}}}})(\tfrac{\cdot }{\varepsilon })\nabla u_\varepsilon \) was a natural quantity to consider in [5, 19] since the very definition of H-convergence [39] is equivalent to the weak convergence of \(\Xi ^1_\varepsilon [\nabla u_\varepsilon ]\) to 0, which is the mathematical formulation of the classical Hill–Mandel relation [27, 28]. Interestingly, this can be made quantitative: a direct computation for the solution of (2.6) yields
where the right-hand side is checked to be of order \(O(\varepsilon )\) in a weak norm if \(d>2\). This property extends to higher order as follows.
Lemma 3.3
(Higher-order Hill–Mandel relation) For all \(f{\in } C^\infty _c({\mathbb {R}}^d)^d\) and \(1{\le }\, n{\le }\,\ell \),
hence, for all \(g\in C^\infty _c({\mathbb {R}}^d)^d\) and \(p<\infty \),
\(\diamondsuit \)
Finally, the relevant higher-order two-scale expansion of the higher-order commutator \(\Xi ^n_\varepsilon [\nabla u_\varepsilon ]\) takes the form \(\Xi ^{\circ ,n}_\varepsilon [\nabla {\bar{u}}^n_\varepsilon ]\), where the so-called standard commutator \(\Xi ^{\circ ,n}_\varepsilon [\nabla \cdot ]\) is the nth-order differential operator with \(\varepsilon \)-rescaled (distributional) stationary random coefficients that is obtained by inserting the nth-order two-scale expansion \(F_\varepsilon ^n[\cdot ]\) into the commutator \(\Xi ^n_\varepsilon [\nabla \cdot ]\) and by truncating the obtained differential operator at order n. In other words, the standard commutator \(\Xi ^{\circ ,n}_\varepsilon [\nabla \cdot ]\) is the nth-order linear differential operator characterized by \(\Xi ^{\circ ,n}_\varepsilon [\nabla {\bar{q}}]=\Xi ^{n}_\varepsilon [\nabla F_\varepsilon ^n[{\bar{q}}]]\) for all nth-order polynomials \({\bar{q}}\). This is efficiently expressed as follows.
Definition 3.4
(Standard higher-order homogenization commutators) For \(0\le n\le \ell \), given a smooth deterministic function \({\bar{w}}\), we consider its nth-order Taylor polynomial with basepoint x,
where we use standard multi-index notation, and the nth-order standard homogenization commutator \(\Xi ^{\circ ,n}_\varepsilon [\nabla {\bar{w}}]\) is then defined as the (distributional) random vector field
(Henceforth by a slight abuse of notation we similarly define \(\Xi ^n[H]\) as in Definition 3.1 even when H is not a gradient field.) \(\diamondsuit \)
This definition ensures \(\mathbb {E}\left[ \Xi _\varepsilon ^{\circ ,n}[\nabla {\bar{w}}] \right] =0\) for all \(0\le n\le \ell \) and \({\bar{w}}\in C^\infty _c({\mathbb {R}}^d)\). Note that \(\Xi _{\varepsilon }^{\circ ,n}[\nabla {\bar{w}}]\) in general differs from \(\Xi _{\varepsilon }^{n}[E_\varepsilon ^{n}[\nabla {\bar{w}}]]\) whenever \(n>1\), even though \(E^n_\varepsilon [\nabla T^n_x{\bar{w}}](x)=E^n_\varepsilon [\nabla {\bar{w}}](x)\). An explicit formula is as follows.
Lemma 3.5
(Explicit formula for \(\Xi ^{\circ ,n}_\varepsilon \)) For all \({\bar{w}}\in C^\infty _c({\mathbb {R}}^d)\) and \(1\le n\le \ell \),
\(\diamondsuit \)
Remark 3.6
While Proposition 3.2 reduces higher-order fluctuations of the field \(\nabla u_\varepsilon \) to corresponding fluctuations of its higher-order homogenization commutator, we briefly argue that fluctuations of higher-order correctors can similarly be reduced to fluctuations of higher-order standard commutators. A simple adaptation of the proof of Proposition 3.2 yields for \(0\le n\le \ell -1\),
Inserting the formula of Lemma 3.5 and the definition of \(E_\varepsilon ^n\) (cf. Definition 2.6), the expression on the left-hand side can, after straightforward computations, be rewritten as follows,
in terms of
If we wish to describe only first-order fluctuations, we may replace \(T_\varepsilon ^{n,s}(f,g)\) by \(g\nabla ^{s+1}{\bar{w}}\) and \(\nabla {\bar{v}}_\varepsilon ^n\) by \(\nabla {\bar{v}}\), and use Lemma 2.3 to estimate the remainder, to the effect of
hence, recalling Definition 2.6 and taking differences, for all \(0\le n\le \ell -1\),
This characterizes the first-order fluctuations of \({\text {Sym}}_{j_1\ldots j_n}(\nabla \varphi ^{n}_{j_1\ldots j_n}+\varphi ^{n-1}_{j_1\ldots j_{n-1}}e_{j_n})\) in terms of (differences of) standard commutators. Note that fluctuations of the latter are easily jointly characterized by repeating the analysis of Sects. 8 and 9, and are seen to converge to suitable linear combinations of derivatives of Gaussian white noise. From here we may inductively infer first-order fluctuations of \({\text {Sym}}_{j_1\ldots j_n}\varphi ^{n}_{j_1\ldots j_n}\) for all \(n\le \ell -1\), while corresponding higher-order fluctuations are similarly extracted from (3.3). \(\diamondsuit \)
3.1 Proof of Proposition 3.2
We focus on the case \(\varepsilon =1\) and drop it from all subscripts in the notation, while the final result is obtained after \(\varepsilon \)-rescaling. We split the proof into three steps.
Step 1. Proof of
Testing equation \(-\,\nabla \cdot {\bar{{\varvec{a}}}}^{*,1}\nabla {\bar{v}}^1=\nabla \cdot g\) with u and testing equation \(-\nabla \cdot {\varvec{a}}\nabla u=\nabla \cdot f\) with \({\bar{v}}^{n}\), we find
Noting that the definition of \(\Xi ^n[\nabla u]\) (cf. Definition 3.1) together with (3.1) yields after integration by parts
we deduce
Decomposing \({\bar{v}}^{n}:=\sum _{j=1}^{n}{\tilde{v}}^j\) and using the defining equations for \(({\tilde{v}}^j)_{2\le j\le n}\) (cf. Definition 2.7) in the resumed form
the claim follows.
Step 2. CLT scaling: for all \(g\in C^\infty _c({\mathbb {R}}^d)^d\),
This result is classical [18, 20, 37]. A short proof based on Malliavin calculus and on the annealed Calderón–Zygmund theory of Sect. 6 is easily obtained as in (7.9); details are omitted.
Step 3. Conclusion.
Since \(\int _{{\mathbb {R}}^d}\nabla {\bar{v}}^n\cdot f\) is deterministic, the identity of Step 1 yields
Applying the CLT scaling result of Step 2 and the Calderón–Zygmund theory for the constant-coefficient equations defining \(({\tilde{v}}^k)_{1\le k\le n}\), this expression is estimated by
and the conclusion follows after \(\varepsilon \)-rescaling. \(\square \)
3.2 Proof of Lemma 3.3
We focus on the case \(\varepsilon =1\) and drop it from all subscripts in the notation, while the final result is obtained after \(\varepsilon \)-rescaling. We split the proof into two steps.
Step 1. Proof that for all \(1\le n\le \ell \),
which coincides with (3.2) after \(\varepsilon \)-rescaling.
First, by Lemma 2.4, the nth-order commutator can alternatively be written as
It remains to show that for all \(0\le n\le \ell \),
while the result (3.5) indeed follows after injecting the equation \(-\nabla \cdot {\varvec{a}}\nabla u=\nabla \cdot f\) and writing \(\varphi ^{*,k}_{ji_1\ldots i_{k-1}}\nabla \cdot f=\nabla \cdot (\varphi ^{*,k}_{ji_1\ldots i_{k-1}}f)-\nabla \varphi ^{*,k}_{ji_1\ldots i_{k-1}}\cdot f\). We prove (3.7) by induction. It is obvious for \(n=0\) (recall that for \(n=0\) the notation \({\varvec{a}}^*\varphi _{ji_1\ldots i_{n-1}}^{*,n}e_{i_n}\) stands for \({\varvec{a}}^*e_j\)). Assume that it holds for some \(n\ge 0\). In order to deduce it at level \(n+1\), it suffices to prove that
The equation for \(\nabla \cdot \sigma ^{*,n+1}\) (cf. Definition 2.1) yields
and the claim (3.8) follows from the skew-symmetry of \(\sigma ^{*,n+1}_{ji_1\ldots i_n}\).
Step 2. Conclusion.
For \(g\in C^\infty _c({\mathbb {R}}^d)^d\), combining (3.5) with the energy estimate \(\Vert \nabla u\Vert _{{\text {L}}^2({\mathbb {R}}^d)}\lesssim \Vert f\Vert _{{\text {L}}^2({\mathbb {R}}^d)}\) and with the corrector estimates of Proposition 2.2, we find for all \(p<\infty \),
In order to optimally estimate the last contribution, we must exploit stochastic cancellations in form of Lemma 2.3, which yields after \(\varepsilon \)-rescaling, for all \(1\le n\le \ell \),
Since \(\ell -1\le \frac{1}{2}(d-1)\), using Hölder’s and Sobolev’s inequalities in the form
the conclusion follows. \(\square \)
3.3 Proof of Lemma 3.5
First note that, given a linear differential operator O of order m, say \(O[{\bar{w}}]=\sum _{k=0}^mP^k_{j_1\ldots j_k}(x)\nabla ^k_{j_1\ldots j_k}{\bar{w}}\), the characterizing property of the Taylor polynomial yields
which amounts to truncating the differential operator O at order n. Now observe that \(\Xi _{\varepsilon }^n[E_\varepsilon ^n[\nabla \cdot ]]\) is a linear differential operator of order \(2n-1\) with (distributional) stationary random coefficients. Composing the explicit definitions of \(E_\varepsilon ^n\) and \(\Xi _{\varepsilon }^n\) (cf. Definitions 2.6 and 3.1), we obtain
Hence, relabelling \(k-1\) as k in the second right-hand side contribution, using the symmetry in \(i_1,\ldots ,i_{k}\), and expanding the multiple derivative \(\nabla ^{k}_{i_1\ldots i_{k}}\) by using the general Leibniz rule,
Now applying (3.9) to the definition of \(\Xi _{\varepsilon }^{\circ ,n}[\nabla {\bar{w}}]\) (cf. Definition 3.4) together with the above formula for \(\Xi _{\varepsilon }^n[E_\varepsilon ^n[\nabla {\bar{w}}]]\), we are led to
Comparing this with (3.10) yields the conclusion. \(\square \)
4 Reminder on Malliavin calculus
In this section, we recall some basic definitions of the Malliavin calculus with respect to the Gaussian field G (e.g. [36, 42, 44] for details). For simplicity, we focus on the explicit description of the Malliavin derivative on a simple dense subspace of random variables (cf. \({\mathcal {R}}\) hereafter), where it is seen to coincide with the formal \({\text {L}}^2\)-gradient with respect to G. Next, we give a precise statement of the different tools from Malliavin calculus that will be used in the sequel. We emphasize that the reader should not be intimidated by the somehow abstract appearance of the theory: for our application to stochastic homogenization all (averaged) quantities of interest happen to be Malliavin smooth, and their Malliavin derivatives are systematically computed by duality arguments so that in the end everything is expressed in terms of the (explicit) Malliavin derivative of the coefficient field \({\varvec{a}}\) itself.
The Gaussian random field G can be seen as a random Schwartz distribution, that is, as a random element in \({\mathcal {S}}'({\mathbb {R}}^d)^\kappa \). Indeed, for all \(\zeta _1,\zeta _2\in C^\infty _c({\mathbb {R}}^d)^\kappa \), we define \(G(\zeta _1)\), \(G(\zeta _2)\) (or formally \(\int _{{\mathbb {R}}^d} G\zeta _1\), \(\int _{{\mathbb {R}}^d} G\zeta _2\)) as centered Gaussian random variables with covariance
We define \({\mathfrak {H}}\) as the closure of \(C^\infty _c({\mathbb {R}}^d)^\kappa \) for the (semi)norm
The space \({\mathfrak {H}}\) (up to taking the quotient with respect to the kernel of \(\Vert \cdot \Vert _{\mathfrak {H}}\)) is a separable Hilbert space. Under the integrability condition (1.5), it obviously contains \({\text {L}}^2({\mathbb {R}}^d)^\kappa \) and there holds
In view of the isometry relation \({\text {Cov}}\left[ {G(\zeta _1)};{G(\zeta _2)}\right] =\langle \zeta _1,\zeta _2\rangle _{\mathfrak {H}}\), the random field G is said to be an isonormal Gaussian process over \({\mathfrak {H}}\).
Without loss of generality, we work under the minimality assumption \({\mathcal {F}}=\sigma (G)\), which implies that the linear subspace
is dense in \({\text {L}}^2(\Omega )\). This allows to define operators and prove properties on the simpler subspace \({\mathcal {R}}\) before extending them to \({\text {L}}^2(\Omega )\) by density. For \(r\ge 1\) we similarly define
which is dense in \({\text {L}}^2(\Omega ;{\mathfrak {H}}^{\otimes r})\). For a random variable \(X\in {\mathcal {R}}\), say \(X=g(G(\zeta _1),\ldots ,G(\zeta _n))\), we define its Malliavin derivative \(DX\in {\text {L}}^2(\Omega ;{\mathfrak {H}})\) as
which coincides with the formal \({\text {L}}^2\) gradient of X with respect to G. For an element \(X\in {\mathcal {R}}({\mathfrak {H}}^{\otimes r})\) with \(r\ge 1\), say \(X=\sum _{i=1}^n\psi _iX_i\), the Malliavin derivative \(DX\in {\text {L}}^2(\Omega ;{\mathfrak {H}}^{\otimes (r+1)})\) is defined as \(DX=\sum _{i=1}^n\psi _i\otimes DX_i\). For \(j\ge 1\), we iteratively define the jth-order Malliavin derivative \(D^j{:}\,{\mathcal {R}}({\mathfrak {H}}^{\otimes r})\rightarrow {\text {L}}^2(\Omega ;{\mathfrak {H}}^{\otimes (r+j)})\) for all \(r\ge 0\). For all \(r,m\ge 0\), we then set
we define the Malliavin–Sobolev space \(\mathbb {D}^{m,2}({\mathfrak {H}}^{\otimes r})\) as the closure of \({\mathcal {R}}({\mathfrak {H}}^{\otimes r})\) for this norm, and we extend the Malliavin derivatives \(D^j\) by density to these spaces.
Next, we define a divergence operator \(D^*\) as the adjoint of the Malliavin derivative D and we start with its domain: for \(r\ge 1\), we set
and for all \(X\in {{\text {dom}}}\,(D^*|_{{\text {L}}^2(\Omega ;{\mathfrak {H}}^{\otimes r})})\) we define the divergence \(D^*X\in {\text {L}}^2(\Omega ;{\mathfrak {H}}^{\otimes (r-1)})\) by duality via the relation
One can show that \({{\text {dom}}}\,(D^*|_{{\text {L}}^2(\Omega ;{\mathfrak {H}}^{\otimes r})})\supset \mathbb {D}^{1,2}({\mathfrak {H}}^{\otimes r})\) (e.g. [44, Proposition 1.3.1]). Combining the Malliavin derivative and the associated divergence, we construct the so-called Ornstein–Uhlenbeck operator (or infinite-dimensional Laplacian)
which is an essentially self-adjoint positive operator. The explicit actions of \(D^*\) and \(\mathcal {L}\) on \({\mathcal {R}}\) are easily computed (e.g. [42, p. 34]): for \(X\in {\mathcal {R}}({\mathfrak {H}}^{\otimes r})\), say \(X=\sum _{i=1}^n\psi _iX_i\), we have \(\mathcal {L}X=\sum _{i=1}^n\psi _i\,\mathcal {L}X_i\), while for \(X_i=g(G(\zeta _1),\ldots ,G(\zeta _n))\),
In particular, a direct computation (e.g. [42, p. 35]) leads to the crucial commutator relation
In addition, one checks that \(\mathcal {L}\) is shift-invariant. While the positivity of \(\mathcal {L}\) ensures that the inverse operator \((1+\mathcal {L})^{-1}\) is contractive on \({\text {L}}^2(\Omega )\), it is easily checked to be also contractive on \({\text {L}}^p(\Omega )\) for all \(p>1\) (cf. e.g. [38, Proposition 3.2]). Although we focus here on the explicit description of operators on their core \({\mathcal {R}}\), the action of \(D,D^*,\mathcal {L}\) on general \({\text {L}}^2\) random variables are naturally described in terms of their Wiener chaos expansion (e.g. [42, 44]).
Based on the above definitions, we state the following proposition collecting various useful results for the fine analysis of functionals of the Gaussian field G. Item (i) is classical [15, 29, 40], as well as item (iii) (e.g. [8] and references therein). Item (ii) is well-known in the discrete Gaussian setting [26] (cf. also [38, 50]). Item (iv) in total variation distance (and similarly in 1-Wasserstein distance) is a consequence of Stein’s method: it was first obtained in the discrete setting by Chatterjee [14], while the present Malliavin analogue is due to [41, 43] (of which we give a slightly different formulation here). The corresponding result in 2-Wasserstein distance is different and is due to [35]. Note the particular role of the so-called Stein kernel \(\langle DX,(1+\mathcal {L})^{-1}DX\rangle _{{\mathfrak {H}}}\) in items (ii) and (iv). A short proof is included in “Appendix A” for the reader’s convenience.
Proposition 4.1
-
(i)
First-order Poincaré inequality For all \(X\in {\text {L}}^2(\Omega )\),
$$\begin{aligned} {\mathrm{Var}}\left[ X\right] \,\le \,\mathbb {E}\left[ \Vert DX\Vert _{{\mathfrak {H}}}^2 \right] . \end{aligned}$$ -
(ii)
Helffer–Sjöstrand identity For all \(X,Y\in \mathbb {D}^{1,2}(\Omega )\),
$$\begin{aligned} {\text {Cov}}\left[ {X};{Y}\right] \,=\,\mathbb {E}\left[ \langle DX,(1+\mathcal {L})^{-1}DY\rangle _{{\mathfrak {H}}} \right] . \end{aligned}$$(4.6) -
(iii)
Logarithmic Sobolev inequality For all \(X\in {\text {L}}^2(\Omega )\),
$$\begin{aligned} {\mathrm{Ent}}\!\left[ X^2\right] \,:=\,\mathbb {E}\left[ X^2\log \frac{X^2}{\mathbb {E}\left[ X^2 \right] } \right] \,\le \,2\,\mathbb {E}\left[ \Vert DX\Vert _{\mathfrak {H}}^2 \right] , \end{aligned}$$hence in particular, for all \(p\ge 1\),
$$\begin{aligned} \mathbb {E}\left[ |X-\mathbb {E}\left[ X \right] |^{2p} \right] \,\le \,(2p+1)^p\,\mathbb {E}\big [ \Vert DX\Vert _{\mathfrak {H}}^{2p} \big ]. \end{aligned}$$ -
(iv)
Second-order Poincaré inequality For all \(X\in {\text {L}}^2(\Omega )\) with \(\mathbb {E}\left[ X \right] =0\) and \({\mathrm{Var}}\left[ X\right] =1\),
$$\begin{aligned} {\text {W}}_2\left( {X};{{\mathcal {N}}}\right) \vee {\text {d}}_{{\text {TV}}}\left( {X};{{\mathcal {N}}}\right)\le & {} 2\,{\mathrm{Var}}\left[ \langle DX,(1+\mathcal {L})^{-1}DX\rangle _{{\mathfrak {H}}}\right] ^\frac{1}{2}\\\le & {} 3\,\mathbb {E}\left[ \Vert D^2X\Vert _{{{\text {op}}}}^4 \right] ^\frac{1}{4}\mathbb {E}\left[ \Vert DX\Vert _{\mathfrak {H}}^4 \right] ^\frac{1}{4}, \end{aligned}$$where \({\text {W}}_2\left( {\cdot };{{\mathcal {N}}}\right) \) and \({\text {d}}_{{\text {TV}}}\left( {\cdot };{{\mathcal {N}}}\right) \) denote the 2-Wasserstein and the total variation distance to a standard Gaussian law, respectively, and where the operator norm of \(D^2X\) is defined by
$$\begin{aligned} \Vert D^2X\Vert _{{{\text {op}}}}\,:=\,\sup _{\begin{array}{c} \zeta ,\zeta '\in {\mathfrak {H}}\\ \Vert \zeta \Vert _{\mathfrak {H}}=\Vert \zeta '\Vert _{\mathfrak {H}}=1 \end{array}}\langle D^2X,\zeta \otimes \zeta '\rangle _{{\mathfrak {H}}^{\otimes 2}}. \end{aligned}$$(4.7)\(\diamondsuit \)
Remark 4.2
Malliavin-type calculus has also been successfully developed in some non-Gaussian settings, to which most of our conclusions could thus be extended. For instance, in the case of an i.i.d. discrete random field \(G:=(G_x)_{x\in {\mathbb {Z}}^d}\), a corresponding calculus is based on the so-called Glauber derivative \(D_xX={\mathbb {E}}\left[ \left. X \,\right| \,G_x\right] -X\) (e.g. [19]). Another important example is that of a Poisson point process \(\xi \) on \({\mathbb {R}}^d\), in which case the Malliavin derivative is chosen as the difference operator \(D_xX(\xi )=X(\xi \cup \{x\})-X(\xi {\setminus }\{x\})\) (e.g. [46]). \(\diamondsuit \)
5 Representation formulas
As recalled in Proposition 4.1 above, tools from Malliavin calculus allow to linearize the dependence on the underlying Gaussian field G and reduce various subtle questions about quantities of interest to the estimation of their Malliavin derivative, that is, of their local dependence with respect to G. In this spirit, the key ingredient for items (ii) and (iii) of Theorem 1 consists in suitable representation formulas for the Malliavin derivative of higher-order commutators.
-
Locality of higher-order commutators
The key property of homogenization commutators for fluctuations is their improved locality with respect to the coefficient field. While this was first discovered at first order in [19] (see also [5, 23, 25]), formula (5.1) below indicates that the higher-order commutator \(\Xi ^n_\varepsilon [\nabla u_\varepsilon ]\) behaves like a local quantity at the level of its Malliavin derivative up to a higher-order error \(O(\varepsilon ^n)\). More precisely, the formula takes the form
$$\begin{aligned} e_j\cdot D\Xi ^n_\varepsilon [\nabla u_\varepsilon ]= & {} \sum _{k=0}^{n-1}(-1)^k\varepsilon ^k\nabla ^k_{i_1\ldots i_k}\Big (\big (\nabla \varphi _{ji_1\ldots i_k}^{*,k+1}\\&+\,\varphi ^{*,k}_{ji_1\ldots i_{k-1}}e_{i_k}\big )(\tfrac{\cdot }{\varepsilon })\cdot D{\varvec{a}}(\tfrac{\cdot }{\varepsilon })\nabla u_\varepsilon \Big )\,+\,O(\varepsilon ^n), \end{aligned}$$where the main right-hand side terms are indeed local in view of \(D_z{\varvec{a}}=a_0'(G(z))\,\delta (\cdot -z)\), and where the error can be checked to be of order \(O(\varepsilon ^n)\) in \({\text {L}}^2(\Omega ;{\mathfrak {H}})\) when integrated with a smooth test function. While the full formula (5.1) below is given for simplicity for \(\varepsilon =1\), the \(\varepsilon \)-scaling is transparent by counting the number of derivatives that fall on test functions. The proof consists in exploiting the algebraic structure of higher-order correctors and commutators in order to make this clean local structure appear.
Since the standard commutator \(\Xi ^{\circ ,n}_\varepsilon [\nabla {\bar{w}}]\) is obtained by applying \(\Xi _\varepsilon ^n[\nabla \cdot ]\) to the two-scale expansion \(F_\varepsilon ^n[\nabla {\bar{w}}]\) whenever \({\bar{w}}\) is an nth-order polynomial, we may deduce a corresponding formula (5.2) for its Malliavin derivative,
$$\begin{aligned}&e_j\cdot D\Xi ^{\circ ,n}_\varepsilon [\nabla {\bar{w}}]\\&\quad \,=\,\sum _{k=0}^{n-1}(-1)^k\varepsilon ^k\nabla ^k_{i_1\ldots i_k}\Big (\big (\nabla \varphi _{ji_1\ldots i_k}^{*,k+1}+\varphi _{ji_1\ldots i_{k-1}}^{*,k}e_{i_k}\big )(\tfrac{\cdot }{\varepsilon })\cdot D{\varvec{a}}(\tfrac{\cdot }{\varepsilon }) E^n_\varepsilon [\nabla {\bar{w}}]\Big )+O(\varepsilon ^n). \end{aligned}$$In view of Proposition 4.1(ii), this improved locality property provides a simple access to the higher-order characterization of the asymptotic covariance structure of large-scale averages of commutators, cf. Sect. 8.
-
Accuracy of the two-scale expansion of higher-order commutators:
Comparing the formulas for Malliavin derivatives of the commutator \(\Xi ^n_\varepsilon [\nabla u_\varepsilon ]\) and of its standard version \(\Xi ^{\circ ,n}_\varepsilon [\nabla {\bar{w}}]\), and choosing \({\bar{w}}={\bar{u}}_\varepsilon ^n\) the homogenized solution, we find for the difference,
$$\begin{aligned}&e_j\cdot D\big (\Xi ^n_\varepsilon [\nabla u_\varepsilon ]-\Xi ^{\circ ,n}_\varepsilon [\nabla {\bar{u}}_\varepsilon ^n]\big ) =O(\varepsilon ^n)\\&\quad +\,\sum _{k=0}^{n-1}(-1)^k\varepsilon ^k\nabla ^k_{i_1\ldots i_k}\Big (\big (\nabla \varphi _{ji_1\ldots i_k}^{*,k+1}+\varphi ^{*,k}_{ji_1\ldots i_{k-1}}e_{i_k}\big )(\tfrac{\cdot }{\varepsilon }) \cdot D{\varvec{a}}(\tfrac{\cdot }{\varepsilon })\big (\nabla u_\varepsilon -E_\varepsilon ^n[\nabla {\bar{u}}_\varepsilon ^n]\big )\Big ), \end{aligned}$$where we exactly recognize the higher-order two-scale expansion error \(\nabla u_\varepsilon -E_\varepsilon ^n[\nabla {\bar{u}}_\varepsilon ^n]\), which is known to be of small order \(O(\varepsilon ^n)\) in view of Proposition 2.8. This constitutes the core of the proof of the higher-order pathwise result, cf. Sect. 7.
We turn to the precise statement of the representation formulas. These are understood in a weak sense: when integrated in space with a smooth test function, the quantities in consideration are indeed Malliavin smooth and the formulas make sense after integrations by parts (see also Lemmas 7.1 and 9.1).
Proposition 5.1
(Formulas for first Malliavin derivatives) Let \(\varepsilon =1\) and drop it from all subscripts in the notation. For all \(1\le n\le \ell \) there holds for any random function w with \(\nabla \cdot {\varvec{a}}\nabla w\) deterministic,
and also, for any smooth deterministic function \({\bar{w}}\),
In particular,
\(\diamondsuit \)
In view of the approximate normality result of Theorem 1(iv), the second-order Poincaré inequality of Proposition 4.1(iv) requires to further estimate the second Malliavin derivative of higher-order standard commutators, and we establish the following formula. Although some of the terms are proportional to \(D^2{\varvec{a}}\), hence display an exact locality, some others unavoidably take the form \(D\AA D\nabla \varphi ^1\), hence are only approximately local. This is due to the nonlinear dependence of the solution operator on the coefficient field, and is the reason why some logarithms are lost in the convergence rate to normality in Theorem 1(iv), cf. Sect. 9.
Proposition 5.2
(Formulas for second Malliavin derivatives) Let \(\varepsilon =1\) and drop it from all subscripts in the notation. For all \(1\le n\le \ell \), there holds for any smooth deterministic function \({\bar{w}}\),
\(\diamondsuit \)
5.1 Proof of Proposition 5.1
We split the proof into three steps.
Step 1. Proof that for any random function w with \(\nabla \cdot {\varvec{a}}\nabla w\) deterministic there holds for all \(0\le n\le \ell \),
We argue by induction. The result is obvious for \(n=0\) (recall \(\Xi ^0[\nabla w]={\varvec{a}}\nabla w\), cf. Definition 3.1). Assume that it holds for some \(n\ge 0\). In order to deduce it at level \(n+1\), we appeal to the alternative definition (3.6) of the commutator, so that it suffices to prove
The definition of \(\sigma ^{*,n+1}\) (cf. Definition 2.1) yields
Then using the skew-symmetry of \(\sigma ^{*,n+1}_{ji_1\ldots i_n}\) and the equation \(D(\nabla \cdot {\varvec{a}}\nabla w)=0\) in the form
the claim (5.5) follows.
Step 2. Proof that for any random function w with \(\nabla \cdot {\varvec{a}}\nabla w\) deterministic and for any smooth deterministic function \({\bar{w}}\) there holds for all \(0\le n,m\le \ell \),
where we recall that by a slight abuse of notation we similarly define \(\Xi ^n[H]\) as in Definition 3.1 even if H is not a gradient field.
Let \(n\ge 0\) be fixed. We argue by induction on m. For \(m=0\), the result (5.6) coincides with (5.4) (recall \(E^0[\nabla {\bar{w}}]=0\), cf. Definition 2.6). Now assume that (5.6) holds for some \(m\ge 0\) and let us deduce it at the level \(m+1\). Given a polynomial \({\bar{q}}\) of order \(m+1\), noting that \(\nabla ^{m+2}{\bar{q}}\) vanishes, identity (2.9) ensures that \(\nabla \cdot {\varvec{a}}\nabla F^{m+1}[{\bar{q}}]\) is deterministic. We then apply (5.6) with \(w=F^{m+1}[{\bar{q}}]\) and \({\bar{w}}={\bar{q}}\). Abundantly using that \(\nabla ^{m+1}{\bar{q}}\) is constant and noting that identity (2.5) together with the definition of \(E^m\) yields
this leads to
Note that both sides of this identity depend linearly on the (constant and deterministic) symmetric tensor \(\nabla ^{m+1}{\bar{q}}\). Hence, we may replace it by the (deterministic) symmetric tensor \(\nabla ^{m+1}{\bar{w}}\), to the effect of
By definition of \(E^m\) and \(\Xi ^{n}\) (cf. Definitions 2.6 and 3.1) together with Lemma 2.4, using the general Leibniz rule, we find
Injecting identities (5.6) and (5.7) into this equality, using again the definition of \(E^m\) (cf. Definition 2.6) and the general Leibniz rule in the forms
and
and recombining the terms, the result (5.6) follows at level \(m+1\). For instance, the increment of the first triple sum over (k, s, l) splits into three contributions,
Step 3. Conclusion.
Applying (5.4) yields (5.1). We now turn to (5.3) and first note that by definition of \(\Xi ^{\circ ,n}\) (cf. Definition 3.4) and by (2.5) we have
so that by definition of \(E^n\) (cf. Definition 2.6) and by (3.6) we obtain
We now apply D to this identity, add (5.6) with \(m=n\), with \((w,{\bar{w}})\) replaced by \((u,{\bar{u}}^n)\), and with s replaced by \(s+1\), and so obtain (5.3). Finally, (5.2) follows by subtracting (5.3) (with \((u,{\bar{u}}^n)=(w,{\bar{w}})\)) from (5.1) and inserting the formula for \(DE^n[\nabla {\bar{u}}^n]\) already used in (5.8) for \(m=n\). \(\square \)
5.2 Proof of Proposition 5.2
The conclusion follows by repeating the proof of Proposition 5.1, taking advantage of the same algebraic identities, but now further keeping track of the mixed term in \(D^2({\varvec{a}}\nabla w)={\varvec{a}}\nabla D^2w+2D{\varvec{a}}\nabla D w+D^2{\varvec{a}}\,\nabla w\). \(\square \)
6 Annealed Calderón–Zygmund theory
In this section, we establish the following new annealed Calderón–Zygmund estimate for linear elliptic equations in divergence form with random coefficients. This constitutes a useful upgrade of the quenched large-scale Calderón–Zygmund estimates of [3, 4, 21]. The merit of (6.2)–(6.4) below is that the stochastic \({\text {L}}^q\) norm is inside the spatial \({\text {L}}^p\) norm; while there is a tiny loss in stochastic integrability, the spatial integrability is as in the constant-coefficient case.
Theorem 6.1
Consider the Helmholtz projection \(T:=\nabla (\nabla \cdot {\varvec{a}}\nabla )^{-1}\nabla \cdot {\varvec{a}}\). There exists a \(\frac{1}{8}\)-Lipschitz stationary field \(r_*\ge 1\) on \({\mathbb {R}}^d\) such that for all \(h\in C^\infty _c({\mathbb {R}}^d;{\text {L}}^\infty (\Omega ))^d\) and \(1<q\le p<\infty \) there holds
where \(B_*(x):=B_{r_*(x)}(x)\). In addition, in the Gaussian setting with integrable correlations (1.5), the stationary random field \(r_*\) satisfies \(\mathbb {E}\left[ \exp (\frac{1}{C}r_*^d) \right] \le 2\) for some \(C\simeq 1\). In particular, for all \(1<p,q<\infty \) and \(0<\delta \le \frac{1}{2}\),
\(\diamondsuit \)
We shall also make use of the following consequence of this annealed estimate for the corresponding Riesz potentials. As opposed to the above, this can alternatively be deduced from the annealed Green’s function estimates in [37].
Corollary 6.2
Consider the Riesz potentials \(U_{e,\circ }:=\nabla \triangle ^{-1}e\cdot \) and \(U_e:=\nabla (\nabla \cdot {\varvec{a}}\nabla )^{-1}e\cdot {\varvec{a}}\) with \(|e|=1\). For all \(h\in C^\infty _c({\mathbb {R}}^d;{\text {L}}^\infty (\Omega ))^d\) and \(1<p,q<\infty \) with \(\frac{dp}{d+p}>1\),
and for all \(0<\delta \le \frac{1}{2}\),
\(\diamondsuit \)
6.1 Proof of Theorem 6.1
Our argument is based on a multiple use of the following version of the Calderón–Zygmund lemma due to Shen [48, Theorem 2.1] (cf. also [49, Theorem 2.4]), based on ideas by Caffarelli and Peral [13]. For a ball B, we set \(B=B(x_B,r_B)\) and abusively write \(\alpha B:=B(x_B,\alpha r_B)\) for \(\alpha >0\).
Lemma 6.3
[13, 48] Let \(1\le p_0<p_1\le \infty \), \(C_0>0\), and \(f,g\in {\text {L}}^{p_0}\cap {\text {L}}^{p_1}({\mathbb {R}}^d)\). Assume that for all balls \(B\subset {\mathbb {R}}^d\) there exist measurable functions \(f_{B,0}\) and \(f_{B,1}\) such that there hold \(f= f_{B,0}+f_{B,1}\) on B and
Then, for all \(p_0<q<p_1\),
\(\diamondsuit \)
Combining this lemma with the quenched large-scale Lipschitz regularity theory due to [6, 7, 21] leads to the following quenched large-scale Calderón–Zygmund estimate. This slightly shortens the proof of [4] and [21, Corollary 4] (cf. also [3, Section 7]).
Proposition 6.4
(Quenched large-scale Calderón–Zygmund estimate) There exists a stationary random field \(r_*\) as in the statement of Theorem 6.1 such that for all \(h\in C^\infty _c({\mathbb {R}}^d;{\text {L}}^\infty (\Omega ))^d\) and \(1<p<\infty \),
where we recall the notation \(B_*(x):=B_{r_*(x)}(x)\). \(\diamondsuit \)
Before proceeding to the proof, we state the following useful properties of averages on the family of balls \(\{B_*(x)\}_{x\in {\mathbb {R}}^d}\). The point is that these balls have a radius that varies with their center x.
Lemma 6.5
For any measurable function f on \({\mathbb {R}}^d\), the following estimates hold.
-
(i)
For all balls \(B\subset {\mathbb {R}}^d\) with \(r_B\ge \frac{1}{4}r_*(x_B)\), we have
$$\begin{aligned} \fint _{B}|f|\,\lesssim \,\fint _{2B}\Big (\fint _{B_*(x)}|f|\Big )\,dx, \end{aligned}$$(6.5)hence, in particular,
$$\begin{aligned} \int _{{\mathbb {R}}^d}|f|\,\simeq \,\int _{{\mathbb {R}}^d}\Big (\fint _{B_*(x)}|f|\Big )\,dx. \end{aligned}$$(6.6) -
(ii)
For all balls \(B\subset {\mathbb {R}}^d\) and \(x_0\in B\) with \(r_B\le \frac{1}{4}r_*(x_0)\), we have
$$\begin{aligned} \Big (\fint _{B_*(x_0)}|f|\Big )^\frac{1}{2}\,\lesssim \,\fint _{5B}\Big (\fint _{B_*(x)}|f|\Big )^\frac{1}{2}\,dx, \end{aligned}$$(6.7)hence, in particular,
$$\begin{aligned} \fint _{B_*(x_0)}|f|\,\lesssim \,\fint _{5B}\Big (\fint _{B_*(x)}|f|\Big )\,dx. \end{aligned}$$(6.8)
\(\diamondsuit \)
We may now proceed to the proof of Proposition 6.4, which amounts to combining the quenched large-scale Lipschitz regularity theory of [6, 7, 21] together with Lemma 6.3.
Proof of Proposition 6.4
As in [21, Step 1 of the proof of Corollary 4], the random field \(r_*\) is chosen as the largest \(\frac{1}{8}\)-Lipschitz lower bound on the so-called minimal radius defined in [21] (what is denoted here by \(r_*\) thus corresponds to the notation \(\underline{r_*}\) in [21]). We split the proof into two steps.
Step 1. For all balls \(B\subset {\mathbb {R}}^d\), decomposing \(Th=\nabla w_{B,0}+\nabla w_{B,1}\) with
we prove for all \(2\le p\le \infty \),
We start with the proof of (6.10), and we first consider the case \(r_B\ge \frac{1}{4}r_*(x_B)\). An energy estimate on the equation for \(w_{B,0}\) yields
hence, combined with (6.5) and (6.6),
which proves (6.10). Next, we consider the case \(r_B\le \frac{1}{4}r_*(x_B)\). For \(x_0\in B\), using the \(\frac{1}{8}\)-Lipschitz property of \(r_*\) in form of \(|B_*(x_0)|\simeq |B_*(x_B)|\), using the energy estimate (6.12), and applying (6.8) (with \(x_0\) and B replaced by \(x_B\) and \(\frac{1}{10}B\)), we find
(The choice of \(\frac{1}{2}B\) in the right-hand side does not matter here and is only made for later reference.) Integrating this estimate over \(x_0\in B\), the conclusion (6.10) follows.
We turn to the proof of (6.11), which is a consequence of quenched large-scale Lipschitz regularity theory, noting that \(w_{B,1}\) is indeed \({\varvec{a}}\)-harmonic in B. It is obviously sufficient to prove for all \(x_0\in \frac{1}{12}B\),
If \(r_*(x_0)\ge \frac{1}{3}r_B\), this follows from (6.8) (with B replaced by \(\frac{1}{12}B\)). Let now \(x_0\in \frac{1}{12}B\) be fixed with \(r_*(x_0)\le \frac{1}{3}r_B\). Since the ball \(B(x_0,r_*(x_0)+\frac{1}{12}r_B)\) is contained in \(\frac{1}{2}B\), where \(w_{B,1}\) is \({\varvec{a}}\)-harmonic, the quenched large-scale Lipschitz regularity theory (e.g. [21, Theorem 1]) implies
Since for \(r_*(x_0)\le \frac{1}{3}r_B\) the \(\frac{1}{8}\)-Lipschitz property of \(r_*\) yields \(r_*(x_B)\le r_*(x_0)+\frac{1}{96}r_B\le 2r_B\), the conclusion (6.14) follows from (6.5) (with B replaced by \(\frac{1}{2}B\)).
Step 2. Conclusion.
Combining (6.10) and (6.11), and applying Lemma 6.3 with
we conclude for all \(2\le p<\infty \),
A standard duality argument allows to deduce the corresponding result for \(1<p\le 2\) (e.g. [21, Step 7 of the proof of Corollary 4]). \(\square \)
We need two additional ingredients for the proof of Theorem 6.1. The first is a deterministic regularity result for \({\varvec{a}}\)-harmonic functions. It follows e.g. from [10, proof of Lemma 4], but a short proof is included below for the reader’s convenience.
Lemma 6.6
If w is \({\varvec{a}}\)-harmonic on a ball B, there holds
\(\diamondsuit \)
The second ingredient is needed to pass from statements on averages at the scale \(r_*\) to corresponding statements on unit scale. This can be done at no loss in the spatial, but a small controlled loss in the stochastic integrability.
Lemma 6.7
Let the stationary random field \(r_*\) satisfy \(\mathbb {E}\left[ \exp (\frac{1}{C}r_*^d) \right] \le 2\) for some \(C\simeq 1\). For all \(f\in C^\infty _c({\mathbb {R}}^d;{\text {L}}^\infty (\Omega ))\) and \(1\le q\le p<\infty \), there hold
-
(i)
for all \(r>q\),
$$\begin{aligned}&\bigg (\int _{{\mathbb {R}}^d}\mathbb {E}\bigg [\Big (\fint _{B_*(x)}|f|^2\Big )^\frac{q}{2}\bigg ]^\frac{p}{q}dx\bigg )^\frac{1}{p}\\&\quad \,\lesssim _{p,q}\,\Big (\frac{1}{q}-\frac{1}{r}\Big )^{-(\frac{1}{q}-\frac{1}{2})_+}L\Big (\frac{1}{q}-\frac{1}{r}\Big )^{\frac{1}{q}-\frac{1}{p}}\Big (\int _{{\mathbb {R}}^d}\mathbb {E}\left[ [f]_2^{r} \right] ^\frac{p}{r}\Big )^\frac{1}{p}; \end{aligned}$$ -
(ii)
for all \(r<q\),
$$\begin{aligned}&\bigg (\int _{{\mathbb {R}}^d}\mathbb {E}\bigg [\Big (\fint _{B_*(x)}|f|^2\Big )^\frac{q}{2}\bigg ]^\frac{p}{q}dx\bigg )^\frac{1}{p}\\&\quad \,\gtrsim _{p,q}\,\Big (\frac{1}{r}-\frac{1}{q}\Big )^{(\frac{1}{2}-\frac{1}{p})_+}L\Big (\frac{1}{r}-\frac{1}{q}\Big )^{-(\frac{1}{q}-\frac{1}{p})}\Big (\int _{{\mathbb {R}}^d}\mathbb {E}\left[ [f]_2^r \right] ^\frac{p}{r}\Big )^\frac{1}{p}; \end{aligned}$$
where we have set \(L(t):=\log (2+\frac{1}{t})\). \(\diamondsuit \)
With these ingredients at hand, we turn the proof of Theorem 6.1. The argument is based on a second application of Lemma 6.3 starting with the quenched large-scale Calderón–Zygmund estimate of Proposition 6.4.
Proof of Theorem 6.1
Let \(r_*\) be chosen as in the proof of Proposition 6.4 above. We split the proof into two steps.
Step 1. For all balls \(B\subset {\mathbb {R}}^d\), decomposing \(Th=\nabla w_{B,0}+\nabla w_{B,1}\) as in (6.9), we prove for all \(1< q<\infty \) and \(1\le p_1\le \infty \),
We start with the proof of (6.15). It suffices to show
since the conclusion (6.15) then follows by taking the expectation. First consider the case \(r_B\le \frac{1}{4}r_*(x_B)\). Integrating (6.13) over \(x_0\in B\) yields
The claim (6.17) follows from this estimate together with Jensen’s inequality if \(q\ge 2\), but a different argument is needed if \(q<2\). For \(x_0\in \frac{1}{2}B\), since the condition \(r_B\le \frac{1}{4}r_*(x_B)\) and the \(\frac{1}{8}\)-Lipschitz property of \(r_*\) imply \(r_B\le \frac{1}{4}r_*(x_0)+\frac{1}{64}r_B\), hence \(r_B\le \frac{1}{2} r_*(x_0)\), it follows from (6.7) (with B replaced by \(\frac{1}{2}B\)) that
Combined with (6.18), this leads to (6.17) by Jensen’s inequality. It remains to consider the case \(r_B\ge \frac{1}{4}r_*(x_B)\). Applying Proposition 6.4 (with q playing the role of p) to the equation for \(w_{B,0}\) yields
For \(x\in B\), the balls \(B_*(x)\) and B are disjoint whenever \(|x-x_B|\ge r_*(x)+r_B\). Since the condition \(r_B\ge \frac{1}{4}r_*(x_B)\) and the \(\frac{1}{8}\)-Lipschitz property of \(r_*\) imply
we deduce that \(B_*(x)\) and B are disjoint whenever \(|x-x_B|\ge 5r_B+\frac{1}{8}|x-x_B|\), that is, whenever \(|x-x_B|\ge 6r_B\). This implies
so that the claim (6.17) follows from (6.19).
We turn to the proof of (6.16). For \(2\le q\le \infty \), taking the \(\frac{q}{2}\)th power of (6.14), using Jensen’s inequality, and taking the expectation, we easily obtain (6.16). In order to conclude for all values \(1\le q\le \infty \), it suffices to establish the following improved version of (6.14): for all \(x_0\in \frac{1}{24}B\),
If \(r_*(x_0)\ge \frac{1}{6}r_B\), this already follows from (6.7) (with B replaced by \(\frac{1}{24}B\)). Let now \(x_0\in \frac{1}{24}B\) be fixed with \(r_*(x_0)\le \frac{1}{6}r_B\). Since \(w_{B,1}\) is \({\varvec{a}}\)-harmonic on B and since the ball \(B(x_0,r_*(x_0)+\frac{1}{24}r_B)\) is contained in \(\frac{1}{4}B\), the quenched large-scale Lipschitz regularity theory (e.g. [21, Theorem 1]) implies
and Lemma 6.6 (with B replaced by \(\frac{1}{2}B\)) then leads to
Since for \(r_*(x_0)\le \frac{1}{6}r_B\) the \(\frac{1}{8}\)-Lipschitz property of \(r_*\) yields \(r_*(x_B)\le r_*(x_0)+\frac{1}{192}r_B\le 2r_B\), the conclusion (6.20) follows from (6.5) (with B replaced by \(\frac{1}{2}B\)) and from Jensen’s inequality.
Step 2. Conclusion.
Combining (6.15) and (6.16), and applying Lemma 6.3 with
we deduce for all \(1<q\le p<\infty \),
which is the main stated result (6.1). It remains to post-process this result by rewriting both sides of the inequality: combining (6.1) with the different bounds stated in Lemma , and noting that for \(q\le p\) there holds \((\frac{1}{q}-\frac{1}{2})_++(\frac{1}{2}-\frac{1}{p})_+=\frac{1}{q\wedge 2}-\frac{1}{p\vee 2}\), the conclusion (6.2) follows for \(q\le p\), while the corresponding result for \(p\le q\) is deduced by duality. \(\square \)
6.2 Proof of Corollary 6.2
For \(d>1\), appealing to the integral representation
the triangle inequality yields
and the result (6.3) follows from the Hardy–Littlewood–Sobolev inequality. Next, writing \(U_e=T{\varvec{a}}^{-1}U_{e,\circ }{\varvec{a}}\), the estimate (6.4) follows from the combination of (6.2) and (6.3). \(\square \)
6.3 Proof of Lemma 6.5
We start with the proof of (i), and thus let \(r_B\ge \frac{1}{4}r_*(x_B)\). For \(|x-z|\le \frac{1}{2} r_*(z)\), the \(\frac{1}{8}\)-Lipschitz property of \(r_*\) implies \(|x-z|\le \frac{1}{2}r_*(x)+\frac{1}{16}|x-z|\), hence \(|x-z|\le r_*(x)\), and similarly \(|B_*(x)|\simeq |B_*(z)|\). These observations lead to the lower bound
Since \(r_*(x_B)\le 4r_B\), we note that for all \(z\in B\) the ball \(\frac{1}{5}B_*(z)\) is included in
and (6.5) follows. The upper bound in (6.6) is a consequence (6.5) with \(r_B\uparrow \infty \), so that it remains to establish the lower bound. For that purpose, we write again
Since for \(|x-z|\le r_*(x)\) the \(\frac{1}{8}\)-Lipschitz property of \(r_*\) implies \(r_*(x)\simeq r_*(z)\), the last integral in bracket is \(\lesssim 1\), and the desired lower bound follows.
We turn to the proof of (ii). By Jensen’s inequality, it suffices to establish (6.7). Let \(x_0\in B\) with \(r_B\le \frac{1}{4} r_*(x_0)\). Noting that for all \(x\in B\) the ball \(B_*(x_0)\) is contained in \(B(x,r_*(x_0)+2r_B)\), we find
The ball \(B_{4r_B}\) at the origin can be covered by \(N\simeq 1\) balls of radius \(r_B\). Denote by \((z_i)_{i=1}^N\subset B_{4r_B}\) their centers. The collection \((B(z_i,r_*(x_0)-r_B))_{i=1}^N\) then covers \(B_{r_*(x_0)+2r_B}\). By subadditivity of the square root, we may decompose
hence, noting that \(z_i+B\subset 5B\) for all i,
For \(x\in 5B\), the \(\frac{1}{8}\)-Lipschitz property of \(r_*\) yields
and (iii) follows. \(\square \)
6.4 Proof of Lemma 6.6
By scaling and translation invariance, we may assume that B is the unit ball at the origin and that \(\fint _Bw=0\). Choose a smooth cut-off function \(\chi \) with \(\chi =1\) in \(\frac{1}{2}B\) and \(\chi =0\) outside B. Caccioppoli’s inequality yields
Sobolev’s inequality combined with Poincaré’s inequality (with vanishing boundary condition) then implies for some suitable \(p=p(d)>2\),
Choosing \(q:=2\frac{p-1}{p-2}\) and writing \(\chi =\xi ^q\), we deduce by Hölder’s inequality,
hence, by Young’s inequality,
Injecting this into (6.22) and using Poincaré’s inequality (with vanishing mean-value), we conclude
as desired. \(\square \)
6.5 Proof of Lemma
Step 1. Proof of the upper bound (i).
Let \(1\le q\le p<\infty \). We start with the following deterministic version of the claimed upper bound: for all \(R\ge 1\) we have
If \(q\ge 2\), Jensen’s inequality on \(t\mapsto t^{\frac{q}{2}}\) yields
If \(q\le 2\), covering the ball \(B_R\) by \(N\simeq R^d\) balls of radius \(\frac{1}{2}\) with centers \((z_i)_{i=1}^N\subset B_{R-\frac{1}{4}}\) such that \(\min _{i\ne j}|z_i-z_j|\ge \frac{1}{2}\), it follows from the discrete \(\ell ^{2}\)–\(\ell ^{q}\) inequality that
and hence, since the balls \((B_{\frac{1}{4}}(x+z_i))_{i=1}^N\) are disjoint and included in \(B_{R}(x)\),
These two estimates (6.25) and (6.26) lead to
and the claim (6.24) follows from Jensen’s inequality on \(t\mapsto t^{\frac{p}{q}}\).
We turn to the proof of (i). Conditioning with respect to the value of \(r_*(x)\) on dyadic scale and using Jensen’s inequality on \(t\mapsto t^{\frac{p}{q}}\) with \(\sum _{n=0}^\infty 2^{-nd\delta }\simeq \delta ^{-1}\), we find for \(0<\delta \le 1\),
Taking advantage of the \(\frac{1}{8}\)-Lipschitz property of \(r_*\), we can rewrite
and applying (6.24) with \(R=2^{n+1}\) then leads to
Hence, for \(r>q\), using Hölder’s inequality and the moment bounds on the stationary \(r_*\),
where in the last estimate we used \(\sum _n(2^{nd})^\alpha e^{-2^{nd}\theta }\lesssim _{\alpha ,d}\theta ^{-\alpha }\). Optimizing \(\frac{1}{\delta }\theta ^{-\delta }\) in \(0<\delta \le 1\) through \(\delta =(\log \frac{1}{\theta })^{-1}\) yielding \(\sim \log \frac{1}{\theta }\), the desired upper bound follows.
Step 2. Proof of the lower bound (ii).
Let \(1\le r\le p\le \infty \). We start with the following deterministic version of the claimed lower bound: for all \(R\ge 1\) we have
If \(2\le r\le p\), using the discrete \(\ell ^{r}\)–\(\ell ^2\) inequality and the discrete \(\ell ^p\)–\(\ell ^r\) inequality, we indeed find
If \(r\le 2\le p\), the triangle inequality in \({\text {L}}^{\frac{2}{r}}(B_R(x))\) and the discrete \(\ell ^p\)–\(\ell ^2\) inequality lead to
Finally, if \(r\le p\le 2\), using again the triangle inequality in \({\text {L}}^{\frac{2}{r}}(B_R(x))\) and Jensen’s inequality on \(t\mapsto t^\frac{2}{p}\), we obtain
This proves the claim (6.27) in all cases.
We turn to the proof of (ii). Conditioning with respect to the value of \(r_*(x)\) on dyadic scale and using Jensen’s inequality on \(t\mapsto t^\frac{p}{r}\) with \(\sum _{n=0}^\infty 2^{-nd\delta }\simeq \delta ^{-1}\), we find for \(0<\delta \le 1\),
Applying (6.27) with \(R=(2^{n}-2)\vee 1\) and taking advantage of the \(\frac{1}{8}\)-Lipschitz property of \(r_*\), we deduce
Hence, for \(q>r\), using Hölder’s inequality and the moment bounds on \(r_*\), and proceeding as for (i),
Optimizing in \(0<\delta \le 1\) yields the desired lower bound. \(\square \)
7 Proof of the higher-order pathwise result
In order to establish the accuracy of the two-scale expansion of higher-order commutators (hence, the higher-order pathwise structure), we appeal to the representation formula (5.3) established in Proposition 5.1 and the conclusion easily follows from the following estimates on the Malliavin derivative of higher-order correctors.
Lemma 7.1
Let T, \(U_{e}\), and \(U_{e,\circ }\) denote the operators defined in Theorem 6.1 and Corollary 6.2 with \({\varvec{a}}\) replaced by its adjoint \({\varvec{a}}^*\). For all \(g\in C^\infty _c({\mathbb {R}}^d;{\text {L}}^\infty (\Omega ))^d\) and \(0\le s\le \ell -1\),
where \(U_\cdot ^k\) denotes any product of the form \(U_1\ldots U_k\) with \(U_j\in \{U_{e_i},U_{e_i,\circ }\}_{1\le i\le d}\) for all j, and where we implicitly sum over all possible such products. Furthermore, for all \(\delta >0\) and \(2\le p<\infty \),
\(\diamondsuit \)
Proof
Denote by \(J_{s,1}(g)^2\), \(J_{s,2}(g)^2\), and \(J_{s,3}(g)^2\) the three left-hand side terms in (7.1), hence \(J_s(g)^2=J_{s,1}(g)^2+J_{s,2}(g)^2+J_{s,3}(g)^2\). We split the proof into two steps.
Step 1. Proof that for all \(g\in C^\infty _c({\mathbb {R}}^d;{\text {L}}^\infty (\Omega ))^d\) and \(0\le s\le \ell -1\),
We start with (7.3). Taking the Malliavin derivative of the equation for \(\varphi ^{s+1}\) (cf. Definition 2.1) and testing it with \((\nabla \cdot {\varvec{a}}^*\nabla )^{-1}\nabla \cdot ({\varvec{a}}^*g)\), we obtain by definition of \(Tg=\nabla (\nabla \cdot {\varvec{a}}^*\nabla )^{-1}\nabla \cdot ({\varvec{a}}^*g)\),
where for the first right-hand side term we used that by definition (1.4) the Malliavin derivative of \({\varvec{a}}\) takes the form
The claim (7.3) follows by applying \((\int _{{\mathbb {R}}^d}(\int _{B(x)}\cdot \,dz)^2dx)^\frac{1}{2}\) to both sides of the estimate. Next, by definition of \(U_{e_{i_{s+1}}}g=\nabla (\nabla \cdot {\varvec{a}}^*\nabla )^{-1}(e_{i_{s+1}}\cdot {\varvec{a}}^*g)\), we find
so that (7.4) follows directly. We turn to (7.5). Writing \(g=\triangle \triangle ^{-1}g\), integrating by parts, and using the Malliavin derivative of the equation for \(\triangle \sigma ^s\) (cf. Definition 2.1), we find
and the claim (7.5) follows from taking the Malliavin derivative of the definition of \(q^s\) and writing \(\nabla \triangle ^{-1}g_i=U_{e_i,\circ }g\).
Step 2. Conclusion.
Using the notation introduced in the statement, Step 1 yields for all \(g\in C_c^\infty ({\mathbb {R}}^d;{\text {L}}^\infty (\Omega ))^d\),
Since by definition \(TU=U\) and \(TU_\circ =U_\circ \), the decomposition (7.1) follows by induction. Next, combining (7.1) with the corrector estimates of Proposition 2.2, we find for all \(g\in C^\infty _c({\mathbb {R}}^d;{\text {L}}^\infty (\Omega ))^d\), \(0\le s\le \ell -1\), \(2\le p<\infty \), and \(\delta >0\),
where the anchoring in \(s=0\) follows from (7.3) because of \(J_{0,2}= J_{0,3}=0\) in view of \(\varphi ^0=1\) and \(\sigma ^0=0\). Here we have used that for a stationary \(\psi \) we have by the triangle inequality in \({\text {L}}^\frac{p}{2}(\Omega )\) and Hölder’s inequality,
Now, for \(0\le s\le \ell -1\), \(2\le p<\infty \), \(0\le k\le s\), \(\beta \in \{0,1\}\), and \(\delta >0\), we apply the bound of Corollary 6.2k times and the one of Theorem 6.1 once, increasing at every application the stochastic integrability by \(\frac{\delta }{k+1}\), to the effect of
given that the restriction \(k\le s\le \ell -1\) indeed ensures \(\frac{2d}{d+2k}>1\), and the conclusion follows. \(\square \)
With these estimates at hand, we may now turn to the proof of Theorem 1(ii), that is, of the higher-order pathwise result.
Proof of Theorem 1(ii)
Let \(1\le p<\infty \). The starting point is the \({\text {L}}^p\) inequality of Proposition 4.1(iii) together with (4.2), in the form
Inserting identity (5.3) of Proposition 5.1, integrating by parts in the z-integral to put derivatives on g and \({\bar{u}}^n\), using (7.6), and using the corrector estimates of Proposition 2.2, we deduce for all \(1\le n\le \ell \),
where we have set
We separately treat the different right-hand side terms in (7.7). First, from equation (2.12), formula (2.5), the annealed Calderón–Zygmund estimate of Theorem 6.1, and the corrector estimates in Proposition 2.2, we obtain
Second, recalling the notation \(T=\nabla (\nabla \cdot {\varvec{a}}^*\nabla )^{-1}\nabla \cdot {\varvec{a}}^*\) from Theorem 6.1 and using the equation for u in the form \(-\nabla \cdot \AA \nabla D_zu=\nabla \cdot D_z{\varvec{a}}\nabla u\), we find
hence, using Theorem 6.1 to bound the operator T and to estimate \(\nabla u\), and applying the corrector estimates of Proposition 2.2, we deduce
Third, decomposing
applying (7.2), and using the corrector estimates of Proposition 2.2,
Fourth, applying (7.2) and using the corrector estimates of Proposition 2.2,
Collecting the above estimates on the right-hand side terms of (7.7), we see that all of them are bilinear in g and f. Crucial for us is their scaling in the underlying lengthscale, which has three origins: the (total) number of derivatives (to be counted negatively), the spatial integrability exponent p (to be counted in form of \(\frac{d}{p}\)), and the weights \(\mu _{d,n}\) (cf. Proposition 2.2). For instance, in view of (7.8), the first right-hand side term in (7.7) has scaling \(\le L^{-n+\frac{d}{4}+\frac{d}{4}}\mu _{d,n}(L)=L^{-n+\frac{d}{2}}\mu _{d,n}(L)\), where L stands for the lengthscale, because \(\nabla {\tilde{u}}^l\) has the same scaling as \(\nabla ^{l-1}f\) (cf. Definition 2.7). Let us take the last term as a second example: the scaling of its (s, l)-contribution is \(L^{-(n+s)+\frac{d+2s}{2}}\mu _{d,n}(L)=L^{-n+\frac{d}{2}}\mu _{d,n}(L)\). In fact, all terms have a scaling at most \(L^{-n+\frac{d}{2}}\mu _{d,n}(L)\), which after \(\varepsilon \)-rescaling (\(L=\varepsilon ^{-1}\)) turns into the desired estimate. \(\square \)
8 Proof of the convergence of the covariance structure
Combining the representation formula (5.2) of Proposition 5.1 with the Helffer–Sjöstrand identity of Proposition 4.1(ii), we establish Theorem 1(iii), that is, the convergence of the covariance structure of the higher-order standard homogenization commutator.
Proof of Theorem 1(iii)
Combining the Helffer–Sjöstrand identity of Proposition 4.1(ii) with the representation formula (5.2) tested with \(g\in C^\infty _c({\mathbb {R}}^d)^d\), we obtain, using (7.6) and recalling that \((1+\mathcal {L})^{-1}\) has operator norm \(\le 1\),
where we have set
and
Appealing to the definition of \(E^n[\nabla {\bar{w}}]\) (cf. Definition 2.6), to (4.2), and to the corrector estimates of Proposition 2.2, we find
and similarly,
in terms of
Then applying (7.2) and again using the corrector estimates of Proposition 2.2, we deduce for all \(\delta >0\),
As in the proof of Theorem 1(ii) (cf. Sect. 7), it is easily checked that these expressions in \((g,\nabla {\bar{w}})\) have the desired length scaling, hence lead to the claimed error estimate after \(\varepsilon \)-rescaling.
Now that the right-hand side of (8.1) is properly estimated, we turn to a suitable decomposition of \(\mathbb {E}\left[ \langle S,(1+\mathcal {L})^{-1}S\rangle _{\mathfrak {H}} \right] \). Inserting the definition of \(E^n[\nabla {\bar{w}}]\) (cf. Definition 2.6) into S, using the definition (4.1) of the scalar product in \({\mathfrak {H}}\), expanding the product, and focusing on terms of order \(\varepsilon ^k\) with \(k\le n-1\), we write
where we have set
Combining this with (8.1) and with the above estimates on \(\mathbb {E}\left[ \Vert S\Vert _{\mathfrak {H}}^2 \right] ^\frac{1}{2}\) and \(\mathbb {E}\left[ \Vert R\Vert _{\mathfrak {H}}^2 \right] ^\frac{1}{2}\) and appealing to the stationarity of \(T^{l,k}_{ji_1\ldots i_lj_1\ldots j_{k+1}}\) (recalling that \(\mathcal {L}\) commutes with shifts), we deduce after \(\varepsilon \)-rescaling,
Appealing to the boundedness of \((1+\mathcal {L})^{-1}\) and to the corrector estimates of Proposition 2.2 in the form
the conclusion follows after Taylor-expanding \(\nabla ^{l'}g_{j'}(z+\varepsilon y)\nabla ^{k'+1}{\bar{w}}(z+\varepsilon y)\) to order \(\varepsilon ^{n-m}\) with the integrability assumption \(\int _{{\mathbb {R}}^d}|y|^{n}|c(y)|dy\le 1\). \(\square \)
9 Proof of the normal approximation result
In order to establish the asymptotic normality of the higher-order standard homogenization commutators, we combine the representation formula of Proposition 5.2 with the second-order Poincaré inequality of Proposition 4.1(iv). We then need the following version of Lemma 7.1 for the estimation of second Malliavin derivatives of higher-order correctors.
Lemma 9.1
For all \(\zeta \in C^\infty _c({\mathbb {R}}^d)\), \(g\in C^\infty _c({\mathbb {R}}^d)^d\), \(0\le s\le \ell -1\), and \(p\ge 1\),
where the notation is taken from Lemma 7.1. \(\diamondsuit \)
Proof
Denote by \(K_{s,1}(g,\zeta )\), \(K_{s,2}(g,\zeta )\), and \(K_{s,3}(g,\zeta )\) the three left-hand side terms in the claimed estimate, hence \(K_s(g,\zeta )=K_{s,1}(g,\zeta )+K_{s,2}(g,\zeta )+K_{s,3}(g,\zeta )\). We prove that for all \(g\in C^\infty _c({\mathbb {R}}^d;{\text {L}}^\infty ({\mathbb {R}}^d))^d\), \(\zeta \in C^\infty _c({\mathbb {R}}^d)\), \(0\le s\le \ell -1\), and \(p\ge 1\),
These estimates combine to
Noting as in the proof of Lemma 7.1 that \(TU=U\) and \(TU_\circ =U_\circ \), the conclusion follows by induction.
Let \(0\le s\le \ell -1\). We start with the proof of (9.1). Taking the second Malliavin derivative of the equation for \(\varphi ^{s+1}\) (cf. Definition 2.1), we find
Testing with \((\nabla \cdot {\varvec{a}}^*\nabla )^{-1}\nabla \cdot ({\varvec{a}}^*g)\), recalling the definition of T, and using (7.6) and
we find after integration against \(\zeta \otimes \zeta \),
and the claim (9.1) follows from Hölder’s inequality. Next, (9.2) follows from the definition of U. We turn to (9.3). Writing \(g=\triangle \triangle ^{-1}g\), integrating by parts, and using the equation for \(\triangle \sigma ^s\) (cf. Definition 2.1), we find
Hence, by the definition of \(q^s\) to which we apply \(D^2_{xy}\), we obtain an analogous structure as in (9.5), and the claim (9.3) follows. \(\square \)
With these estimates at hand, we now turn to the proof of Theorem 1(iv), that is, of the normal approximation result for the higher-order standard homogenization commutators.
Proof of Theorem 1(iv)
We split the proof into two steps.
Step 1. Proof that for all \(8\le p,q<\infty \) and \(1\le n\le \ell \),
In order to not overestimate the logarithmic correction in the final result, we have to be specific here on the p-dependence; however, this is only necessary in the leading-order terms (that is, in the first two terms). Let \(1\le n\le \ell \). Applying the representation formula of Proposition 5.2, using (7.6) and (9.4), inserting the definition of \(E_\varepsilon ^n[\nabla {\bar{w}}]\) (cf. Definition 2.6), and ordering terms by the number of Malliavin derivatives on \({\varvec{a}}\), we find
in terms of
where we identify the operators \(U_i\) with their kernels and we have set
with the implicit understanding that the contribution of \(\sum _{l=0}^{k+s-n}\) vanishes if \(k<n-s\). We analyze the three right-hand side terms in (9.6) separately and we start with \(U_1\). Decomposing the covariance function as \(c=c_0*c_0\) so that \(\Vert \zeta \Vert _{\mathfrak {H}}=\Vert c_0*\zeta \Vert _{{\text {L}}^2({\mathbb {R}}^d)}\), the definition (4.7) of the norm \(\Vert \cdot \Vert _{{\text {op}}}\) is equivalent to
For \(p\ge 4\), inserting the special form of \(U_1(x,y)=\delta (x-y)\,S^1(x)\) and noting that the discrete \(\ell ^{\frac{2p}{p-2}}\)–\(\ell ^2\) inequality and the assumption (1.5) imply
we get
hence,
By definition of \(S^1\), appealing to the corrector estimates of Proposition 2.2, we deduce
We turn to \(U_2\). By symmetry, it suffices to estimate the norm of \({\tilde{U}}^2\). From (9.7) and \(\Vert [c_0*\zeta ]_\infty \Vert _{{\text {L}}^2({\mathbb {R}}^d)}\le \Vert \zeta \Vert _{{\text {L}}^2({\mathbb {R}}^d)}\), we see
Taking the expectation and arguing by duality, we find
where
Let \(\zeta \in C^\infty _c({\mathbb {R}}^d;{\text {L}}^\infty (\Omega ))\) be fixed with \(\Vert [\zeta ]_\infty \Vert _{{\text {L}}^4(\Omega ;{\text {L}}^2({\mathbb {R}}^d))}=1\). Noting that Jensen’s inequality yields
and that the discrete \(\ell ^4\)–\(\ell ^2\) inequality gives
we deduce by interpolation, for all \(2\le r\le 4\),
By definition of \({\tilde{U}}_2\) and by Lemma 7.1 in the form (7.1), we get
and the corrector estimates of Proposition 2.2 yield for all \(p\ge 1\),
For \(p\ge 2\), applying m times the bound of Corollary 6.2 and once the one of Theorem 6.1 with \(\delta =\frac{1}{m+1}\frac{2p}{(p-1)(p-2)}\sim \frac{1}{p}\) in order to pass from the stochastic integrability \(\frac{2p}{p-1}\) to \(\frac{2p}{p-2}\), we obtain the following,
where the exponent \(\frac{m(m+1)}{d}\) of \(p\log ^2p\) comes from the large prefactors in Corollary 6.2 and Theorem 6.1 and is due to the successive deviations of spatial integrability from 2. Hölder’s inequality together with (9.10) yields for \(p\ge 8\),
By definition of \(S^{2;s}\), using the corrector estimates of Proposition 2.2, we conclude
It remains to analyze \(U_3\). For \(p\ge 2\), we first appeal to the definition of \(U_3\) and to Lemma 9.1 (with p replaced by \(\frac{p}{2}\)) in the form
From (9.7) and (9.8), we deduce
Taking the \({\text {L}}^4(\Omega )\) norm and using the corrector estimates of Proposition 2.2 in the first right-hand side term, and arguing by duality as in (9.9) and repeating the proof of (9.12) for the second right-hand side term, we obtain for all \(8\le p<\infty \),
A multiple use of the annealed bounds of Theorem 6.1 and Corollary 6.2 then leads to
By definition of \(S^{3;s}\), using the corrector estimates of Proposition 2.2, we conclude
Gathering the above estimates and replacing p by q in the estimate for \(U^3\), the claim follows.
Step 2. Conclusion.
By scaling, the conclusion of Step 1 yields for all \(8\le p,q<\infty \) and \(1\le n\le \ell \),
hence, choosing \(p=|\!\log \varepsilon |\) and \(q=8\vee 4d\), and recalling that \(c_0=\frac{1}{2}\),
In terms of \(X_\varepsilon ^n:=\varepsilon ^{-\frac{d}{2}}\int _{{\mathbb {R}}^d}g\cdot \Xi _\varepsilon ^{\circ ,n}[\nabla {\bar{w}}]\), we now apply Proposition 4.1(iv) in the form
Inserting the above estimate on \(\Vert D^2X_\varepsilon ^n\Vert _{{{\text {op}}}}\) and noting that the proof of Theorem 1(iii) yields \(\mathbb {E}\left[ \Vert DX_\varepsilon ^n\Vert _{\mathfrak {H}}^4 \right] \,\lesssim _{f,g}\,1\), the conclusion follows. \(\square \)
Notes
We systematically use Einstein’s summation rule on repeated indices.
For \(n=1\) the notation \({\bar{{\varvec{a}}}}^{*,n}_{ji_1\ldots i_{n-2}}e_{i_{n-1}}\) stands for \({\bar{{\varvec{a}}}}^{*,1}e_{j}\), and we use a similar unifying notation throughout in the sequel.
References
Abdulle, A., Pouchon, T.: Effective models and numerical homogenization for wave propagation in heterogeneous media on arbitrary timescales. Preprint. arXiv:1905.09062
Aida, S., Stroock, D.: Moment estimates derived from Poincaré and logarithmic Sobolev inequalities. Math. Res. Lett. 1(1), 75–86 (1994)
Armstrong, S., Kuusi, T., Mourrat, J.-C.: Quantitative Stochastic Homogenization and Large-Scale Regularity, Volume 352 of Grundlehren der Mathematischen Wissenschaften. Springer, Cham (2019)
Armstrong, S.N., Daniel, J.-P.: Calderón–Zygmund estimates for stochastic homogenization. J. Funct. Anal. 270(1), 312–329 (2016)
Armstrong, S.N., Kuusi, T., Mourrat, J.-C.: The additive structure of elliptic homogenization. Invent. Math. 208, 999–1154 (2017)
Armstrong, S.N., Mourrat, J.-C.: Lipschitz regularity for elliptic equations with random coefficients. Arch. Ration. Mech. Anal. 219(1), 255–348 (2016)
Armstrong, S.N., Smart, C.K.: Quantitative stochastic homogenization of convex integral functionals. Ann. Sci. Éc. Norm. Supér. (4) 49(2), 423–481 (2016)
Bakry, D.: L’hypercontractivité et son utilisation en théorie des semigroupes. In: Lectures on Probability Theory (Saint-Flour., 1992), Volume 1581 of Lecture Notes in Mathematics, pp. 1–114. Springer, Berlin (1994)
Bella, P., Fehrman, B., Fischer, J., Otto, F.: Stochastic homogenization of linear elliptic equations: higher-order error estimates in weak norms via second-order correctors. SIAM J. Math. Anal. 49(6), 4658–4703 (2017)
Bella, P., Giunti, A., Otto, F.: Quantitative stochastic homogenization: local control of homogenization error through corrector. In: Mathematics and Materials, Volume 23 of IAS/Park City Mathematics Series, pp. 301–327. American Mathematical Society, Providence (2017)
Bensoussan, A., Lions, J.-L., Papanicolaou, G.: Asymptotic Analysis for Periodic Structures, Volume 5 of Studies in Mathematics and its Applications. North-Holland Publishing Co., Amsterdam (1978)
Bourgain, J.: On a homogenization problem. J. Stat. Phys. 172(2), 314–320 (2018)
Caffarelli, L.A., Peral, I.: On \(W^{1, p}\) estimates for elliptic equations in divergence form. Commun. Pure Appl. Math. 51(1), 1–21 (1998)
Chatterjee, S.: Fluctuations of eigenvalues and second order Poincaré inequalities. Probab. Theory Relat. Fields 143(1–2), 1–40 (2009)
Chernoff, H.: A note on an inequality involving the normal distribution. Ann. Probab. 9(3), 533–535 (1981)
Duerinckx, M., Fischer, J., Gloria, A.: Scaling limit of the homogenization commutator for Gaussian coefficient fields. Preprint. arXiv:1910.04088
Duerinckx, M., Gloria, A., Lemm, M.: A remark on a surprising result by Bourgain in homogenization. Commun. Partial Differ. Equ. 44(12), 1345–1357 (2019)
Duerinckx, M., Gloria, A., Otto, F.: Robustness of the pathwise structure of fluctuations in stochastic homogenization. Preprint. arXiv:1807.11781
Duerinckx, M., Gloria, A., Otto, F.: The structure of fluctuations in stochastic homogenization. Preprint. arXiv:1602.01717
Gloria, A., Neukamm, S., Otto, F.: Quantitative estimates in stochastic homogenization for correlated coefficient fields. Preprint. arXiv:1910.05530
Gloria, A., Neukamm, S., Otto, F.: A regularity theory for random elliptic operators. Preprint. arXiv:1409.2678
Gloria, A., Neukamm, S., Otto, F.: An optimal quantitative two-scale expansion in stochastic homogenization of discrete elliptic equations. M2AN Math. Model. Numer. Anal. 48(2), 325–346 (2014)
Gloria, A., Otto, F.: The corrector in stochastic homogenization: optimal rates, stochastic integrability, and fluctuations. Preprint. arXiv:1510.08290
Gu, Y.: High order correctors and two-scale expansions in stochastic homogenization. Probab. Theory Relat. Fields 169(3–4), 1221–1259 (2017)
Gu, Y., Mourrat, J.-C.: Scaling limit of fluctuations in stochastic homogenization. Multiscale Model. Simul. 14(1), 452–481 (2016)
Helffer, B., Sjöstrand, J.: On the correlation for Kac-like models in the convex case. J. Stat. Phys. 74(1–2), 349–409 (1994)
Hill, R.: Elastic properties of reinforced solids: some theoretical principles. J. Mech. Phys. Solids 11, 357–372 (1963)
Hill, R.: On constitutive macro-variables for heterogeneous solids at finite strain. Proc. R. Soc. Lond. A 326, 131–146 (1972)
Houdré, C., Pérez-Abreu, V.: Covariance identities and inequalities for functionals on Wiener and Poisson spaces. Ann. Probab. 23(1), 400–419 (1995)
Jikov, V.V., Kozlov, S.M., Oleĭnik, O.A.: Homogenization of Differential Operators and Integral Functionals. Springer, Berlin (1994). Translated from Russian by G. A. Iosif’yan
Josien, M., Otto, F.: Introduction to stochastic homogenization: oscillations and fluctuations. In preparation (2019)
Kamotski, V., Matthies, K., Smyshlyaev, V.P.: Exponential homogenization of linear second order elliptic PDEs with periodic coefficients. SIAM J. Math. Anal. 38(5), 1565–1587 (2006/07)
Kim, J., Lemm, M.: On the averaged Green’s function of an elliptic equation with random coefficients. Preprint. arXiv:1804.10260
Kozlov, S.M.: The averaging of random operators. Mat. Sb. (N.S.) 109(151)(2):188–202, 327 (1979)
Ledoux, M., Nourdin, I., Peccati, G.: Stein’s method, logarithmic Sobolev and transport inequalities. Geom. Funct. Anal. 25(1), 256–306 (2015)
Malliavin, P.: Stochastic Analysis, Volume 313 of Grundlehren der Mathematischen Wissenschaften [Fundamental Principles of Mathematical Sciences]. Springer, Berlin (1997)
Marahrens, D., Otto, F.: Annealed estimates on the Green’s function. Probab. Theory Relat. Fields 163(3–4), 527–573 (2015)
Mourrat, J.-C., Otto, F.: Correlation structure of the corrector in stochastic homogenization. Ann. Probab. 44(5), 3207–3233 (2016)
Murat, F., Tartar, L.: \(H\)-convergence. In: Topics in the Mathematical Modelling of Composite Materials, Volume 31 of Progress Nonlinear Differential Equations Applications, pp. 21–43. Birkhäuser, Boston (1997)
Nash, J.: Continuity of solutions of parabolic and elliptic equations. Am. J. Math. 80, 931–954 (1958)
Nourdin, I., Peccati, G.: Stein’s method on Wiener chaos. Probab. Theory Relat. Fields 145(1–2), 75–118 (2009)
Nourdin, I., Peccati, G.: Normal Approximations with Malliavin Calculus. From Stein’s Method to Universality, Volume 192 of Cambridge Tracts in Mathematics. Cambridge University Press, Cambridge (2012)
Nourdin, I., Peccati, G., Reinert, G.: Second order Poincaré inequalities and CLTs on Wiener space. J. Funct. Anal. 257(2), 593–609 (2009)
Nualart, D.: The Malliavin Calculus and Related Topics, 2nd edn. Springer, Berlin (2006)
Papanicolaou, G.C., Varadhan, S.R.S.: Boundary value problems with rapidly oscillating random coefficients. In: Random Fields, Vol. I, II (Esztergom, 1979), Volume 27 of Colloq. Math. Soc. János Bolyai, pp. 835–873. North-Holland, Amsterdam (1981)
Peccati, G., Reitzner, M. (eds.): Stochastic Analysis for Poisson Point Processes, Volume 7 of Bocconi & Springer Series. Bocconi University Press, Springer, Berlin (2016)
Pouchon, T.: Effective models and numerical homogenization methods for long time wave propagation in heterogeneous media. Ph.D. thesis, École Polytechnique Fédérale de Lausanne (2017)
Shen, Z.: The \(L^p\) boundary value problems on Lipschitz domains. Adv. Math. 216, 212–254 (2007)
Shen, Z.: The Calderón-Zygmund lemma revisited. In: Lectures on the Analysis of Nonlinear Partial Differential Equations. Part 2, Volume 2 of Morningside Lecture Notes in Mathematics, pp. 203–224. International Press, Somerville (2012)
Sjöstrand, J.: Correlation asymptotics and Witten Laplacians. Algebra Anal. 8(1), 160–191 (1996)
Tran, T.-H., Monchiet, V., Bonnet, G.: A micromechanics-based approach for the derivation of constitutive elastic coefficients of strain-gradient media. Int. J. Solids Struct. 49(5), 783–792 (2012)
Acknowledgements
The work of MD was supported by F.R.S.-FNRS and by the CNRS-Momentum program.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix A. More Malliavin calculus
Appendix A. More Malliavin calculus
In this appendix, we include for completeness a short, essentially self-contained proof of Proposition 4.1.
Proof of Proposition 4.1
We split the proof into four steps.
Step 1. Proof of (i).
In terms of the Ornstein–Uhlenbeck semigroup \(e^{-t\mathcal {L}}\) (e.g. [44, Section 1.4]), we may write
where Mehler’s formula (e.g. [44, (1.67)]) indeed ensures that \(\mathbb {E}\left[ (e^{-t\mathcal {L}}X)^2 \right] \rightarrow \mathbb {E}\left[ X \right] ^2\) as \(t\uparrow \infty \). Computing the derivative in t yields
hence, appealing to the commutation relation (4.5) in the form \(De^{-t\mathcal {L}}=e^{-t}e^{-t\mathcal {L}}D\),
and the positivity of \(\mathcal {L}\) leads to the conclusion,
Step 2. Proof of (ii).
Let \(X,Y\in \mathbb {D}^{1,2}\) with \(\mathbb {E}\left[ X \right] =\mathbb {E}\left[ Y \right] =0\). First note that the Poincaré inequality (i) implies that the restriction of \(\mathcal {L}\) to \({\text {L}}^2(\Omega )/{\mathbb {C}}:=\{U\in {\text {L}}^2(\Omega ):\mathbb {E}\left[ U \right] =0\}\) is invertible. In particular, there exists \(Z\in {\text {L}}^2(\Omega )\) such that \(Y=\mathcal {L}Z\). By definition of the adjoint \(D^*\) (cf. (4.4)), we then find
Appealing to the commutator relation (4.5) in the form \(DY=D\mathcal {L}Z=(1+\mathcal {L})DZ\), we conclude
Step 3. Proof of (iii).
Similarly as in Step 1, in terms of the Ornstein–Uhlenbeck semigroup \(e^{-t\mathcal {L}}\), we may write
hence, by the commutation relation (4.5) in the form \(De^{-t\mathcal {L}}=e^{-t}e^{-t\mathcal {L}}D\),
Appealing to Mehler’s formula for the Ornstein–Uhlenbeck semigroup \(e^{-t\mathcal {L}}\) (e.g. [44, (1.67)]), the Cauchy–Schwarz inequality leads to
so that the above becomes
which proves the logarithmic Sobolev inequality. Next, higher integrability is deduced by integration as e.g. in [2, Theorem 3.4]: we write
so that the logarithmic Sobolev inequality implies
and the conclusion follows from the Poincaré inequality (i).
Step 4. Proof of (iv).
Let \(X\in {\text {L}}^2(\Omega )\) with \(\mathbb {E}\left[ X \right] =0\) and \({\mathrm{Var}}\left[ X\right] =1\). For \(h\in {\text {L}}^\infty ({\mathbb {R}})\), we define its Stein transform \(S_h\) as the solution of Stein’s equation
As in Step 2, there exists \(Z\in {\text {L}}^2(\Omega )\) such that \(Y=\mathcal {L}Z\). We then compute
and hence, integrating by parts and using (A.1) in the form \(\mathbb {E}\left[ \langle DZ,DX\rangle _{\mathfrak {H}} \right] ={\mathrm{Var}}\left[ X\right] =1\),
Noting that \(\Vert S_h'\Vert _{{\text {L}}^\infty }\le 2\Vert h\Vert _{{\text {L}}^\infty }\) (e.g. [42, Theorem 3.3.1]) and taking the supremum over all \(h\in {\text {L}}^\infty ({\mathbb {R}})\), we deduce
Noting that for h Lipschitz-continuous there holds \(\Vert S_h'\Vert _{{\text {L}}^\infty }\le \sqrt{\frac{2}{\pi }}\Vert h'\Vert _{{\text {L}}^\infty }\) (e.g. [42, Proposition 3.5.1]), we can deduce a similar bound on the 1-Wasserstein distance. The corresponding bound on the 2-Wasserstein distance takes the form
its proof is of a different nature and is based on an optimal transport argument in density space (cf. [35, Proposition 3.1]).
It remains to estimate the variance \({\mathrm{Var}}\left[ \langle DZ,DX\rangle _{\mathfrak {H}}\right] \). For that purpose, we apply the first-order Poincaré inequality (i) in the form
Noting that the commutation relation (4.5) leads to \(DX=(1+\mathcal {L})DZ\) and \(D^2X=(2+\mathcal {L})D^2Z\), we deduce
Noting as in [38, Proposition 3.2] that \((1+\mathcal {L})^{-1}\) and \((2+\mathcal {L})^{-1}\) have operator norms on \({\text {L}}^4(\Omega )\) bounded by 1 and \(\frac{1}{2}\), respectively, the conclusion follows. \(\square \)
Rights and permissions
About this article
Cite this article
Duerinckx, M., Otto, F. Higher-order pathwise theory of fluctuations in stochastic homogenization. Stoch PDE: Anal Comp 8, 625–692 (2020). https://doi.org/10.1007/s40072-019-00156-4
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40072-019-00156-4