Analysis of the domain mapping method for elliptic diffusion problems on random domains

Harbrecht, H.; Peters, M.; Siebenmorgen, M.

doi:10.1007/s00211-016-0791-4

Analysis of the domain mapping method for elliptic diffusion problems on random domains

Published: 10 February 2016

Volume 134, pages 823–856, (2016)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Numerische Mathematik Aims and scope Submit manuscript

Analysis of the domain mapping method for elliptic diffusion problems on random domains

Download PDF

H. Harbrecht¹,
M. Peters¹ &
M. Siebenmorgen¹

1077 Accesses
52 Citations
Explore all metrics

Abstract

In this article, we provide a rigorous analysis of the solution to elliptic diffusion problems on random domains. In particular, based on the decay of the Karhunen-Loève expansion of the domain perturbation field, we establish decay rates for the derivatives of the random solution that are independent of the stochastic dimension. For the implementation of a related approximation scheme, like quasi-Monte Carlo quadrature, stochastic collocation, etc., we propose parametric finite elements to compute the solution of the diffusion problem on each individual realization of the domain generated by the perturbation field. This simplifies the implementation and yields a non-intrusive approach. Having this machinery at hand, we can easily transfer it to stochastic interface problems. The theoretical findings are complemented by numerical examples for both, stochastic interface problems and boundary value problems on random domains.

Quasi-Monte Carlo finite element methods for elliptic PDEs with lognormal random coefficients

Article 02 December 2014

Multilevel methods for uncertainty quantification of elliptic PDEs with random anisotropic diffusion

Article 27 May 2019

A domain mapping approach for elliptic equations posed on random bulk and surface domains

Article Open access 01 August 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Many problems in science and engineering lead to boundary value problems for an unknown function. In general, the numerical simulation is well understood provided that the input parameters are given exactly. Often, however, the input parameters are not known exactly. Especially, the treatment of uncertainties in the computational domain has become of growing interest, see e.g. [5, 18, 33, 36]. In this article, we consider the elliptic diffusion equation

$$\begin{aligned} -\mathrm{div}\big (\alpha \nabla u(\omega )\big ) = f\ \text {in} \quad D(\omega ), \quad u(\omega ) = 0\ \text {on} \ \partial D(\omega ), \end{aligned}$$

(1)

as a model problem where the underlying domain $D\subset \mathbb {R}^d$ or respectively its boundary $\partial D$ are random. For example, one might think of tolerances in the shape of products fabricated by line production or shapes which stem from inverse problems, like e.g. tomography. Besides the fictitious domain approach considered in [5], one might essentially distinguish two approaches: the perturbation method and the domain mapping method.

The perturbation method starts with a prescribed perturbation field

$$\begin{aligned} \mathbf{V}(\omega ):\partial D_{\mathrm{ref}}\rightarrow \mathbb {R}^d \end{aligned}$$

at the boundary $\partial D_{\mathrm{ref}}$ of a reference configuration and uses a shape Taylor expansion with respect to this perturbation field to represent the solution to (1), cf. [14, 18]. Whereas, the domain mapping method requires that the perturbation field is also known in the interior of the domain $D_{\mathrm{ref}}$, i.e.

$$\begin{aligned} \mathbf{V}(\omega ):\overline{D_{\mathrm{ref}}}\rightarrow \mathbb {R}^d. \end{aligned}$$

Then, the problem may be transformed to the fixed reference domain $D_{\mathrm{ref}}$. This yields a partial differential equation with correlated random diffusion matrix and right hand side, cf. [6, 26, 33, 36].

The major drawback of the perturbation method is that it is only feasible for relatively small perturbations. Thus, in order to treat larger perturbations, the domain mapping method is the method of choice. Nevertheless, it might in practice be much easier to obtain measurements from the outside of a workpiece to estimate the perturbation field $\mathbf{V}(\omega )$ rather than from its interior. If no information of the vector field inside the domain is available, it has to be extended appropriately, e.g. by the Laplacian, as proposed in [26, 36].

The perturbation method relies on a description in spatial or Eulerian coordinates. To that end, a compactum inside the domain is fixed and the domain deformation is considered relative to this compactum. The compactum has to be chosen in such a way that it is not intersected by the realizations of the domain’s boundary, cf. [18]. This particularly limits the magnitude of the boundary variation. The domain mapping method is based on a description in material or Lagrangian coordinates. Here, starting from the reference configuration $D_{\mathrm{ref}}$, the trajectory of each particular point is tracked. In the domain mapping method, the notions of Eulerian and Lagrangian coordinates coincide on compacta, where the deformation is zero. Thus, in this sense, the domain mapping method provides the more general framework. The correspondence between the perturbation method and the domain mapping method can be expressed in terms of the local shape derivative $\delta u[\mathbf{V}(\omega )]$ and the material derivative $\dot{u}[\mathbf{V}(\omega )]$ of a given function u which differ by a transport term, cf. [32]:

$$\begin{aligned} \dot{u}[\mathbf{V}(\omega )]=\delta u[\mathbf{V}(\omega )]+\langle \nabla u, \mathbf{V}(\omega )\rangle . \end{aligned}$$

In this article, we focus on the domain mapping method. In [6], it is shown for a specific class of variation fields that the solution to (1) provides analytic regularity with respect to the random parameter. We will generalize the result from [6] to arbitrary domain perturbation fields which are described by their mean $ \mathbb {E}[\mathbf{V}]:\!\! D_{\mathrm{ref}}\!\rightarrow \!\mathbb {R}^d,~\mathbb {E}[\mathbf{V}](\mathbf{x})\!=\!\big [\mathbb {E}[{v_1}](\mathbf{x}),\ldots ,\mathbb {E}[{v_d}](\mathbf{x})\big ]^\intercal $ and their (matrix-valued) covariance function

Note that the covariance function describes the covariance between any pair $(\mathbf{x},\mathbf{x}')$ of points in $D_{\mathrm{ref}}$ and facilitates thus a modeling in terms of Lagrangian coordinates. Taking the Karhunen-Loève expansion of $\mathbf{V}(\omega )$ as the starting point, we show decay rates for the derivatives of the solution to (1) with respect to the random parameter. Given that the Karhunen-Loève expansion decays fast enough, our results imply the dimension independent convergence of the quasi-Monte Carlo method based on the Halton sequence, cf. [13, 16, 34]. Moreover, our results are convenient for the convergence theory of the anisotropic sparse collocation, cf. [28], and best N-term approximations, cf. [8]. Although the presented results allow for a broad variety of methods for the stochastic approximation, we employ the quasi-Monte Carlo method in our numerical examples for the sake of simplicity.

For the spatial approximation, we propose to use parametric finite elements. Then, we are able to approximate the mean and the variance of the solution to (1) or a related quantity of interest by computing each sample on the particular realization $D(\omega _i)=\mathbf{V}(D_{\mathrm{ref}},\omega _i)$ of the random domain rather than on the reference domain $D_{\mathrm{ref}}$. This yields a non-intrusive approach to solve the problem at hand. In fact, any available finite element solver can be employed to compute the particular samples. Following this approach rather than mapping the diffusion problem always to the reference domain, we can easily treat also stochastic interface problems, cf. [14].

The rest of this article is organized as follows. In Sect. 2, we introduce some basic definitions and notation. Section 3 is dedicated to the Karhunen-Loève expansion of vector fields. Although this is a straightforward adaption of the state of the art literature [29], we think that it is sensible to explicitly introduce the related spaces, norms and operators. In Sect. 4, we present the essential contribution of this article: the regularity of the solution to the model problem defined in Sect. 2 with respect to the Karhunen-Loève expansion of the perturbation field. Section 5 introduces parametric finite elements which are the basic ingredient for the numerical realization of our approach. In Sect. 6, we extend our approach to stochastic interface problems. Finally, Sect. 7 provides numerical examples to validate and quantify the theoretical findings.

In the following, in order to avoid the repeated use of generic but unspecified constants, by $C\lesssim D$ we mean that C can be bounded by a multiple of D, independently of parameters which C and D may depend on. Obviously, $C\gtrsim D$ is defined as $D\lesssim C$ and we write $C\eqsim D$ if $C\lesssim D$ and $C\gtrsim D$.

2 Problem formulation

Let $D_{\mathrm{ref}}\subset \mathbb {R}^d$ for $d\in \mathbb {N}$ (of special interest are the cases $d=2,3$) denote a domain with Lipschitz continuous boundary $\partial D_{\mathrm{ref}}$ and let $(\Omega ,\mathcal {F},\mathbb {P})$ be a complete probability space with $\sigma $-field $\mathcal {F}\subset 2^\Omega $ and probability measure $\mathbb {P}$. In order to guarantee that $L^2_\mathbb {P}(\Omega )$ exhibits an orthonormal basis, we further assume that $\Omega $ is a separable set. Let $\mathbf{V}:\overline{D_{\mathrm{ref}}}\times \Omega \rightarrow \mathbb {R}^d$ be an invertible vector field of class $C^2$, i.e. V is twice continuously differentiable with respect to $\mathbf{x}$ for almost every $\omega \in \Omega $. Moreover, we impose the uniformity condition

$$\begin{aligned} \Vert \mathbf{V}(\omega )\Vert _{C^2(\overline{D_{\mathrm{ref}}};\mathbb {R}^d)},\quad \Vert \mathbf{V}^{-1}(\omega )\Vert _{C^2(\overline{D_{\mathrm{ref}}};\mathbb {R}^d)}\le C \end{aligned}$$

for some $C\in (0,\infty )$ and almost every $\omega \in \Omega $.^{Footnote 1} Thus, $\mathbf{V}$ defines a family of domains

$$\begin{aligned} D(\omega )\mathrel {\mathrel {\mathop :}=}\mathbf{V}(D_{\mathrm{ref}},\omega ). \end{aligned}$$

For the subsequent analysis, we restrict ourselves to the case of the Poisson equation, i.e. $\alpha \equiv 1$,

$$\begin{aligned} -\Delta u(\mathbf{x},\omega ) =f(\mathbf{x})\text { in }D(\omega ), \quad u(\mathbf{x},\omega )=0\text { on }\Gamma (\omega ). \end{aligned}$$

(2)

This considerably simplifies the analysis and the extension to non-constant diffusion coefficients is straightforward, cf. Remark 2. In order to guarantee solvability for almost every $\omega \in \Omega $, we consider the right hand side to be defined on the hold-all domain

$$\begin{aligned} \mathcal {D}\mathrel {\mathrel {\mathop :}=}\bigcup _{\omega \in \Omega }D(\omega ). \end{aligned}$$

(3)

From the uniformity condition, we infer for almost every $\omega \in \Omega $ and every $\mathbf{x}\in D$ that the singular-values of the vector field $\mathbf{V}$’s Jacobian $\mathbf{J}(\omega ,\mathbf{x})$ satisfy

$$\begin{aligned} 0<\underline{\sigma }\le \min \big \{\sigma \big (\mathbf{J}(\mathbf{x},\omega )\big )\big \} \le \max \big \{\sigma \big (\mathbf{J}(\mathbf{x},\omega )\big )\big \}\le \overline{\sigma }<\infty . \end{aligned}$$

(4)

In particular, we assume without loss of generality that $\underline{\sigma }\le 1$ and $\overline{\sigma }\ge 1$.

2.1 Reformulation on the reference domain

In the sequel, we consider the spaces $H^1_0\big (D(\omega )\big )$ and $H^1_0(D_{\mathrm{ref}})$ to be equipped with the norms $ \Vert \cdot \Vert _{H^1(D(\omega ))}\mathrel {\mathrel {\mathop :}=}\Vert \nabla \cdot \Vert _{L^2(D(\omega );\mathbb {R}^d)} \text { and } \Vert \cdot \Vert _{H^1(D_{\mathrm{ref}})}\mathrel {\mathrel {\mathop :}=}\Vert \nabla \cdot \Vert _{L^2(D_{\mathrm{ref}};\mathbb {R}^d)}, $ respectively. Furthermore, we assume that the related dual spaces $H^{-1}\big (D(\omega )\big )$ and $H^{-1}(D_{\mathrm{ref}})$ are defined with respect to these norms. The main tool we use in the convergence analysis for the model problem (2) is the one-to-one correspondence between the problem which is pulled back to the reference domain $D_{\mathrm{ref}}$ and the problem on the actual realization $D(\omega )$. The equivalence between those two problems is described by the vector field $\mathbf{V}(\mathbf{x},\omega )$. For an arbitrary function v on $D(\omega )$, we denote the transported function by $\hat{v}(\mathbf{x},\omega )\mathrel {\mathrel {\mathop :}=}(v\circ \mathbf{V})(\mathbf{x},\omega )$. According to the chain rule, we have for $v\in C^1\big (D(\omega )\big )$ that

$$\begin{aligned} (\nabla v)\big (\mathbf{V}(\mathbf{x},\omega )\big )=\mathbf{J}(\mathbf{x},\omega )^{-\intercal } \nabla \hat{v}(\mathbf{x},\omega ). \end{aligned}$$

(5)

For given $\omega \in \Omega $, the variational formulation for the model problem (2) reads as follows: Find $u({\omega })\in H^1_0\big (D(\omega )\big )$ such that

$$\begin{aligned} \int _{D(\omega )}\langle \nabla u,\nabla v\rangle \mathrm{d}\mathbf{x} =\int _{D(\omega )} fv\mathrm{d}\mathbf{x}\quad \text {for all }v\in H^1_0\big (D(\omega )\big ). \end{aligned}$$

(6)

Thus, with

$$\begin{aligned} \mathbf{A}(\mathbf{x},\omega ) \mathrel {\mathrel {\mathop :}=}\big (\mathbf{J}(\mathbf{x},\omega )^\intercal \mathbf{J}(\mathbf{x},\omega )\big )^{-1}\! \det \mathbf{J}(\mathbf{x},\omega ) \end{aligned}$$

(7)

and

$$\begin{aligned} f_{\mathrm{ref}}(\mathbf{x},\omega )\mathrel {\mathrel {\mathop :}=}\hat{f}(\mathbf{x},\omega )\det \mathbf{J}(\mathbf{x},\omega ), \end{aligned}$$

(8)

we obtain the following variational formulation with respect to the reference domain: Find $\hat{u}({\omega }) \in H^1_0(D_{\mathrm{ref}})$ such that

$$\begin{aligned} \int _{D_{\mathrm{ref}}}\langle \mathbf{A}(\omega )\nabla \hat{u}(\omega ),\nabla \hat{v}(\omega ) \rangle \mathrm{d}\mathbf{x} =\int _{D_{\mathrm{ref}}}{f}_{\mathrm{ref}}(\omega )\hat{v}(\omega )\mathrm{d}\mathbf{x}\quad \text {for all }\quad \hat{v}(\omega ) \in H^1_0(D_{\mathrm{ref}}).\nonumber \\ \end{aligned}$$

(9)

Here and afterwards, $\langle \cdot ,\cdot \rangle $ denotes the canonical inner product for $\mathbb {R}^d$.

Remark 1

Since $\mathbf{V}$ is assumed to be a $C^2$-diffeomorphism, we have for almost every $\omega \in \Omega $ that

$$\begin{aligned} \mathbf{V}^{-1}\circ \mathbf{V}=\mathrm{Id}\quad \Rightarrow \quad \mathbf{J}^{-1}{} \mathbf{J}=\mathbf{I} \quad \Rightarrow \quad \det \mathbf{J}^{-1}\det \mathbf{J}=1\quad \text { for all }\mathbf{x}. \end{aligned}$$

Herein, $\mathbf{I}\in \mathbb {R}^{d\times d}$ denotes the identity matrix. Especially, we infer $\det \mathbf{J}^{-1},\det \mathbf{J}\ne 0$. The continuity of $\mathbf{J},\mathbf{J}^{-1}$ and of the determinant function imply now that either $\det \mathbf{J}^{-1},\det \mathbf{J}>0$ or $\det \mathbf{J}^{-1},\det \mathbf{J}<0$ for all $\mathbf{x}$. Therefore, without loss of generality, we will assume the positiveness of the determinants.

Notice that Eq. (9) contains for fixed $v\in H^1_0\big (D(\omega )\big )$ the related transported test function $\hat{v}(\omega )$.

The connection between the spaces $H^1_0(D_{\mathrm{ref}})$ and $H^1_0\big (D(\omega )\big )$ is given by the following

Lemma 1

The spaces $H^1_0(D_{\mathrm{ref}})$ and $H^1_0\big (D(\omega )\big )$ are isomorphic by the isomorphism

$$\begin{aligned} \mathcal {E}:H^1_0(D_{\mathrm{ref}})\rightarrow H^1_0\big (D(\omega )\big ),\quad v\mapsto v\circ \mathbf{V}(\omega )^{-1}. \end{aligned}$$

The inverse mapping is given by

$$\begin{aligned} \mathcal {E}^{-1}:H^1_0\big (D(\omega )\big )\rightarrow H^1_0(D_{\mathrm{ref}}),\quad v\mapsto v\circ \mathbf{V}(\omega ). \end{aligned}$$

Proof

The proof of this lemma is a consequence of the chain rule (5) and the ellipticity Assumption (4). $\Box $

This lemma implies that the space of test functions is not dependent on $\omega \in \Omega $ at all: Obviously, we have $H^1_0\big (D(\omega )\big ) =\left\{ \mathcal {E}(v):v\in H^1_0(D_{\mathrm{ref}})\right\} $. Thus, for an arbitrary function $\mathcal {E}(v)\in H^1_0 \big (D(\omega )\big )$ it holds $\widehat{\mathcal {E}(v)} = \mathcal {E}(v)\circ \mathbf{V}=v\circ \mathbf{V}^{-1}\circ \mathbf{V} = v\in H^1_0(D_{\mathrm{ref}})$ independent of $\omega \in \Omega $. In particular, the solutions u to (6) and $\hat{u}$ to (9) satisfy

$$\begin{aligned} \hat{u}(\omega )=u\circ \mathbf{V}(\omega )\quad \text {and}\quad {u}(\omega )=\hat{u}\circ \mathbf{V}(\omega )^{-1}. \end{aligned}$$

(10)

3 Karhunen-Loève expansion

In order to make the random vector field $\mathbf{V}(\mathbf{x},\omega )$ feasible for computations, we consider here its Karhunen-Loève expansion, cf. [25]. This section shall give a brief overview of the relevant facts concerning the Karhunen-Loève expansion of vector valued random fields. Especially, we introduce here the related function spaces which are used in the rest of this article. For further details on the Karhunen-Loève expansion in general and also on computational aspects, we refer to [10, 11, 17, 29].

Let $D\subset \mathbb {R}^d$ always denote a domain. Then, we define $L^2(D;\mathbb {R}^d)$ to be the Hilbert space which consists of all equivalence classes of square integrable functions $\mathbf{v}:D\rightarrow \mathbb {R}^d$ equipped with the inner product

$$\begin{aligned} (\mathbf{u},\mathbf{v})_{L^2(D;\mathbb {R}^d)}\mathrel {\mathrel {\mathop :}=}\int _D\langle \mathbf{u},\mathbf{v}\rangle \mathrm{d}\mathbf{x}\quad \text {for all }\quad \mathbf{u},\mathbf{v}\in L^2(D;\mathbb {R}^d). \end{aligned}$$

We assume that the vector field $\mathbf{V}$ satisfies

$$\begin{aligned} \mathbf{V}(\mathbf{x},\omega )=[{v}_1(\mathbf{x}, \omega ),\ldots , {v}_d(\mathbf{x},\omega )]^\intercal \in L^2_{\mathbb {P}}\big (\Omega ;L^2(D;\mathbb {R}^d)\big ). \end{aligned}$$

Here and in the sequel, given a Banach space B and $1\le p\le \infty $, the Lebesgue-Bochner space $L_{\mathbb {P}}^p(\Omega ;B)$ consists of all equivalence classes of strongly measurable functions $v:\Omega \rightarrow B$ whose norm

$$\begin{aligned} \Vert v\Vert _{L_{\mathbb {P}}^p(\Omega ;B)}\mathrel {\mathrel {\mathop :}=}{\left\{ \begin{array}{ll} \displaystyle {\left( \int _{\Omega } \Vert v(\cdot ,\omega )\Vert _B^p \mathrm{d}\mathbb {P}(\omega )\right) ^{1/p}},\quad &{} p<\infty \\ \displaystyle {\mathrm{ess}\mathop {\sup }\limits _{\omega \in \Omega } \Vert v(\cdot ,\omega )\Vert _B},\quad &{} p=\infty \end{array}\right. } \end{aligned}$$

is finite. If $B=H$ is a separable Hilbert space and $p=2$, then the Lebesgue-Bochner space is isomorphic to the tensor product space $L_{\mathbb {P}}^2(\Omega )\otimes H$ equipped with the inner product

$$\begin{aligned} (u,v)_{L^2_{\mathbb {P}}(\Omega ;H)}\mathrel {\mathrel {\mathop :}=}\int _\Omega \big (u(\cdot ,\omega ),v(\cdot ,\omega )\big )_H \mathrm{d}\mathbb {P}(\omega ), \end{aligned}$$

cf. [2, 24].

The mean of $\mathbf{V}$ is given by $ \mathbb {E}[\mathbf{V}](\mathbf{x}) = \big [\mathbb {E}[{v}_1](\mathbf{x}),\ldots , \mathbb {E}[{v}_d](\mathbf{x})\big ]^\intercal $ with

$$\begin{aligned} \mathbb {E}[{v}_i](\mathbf{x})\mathrel {\mathrel {\mathop :}=}\int _\Omega {v}_i(\mathbf{x},\omega ) \mathrm{d}\mathbb {P}(\omega ),\quad i=1,2,\ldots ,d. \end{aligned}$$

From the theory of Bochner integrals, see e.g. [24], it follows that $\mathbb {E}[{v}_i](\mathbf{x})\in L^2(D)$ and thus $\mathbb {E}[\mathbf{V}](\mathbf{x})\in L^2(D;\mathbb {R}^d)$. Furthermore, the (matrix-valued) covariance function of $\mathbf{V}$ is given by $ \mathrm{Cov}[\mathbf{V}](\mathbf{x},\mathbf{y})=[\mathrm{Cov}_{i,j}(\mathbf{x},\mathbf{y})]_{i,j=1}^d $ with

$$\begin{aligned} \mathrm{Cov}_{i,j}(\mathbf{x},\mathbf{y}) =\mathbb {E}\big [\big ({v}_i(\mathbf{x},\omega )-\mathbb {E}[{v}_i](\mathbf{x})\big ) \big ({v}_j(\mathbf{y},\omega )-\mathbb {E}[{v}_j](\mathbf{y})\big )\big ]. \end{aligned}$$

We have $\mathrm{Cov}_{i,j}(\mathbf{x},\mathbf{y})\in L^2(D\times D)$ which also follows from the properties of the Bochner integral and the application of the Cauchy-Schwarz inequality. We therefore conclude $\mathrm{Cov}[\mathbf{V}](\mathbf{x},\mathbf{y})\in L^2(D\times D;\mathbb {R}^{d\times d})$ where we equip the space $\mathbb {R}^{d\times d}$ with the inner product

$$\begin{aligned} \mathbf{A}:\mathbf{B}\mathrel {\mathrel {\mathop :}=}\sum _{i,j=1}^d a_{i,j}b_{i,j}\quad \text {for} \ \mathbf{A},\mathbf{B}\in \mathbb {R}^{d\times d}\quad \text {with }\mathbf{A}=[a_{i,j}]_{i,j=1}^d,\, \mathbf{B}=[b_{i,j}]_{i,j=1}^d. \end{aligned}$$

This particularly induces the inner product on $L^2(D\times D;\mathbb {R}^{d\times d})$ given by

$$\begin{aligned} (\mathbf{A},\mathbf{B})_{L^2(D\times D;\mathbb {R}^{d\times d})}\mathrel {\mathrel {\mathop :}=}\int _D\int _D (\mathbf{A}:\mathbf{B})\mathrm{d}\mathbf{x}\mathrm{d}\mathbf{y}\quad \text {for } \mathbf{A},\mathbf{B}\in {L^2(D\times D;\mathbb {R}^{d\times d})}. \end{aligned}$$

Now, we shall introduce the operator

$$\begin{aligned} \mathcal {S}:L^2_\mathbb {P}(\Omega )\rightarrow L^2(D;\mathbb {R}^d),\quad (\mathcal {S}X)(\mathbf{x})\mathrel {\mathrel {\mathop :}=}\int _\Omega \big (\mathbf{V}(\mathbf{x},\omega )-\mathbb {E}[\mathbf{V}](\mathbf{x})\big ) X(\omega )\mathrm{d}\mathbb {P}(\omega )\nonumber \\ \end{aligned}$$

(11)

and its adjoint

$$\begin{aligned} \mathcal {S}^\star :L^2(D;\mathbb {R}^d)\rightarrow L^2_\mathbb {P}(\Omega ),\quad (\mathcal {S}^\star \mathbf{u})(\omega )\mathrel {\mathrel {\mathop :}=}\int _D\big (\mathbf{V}(\mathbf{x},\omega ) -\mathbb {E}[\mathbf{V}](\mathbf{x})\big )^\intercal \mathbf{u}(\mathbf{x})\mathrm{d}\mathbf{x}.\qquad \quad \end{aligned}$$

(12)

Then, there holds the following

Lemma 2

The operators $\mathcal {S}$ and $\mathcal {S}^\star $ given by (11) and (12), respectively, are bounded with Hilbert-Schmidt norms $\Vert \mathcal {S}\Vert _{\mathrm{HS}}=\Vert \mathcal {S}^\star \Vert _{\mathrm{HS}} =\Vert \mathbf{V}-\mathbb {E}[\mathbf{V}]\Vert _{L^2_{\mathbb {P}}(\Omega ;L^2(D;\mathbb {R}^d))}$. Moreover, the covariance operator

$$\begin{aligned} \mathcal {C}:L^2(D;\mathbb {R}^d)\rightarrow L^2(D;\mathbb {R}^d),\, (\mathcal {C}{} \mathbf{v})(\mathbf{x})\mathrel {\mathrel {\mathop :}=}\int _D\mathrm{Cov}[\mathbf{V}](\mathbf{x},\mathbf{y})\mathbf{v}(\mathbf{y}) \mathrm{d}\mathbf{y}=(\mathcal {SS}^\star \mathbf{v})(\mathbf{x}) \end{aligned}$$

is a non-negative, symmetric, trace class operator with trace $\Vert \mathbf{V}-\mathbb {E}[\mathbf{V}]\Vert _{L^2_{\mathbb {P}}(\Omega ;L^2(D;\mathbb {R}^d))}^2$.

Proof

The statement on the norms of $\mathcal {S}$ and $\mathcal {S}^\star $ follows by the application of Parseval’s identity, see the last part of the proof. Moreover, we have for all $\mathbf{u}\in L^2(D;\mathbb {R}^d)$ that

$$\begin{aligned} (\mathcal {S}\mathcal {S}^\star \mathbf{u})(\mathbf{x})= & {} \int _\Omega \big (\mathbf{V}(\mathbf{x},\omega )-\mathbb {E}[\mathbf{V}](\mathbf{x})\big ) \int _D\big (\mathbf{V}(\mathbf{y},\omega ) -\mathbb {E}[\mathbf{V}](\mathbf{y})\big )^\intercal \mathbf{u}(\mathbf{y})\mathrm{d}\mathbf{y} \mathrm{d}\mathbb {P}(\omega )\\= & {} \int _D\bigg (\int _\Omega \big (\mathbf{V}(\mathbf{x},\omega )-\mathbb {E}[\mathbf{V}](\mathbf{x})\big ) \big (\mathbf{V}(\mathbf{y},\omega )-\mathbb {E}[\mathbf{V}](\mathbf{y})\big )^\intercal \mathrm{d}\mathbb {P}(\omega )\bigg )\mathbf{u}(\mathbf{y})\mathrm{d}\mathbf{y}\\= & {} \int _D\mathrm{Cov}[\mathbf{V}](\mathbf{x},\mathbf{y})\mathbf{u}(\mathbf{y})\mathrm{d}\mathbf{y} =(\mathcal {C}{} \mathbf{u})(\mathbf{x}). \end{aligned}$$

In particular, $\mathcal {C}$ is non-negative and symmetric according to

$$\begin{aligned} (\mathcal {C}{} \mathbf{u},\mathbf{u})_{L^2(D;\mathbb {R}^d)} =(\mathcal {S}^\star \mathbf{u}, \mathcal {S}^\star \mathbf{u})_{L^2_\mathbb {P}(\Omega )} =\Vert \mathcal {S}^\star \mathbf{u}\Vert _{L^2_\mathbb {P}(\Omega )}^2\ge 0. \end{aligned}$$

Finally, to show that $\mathcal {C}$ is of trace class, let $\{{\varvec{\varphi }}_k\}_k$ be an arbitrary orthonormal basis in $L^2(D;\mathbb {R}^d)$. We thus have

$$\begin{aligned} \sum _{k}(\mathcal {C}{\varvec{\varphi }}_k, {\varvec{\varphi }}_k)_{L^2(D;\mathbb {R}^d)}&=\sum _k \Vert \mathcal {S}^\star {\varvec{\varphi }}_k\Vert _{L^2_\mathbb {P}(\Omega )}^2= \int _\Omega \sum _k(\mathcal {S}^\star {\varvec{\varphi }}_k)^2 \mathrm{d}\mathbb {P}(\omega )\\&=\int _\Omega \sum _k\bigg (\int _D\big (\mathbf{V}(\mathbf{x},\omega ) -\mathbb {E}[\mathbf{V}](\mathbf{x})\big )^\intercal {\varvec{\varphi }}_k\mathrm{d}\mathbf{x}\bigg )^2 \mathrm{d}\mathbb {P}(\omega )\\&=\int _\Omega \int _D\langle \mathbf{V}(\mathbf{x},\omega ) -\mathbb {E}[\mathbf{V}](\mathbf{x}),\mathbf{V}(\mathbf{x},\omega ) -\mathbb {E}[\mathbf{V}](\mathbf{x})\rangle \mathrm{d}\mathbf{x} \mathrm{d}\mathbb {P}(\omega )\\&=\Vert \mathbf{V}-\mathbb {E}[\mathbf{V}]\Vert _{L^2_{\mathbb {P}}(\Omega ;L^2(D;\mathbb {R}^d))}^2, \end{aligned}$$

where we employed Parseval’s identity in the second last step. $\Box $

Trace class operators are especially compact, see e.g. [20, 30], and exhibit hence a spectral decomposition.

Theorem 1

Let $\mathcal {C}:L^2(D;\mathbb {R}^d)\rightarrow L^2(D;\mathbb {R}^d)$ be the covariance operator related to $\mathbf{V}(\mathbf{x},\omega )\in L^2_\mathbb {P}\big (\Omega ;L^2(D;\mathbb {R}^d)\big )$. Then, there exists an orthonormal set $\{{\varvec{\varphi }}_k\}_k$ and a sequence $\lambda _1\ge \lambda _2\ge \cdots \ge 0$ such that $\mathcal {C}{\varvec{\varphi }}_k=\lambda _k{\varvec{\varphi }}_k$ for all $k=1,2,\ldots .$ Furthermore, it holds

$$\begin{aligned} \mathcal {C}{} \mathbf{u}=\sum _k\lambda _k (\mathbf{u},{\varvec{\varphi }}_k)_{L^2(D;\mathbb {R}^d)}{\varvec{\varphi }}_k \quad \text {for all }\quad \mathbf{u}\in L^2(D;\mathbb {R}^d). \end{aligned}$$

Proof

For a proof of this theorem, we refer to [2]. $\Box $

We have now all prerequisites at hand to define the Karhunen-Loève expansion of the vector field $\mathbf{V}(\mathbf{x},\omega )\in L^2_{\mathbb {P}}\big (\Omega ;L^2(D;\mathbb {R}^d)\big )$.

Definition 1

Let $\mathbf{V}(\mathbf{x},\omega )$ be a vector field in $L^2_{\mathbb {P}}\big (\Omega ;L^2(D;\mathbb {R}^d)\big )$. The expansion

$$\begin{aligned} \mathbf{V}(\mathbf{x},\omega )=E[\mathbf{V}](\mathbf{x})+ \sum _k\sigma _k{\varvec{\varphi }}_k(\mathbf{x})X_k(\omega ) \end{aligned}$$

(13)

with $\sigma _k=\sqrt{\lambda _k}$ and $X_k=\mathcal {S}^\star {\varvec{\varphi }}_k/\sigma _k$, where $\{(\lambda _k,{\varvec{\varphi }}_k)\}_k$ is the sequence of eigenpairs of the underlying covariance operator $\mathcal {C}=\mathcal {S}\mathcal {S}^\star $, is called Karhunen-Loève expansion of $\mathbf{V}(\mathbf{x},\omega )$.

The space $L^2(D;\mathbb {R}^d)$ served as pivot space for our considerations in the preceding derivation of the Karhunen-Loève expansion. In order to control the error of truncating the expansion after $M\in \mathbb {N}$ terms, i.e.

$$\begin{aligned}&\left\| \mathbf{V}(\mathbf{x},\omega )-E[\mathbf{V}](\mathbf{x})- \sum _{k=1}^M\sigma _k{\varvec{\varphi }}_k(\mathbf{x})X_k(\omega )\right\| _{ L^2(\Omega ;L^2(D;\mathbb {R}^d))}=\, \left( \sum _{k=M+1}^\infty \lambda _k\right) ^{\frac{1}{2}},\qquad \end{aligned}$$

(14)

one has to study the decay of the singular values $\sigma _k$ in the representation (13). The particular rate of decay is known to depend on the spatial regularity of $\mathbf{V}(\mathbf{x},{\omega })$. To that end, we consider the Sobolev space $H^p(D;\mathbb {R}^d)$ for $p>0$. The related inner product is given by

$$\begin{aligned} (\mathbf{u},\mathbf{w})_{H^p(D;\mathbb {R}^d)}\mathrel {\mathrel {\mathop :}=}\sum _{|{\varvec{\alpha }}|\le p} \int _D\langle \partial ^{\varvec{\alpha }}{} \mathbf{u}, \partial ^{\varvec{\alpha }}{} \mathbf{w}\rangle \mathrm{d}\mathbf{x} \end{aligned}$$

for $p\in \mathbb {N}$ and

$$\begin{aligned} (\mathbf{u},\mathbf{w})_{H^p(D;\mathbb {R}^d)}\mathrel {\mathrel {\mathop :}=}(\mathbf{u},\mathbf{w})_{H^{\lfloor p\rfloor }(D;\mathbb {R}^d)} +\sum _{|{\varvec{\alpha }}|=\lfloor p\rfloor }\int _D\int _D \frac{\Vert \partial ^{{\varvec{\alpha }}}{} \mathbf{u}(\mathbf{x})-\partial ^{{\varvec{\alpha }}}{} \mathbf{w}(\mathbf{y})\Vert _2^2}{\Vert \mathbf{x}-\mathbf{y}\Vert _2^{d+2s}}\mathrm{d}\mathbf{x}\mathrm{d}\mathbf{y} \end{aligned}$$

for $p=\lfloor p\rfloor +s$ with $s\in (0,1)$. Its dual space with respect to the $L^2$-duality pairing ist denoted as $\tilde{H}^{-p}(D;\mathbb {R}^d)$.

For given $\mathbf{V}(\mathbf{x},{\omega }) \in L^2_\mathbb {P}\big (\Omega ;H^p(D;\mathbb {R}^d)\big )$, it obviously holds

$$\begin{aligned} \mathrm{Cov}_{i,j}(\mathbf{x},\mathbf{y})\in H^{p}(D)\otimes H^p(D)\quad \text {for }i,j=1,\ldots ,d, \end{aligned}$$

cf. [11]. Therefore, the following theorem is a straightforward modification of [11, Theorem 3.3] for the vector valued case.

Theorem 2

Let $\mathbf{V}(\mathbf{x},{\omega }) \in L^2_\mathbb {P}\big (\Omega ;H^p(D;\mathbb {R}^d)\big )$. Then, the eigenvalues of the covariance operator $\mathcal {C}:\tilde{H}^{-p}(D;\mathbb {R}^d)\rightarrow H^p(D;\mathbb {R}^d)$ decay like $\lambda _k\lesssim (k/d)^{-2p/d}$ as $k\rightarrow \infty $.

We may summarize the results of this section as follows. If the mean $\mathbb {E}[\mathbf{V}](\mathbf{x})$ and the covariance function $\mathrm{Cov}[\mathbf{V}](\mathbf{x},\mathbf{y})$ as well as the distribution of $\mathbf{V}(\mathbf{x},{\omega })$ are known or appropriately estimated, cf. [29], we are able to reconstruct the vector field $\mathbf{V}(\mathbf{x},{\omega })$ from its Karhunen-Loève expansion. In the following, in order to make the Karhunen-Loève expansion feasible for computations, we make some common assumptions.

Assumption 1

(1)
The random variables $\{X_k\}_k$ are centered and take values in $[-1,1]$, i.e. $X_k(\omega )\in [-1,1]$ for all k and almost every $\omega \in \Omega $.
(2)
The random variables $\{X_k\}_k$ are independent and identically distributed.
(3)
The sequence
$$\begin{aligned} \{\gamma _k\}_k \mathrel {\mathrel {\mathop :}=}\big \{\Vert \sigma _k{\varvec{\varphi }}_k\Vert _{ W^{1,\infty }(D;\mathbb {R}^{d})}\big \}_k \end{aligned}$$
(15)
is at least in $\ell ^1(\mathbb {N})$. We denote its $\ell ^1$-norm by $c_{\varvec{\gamma }}\mathrel {\mathrel {\mathop :}=}\sum _{k=1}^\infty \gamma _k$.

Here and hereafter, we shall equip the space $W^{1,\infty }(D;\mathbb {R}^d)$ with the equivalent norm $ \Vert \mathbf{v}\Vert _{W^{1,\infty }(D;\mathbb {R}^d)}=\max \big \{\Vert \mathbf{v}\Vert _{L^\infty (D;\mathbb {R}^d)}, \Vert \mathbf{v}'\Vert _{L^\infty (D;\mathbb {R}^{d\times d})}\big \}, $ where $\mathbf{v}'$ denotes the Jacobian of $\mathbf{v}$ and $ \Vert \mathbf{v}'\Vert _{L^\infty (D;\mathbb {R}^{d\times d})}\mathrel {\mathrel {\mathop :}=}\mathrm{ess\,sup}_{\mathbf{x}\in D}\Vert \mathbf{v}'(\mathbf{x})\Vert _2. $ Herein, $\Vert \cdot \Vert _2$ is the usual 2-norm of matrices, i.e. the largest singular value.

4 Regularity of the solution

In this section, we assume that the vector field $\mathbf{V}(\mathbf{x},\mathbf{y})$ is given by a finite rank Karhunen-Loève expansion, i.e.

$$\begin{aligned} \mathbf{V}(\mathbf{x},\mathbf{y})=\mathbb {E}[\mathbf{V}](\mathbf{x})+ \sum _{k=1}^M\sigma _k{\varvec{\varphi }}_k(\mathbf{x}){y}_k, \end{aligned}$$

otherwise it has to be truncated appropriately. Nevertheless, we provide in this section estimates which are independent of $M\in \mathbb {N}$. Thus, we explicitly allow M to become arbitrarily large.

For the rest of this article, we will refer to the randomness only via the coordinates $\mathbf{y}\in \Box \mathrel {\mathrel {\mathop :}=}[-1,1]^M$, where $\mathbf{y}=[y_1,\ldots ,y_M]$. Notice that due to the independence of the random variables, the related push-forward measure $\mathbb {P}_\mathbf{X}\mathrel {\mathrel {\mathop :}=}\mathbb {P}\circ \mathbf{X}^{-1}$ where $\mathbf{X}(\omega )\mathrel {\mathrel {\mathop :}=}[X_1(\omega ),\ldots X_M(\omega )]$ is of product structure. Furthermore, we always think of the spaces $L^p(\Box )$ for $p\in [1,\infty ]$ to be equipped with the measure $\mathbb {P}_\mathbf{X}$. Moreover, we set ${\varvec{\gamma }}=[\gamma _k]_{k=1}^M$, cf. (15).

Without loss of generality, we may assume that $\mathbb {E}[\mathbf{V}](\mathbf{x})=\mathbf{x}$ is the identity mapping. Otherwise, we replace $D_{\mathrm{ref}}$ by

$$\begin{aligned} \widetilde{D}_{\mathrm{ref}}\mathrel {\mathrel {\mathop :}=}\mathbb {E}[\mathbf{V}](D_{\mathrm{ref}})\quad \text {and}\quad \widetilde{\varvec{\varphi }}_k \mathrel {\mathrel {\mathop :}=}\sqrt{\det (\mathbb {E}[\mathbf{V}]^{-1})'}{\varvec{\varphi }}_k\circ \mathbb {E}[\mathbf{V}]^{-1}. \end{aligned}$$

Therefore, we obtain

$$\begin{aligned} \mathbf{V}(\mathbf{x},\mathbf{y})=\mathbf{x}+ \sum _{k=1}^M\sigma _k{\varvec{\varphi }}_k(\mathbf{x}){y}_k\quad \text {and}\quad \mathbf{J}(\mathbf{x},\mathbf{y})=\mathbf{I}+\sum _{k=1}^M\sigma _k\varvec{\varphi }_k'(\mathbf{x})y_k. \end{aligned}$$

(16)

In the subsequent regularity results, we shall refer to the following Lebesgue-Bochner spaces. We define the space $L^\infty \big (\Box ;L^\infty (D_{\mathrm{ref}};\mathbb {R}^d)\big )$ as the set of all equivalence classes of strongly measurable functions $\mathbf{V}:\Box \rightarrow L^\infty (D_{\mathrm{ref}}; \mathbb {R}^d)$ with finite norm

$$\begin{aligned} {\vert \vert \vert \mathbf V \vert \vert \vert }_d\mathrel {\mathrel {\mathop :}=}\mathrm{ess}\sup \limits _{\mathbf{y}\in \Box }\Vert \mathbf{V}(\mathbf{y})\Vert _{L^\infty (D_{\mathrm{ref}};\mathbb {R}^d)}. \end{aligned}$$

Furthermore, the space $L^\infty \big (\Box ;L^\infty (D_{\mathrm{ref}};\mathbb {R}^{d\times d})\big )$ consists of all equivalence classes of strongly measurable functions $\mathbf{M}:\Box \rightarrow L^\infty (D_{\mathrm{ref}};\mathbb {R}^{d\times d})$ with finite norm

$$\begin{aligned} {\vert \vert \vert \mathbf M \vert \vert \vert }_{d\times d}\mathrel {\mathrel {\mathop :}=}\mathrm{ess}\sup \limits _{\mathbf{y}\in \Box } \Vert \mathbf{M}(\mathbf{y})\Vert _{L^\infty (D_{\mathrm{ref}};\mathbb {R}^{d\times d})}. \end{aligned}$$

We start by providing bounds on the derivatives of $\big (\mathbf{J}(\mathbf{x},\mathbf{y})^\intercal \mathbf{J}(\mathbf{x},\mathbf{y})\big )^{-1}$.

Lemma 3

Let $\mathbf{J}:D_{\mathrm{ref}}\times \Box \rightarrow \mathbb {R}^{d\times d}$ be defined as in (16). Then, it holds for the derivatives of

$$\begin{aligned} \big (\mathbf{J}(\mathbf{x},\mathbf{y})^\intercal \mathbf{J}(\mathbf{x},\mathbf{y})\big )^{-1} \end{aligned}$$

under the conditions of Assumption 1.3 that

$$\begin{aligned} {\big \vert \big \vert \big \vert \partial ^{{\varvec{\alpha }}}_\mathbf{y}(\mathbf{J}^\intercal \mathbf{J})^{-1} \big \vert \big \vert \big \vert }_{d\times d} \le |{\varvec{\alpha }}|! \frac{{\varvec{\gamma }}^{\varvec{\alpha }}}{\underline{\sigma }^2}\bigg (\frac{2(1+c_{\varvec{\gamma }})}{\underline{\sigma }^2\log 2} \bigg )^{|{\varvec{\alpha }}|}. \end{aligned}$$

Proof

We define $ \mathbf{B}(\mathbf{x},\mathbf{y})\mathrel {\mathrel {\mathop :}=}\mathbf{J}(\mathbf{x},\mathbf{y})^\intercal \mathbf{J}(\mathbf{x},\mathbf{y})$ and $\tilde{\mathbf{A}}(\mathbf{x},\mathbf{y})\mathrel {\mathrel {\mathop :}=}\big (\mathbf{B}(\mathbf{x},\mathbf{y})\big )^{-1}. $ Expanding the expression for $\mathbf{B}(\mathbf{x},\mathbf{y})$ yields

$$\begin{aligned} \mathbf{B}(\mathbf{x},\mathbf{y})=\mathbf{I}+ \sum _{k=1}^M\sigma _k\big ({\varvec{\varphi }}_k'(\mathbf{x})^\intercal +{\varvec{\varphi }}_k'(\mathbf{x})\big )y_k +\sum _{k,k'=1}^M\sigma _k\sigma _{k'}{\varvec{\varphi }}_k'(\mathbf{x})^\intercal {\varvec{\varphi }}_{k'}'(\mathbf{x})y_ky_{k'}. \end{aligned}$$

Thus, the first order derivatives of $\mathbf{B}(\mathbf{x},\mathbf{y})$ are given by

$$\begin{aligned} {\partial _{y_i}}{} \mathbf{B}(\mathbf{x},\mathbf{y}) =\sigma _i\big ({\varvec{\varphi }}_i'(\mathbf{x})^\intercal +{\varvec{\varphi }}_i'(\mathbf{x})\big ) +\sum _{k=1}^M\sigma _i\sigma _k\big ({\varvec{\varphi }}_i'(\mathbf{x})^\intercal {\varvec{\varphi }}_{k}'(\mathbf{x})+{\varvec{\varphi }}_k'(\mathbf{x})^\intercal {\varvec{\varphi }}_{i}'(\mathbf{x})\big )y_{k}\qquad \end{aligned}$$

(17)

and the second order derivatives according to

$$\begin{aligned} {\partial _{y_j}\partial _{y_i}}{} \mathbf{B}(\mathbf{x},\mathbf{y}) =\sigma _i\sigma _j\big ( {\varvec{\varphi }}_i'(\mathbf{x})^\intercal {\varvec{\varphi }}_{j}'(\mathbf{x})+ {\varvec{\varphi }}_j'(\mathbf{x})^\intercal {\varvec{\varphi }}_{i}'(\mathbf{x})\big ). \end{aligned}$$

(18)

Obviously, all higher order derivatives with respect to $\mathbf{y}$ vanish.

The ellipticity Assumption (4) now yields the following bounds:

$$\begin{aligned} \underline{\sigma }^2 \le {\vert \vert \vert \mathbf B \vert \vert \vert }_{d\times d}\le \overline{\sigma }^2 \quad \text {and}\quad \frac{1}{\overline{\sigma }^2} \le {\big \vert \big \vert \big \vert \tilde{\mathbf{A}} \big \vert \big \vert \big \vert }_{d\times d} \le \frac{1}{\underline{\sigma }^2}, \end{aligned}$$

respectively. Furthermore, we derive from (17) that

$$\begin{aligned} {\big \vert \big \vert \big \vert {\partial _{y_i}}{} \mathbf{B} \big \vert \big \vert \big \vert }_{d\times d} \le 2\gamma _i+2\gamma _i\sum _{k=1}^M\gamma _k\le 2(1+c_{\varvec{\gamma }})\gamma _i \end{aligned}$$

and from (18) that $ {\big \vert \big \vert \big \vert {\partial _{y_j}\partial _{y_i}}{} \mathbf{B} \big \vert \big \vert \big \vert }_{d\times d}\le 2\gamma _i\gamma _j. $ Thus, we have

$$\begin{aligned} {\big \vert \big \vert \big \vert \partial _\mathbf{y}^{{\varvec{\alpha }}}{} \mathbf{B} \big \vert \big \vert \big \vert }_{d\times d}\le {\left\{ \begin{array}{ll} 2(1+c_{\varvec{\gamma }}){\varvec{\gamma }}^{\varvec{\alpha }},&{}\text { if }|{\varvec{\alpha }}|=1,2\\ 0,&{}\text { if }|{\varvec{\alpha }}|>2. \end{array}\right. } \end{aligned}$$

(19)

Since $\tilde{\mathbf{A}}=v\circ \mathbf{B}$ is a composite function with $v(x)=x^{-1}$, we may employ Faà di Bruno’s formula, cf. [9], which is a generalization of the chain rule, to compute its derivatives. For $n=|{\varvec{\alpha }}|$ Faà di Bruno’s formula formally yields^{Footnote 2}

$$\begin{aligned} \partial ^{{\varvec{\alpha }}}_\mathbf{y}\tilde{\mathbf{A}}(\mathbf{x},\mathbf{y})= \sum _{r=1}^n (-1)^rr!\tilde{\mathbf{A}}(\mathbf{x},\mathbf{y})^{r+1}\sum _{P({\varvec{\alpha }},r)}{\varvec{\alpha }}!\prod _{j=1}^n \frac{\left( \partial ^{{\varvec{\beta }}_j}_\mathbf{y}{} \mathbf{B}(\mathbf{x},\mathbf{y})\right) ^{k_j}}{k_j!({\varvec{\beta }}_j!)^{k_j}}. \end{aligned}$$

(20)

Here, the set $P({\varvec{\alpha }},r)$ contains restricted integer partitions of a multiindex ${\varvec{\alpha }}$ into r non-vanishing multiindices, i.e.

$$\begin{aligned} P({\varvec{\alpha }},r)\mathrel {\mathrel {\mathop :}=}\bigg \{&\big ((k_1,{\varvec{\beta }}_1),\ldots ,(k_n,{\varvec{\beta }}_n)\big )\in \left( \mathbb {N}_0\times \mathbb {N}_0^M\right) ^n: \sum _{i=1}^n k_i{\varvec{\beta }}_i={\varvec{\alpha }},\quad \ \sum _{i=1}^n k_i=r,\\&\ \text {and}\ \exists \,1\le s\le n: k_i=0\ \text {and}\ {\varvec{\beta }}_{i} = \mathbf{0}\quad \ \text {for all}\ 1\le i\le n-s,\\&\quad k_i>0\quad \ \text {for all}\ n-s+1\le i\le n\text { and } \mathbf{0}\prec {\varvec{\beta }}_{n-s+1}\prec \dots \prec {\varvec{\beta }}_n\bigg \}. \end{aligned}$$

Herein, for multiindices ${\varvec{\beta }},{\varvec{\beta }}'\in \mathbb {N}_0^M$, the relation ${\varvec{\beta }}\prec {\varvec{\beta }}'$ means either $|{\varvec{\beta }}|<|{\varvec{\beta }}'|$ or, if $|{\varvec{\beta }}|=|{\varvec{\beta }}'|$, it denotes the lexicographical order which means that it holds that $\beta _1=\beta _1',\ldots ,\beta _k=\beta _k'$ and $\beta _{k+1}<\beta _{k+1}'$ for some $0\le k< M$.

Taking the norm in (20), we derive the estimate

$$\begin{aligned} {\big \vert \big \vert \big \vert \partial ^{{\varvec{\alpha }}}_\mathbf{y}\tilde{\mathbf{A}} \big \vert \big \vert \big \vert }_{d\times d}&\le \sum _{r=1}^n r!{\big \vert \big \vert \big \vert \tilde{\mathbf{A}} \big \vert \big \vert \big \vert }^{r+1}_{d\times d}\sum _{P({\varvec{\alpha }},r)}{\varvec{\alpha }}!\prod _{j=1}^n \frac{{\big \vert \big \vert \big \vert \partial ^{{\varvec{\beta }}_j}_\mathbf{y}{} \mathbf{B} \big \vert \big \vert \big \vert }_{d\times d}^{k_j}}{k_j!({\varvec{\beta }}_j!)^{k_j}}\\&\le \sum _{r=1}^n r!\bigg (\frac{1}{\underline{\sigma }^2}\bigg )^{r+1} \sum _{P({\varvec{\alpha }},r)}{\varvec{\alpha }}!\prod _{j=1}^n \frac{\big (2(1+c_{\varvec{\gamma }}){\varvec{\gamma }}^{{\varvec{\beta }}_j}\big )^{k_j}}{k_j!({\varvec{\beta }}_j!)^{k_j}}\\&={\varvec{\gamma }}^{\varvec{\alpha }}\sum _{r=1}^n r!\bigg (\frac{1}{\underline{\sigma }^2}\bigg )^{r+1} \big (2(1+c_{\varvec{\gamma }})\big )^{r} \sum _{P({\varvec{\alpha }},r)}{\varvec{\alpha }}!\prod _{j=1}^n \frac{1}{k_j!({\varvec{\beta }}_j!)^{k_j}}. \end{aligned}$$

From [9] we know that

$$\begin{aligned} \sum _{P({\varvec{\alpha }},r)}{\varvec{\alpha }}!\prod _{j=1}^n \frac{1}{k_j!({\varvec{\beta }}_j!)^{k_j}}=S_{n,r}, \end{aligned}$$

where $S_{n,r}$ are the Stirling numbers of the second kind, cf. [1]. Thus, we obtain

$$\begin{aligned} {\big \vert \big \vert \big \vert \partial ^{{\varvec{\alpha }}}_\mathbf{y}\tilde{\mathbf{A}} \big \vert \big \vert \big \vert }_{d\times d} \le \frac{{\varvec{\gamma }}^{\varvec{\alpha }}}{\underline{\sigma }^2} \sum _{r=1}^n r!\bigg (\frac{2(1+c_{\varvec{\gamma }})}{\underline{\sigma }^2}\bigg )^{r} S_{n,r} \le \frac{{\varvec{\gamma }}^{\varvec{\alpha }}}{\underline{\sigma }^2} \bigg (\frac{2(1+c_{\varvec{\gamma }})}{\underline{\sigma }^2}\bigg )^{|{\varvec{\alpha }}|} \sum _{r=1}^n r!S_{n,r}. \end{aligned}$$

The term $ \tilde{b}(n)\mathrel {\mathrel {\mathop :}=}\sum _{r=0}^n r!S_{n,r} $ coincides with the n-th ordered Bell number. The ordered Bell numbers satisfy the recurrence relation

$$\begin{aligned} \tilde{b}(n)=\sum _{r=0}^{n-1} \begin{pmatrix} n\\ k \end{pmatrix} \tilde{b}(r)\quad \text {with }\quad \tilde{b}(0)=1, \end{aligned}$$

(21)

see [12], and may be estimated as follows,^{Footnote 3} cf. [3],

$$\begin{aligned} \tilde{b}(n)\le \frac{n!}{(\log 2)^{n}}. \end{aligned}$$

(22)

This finally proves the assertion. $\Box $

The next lemma bounds the derivatives of $\det \mathbf{J}(\mathbf{x},\mathbf{y})$.

Lemma 4

Let $\mathbf{J}:\Box \rightarrow L^\infty (D_{\mathrm{ref}};\mathbb {R}^{d\times d})$ be defined as in (16). Then, it holds for the derivatives of $\det \mathbf{J}(\mathbf{x},\mathbf{y})$ that

$$\begin{aligned} \big \Vert \partial ^{\varvec{\alpha }}_\mathbf{y}\det \mathbf{J}(\mathbf{x},\mathbf{y})\big \Vert _{L^\infty (\Box ;L^\infty (D_{\mathrm{ref}}))} \le d!(1+\overline{\sigma })^{d}|{\varvec{\alpha }}|!{\varvec{\gamma }}^{\varvec{\alpha }}. \end{aligned}$$

(23)

Proof

The proof is by induction on the minors of $\mathbf{J}(\mathbf{x},\mathbf{y})=[j(\mathbf{x},\mathbf{y})_{k,\ell }]_{k,\ell =1}^d\in \mathbb {R}^{d\times d}$. For the $(1\times 1)$-minors, we obviously obtain

$$\begin{aligned} \big \Vert \partial ^{\varvec{\alpha }}_\mathbf{y}\det j_{k,\ell }\big \Vert _{L^\infty (\Box ;L^\infty (D_{\mathrm{ref}}))} =\big \Vert \partial ^{\varvec{\alpha }}_\mathbf{y}j_{k,\ell }\big \Vert _{L^\infty (\Box ;L^\infty (D_{\mathrm{ref}}))}\le {\left\{ \begin{array}{ll} \overline{\sigma },&{}\!\!\!\text {if }|{\varvec{\alpha }}| = 0\\ \gamma _i,&{}\!\!\!\text {if }|{\varvec{\alpha }}|=\alpha _i=1\\ 0,&{}\!\!\!\text {if }|{\varvec{\alpha }}|>1. \end{array}\right. }\nonumber \\ \end{aligned}$$

(24)

For $m\le d$, we set $\mathbf{J}^{\mathbf{k},{\varvec{\ell }}}\mathrel {\mathrel {\mathop :}=}[j_{k,\ell }]_{k\in \mathbf{k},\ell \in {\varvec{\ell }}}\in \mathbb {R}^{m\times m}$, where $\mathbf{k}=\{k_1,\ldots , k_m\}$ and ${\varvec{\ell }}=[\ell _1,\ldots , \ell _m]$ with $1\le k_1<\cdots <k_m\le d$ and $1\le \ell _1<\cdots < \ell _m\le d$. Now, let the assertion (23) hold for some $m-1<d$. Then, Laplace’s rule for determinants yields

$$\begin{aligned} \big \Vert \partial ^{\varvec{\alpha }}_\mathbf{y}\det \mathbf{J}^{\mathbf{k},{\varvec{\ell }}}\big \Vert _{L^\infty (\Box ;L^\infty (D_{\mathrm{ref}}))}= \left\| \partial ^{\varvec{\alpha }}_\mathbf{y}\sum _{\ell '=1}^m (-1)^{k'+\ell '}j_{k_{k'},\ell _{\ell '}}\det \mathbf{J}^{\mathbf{k}',{\varvec{\ell }}'}\right\| _{L^\infty (\Box ;L^\infty (D_{\mathrm{ref}}))}, \end{aligned}$$

where $\mathbf{k}'\mathrel {\mathrel {\mathop :}=}\mathbf{k}{\setminus }\{k_{k'}\}$ and ${\varvec{\ell }}'\mathrel {\mathrel {\mathop :}=}{\varvec{\ell }}{\setminus }\{\ell _{\ell '}\}$. The triangle inequality and Leibniz rule for differentiation give us

$$\begin{aligned}&\bigg \Vert \partial ^{\varvec{\alpha }}_\mathbf{y}\sum _{k'=1}^m(-1)^{k'+\ell '}j_{k_{k'},\ell _{\ell '}}\det \mathbf{J}^{\mathbf{k}',{\varvec{\ell }}'}\bigg \Vert _{L^\infty (\Box ;L^\infty (D_{\mathrm{ref}}))}\\&\quad \le \sum _{k'=1}^m\Big \Vert \partial ^{\varvec{\alpha }}_\mathbf{y}\big (j_{k_{k'},\ell _{\ell '}}\det \mathbf{J}^{\mathbf{k}',{{\varvec{\ell }}}'}\big )\Big \Vert _{L^\infty (\Box ;L^\infty (D_{\mathrm{ref}}))}\\&\quad =\sum _{k'=1}^m\bigg \Vert \sum _{{\varvec{\alpha }}'\le {\varvec{\alpha }}}{{\varvec{\alpha }}\atopwithdelims (){\varvec{\alpha }}'}\partial ^{{\varvec{\alpha }}' }_\mathbf{y}j_{k_{k'},\ell _{\ell '}}\partial ^{{\varvec{\alpha }}-{\varvec{\alpha }}'}_\mathbf{y}\det \mathbf{J}^{\mathbf{k}',{{\varvec{\ell }}}'}\bigg \Vert _{L^\infty (\Box ;L^\infty (D_{\mathrm{ref}}))}\\&\quad =\sum _{k'=1}^m\bigg \Vert \sum _{r=1}^M\alpha _r\partial ^{\mathbf{e}_r }_\mathbf{y}j_{k_{k'},\ell _{\ell '}}\partial ^{{\varvec{\alpha }}-\mathbf{e}_r}_\mathbf{y}\det \mathbf{J}^{\mathbf{k}',{{\varvec{\ell }}}'}+j_{k_{k'},\ell _{\ell '}} \partial ^{{\varvec{\alpha }}}_\mathbf{y}\det \mathbf{J}^{\mathbf{k}',{{\varvec{\ell }}}'}\bigg \Vert _{L^\infty (\Box ;L^\infty (D_{\mathrm{ref}}))}, \end{aligned}$$

since $j_{k_{k'},\ell _{\ell '}}$ is an affine function with respect to $\mathbf{y}$ and all higher order derivatives, i.e. $|{\varvec{\alpha }}'|>1$, vanish, see (24). A reapplication of the triangle inequality together with the induction hypothesis and the sub-multiplicativity of the $L^\infty $-norm hence provides

$$\begin{aligned}&\sum _{k'=1}^m\left\| \sum _{r=1}^M\alpha _r\partial ^{\mathbf{e}_r }_\mathbf{y}j_{k_{k'},\ell _{\ell '}}\partial ^{{\varvec{\alpha }}-\mathbf{e}_r}_\mathbf{y}\det \mathbf{J}^{\mathbf{k}',{{\varvec{\ell }}}'}+j_{k_{k'},\ell _{\ell '}}\partial ^{{\varvec{\alpha }}}_\mathbf{y} \det \mathbf{J}^{\mathbf{k}',{{\varvec{\ell }}}'}\right\| _{L^\infty (\Box ;L^\infty (D_{\mathrm{ref}}))}\\&\quad \le \sum _{k'=1}^m\bigg (\sum _{r=1}^M\alpha _r\big \Vert \partial ^{\mathbf{e}_r }_\mathbf{y}j_{k_{k'},\ell _{\ell '}}\big \Vert _{L^\infty (\Box ;L^\infty (D_{\mathrm{ref}}))} \big \Vert \partial ^{{\varvec{\alpha }}-\mathbf{e}_r}_\mathbf{y}\det \mathbf{J}^{\mathbf{k}',{{\varvec{\ell }}}'}\big \Vert _{L^\infty (\Box ;L^\infty (D_{\mathrm{ref}}))}\\&\quad \quad + \big \Vert j_{k_{k'},\ell _{\ell '}}\big \Vert _{L^\infty (\Box ;L^\infty (D_{\mathrm{ref}}))} \big \Vert \partial ^{{\varvec{\alpha }}}_\mathbf{y}\det \mathbf{J}^{\mathbf{k}',{{\varvec{\ell }}}'}\big \Vert _{L^\infty (\Box ;L^\infty (D_{\mathrm{ref}}))}\bigg )\\&\quad \le \sum _{k'=1}^m\bigg (\sum _{r=1}^M\alpha _r\gamma _r(m-1)!(1+\overline{\sigma })^{m-1}|{\varvec{\alpha }}-\mathbf{e}_r|!{\varvec{\gamma }}^{{\varvec{\alpha }}-\mathbf{e}_r}\\&\quad \quad +\,\overline{\sigma }(m-1)!(1+\overline{\sigma })^{m-1}|{\varvec{\alpha }}|!{\varvec{\gamma }}^{\varvec{\alpha }}\bigg )\\&\quad \le \sum _{k'=1}^m(m-1)!(1+\overline{\sigma })^{m-1}|{\varvec{\alpha }}|!{\varvec{\gamma }}^{\varvec{\alpha }}+\overline{\sigma }(m-1)!(1+\overline{\sigma })^{m-1}|{\varvec{\alpha }}|!{\varvec{\gamma }}^{\varvec{\alpha }}\\&\quad =m!(1+\overline{\sigma })^{m}|{\varvec{\alpha }}|!{\varvec{\gamma }}^{\varvec{\alpha }}, \end{aligned}$$

where we exploited that

$$\begin{aligned} \sum _{r=1}^M\alpha _r|{\varvec{\alpha }}-\mathbf{e}_r|!=(|{\varvec{\alpha }}|-1)!\sum _{r=1}^M\alpha _r=(|{\varvec{\alpha }}|-1)!|{\varvec{\alpha }}|=|{\varvec{\alpha }}|!. \end{aligned}$$

$\Box $

The application of the Leibniz rule yields now a regularity estimate for the diffusion matrix $\mathbf{A}(\mathbf{x},\mathbf{y})$.

Theorem 3

The derivatives of the diffusion matrix $\mathbf{A}(\mathbf{x},\mathbf{y})$ defined in (7) satisfy under the conditions of Assumption 1.3 that

$$\begin{aligned} {\big \vert \big \vert \big \vert \partial ^{\varvec{\alpha }}_\mathbf{y}{} \mathbf{A} \big \vert \big \vert \big \vert }_{d\times d}\le (|{\varvec{\alpha }}|+1)! \frac{C_{\det }}{\underline{\sigma }^2} \bigg (\frac{2(1+c_{\varvec{\gamma }})}{\underline{\sigma }^2\log 2}\bigg )^{|{\varvec{\alpha }}|} {{\varvec{\gamma }}}^{{\varvec{\alpha }}}. \end{aligned}$$

Proof

The Leibniz rule for $ \partial ^{\varvec{\alpha }}_\mathbf{y}{} \mathbf{A}(\mathbf{x},\mathbf{y}) $ reads as

$$\begin{aligned} \partial ^{\varvec{\alpha }}_\mathbf{y}{} \mathbf{A}(\mathbf{x},\mathbf{y}) =\sum _{{\varvec{\alpha }}'\le {\varvec{\alpha }}} \begin{pmatrix} {\varvec{\alpha }}\\ {\varvec{\alpha }}' \end{pmatrix} \partial ^{{\varvec{\alpha }}'}_\mathbf{y} \big (\mathbf{J}(\mathbf{x},\mathbf{y})^\intercal \mathbf{J}(\mathbf{x},\mathbf{y})\big )^{-1} \partial _\mathbf{y}^{{\varvec{\alpha }}-{\varvec{\alpha }}'}\det \mathbf{J}(\mathbf{x},\mathbf{y}). \end{aligned}$$

Inserting the results of Lemmas 3 and 4 yields with $C_{\det }\mathrel {\mathrel {\mathop :}=}d!(1+\overline{\sigma })^{d}$ that

$$\begin{aligned} {\big \vert \big \vert \big \vert \partial ^{\varvec{\alpha }}_\mathbf{y}{} \mathbf{A} \big \vert \big \vert \big \vert }_{d\times d}\le & {} \sum _{{\varvec{\alpha }}'\le {\varvec{\alpha }}} \begin{pmatrix} {\varvec{\alpha }}\\ {\varvec{\alpha }}' \end{pmatrix} |{\varvec{\alpha }}'|! \frac{{\varvec{\gamma }}^{{\varvec{\alpha }}'}}{\underline{\sigma }^2}\bigg (\frac{2(1+c_{\varvec{\gamma }})}{\underline{\sigma }^2\log 2} \bigg )^{|{\varvec{\alpha }}'|} C_{\det }|{\varvec{\alpha }}-{\varvec{\alpha }}'|!{\varvec{\gamma }}^{{\varvec{\alpha }}-{\varvec{\alpha }}'}\\\le & {} \frac{C_{\det }}{\underline{\sigma }^2} \bigg (\frac{2(1+c_{\varvec{\gamma }})}{\underline{\sigma }^2\log 2}\bigg )^{|{\varvec{\alpha }}|} {{\varvec{\gamma }}}^{{\varvec{\alpha }}}\sum _{{\varvec{\alpha }}'\le {\varvec{\alpha }}} \begin{pmatrix} {\varvec{\alpha }}\\ {\varvec{\alpha }}' \end{pmatrix} |{\varvec{\alpha }}'|!|{\varvec{\alpha }}-{\varvec{\alpha }}'|!. \end{aligned}$$

Now, we employ the combinatorial identity

$$\begin{aligned} \sum _{{\genfrac{}{}{0.0pt}{}{{\varvec{\alpha }}'\le {\varvec{\alpha }}}{|{\varvec{\alpha }}'|=j}}} \begin{pmatrix} {{\varvec{\alpha }}}\\ {{\varvec{\alpha }}'} \end{pmatrix} =\begin{pmatrix} |{\varvec{\alpha }}|\\ j \end{pmatrix} \end{aligned}$$

(25)

and obtain

$$\begin{aligned} \sum _{{\varvec{\alpha }}'\le {\varvec{\alpha }}} \begin{pmatrix} {\varvec{\alpha }}\\ {\varvec{\alpha }}' \end{pmatrix} |{\varvec{\alpha }}'|!|{\varvec{\alpha }}-{\varvec{\alpha }}'|!&=\sum _{j=0}^{|{\varvec{\alpha }}|}j!(|{\varvec{\alpha }}|-j)!\sum _{{\genfrac{}{}{0.0pt}{}{{\varvec{\alpha }}'\le {\varvec{\alpha }}}{|{\varvec{\alpha }}'|=j}}} \begin{pmatrix} {\varvec{\alpha }}\\ {\varvec{\alpha }}' \end{pmatrix}\\&=\sum _{j=0}^{|{\varvec{\alpha }}|}j!(|{\varvec{\alpha }}|-j)! \begin{pmatrix} |{\varvec{\alpha }}|\\ j \end{pmatrix} =|{\varvec{\alpha }}|!\sum _{j=0}^{|{\varvec{\alpha }}|}1=(|{\varvec{\alpha }}|+1)!. \end{aligned}$$

$\Box $

In order to prove regularity results for the right hand side $f_{\mathrm{ref}}$ in (9), we have to assume that f is a smooth function.

Lemma 5

Let $f\in C^\infty (\mathcal {D})$ be analytic, i.e. $\Vert \partial _\mathbf{x}^{{\varvec{\alpha }}}f\Vert _{L^\infty (\mathcal {D}; \mathbb {R}^d)}\le {\varvec{\alpha }}!\rho ^{-|{\varvec{\alpha }}|} c_f$ for all ${\varvec{\alpha }}\in \mathbb {N}_0^d$ and some $\rho \in (0,1]$. Then, the derivatives of $\hat{f}=f\circ \mathbf{V}$ are bounded by

$$\begin{aligned} \big \Vert \partial ^{\varvec{\alpha }}_\mathbf{y}\hat{f}\big \Vert _{ L^\infty (\Box ;L^\infty (D_{\mathrm{ref}}))} \le |{\varvec{\alpha }}|!c_f\bigg (\frac{d}{\rho \log 2}\bigg )^{|{\varvec{\alpha }}|}{\varvec{\gamma }}^{\varvec{\alpha }}. \end{aligned}$$

Proof

In view of (16), differentiation of $\mathbf{V}(\mathbf{x},\mathbf{y})$ yields $ {\partial _{y_i}}{} \mathbf{V}(\mathbf{x},\mathbf{y}) =\sigma _i{\varvec{\varphi }}_i(\mathbf{x}). $ Thus, all higher order derivatives with respect to an arbitrary direction $y_j$ vanish. The norm of the first order derivatives is bounded by $ {\big \vert \big \vert \big \vert {\partial _{y_i}}{} \mathbf{V} \big \vert \big \vert \big \vert }_{d}\le \gamma _i. $

The rest of the proof is also based on the application of Faà di Bruno’s formula. Nevertheless, we have this time to consider the multivariate case. To that end, we define the set $P({\varvec{\alpha }},{\varvec{\alpha }}^\prime )$ given by

$$\begin{aligned} P({\varvec{\alpha }},{\varvec{\alpha }}')\mathrel {\mathrel {\mathop :}=}\bigg \{&\big ((\mathbf{k}_1,{\varvec{\beta }}_1), \ldots ,(\mathbf{k}_n,{\varvec{\beta }}_n)\big ){\in }(\mathbb {N}_0^d\times \mathbb {N}_0^M)^n: \sum _{i=1}^n |\mathbf{k}_i|{\varvec{\beta }}_i={\varvec{\alpha }},\ \sum _{i=1}^n \mathbf{k}_i={\varvec{\alpha }}',\\&\text { and }\ \exists \,1\le s\le n : |\mathbf{k}_j|=|{\varvec{\beta }}_a|=0\ \quad \text {for all}\ 1\le i\le n-s,\\&|\mathbf{k}_i|\ne 0\ \quad \text {for all}\ n-s+1\le i\le n\quad \text { and } \quad \mathbf{0}\prec {\varvec{\beta }}_{n-s+1}\prec \dots \prec {\varvec{\beta }}_n\bigg \} \end{aligned}$$

with $n=|{\varvec{\alpha }}|$. The application of the multivariate Faà di Bruno formula yields now

$$\begin{aligned}&\big \Vert \partial ^{{\varvec{\alpha }}}_\mathbf{y}\hat{f}\big \Vert _{ L^\infty (\Box ;L^\infty (D_{\mathrm{ref}}))}\\&\quad \le \sum _{1\le |{\varvec{\alpha }}'|\le n}\big \Vert \partial _\mathbf{x}^{{\varvec{\alpha }}'} f\big \Vert _{ L^\infty (\Box ;L^\infty (\mathcal {D}))} \sum _{P({\varvec{\alpha }},{\varvec{\alpha }}')}{\varvec{\alpha }}!\prod _{j=1}^n \frac{\big \Vert \big (\partial ^{{\varvec{\beta }}_j}_\mathbf{y}{} \mathbf{V}\big )^{\mathbf{k}_j}\big \Vert _{ L^\infty (\Box ;L^\infty (D_{\mathrm{ref}}))}}{\mathbf{k}_j!({\varvec{\beta }}_j!)^{|\mathbf{k}_j|}}\\&\quad \le \sum _{1\le |{\varvec{\alpha }}'|\le n}{\varvec{\alpha }}'!\rho ^{-|{\varvec{\alpha }}'|} c_f \sum _{P({\varvec{\alpha }},{\varvec{\alpha }}')}{\varvec{\alpha }}!\prod _{j=1}^n \frac{\big ({{\varvec{\gamma }}}^{{\varvec{\beta }}_j}\big )^{|\mathbf{k}_j|}}{\mathbf{k}_j!({\varvec{\beta }}_j!)^{|\mathbf{k}_j|}}\\&\quad =c_f{\varvec{\gamma }}^{\varvec{\alpha }}\sum _{1\le |{\varvec{\alpha }}'|\le n}{\varvec{\alpha }}'!\rho ^{-|{\varvec{\alpha }}'|} \sum _{P({\varvec{\alpha }},{\varvec{\alpha }}')}{\varvec{\alpha }}!\prod _{j=1}^n \frac{1}{\mathbf{k}_j!({\varvec{\beta }}_j!)^{|\mathbf{k}_j|}}. \end{aligned}$$

From [9], we know that

$$\begin{aligned} \sum _{|{\varvec{\alpha }}'|=r}\sum _{P({\varvec{\alpha }},{\varvec{\alpha }}')}{\varvec{\alpha }}!\prod _{j=1}^n \frac{1}{\mathbf{k}_j!({\varvec{\beta }}_j!)^{|\mathbf{k}_j|}}=d^r S_{n,r}, \end{aligned}$$

where again $S_{n,r}$ is the Stirling number of the second kind. Thus, we obtain

$$\begin{aligned} \big \Vert \partial ^{{\varvec{\alpha }}}_\mathbf{y}\hat{f}\big \Vert _{ L^\infty (\Box ;L^\infty (D_{\mathrm{ref}}))}\le c_f{\varvec{\gamma }}^{\varvec{\alpha }}\sum _{r=1}^{n}\bigg (\frac{d}{\rho }\bigg )^r r!S_{n,r} \le c_f{\varvec{\gamma }}^{\varvec{\alpha }}\bigg (\frac{d}{\rho }\bigg )^{|{\varvec{\alpha }}|}\sum _{r=0}^{n}r! S_{n,r}. \end{aligned}$$

Analogously to the proof of Lemma 3, we finally arrive at the assertion. $\Box $

Now, in complete analogy to Theorem 3, we have the following regularity result for the right hand side $f_{\mathrm{ref}}$.

Theorem 4

The derivatives of the right hand side $f_{\mathrm{ref}}(\mathbf{x},\mathbf{y})$ defined in (8) satisfy

$$\begin{aligned} \big \Vert \partial ^{\varvec{\alpha }}_\mathbf{y}{f}_{\mathrm{ref}}\big \Vert _{ L^\infty (\Box ;L^\infty (D_{\mathrm{ref}}))} \le (|{\varvec{\alpha }}|+1)!c_fC_{\det } \bigg (\frac{d}{\rho \log 2}\bigg )^{|{\varvec{\alpha }}|}{{\varvec{\gamma }}}^{\varvec{\alpha }}. \end{aligned}$$

Finally, we establish the dependency between the solution $\hat{u}$ to (9) and the data ${f}_{\mathrm{ref}}$.

Lemma 6

Let $\hat{u}(\mathbf{y})$ be the solution to (9) and ${f}_{\mathrm{ref}}\in L^\infty \big (\Box ;L^\infty (D_{\mathrm{ref}})\big )$. Then, there holds

$$\begin{aligned} \Vert \hat{u}(\mathbf{y})\Vert _{H^1(D_{\mathrm{ref}})}\le \frac{\overline{\sigma }^2}{\underline{\sigma }^d} c_{{D}} \Vert {f}_{\mathrm{ref}}\Vert _{L^\infty (\Box ;L^\infty (D_{\mathrm{ref}}))} \end{aligned}$$

(26)

with a constant $c_{{D}}$ only dependent on ${D}_{\mathrm{ref}}$ for almost every $\mathbf{y}\in \Box $.

Proof

The bilinear form

$$\begin{aligned} (\mathbf{A}\nabla \,\cdot \,,\nabla \,\cdot \,)_{L^2(D_{\mathrm{ref}};\mathbb {R}^d)}:H^1_0(D_{\mathrm{ref}})\times H^1_0(D_{\mathrm{ref}})\rightarrow \mathbb {R} \end{aligned}$$

is coercive and bounded according to (4) and $\underline{\sigma }^d\le \det \mathbf{J}(\mathbf{x},\mathbf{y})\le \overline{\sigma }^d$. It holds

$$\begin{aligned} \frac{\underline{\sigma }^d}{\overline{\sigma }^2}\Vert \hat{u}\Vert _{H^1(D_{\mathrm{ref}})}^2\le (\mathbf{A}\nabla \hat{u},\nabla \hat{u})_{L^2(D_{\mathrm{ref}};\mathbb {R}^d)} \end{aligned}$$

and

$$\begin{aligned} (\mathbf{A}\nabla \hat{u},\nabla \hat{v})_{L^2(D_{\mathrm{ref}};\mathbb {R}^d)}\le \frac{\overline{\sigma }^d}{\underline{\sigma }^2}\Vert \hat{u}\Vert _{H^1(D_{\mathrm{ref}})}\Vert \hat{v}\Vert _{H^1(D_{\mathrm{ref}})} \end{aligned}$$

for all $\hat{u},\hat{v}\in H^1(D_{\mathrm{ref}})$ and almost every $\mathbf{y}\in \Box $. The assertion follows now by the application of the Lax-Milgram Lemma and the observation that

$$\begin{aligned} \Vert {f}_{\mathrm{ref}}\Vert _{L^\infty (\Box ;H^{-1}(D_{\mathrm{ref}}))} \le \sqrt{|D_{\mathrm{ref}}|}c_P\Vert {f}_{\mathrm{ref}}\Vert _{L^\infty (\Box ;L^\infty (D_{\mathrm{ref}}))}, \end{aligned}$$

where $c_P$ denotes the Poincaré constant of $D_{\mathrm{ref}}$. $\Box $

Combining the constants arising from Theorems 3 and 4 leads to the modified sequence

$$\begin{aligned} \{\mu _k\}_k\mathrel {\mathrel {\mathop :}=}\bigg \{2\max \bigg (\frac{d}{\rho \log 2}, \frac{2(1+c_{\varvec{\gamma }})}{\underline{\sigma }^2\log 2}\bigg ){\gamma _k}\bigg \}_k \end{aligned}$$

such that

$$\begin{aligned} {\big \vert \big \vert \big \vert \partial ^{\varvec{\alpha }}_\mathbf{y}{} \mathbf{A} \big \vert \big \vert \big \vert }_{d\times d}\le C|{\varvec{\alpha }}|!{\varvec{\mu }}^{\varvec{\alpha }}\quad \text {and}\quad \big \Vert \partial ^{\varvec{\alpha }}_\mathbf{y}f_{\mathrm{ref}}\big \Vert _{L^\infty (\Box ;L^\infty (D_{\mathrm{ref}}))}\le C|{\varvec{\alpha }}|!{\varvec{\mu }}^{\varvec{\alpha }}. \end{aligned}$$

Herein, we set $C\mathrel {\mathrel {\mathop :}=}C_{\det }\max (c_f,1/\underline{\sigma }^2)$. Notice that we introduced also the additional factor 2 in order to obtain the factor $|{\varvec{\alpha }}|!$ in the derivatives instead of the factor $(|{\varvec{\alpha }}|+1)!$.

Theorem 5

The derivatives of the solution u to (9) satisfy under the conditions of Assumption 1.3 that

$$\begin{aligned} \big \Vert \partial ^{\varvec{\alpha }}_\mathbf{y}\hat{u}(\mathbf{y})\big \Vert _{H^1(D_{\mathrm{ref}})} \le |{\varvec{\alpha }}|!{\varvec{\mu }}^{\varvec{\alpha }}\bigg (4\frac{\overline{\sigma }^2}{\underline{\sigma }^d}C\max \{1,c_D\}\bigg )^{|{\varvec{\alpha }}|+1}, \end{aligned}$$

where $c_D$ denotes the constant from the previous theorem.

Proof

Differentiating the variational formulation (9) with respect to $\mathbf{y}$ leads to

$$\begin{aligned} \Big (\partial ^{\varvec{\alpha }}_\mathbf{y}\big (\mathbf{A}(\mathbf{y})\nabla _\mathbf{x}\hat{u}(\mathbf{y})\big ), \nabla _\mathbf{x}\hat{v}\Big )_{L^2(D_{\mathrm{ref}};\mathbb {R}^d)}= \big (\partial ^{\varvec{\alpha }}_\mathbf{y}{f}_{\mathrm{ref}}(\mathbf{y}),\hat{v}\big )_{L^2(D_{\mathrm{ref}};\mathbb {R})}. \end{aligned}$$

The isomorphism of the spaces $H^1_0(D_{\mathrm{ref}})$ and $H^1_0\big (D(\mathbf{y})\big )$ from Lemma 1 allows us to consider the test functions v to be independent of $\mathbf{y}$. Furthermore, the application of the Leibniz rule for the expression $ \partial ^{\varvec{\alpha }}_\mathbf{y}\big (\mathbf{A}(\mathbf{y})\nabla _\mathbf{x}\hat{u}(\mathbf{y})\big ) $ results in

$$\begin{aligned} \partial ^{\varvec{\alpha }}_\mathbf{y}\big (\mathbf{A}(\mathbf{y})\nabla _\mathbf{x}\hat{u}(\mathbf{y})\big ) =\sum _{{\varvec{\alpha }}'\le {\varvec{\alpha }}} \begin{pmatrix} {\varvec{\alpha }}\\ {\varvec{\alpha }}' \end{pmatrix} \partial ^{{\varvec{\alpha }}'}_\mathbf{y}{} \mathbf{A}(\mathbf{y}) \partial _\mathbf{y}^{{\varvec{\alpha }}-{\varvec{\alpha }}'}\nabla _\mathbf{x}\hat{u}(\mathbf{y}). \end{aligned}$$

Thus, rearranging the preceding expression and using the linearity of the gradient, we arrive at

$$\begin{aligned} \int _{D_{\mathrm{ref}}}{} \mathbf{A}(\mathbf{y})\nabla _\mathbf{x}\partial ^{\varvec{\alpha }}_\mathbf{y}\hat{u}(\mathbf{y}) \nabla _\mathbf{x}v\mathrm{d}\mathbf{x}&= \int _{D_{\mathrm{ref}}}\partial ^{\varvec{\alpha }}_\mathbf{y}{f}_{\mathrm{ref}}(\mathbf{y})v\mathrm{d}\mathbf{x}\\&\quad -\sum _{{\varvec{\alpha }}\ne {\varvec{\alpha }}'\le {\varvec{\alpha }}} \begin{pmatrix} {\varvec{\alpha }}\\ {\varvec{\alpha }}' \end{pmatrix} \int _{D_{\mathrm{ref}}}\partial ^{{\varvec{\alpha }}-{\varvec{\alpha }}'}_\mathbf{y}{} \mathbf{A}(\mathbf{y}) \nabla _\mathbf{x}\partial _\mathbf{y}^{{\varvec{\alpha }}'} \hat{u}(\mathbf{y})\nabla _\mathbf{x}v\mathrm{d}\mathbf{x}. \end{aligned}$$

By choosing $v=\partial ^{\varvec{\alpha }}_\mathbf{y}\hat{u}(\mathbf{y})$ and by employing the estimates from Theorems 3 and 4, it follows that

$$\begin{aligned}&\frac{\underline{\sigma }^d}{\overline{\sigma }^2}\big \Vert \partial ^{\varvec{\alpha }}_\mathbf{y} \hat{u}(\mathbf{y})\big \Vert _{H^1(D_{\mathrm{ref}})}^2\le \int _{D_{\mathrm{ref}}}\quad \partial ^{\varvec{\alpha }}_\mathbf{y}{f}_{\mathrm{ref}}(\mathbf{y}) \partial ^{\varvec{\alpha }}_\mathbf{y}\hat{u}(\mathbf{y})\mathrm{d}\mathbf{x} \\&\qquad -\,\sum _{{\varvec{\alpha }}\ne {\varvec{\alpha }}'\le {\varvec{\alpha }}} \begin{pmatrix} {\varvec{\alpha }}\\ {\varvec{\alpha }}' \end{pmatrix} \int _{D_{\mathrm{ref}}}\quad \partial ^{{\varvec{\alpha }}-{\varvec{\alpha }}'}_\mathbf{y}{} \mathbf{A}(\mathbf{y}) \nabla _\mathbf{x}\partial _\mathbf{y}^{{\varvec{\alpha }}'} \hat{u}(\mathbf{y})\nabla _\mathbf{x}\partial ^{\varvec{\alpha }}_\mathbf{y}\hat{u}(\mathbf{y})\mathrm{d}\mathbf{x}\\&\quad \le c_DC|{\varvec{\alpha }}|!{\varvec{\mu }}^{\varvec{\alpha }}\big \Vert \partial ^{\varvec{\alpha }}_\mathbf{y} \hat{u}(\mathbf{y})\big \Vert _{H^1(D_{\mathrm{ref}})}\\&\qquad + \sum _{{\varvec{\alpha }}\ne {\varvec{\alpha }}'\le {\varvec{\alpha }}} \begin{pmatrix} {\varvec{\alpha }}\\ {\varvec{\alpha }}' \end{pmatrix} C|{\varvec{\alpha }}-{\varvec{\alpha }}'|! {\varvec{\mu }}^{{\varvec{\alpha }}-{\varvec{\alpha }}'} \big \Vert \partial ^{{\varvec{\alpha }}'}_\mathbf{y} \hat{u}(\mathbf{y})\big \Vert _{H^1(D_{\mathrm{ref}})} \big \Vert \partial ^{\varvec{\alpha }}_\mathbf{y} \hat{u}(\mathbf{y})\big \Vert _{H^1(D_{\mathrm{ref}})}. \end{aligned}$$

From this, we obtain

$$\begin{aligned} \big \Vert \partial ^{\varvec{\alpha }}_\mathbf{y} \hat{u}(\mathbf{y})\big \Vert _{H^1(D_{\mathrm{ref}})} \le \frac{\tilde{C}}{4}|{\varvec{\alpha }}|!{\varvec{\mu }}^{\varvec{\alpha }}\!+\! \frac{\tilde{C}}{4}\!\sum _{{\varvec{\alpha }}\ne {\varvec{\alpha }}'\le {\varvec{\alpha }}}\!\begin{pmatrix}{\varvec{\alpha }}\\ {\varvec{\alpha }}'\end{pmatrix} |{\varvec{\alpha }}-{\varvec{\alpha }}'|! {\varvec{\mu }}^{{\varvec{\alpha }}-{\varvec{\alpha }}'} \big \Vert \partial ^{{\varvec{\alpha }}'}_\mathbf{y} \hat{u}(\mathbf{y})\big \Vert _{H^1(D_{\mathrm{ref}})} \end{aligned}$$

by setting

$$\begin{aligned} \tilde{C}\mathrel {\mathrel {\mathop :}=}4\frac{\overline{\sigma }^2}{\underline{\sigma }^d}C\max (1,c_D). \end{aligned}$$

The proof is now by induction on $|{\varvec{\alpha }}|$. The induction hypothesis is given by

$$\begin{aligned} \big \Vert \partial ^{\varvec{\alpha }}_\mathbf{y}\hat{u}(\mathbf{y})\big \Vert _{H^1(D_{\mathrm{ref}})}\le |{\varvec{\alpha }}|!{\varvec{\mu }}^{\varvec{\alpha }}\tilde{C}^{|{\varvec{\alpha }}|+1}. \end{aligned}$$

For $|{\varvec{\alpha }}|=0$, we conclude just the stability estimate (26), where the right hand side of the inequality is scaled by the factor 4. Therefore, let the assertion hold for all $|{\varvec{\alpha }}|\le n-1$ for some $n\ge 1$. Then, we have

$$\begin{aligned} \big \Vert \partial ^{\varvec{\alpha }}_\mathbf{y} \hat{u}(\mathbf{y})\big \Vert _{H^1(D_{\mathrm{ref}})}&\le \frac{\tilde{C}}{4}|{\varvec{\alpha }}|!{\varvec{\mu }}^{\varvec{\alpha }}+ \frac{\tilde{C}}{4}\sum _{{\varvec{\alpha }}\ne {\varvec{\alpha }}'\le {\varvec{\alpha }}} \begin{pmatrix} {\varvec{\alpha }}\\ {\varvec{\alpha }}' \end{pmatrix} |{\varvec{\alpha }}-{\varvec{\alpha }}'|! {\varvec{\mu }}^{{\varvec{\alpha }}-{\varvec{\alpha }}'} |{\varvec{\alpha }}'|!{\varvec{\mu }}^{{\varvec{\alpha }}'}\tilde{C}^{|{\varvec{\alpha }}'|+1}\\&\le \frac{\tilde{C}}{4}|{\varvec{\alpha }}|!{\varvec{\mu }}^{\varvec{\alpha }}+ \frac{\tilde{C}}{4}{\varvec{\mu }}^{{\varvec{\alpha }}} \sum _{{\varvec{\alpha }}\ne {\varvec{\alpha }}'\le {\varvec{\alpha }}} \begin{pmatrix} {\varvec{\alpha }}\\ {\varvec{\alpha }}' \end{pmatrix} |{\varvec{\alpha }}-{\varvec{\alpha }}'|!\tilde{C}^{|{\varvec{\alpha }}'|+1}\\&=\frac{\tilde{C}}{4}|{\varvec{\alpha }}|!{\varvec{\mu }}^{\varvec{\alpha }}+ \frac{\tilde{C}}{4}{\varvec{\mu }}^{\varvec{\alpha }}\sum _{j=0}^{n-1} \sum _{{\genfrac{}{}{0.0pt}{}{{\varvec{\alpha }}'\le {\varvec{\alpha }}}{|{\varvec{\alpha }}'|=j}}} \begin{pmatrix} {\varvec{\alpha }}\\ {\varvec{\alpha }}' \end{pmatrix} |{\varvec{\alpha }}-{\varvec{\alpha }}'|!|{\varvec{\alpha }}'|!\tilde{C}^{|{\varvec{\alpha }}'|+1}. \end{aligned}$$

Again, we make use of the combinatorial identity (25) and obtain the estimate

$$\begin{aligned} \big \Vert \partial ^{\varvec{\alpha }}_\mathbf{y} \hat{u}(\mathbf{y})\big \Vert _{H^1(D_{\mathrm{ref}})}&\le \frac{\tilde{C}}{4}|{\varvec{\alpha }}|!{\varvec{\mu }}^{{\varvec{\alpha }}}+ \frac{\tilde{C}}{4}{\varvec{\mu }}^{\varvec{\alpha }}\sum _{j=0}^{n-1} \begin{pmatrix} |{\varvec{\alpha }}|\\ j \end{pmatrix} (|{\varvec{\alpha }}|-j)!j!\tilde{C}^{j+1}\\&= \frac{\tilde{C}}{4}|{\varvec{\alpha }}|!{\varvec{\mu }}^{{\varvec{\alpha }}}+ \frac{\tilde{C}}{4}|{\varvec{\alpha }}|!{\varvec{\mu }}^{\varvec{\alpha }}\tilde{C}\sum _{j=0}^{n-1}\tilde{C}^{j}\\&\le \frac{\tilde{C}}{4}|{\varvec{\alpha }}|!{\varvec{\mu }}^{{\varvec{\alpha }}}+ \frac{\tilde{C}}{4}|{\varvec{\alpha }}|!{\varvec{\mu }}^{\varvec{\alpha }}\tilde{C}\frac{\tilde{C}^{|{\varvec{\alpha }}|}}{\tilde{C}-1}. \end{aligned}$$

Now, the application of Lemma 9 from the Appendix gives us

$$\begin{aligned} \frac{\tilde{C}}{2}\frac{\tilde{C}^{|{\varvec{\alpha }}|}}{\tilde{C}-1}\le \tilde{C}^{|{\varvec{\alpha }}|} \end{aligned}$$

Since $\tilde{C}>1$, we conclude

$$\begin{aligned} \big \Vert \partial ^{\varvec{\alpha }}_\mathbf{y} \hat{u}(\mathbf{y})\big \Vert _{H^1(D_{\mathrm{ref}})} \le \frac{\tilde{C}^{|{\varvec{\alpha }}|+1}}{4}|{\varvec{\alpha }}|!{\varvec{\mu }}^{{\varvec{\alpha }}}+ \frac{\tilde{C}^{|{\varvec{\alpha }}|+1}}{2}|{\varvec{\alpha }}|!{\varvec{\mu }}^{\varvec{\alpha }}\le \tilde{C}^{|{\varvec{\alpha }}|+1}|{\varvec{\alpha }}|!{\varvec{\mu }}^{{\varvec{\alpha }}}. \end{aligned}$$

This completes the proof. $\Box $

Taking into account the additional factor provided by the theorem, we end up with the sequence

$$\begin{aligned} \{\mu _k\}_k\mathrel {\mathrel {\mathop :}=}\bigg \{\frac{8\overline{\sigma }^2}{\underline{\sigma }^d} C\max (1,c_D)\max \bigg (\frac{d}{\rho \log 2}, \frac{2(1+c_{\varvec{\gamma }})}{\underline{\sigma }^2\log 2}\bigg ){\gamma _k}\bigg \}_k, \end{aligned}$$

which yields in view of Theorem 5 that

$$\begin{aligned} \big \Vert \partial ^{\varvec{\alpha }}_\mathbf{y}\hat{u}(\mathbf{y})\big \Vert _{H^1(D_{\mathrm{ref}})} \le C|{\varvec{\alpha }}|!{\varvec{\mu }}^{\varvec{\alpha }}\end{aligned}$$

with a constant $C>0$ independent of the dimension M. Moreover, we observe $\mu _k\eqsim \gamma _k$. Therefore, we obtain for $\gamma _k\lesssim k^{-1-\delta }$ the analyticity of $\hat{u}$ by Lemma 8 from the Appendix for any $\delta >0$.

Remark 2

The discussion in this section only refers to the case of the Poisson equation. Of course, the analysis presented here straightforwardly applies also to the more general diffusion problem

$$\begin{aligned} -\mathrm{div}\big (\alpha (\mathbf{x})\nabla u(\mathbf{x},\mathbf{y})\big )=f(\mathbf{x})\quad \text {for }{} \mathbf{x}\in D(\mathbf{y}). \end{aligned}$$

In this case, one has to impose the restriction that $\alpha (\mathbf{x})$ is an analytic function which is bounded from above and below away from 0. Then, an estimate analogous to Lemma 5 applies for $\hat{\alpha }(\mathbf{x},\mathbf{y})$. The proof of a related Theorem 3 for $\hat{\alpha }(\mathbf{x},\mathbf{y}) \mathbf{A}(\mathbf{x},\mathbf{y})$ then involves an additional application of the Leibniz rule.

Remark 3

We can obtain similar approximation results for the moments of $\hat{u}$, i.e. for $\hat{u}^p$ with $p\in \mathbb {N}$, possibly with worse constants. To that end, one has to bound the derivatives of $\hat{u}^p$ with respect to $\mathbf{y}$, too. This is also achieved by the application of Faà di Bruno’s formula. For an idea of the related proofs, we refer to [16] where this topic is discussed in case of a random diffusion coefficient.

5 Curved domains and parametric finite elements

For the analysis of the regularity in the preceding section, we have exploited that there exists a one-to-one correspondence between the deterministic problem on the random domain and the random problem on the reference domain. For the computations, in contrast to [6, 36], we do however not aim at mapping the equation to the reference domain $D_{\mathrm{ref}}$ but rather to solve the equation on each particular realization $D(\mathbf{y}_i)=\mathbf{V} (D_{\mathrm{ref}},\mathbf{y}_i)$ for a suitable set of samples $\{\mathbf{y}_i\}_{i=1}^N \subset \Box $. A first step towards this approach is made by [26], where a random boundary variation is assumed and a mesh on the realization $D(\mathbf{y}_i)$ is generated via the solution of the Laplacian. Here, under the assumption that the random domain is obtained by a sufficiently smooth mapping $\mathbf{V}(\mathbf{y}_i)$, we will employ parametric finite elements to map the mesh on $D_{\mathrm{ref}}$ onto a mesh on $D(\mathbf{y}_i)$.

We assume that the domain $D_{\mathrm{ref}}$ is given as a collection of simplicial smooth patches. More precisely, let $\triangle $ denote the reference simplex in $\mathbb {R}^d$. We assume that the domain $D_{\mathrm{ref}}$ is partitioned into K patches

$$\begin{aligned} \overline{D_{\mathrm{ref}}} = \bigcup _{j=1}^K \tau _{0,j}, \quad \tau _{0,j} = \varvec{\kappa }_j(\triangle ), \quad j = 1,2,\ldots ,K, \end{aligned}$$

(27)

where each $\varvec{\kappa }_j:\triangle \rightarrow \tau _{0,j}$ defines a diffeomorphism of $\triangle $ onto $\tau _{0,j}$. Thus, we have especially that

$$\begin{aligned} \frac{\sup \{\Vert {\varvec{\kappa }_j'}(\mathbf{s})\mathbf{x}\Vert _2: \mathbf{s}\in \triangle ,\Vert \mathbf{x}\Vert _2=1\}}{\inf \{\Vert {\varvec{\kappa }_j'}(\mathbf{s})\mathbf{x}\Vert _2: \mathbf{s}\in \triangle ,\Vert \mathbf{x}\Vert _2=1\}}\le \rho _j\quad \text { for all } j=1,\ldots ,K, \end{aligned}$$

(28)

where $\varvec{\kappa }_j'$ denotes as before the Jacobian of $\varvec{\kappa }_j$. Since there are only finitely many patches, we may set $\rho \mathrel {\mathrel {\mathop :}=}\max _{j=1}^K\rho _j.$ The intersection $\tau _{0,j}\cap \tau _{0,j'}$, $j\ne j'$, of any two patches $\tau _{0,j}$ and $\tau _{0,j'}$ is supposed to be either $\emptyset $, or a common lower dimensional face.

A mesh on level $\ell $ on $D_{\mathrm{ref}}$ is now obtained by regular subdivisions of depth $\ell $ of the reference simplex into $2^{\ell d}$ sub-simplices. This generates the $2^{\ell d}$ elements $\{\tau _{\ell ,j}\}_j$. In order to ensure that the triangulation $\mathcal {T}_\ell \mathrel {\mathrel {\mathop :}=}\{\tau _{\ell ,j}\}_j$ on the level $\ell $ forms a regular mesh on $D_{\mathrm{ref}}$, the parametrizations $\{ \varvec{\kappa }_j\}_j$ are assumed to be $C^0$ compatible in the following sense: there exists a bijective, affine mapping $\varvec{\Xi }:\triangle \rightarrow \triangle $ such that for all $\mathbf{x} = \varvec{\kappa }_i(\mathbf{s})$ on a common interface of $\tau _{0,j}$ and $\tau _{0,j'}$ it holds that $\varvec{\kappa }_j(\mathbf{s}) = (\varvec{\kappa }_{j'}\circ \varvec{\Xi })(\mathbf{s})$. In other words, the diffeomorphisms $\varvec{\kappa }_j$ and $\varvec{\kappa }_{j'}$ coincide at the common interface except for orientation. An illustration of such a triangulation is found in Fig. 1. Notice that in our construction the local element mappings $\triangle \rightarrow \tau _{\ell ,j}$ satisfy the same bound (28) by definition. Therefore, especially the uniformity condition for (iso-) parametric finite elements is fulfilled, cf. [4, 22].

Finally, we define the finite element ansatz functions via the parametrizations $\{ \varvec{\kappa }_j\}_j$ in the usual fashion, i.e. by lifting Lagrangian finite elements from $\triangle $ to the domain $D_{\mathrm{ref}}$ by using the mappings $\varvec{\kappa }_j$. To that end, we define on the $\ell $-th subdivision $\triangle _\ell $ of the reference domain the standard Lagrangian piecewise polynomial continuous finite elements $\Phi _\ell =\{\varphi _{\ell ,i}:i\in \mathcal {I}_\ell \}$, where $\mathcal {I}_\ell $ denotes an appropriate index set. The corresponding finite element space is then given by

$$\begin{aligned} {V}_{\triangle ,\ell }= \mathrm{span}\{\varphi _{\ell ,j}:j\in \mathcal {I}_\ell \} = \{ u\in C(\triangle ): u|_\tau \in \Pi _n\ \quad \text {for all}\ \tau \in \triangle _\ell \} \end{aligned}$$

with $\dim {V}_{\triangle ,\ell }\eqsim 2^{\ell d}$ and $\Pi _n$ denoting the space of polynomials of degree at most n. Continuous basis functions whose support overlaps with several patches are obtained by gluing across patch boundaries, using the $C^0$ inter-patch compatibility. This yields a (nested) sequence of finite element spaces

$$\begin{aligned} V_{{\mathrm{ref}},\ell }\mathrel {\mathrel {\mathop :}=}\{v\in C(D_{\mathrm{ref}}): v|_{{\varvec{\kappa }}_j(\triangle )}=\varphi \circ {\varvec{\kappa }}^{-1}_j, \varphi \in {V}_{\triangle ,\ell },\quad \ j=1,\ldots ,K\}\subset H^1(D_{\mathrm{ref}}) \end{aligned}$$

with $\dim V_{{\mathrm{ref}},\ell }\eqsim 2^{\ell d}$. It is well known that the spaces $V_{{\mathrm{ref}},\ell }$ satisfy the following Jackson and Bernstein type estimates for all $0\le s\le t<3/2$, $t\le q\le n+1$

$$\begin{aligned} \inf _{v_\ell \in V_{{\mathrm{ref}},\ell }}\Vert u-v_\ell \Vert _{H^t(D_{\mathrm{ref}})} \lesssim h_\ell ^{q-t}\Vert u\Vert _{H^q(D_{\mathrm{ref}})}, \quad u\in H^q(D_{\mathrm{ref}}), \end{aligned}$$

(29)

and

$$\begin{aligned} \Vert v_\ell \Vert _{H^t(D_{\mathrm{ref}})}\lesssim h_\ell ^{s-t} \Vert v_\ell \Vert _{H^s(D_{\mathrm{ref}})},\quad v_\ell \in V_{{\mathrm{ref}},\ell }, \end{aligned}$$

(30)

uniformly in $\ell $, where we set $h_\ell \mathrel {\mathrel {\mathop :}=}2^{-\ell }$. Note that, by construction, $h_\ell $ scales like the mesh size $\max _k\{\mathrm{diam}\tau _{\ell ,k}\}$, i.e. it holds $h_\ell \eqsim \max _k\{\mathrm{diam}\tau _{\ell ,k}\}$ uniformly in $\ell \in \mathbb {N}$ due to (28).

We can employ the same argumentation to map the finite elements from the reference domain $D_{\mathrm{ref}}$ to the particular realization $D(\mathbf{y})=\mathbf{V}(D_{\mathrm{ref}},\mathbf{y})$ for $\mathbf{y}\in \Box $. The ellipticity condition (4) on the Jacobian $\mathbf{J}(\mathbf{x},\omega )$ of the random vector field guarantees that (28) is satisfied with $\rho =\overline{\sigma }/\underline{\sigma }$. Also the Jackson and Bernstein type estimates (29) and (30) are still valid, where the only limitation is imposed by the smoothness of $\mathbf{V}(\mathbf{x},\mathbf{y})$. If for example $\mathbf{V}(\mathbf{x},\mathbf{y})$ is of class $C^2$, then we have the restriction $q\le 2$ such that

$$\begin{aligned} \inf _{v_\ell \in {V}_\ell (\mathbf{y})}\Vert u-v_\ell \Vert _{H^t(D(\mathbf{y}))} \lesssim h_\ell ^{q-t}\Vert u\Vert _{H^q(D(\mathbf{y}))} \end{aligned}$$

for all $0\le t\le 3/2$, $t\le q\le 2$ where $ {V_\ell }(\mathbf{y})\mathrel {\mathrel {\mathop :}=}\{\varphi \circ \mathbf{V}(\mathbf{y})^{-1}: \varphi \in {V}_{{\mathrm{ref}},\ell }\}\subset H^1\big (D(\mathbf{y})\big ). $

The one-to-one correspondence between the solution $u_\ell (\mathbf{y})\in {V}_\ell (\mathbf{y})$ to (6) and the solution $\hat{u}_\ell (\mathbf{y})\in V_{{\mathrm{ref}},\ell }$ to (9) is given by the following

Theorem 6

Let $u_\ell (\mathbf{y})\in {V}_\ell (\mathbf{y})$ be the Galerkin solution to (6) and $\hat{u}_\ell (\mathbf{y})\in V_{{\mathrm{ref}},\ell }$ the Galerkin solution to (9), respectively. Then, it holds

$$\begin{aligned} \hat{u}_\ell (\mathbf{y})=u_\ell \circ \mathbf{V}(\mathbf{y})\quad \text {and}\quad {u}_\ell (\mathbf{y})=\hat{u}_\ell \circ \mathbf{V}(\mathbf{y})^{-1}. \end{aligned}$$

Proof

The proof is a straightforward consequence of the construction of the spaces ${V}_\ell (\mathbf{y})$ and the equivalence of the problems (6) and (9), see also (10). $\Box $

Remark 4

The $H^2$-regularity of the mapped problem, i.e. on $D(\mathbf{y})$, follows from the $H^2$-regularity of the problem on the reference domain $D_{\mathrm{ref}}$ if the vector field $\mathbf{V}(\mathbf{x},\mathbf{y})$ is at least a $C^2$-diffeomorphism. Especially, if $\mathbf{V}(\mathbf{x},\mathbf{y})=\mathbf{x}+\mathbf{V}_0(\mathbf{x},\mathbf{y})$ is a perturbation of the identity as in (16) and $\mathbf{V}_0(\mathbf{x},\mathbf{y})$ is of class $C^2$, then $\mathbf{V}(\mathbf{x},\mathbf{y})^{-1}$ is also a $C^2$-diffeomorphism provided that $\Vert \mathbf{V}_0(\cdot ,\mathbf{y})\Vert _{C^2(D_{\mathrm{ref}})}<1/2$, cf. [31].

6 Stochastic interface problems

As a special case of a diffusion problem on a random domain, we shall focus on the stochastic interface problem as already discussed in e.g. [14].

6.1 Problem formulation

Let the hold-all $\mathcal {D}\subset \mathbb {R}^d$, cf. (3), be a simply-connected and convex domain with Lipschitz continuous boundary $\partial \mathcal {D}$. Inscribed into $\mathcal {D}$, we have a randomly varying inclusion $D^-(\mathbf{y})\subsetneq \mathcal {D}$ for $\mathbf{y}\in \Box $ with a $C^{2}$-smooth boundary $\Gamma (\mathbf{y})\mathrel {\mathrel {\mathop :}=}\partial D^-(\mathbf{y})$. The complement of $\overline{D^-(\mathbf{y})}$ will be denoted by $D^+(\mathbf{y})\mathrel {\mathrel {\mathop :}=}\mathcal {D}{{\setminus }}\overline{D^-(\mathbf{y})}$. A visualization of this setup is found in Fig. 2. For given $\mathbf{y}\in \Box $, we can state the stochastic elliptic interface problem as follows:

$$\begin{aligned} -\mathrm{div}\big (\alpha (\mathbf{x},\mathbf{y})\nabla u(\mathbf x,\mathbf{y})\big )&=f(\mathbf{x})&\quad \text {in }\mathcal {D}{\setminus }\Gamma (\mathbf{y}),\end{aligned}$$

(31)

$$\begin{aligned}{}[\![u(\mathbf{x},\mathbf{y})]\!]&=0&\quad \text {on }\Gamma (\mathbf{y}),\end{aligned}$$

(32)

$$\begin{aligned} \bigg [\!\!\bigg [\alpha (\mathbf{x},\mathbf{y}) \frac{\partial u}{\partial \mathbf{n}}(\mathbf{x},\mathbf{y})\bigg ]\!\!\bigg ]&=0&\quad \text {on }\Gamma (\mathbf{y}),\end{aligned}$$

(33)

$$\begin{aligned} u(\mathbf{x},\mathbf{y})&= 0&\quad \text {on }\partial \mathcal {D}. \end{aligned}$$

(34)

Here, $\mathbf{n}$ denotes the outward normal vector on $\Gamma (\mathbf{y})$. Furthermore, the diffusion coefficient is given by

$$\begin{aligned} \alpha (\mathbf{x},\mathbf{y})\mathrel {\mathrel {\mathop :}=}\chi _{D^+(\mathbf{y})}(\mathbf{x})\alpha ^+(\mathbf{x})+ \chi _{D^-(\mathbf{y})}(\mathbf{x})\alpha ^-(\mathbf{x})\quad \text {for }{} \mathbf{x}\in \mathcal {D}, \end{aligned}$$

where $\chi _{D^-(\mathbf{y})}$ is the characteristic function of $D^-(\mathbf{y})$ and $\alpha ^+,\alpha ^-$ are smooth deterministic functions with

$$\begin{aligned} 0<\underline{\alpha }\le \alpha ^-(\mathbf{x}),\alpha ^+(\mathbf{x}) \le \overline{\alpha } <\infty \quad \text {for almost every }\mathbf{x}\in \mathcal {D}. \end{aligned}$$

By $[\![u(\mathbf{x},\mathbf{y})]\!]\mathrel {\mathrel {\mathop :}=}u^+(\mathbf{x},\mathbf{y})-u^-(\mathbf{x},\mathbf{y})$, we denote the jump of the solution u across $\Gamma (\mathbf{y})$, where $u^-(\mathbf{x},\mathbf{y})\mathrel {\mathrel {\mathop :}=}u|_{D^-(\mathbf{y})}$ and $u^+(\mathbf{x},\mathbf{y})\mathrel {\mathrel {\mathop :}=}u|_{D^+(\mathbf{y})}$, respectively. Analogously, we define the jump of the co-normal derivative across $\Gamma (\mathbf{y})$ via

$$\begin{aligned} \bigg [\!\!\bigg [\alpha (\mathbf{x},\mathbf{y}) \frac{\partial u}{\partial \mathbf{n}}(\mathbf{x},\mathbf{y})\bigg ]\!\!\bigg ] \mathrel {\mathrel {\mathop :}=}\alpha ^+(\mathbf{x})\frac{\partial u}{\partial \mathbf{n}}(\mathbf{x},\mathbf{y}) -\alpha ^-(\mathbf{x})\frac{\partial u}{\partial \mathbf{n}}(\mathbf{x},\mathbf{y}). \end{aligned}$$

Remark 5

This formulation of the stochastic interface problem also covers the case of elliptic equations on random domains. For example, for $\alpha ^+{(\mathbf x)}\equiv 0$ and $\alpha ^-(\mathbf{x})\equiv 1$ (perfect insulation), we have the Poisson equation on $D^-(\mathbf{y})$ with homogeneous Neumann data on $\Gamma (\mathbf{y})$ while, for $\alpha ^+{(\mathbf x)}\equiv \infty $ and $\alpha ^-(\mathbf{x})\equiv 1$ (perfect conduction), we have the Poisson equation on $D^-(\mathbf{y})$ with homogeneous Dirichlet data on $\Gamma (\mathbf{y})$.

6.2 Modeling the stochastic interface

Instead of solving the stochastic interface problem by the perturbation method by means of shape sensitivity analysis as in [14, 18], we propose here to apply the domain mapping approach. To that end, let $\Gamma _{\mathrm{ref}}\subset \mathcal {D}$ denote a reference interface of class $C^{2}$ and co-dimension 1 which separates the interior domain $D^-_{\mathrm{ref}}$ and the outer domain $D^+_{\mathrm{ref}}$. We assume that $\Gamma (\mathbf{y})$ is prescribed by the application of a vector field $ \mathbf{V}:\mathcal {D}\times \Box \rightarrow \mathcal {D}, $ i.e. $\Gamma (\mathbf{y})=\mathbf{V}(\Gamma _{\mathrm{ref}},\mathbf{y})$, which is a uniform $C^{2}$-diffeomorphism in the sense of Sect. 2. Furthermore, let the Jacobian of $\mathbf{V}$ satisfy the ellipticity condition (4).

As an example, we can consider here an extension of the vector field in [14], which only prescribes the perturbation at the boundary: If $\Gamma _{\mathrm{ref}}$ is of class $C^{3}$, then its outward normal $\mathbf{n}$ is of class $C^{2}$. Thus, given a random field $\kappa :\Gamma _{\mathrm{ref}}\times \Box \rightarrow \mathbb {R}$ which satisfies $|\kappa (\mathbf{x},\mathbf{y})|\le \overline{\kappa }<1$ almost surely, we can define $ \mathbf{V}(\mathbf{x},\mathbf{y})\mathrel {\mathrel {\mathop :}=}\mathbf{x}+\kappa (\mathbf{x},\mathbf{y})\mathbf{n}(\mathbf{x}) $ for $\mathbf{x}\in \Gamma _{\mathrm{ref}}$. A suitable extension of this vector field to the whole domain $\mathcal {D}$ is given by $ \mathbf{V}(\mathbf{x},\mathbf{y})\mathrel {\mathrel {\mathop :}=}\mathbf{x}+\kappa (P\mathbf{x},\mathbf{y}) \mathbf{n}(P\mathbf{x})B(\Vert \mathbf{x}-P\mathbf{x}\Vert _2), $ where $P\mathbf{x}$ is the orthogonal projection of $\mathbf{x}$ onto $\Gamma _{\mathrm{ref}}$ and $B:[0,\infty )\rightarrow [0,1]$ is a smooth blending function with $B(0)=1$ and $B(t)=0$ for all $t\ge c$ for some constant $c\in (0,\infty )$. Notice that, if $\Gamma _{\mathrm{ref}}$ is of class $C^{3}$, the orthogonal projection P onto $\Gamma _{\mathrm{ref}}$ and thus $\mathbf{V}(\mathbf{x},\mathbf{y})$ is at least of class $C^{2}$, cf. [19].

6.3 Reformulation for the reference interface

For $\mathbf{y}\in \Box $, the variational formulation of the interface problem (31)–(34) is given as follows: Find $u\in H^1_0(\mathcal {D})$ such that

$$\begin{aligned} \int _{D^-(\mathbf{y})\cup D^+(\mathbf{y})}\alpha \langle \nabla u,\nabla v\rangle \mathrm{d}\mathbf{x} =\int _{\mathcal {D}} fv\mathrm{d}\mathbf{x}\quad \text {for all }v\in H^1_0(\mathcal {D}). \end{aligned}$$

As in Sect. 2, we can reformulate this variational formulation relative to the reference interface. As we have for the transported coefficient

$$\begin{aligned} \hat{\alpha }(\mathbf{x},\mathbf{y})&= \chi _{\mathbf{V}(D^+_{\mathrm{ref}},\mathbf{y})}\big (\mathbf{V}(\mathbf{x},\mathbf{y})\big )\hat{\alpha }^+(\mathbf{x},\mathbf{y})+ \chi _{\mathbf{V}(D^-_{\mathrm{ref}},\mathbf{y})}\big (\mathbf{V}(\mathbf{x},\mathbf{y})\big )\hat{\alpha }^-(\mathbf{x},\mathbf{y})\\&=\chi _{D^+_{\mathrm{ref}}}(\mathbf{x})\hat{\alpha }^+(\mathbf{x},\mathbf{y})+ \chi _{D^-_{\mathrm{ref}}}(\mathbf{x})\hat{\alpha }^-(\mathbf{x},\mathbf{y}), \end{aligned}$$

we obtain the following variational formulation with the definition (7) of the diffusion matrix $\mathbf{A}(\mathbf{x},\mathbf{y})$: Find $\hat{u}(\mathbf{y})\in H^1_0(\mathcal {D})$ such that

$$\begin{aligned} \int _{D^-_{\mathrm{ref}}\cup D^+_{\mathrm{ref}}}\hat{\alpha }(\mathbf{y})\langle \mathbf{A}(\mathbf{y})\nabla \hat{u}(\mathbf{y}),\nabla {v} \rangle \mathrm{d}\mathbf{x} =\int _{\mathcal {D}}\hat{f}(\mathbf{y}){v}\det \mathbf{J}(\mathbf{y})\mathrm{d}\mathbf{x} \end{aligned}$$

(35)

for all $v\in H^1_0(\mathcal {D})$. Since $\hat{\alpha }(\mathbf{x},\mathbf{y})$ is a smooth function with respect to $\mathbf{y}$, the regularity results from Sect. 4 remain valid here.

6.4 Finite element approximation for the stochastic interface problem

The application of parametric finite elements yields especially an interface-resolved triangulation for the discretization of the stochastic interface problem (31)–(34). By “interface-resolved” we mean that the vertices of elements around the interface lie exactly on the interface, cf. [7, 23]. Thus, the approximation error for a particular realization of the solution $u(\mathbf{y})$ to the stochastic interface problem (31)–(34) can be quantified by the following theorem adopted from [23, Theorem 4.1].

Theorem 7

For $\mathbf{y}\in \Box $, let $\{ \mathcal {T}_{\ell }\} _{\ell >0}$ be a family of interface resolved triangulations for $\mathbf{V}(\mathcal {D},\mathbf{y})$ and $\{{V}_{\ell }(\mathbf{y})\} _{\ell >0}$ the associated finite element spaces. Let ${u}_{\ell }(\mathbf{y})$ be the finite element solution corresponding to the realization ${u}(\mathbf{y})$ of the solution to the elliptic problem (31)–(34). Then, for $s=0,1$, there holds that

$$\begin{aligned} \left\| {u}(\mathbf{y})-{u}_{\ell }(\mathbf{y})\right\| _{H^s(\mathcal {D})}\lesssim h_\ell ^{2-s}\left\| {u}(\mathbf{y})\right\| _{H^2(D^-(\mathbf{y}))\cup H^2(D^+(\mathbf{y}))}, \end{aligned}$$

(36)

where ${H^2\big (D^-(\mathbf{y})\big )}\cup H^2\big (D^+(\mathbf{y})\big )$ is the broken Sobolev space equipped by the norm

$$\begin{aligned} \Vert \cdot \Vert _{H^2(D^-(\mathbf{y}))\cup H^2(D^+(\mathbf{y}))} \mathrel {\mathrel {\mathop :}=}\sqrt{\Vert \cdot \Vert _{H^2(D^-(\mathbf{y}))}^2 + \Vert \cdot \Vert _{H^2(D^+(\mathbf{y}))}^2}. \end{aligned}$$

In view of Theorem 6, the statement of the previous theorem is also valid for the realization of the solution which is pulled back to the domain $\mathcal {D}$ relative to the reference interface $\Gamma _{\mathrm{ref}}$.

7 Numerical examples

In this section, we consider two examples for boundary value problems on random domains. On the one hand, we consider a stochastic interface problem, and on the other hand, we consider the Laplace equation on a random domain. In both examples, we employ the pivoted Cholesky decomposition, cf. [15, 17], in order to approximate the Karhunen-Loève expansion of V. The spatial discretization is performed by using piecewise linear parametric finite elements on the mapped domain $\mathbf{V}(D_{\mathrm{ref}},\mathbf{y}_i)$ for each sample $\mathbf{y}_i$. It would of course be also possible to perform the computations on the reference domain. In this case, the diffusion matrix $\mathbf{A}$ has to be computed from Karhunen-Loève expansion of $\mathbf{V}$ for each particular sample.

For the stochastic approximation, we employ a quasi-Monte Carlo quadrature based on N Halton points $\{{\varvec{\xi }}_i\}_{i=1}^N$ mapped to the hypercube $[-1,1]^M$, i.e.

$$\begin{aligned} \mathbb {E}[\hat{u}](\mathbf{x})\approx (Q\hat{u})(\mathbf{x})\mathrel {\mathrel {\mathop :}=}\frac{1}{N}\sum _{i=1}^N \hat{u}(\mathbf{x},2{\varvec{\xi }}_i-\mathbf{1}). \end{aligned}$$

In accordance with [16], we have the following convergence result for this quasi-Monte Carlo quadrature, which is valid for the variance of $\hat{u}$ as well.

Lemma 7

The quasi-Monte Carlo quadrature with Halton points converges for the mean of the solution $\hat{u}$ to (9) independent of the stochastic dimension M if ${\gamma }_k\lesssim k^{-3-\varepsilon }$. More precisely, for all $\delta >0$, there exists a constant such that the quasi-Monte Carlo quadrature based on N Halton points satisfies

$$\begin{aligned} \Vert \mathbb {E}[\hat{u}]-Q\hat{u}\Vert _{H^1(D_{\mathrm{ref}})}\le C(\delta ) N^{\delta -1}, \end{aligned}$$

where $C(\delta )\rightarrow \infty $ as $\delta \rightarrow 0$.

Proof

From [16, 21], we know that the error of the quasi-Monte Carlo quadrature can be estimated by the weighted Koksma-Hlawka inequality, cf. [27],

$$\begin{aligned}&\big \Vert (\mathbb {E}-Q )\hat{u}\big \Vert _{H_0^1(D)}\le \Bigg (\sup \limits _{\Vert {\varvec{\alpha }}\Vert _\infty =1}w_{{\varvec{\alpha }}}^{-\frac{1}{2}} 2^{|{\varvec{\alpha }}|} \sup \limits _{\mathbf{y}\in [-1,1]^M}\big \Vert \partial ^{\varvec{\alpha }}_\mathbf{y} \hat{u}(\mathbf{y})\big \Vert _{H_0^1(D)}\Bigg )\nonumber \\&\quad \times \, \Bigg (\sum \limits _{\Vert {\varvec{\alpha }}\Vert _\infty =1} w_{{\varvec{\alpha }}}^{\frac{1}{2}}\mathcal {D}^{\star }(\Xi _{\varvec{\alpha }})\Bigg ). \end{aligned}$$

(37)

Herein, we denote by $\mathcal {D}^{\star }(\Xi _{\varvec{\alpha }})$ the star-discrepancy of the set of Halton-points on $[0,1]^M$ which are projected onto the dimensions where $\alpha _k=1$. Additionally, the factor $2^{|{\varvec{\alpha }}|}$ appears due to the transport of $\hat{u}$ to the unit cube $[0,1]^M$. It is shown in [34] that the second factor in (37) is bounded by

$$\begin{aligned} \left\{ \sum _{\Vert {\varvec{\alpha }}\Vert _\infty =1} w_{{\varvec{\alpha }}}^{\frac{1}{2}}\mathcal {D}^{\star }(\Xi _{\varvec{\alpha }})\right\} \le C(\delta ) N^{-1+\delta } \end{aligned}$$

with a constant $C(\delta )$ which is independent of M if the weights $w_{{\varvec{\alpha }}}$ are product weights, i.e. $w_{{\varvec{\alpha }}}=\prod _{k=1}^M w_k^{\alpha _k}$, and satisfy

$$\begin{aligned} \sum _{k=1}^{\infty }w_k^{\frac{1}{2}}k\log k<\infty . \end{aligned}$$

(38)

In order to bound the first product in (37), we employ the estimate

$$\begin{aligned} \big \Vert \partial ^{\varvec{\alpha }}_\mathbf{y}\hat{u}(\mathbf{y})\big \Vert _{H^1(D_{\mathrm{ref}})} \le C|{\varvec{\alpha }}|! c^{|{\varvec{\alpha }}|}{\varvec{\gamma }}^{\varvec{\alpha }}\le C\prod _{k=1}^{M} k c \gamma _k \end{aligned}$$

from Theorem 5 and choose the weights accordingly as $w_k^{1/2}=2c k \gamma _k$. Then, the condition (38) can be rewritten as

$$\begin{aligned} \sum _{k=1}^{\infty }2c\gamma _k k^2\log k<\infty . \end{aligned}$$

which is satisfied if $\gamma _k\lesssim k^{-3-\varepsilon }$. $\Box $

All computations have been carried out on a computing server consisting of four nodes^{Footnote 4} with up to 64 threads.

7.1 The stochastic interface problem

We consider the stochastic interface problem from [14] where the hold-all is given as $\mathcal {D}=[-1,1]^2$ and the reference interface is given as $\Gamma _{\mathrm{ref}}=\{\mathbf{x}\in \mathcal {D}:\Vert \mathbf{x}\Vert _2=0.7\}$. Thus, the outward normal is $\mathbf{n}(\mathbf{x})=[\cos (\theta ),\sin (\theta )]^\intercal $ where $\mathbf{x}=r[\cos (\theta ),\sin (\theta )]^\intercal $ is the representation of $\mathbf{x}$ in polar coordinates. The random field under consideration reads

$$\begin{aligned} \kappa (\theta ,\omega )=\frac{1}{80}\sum _{k=0}^5\cos (k\theta )X_{2k}(\omega )+\sin (k\theta )X_{2k+1}(\omega ). \end{aligned}$$

(39)

Here, $X_0,\ldots ,X_{11}$ are independent, uniformly distributed random variables with variance 1, i.e. their range is $[-\sqrt{3},\sqrt{3}]$. The diffusion coefficient is given as $\alpha ^-(\mathbf{x})\equiv 2$ in the interior part of the domain and as $\alpha ^+(\mathbf{x})\equiv 1$ in the remaining part of the domain. The right hand side is chosen as $f(\mathbf{x})\equiv 1$.

In this example, only the perturbation at the random interface is known. Thus, the solution of the associated diffusion problem depends on the particular extension of the vector field and it is reasonable to consider a quantity of interest (QoI) that does not depend on this extension. Specifically, the QoI is given by the solution on a non-varying part of the domain, namely on $\{\Vert \mathbf{x}\Vert _2\le 0.4\}$. We therefore extend the random field (39) onto $\mathcal {D}$ as described in Sect. 6.2 by using the quadratic B-spline $B(\mathbf{x}) = \frac{4}{3} B_2(5\Vert \mathbf{x}-P\mathbf{x}\Vert _2)$ as blending function. Hence, the random perturbation is localized in the annulus $\{0.4<\Vert \mathbf{x}\Vert _2<1\}$ and we end up with the covariance

$$\begin{aligned} \mathrm{Cov}[\mathbf{V}](\mathbf{x},\mathbf{y})=B(\mathbf{x})B(\mathbf{y})\mathrm{Cov}_\kappa (\theta _\mathbf{x},\theta _\mathbf{y}) \begin{bmatrix} \cos (\theta _\mathbf{x})\cos (\theta _\mathbf{y})&\cos (\theta _\mathbf{x})\sin (\theta _\mathbf{y})\\ \sin (\theta _\mathbf{x})\cos (\theta _\mathbf{y})&\sin (\theta _\mathbf{x})\sin (\theta _\mathbf{y}) \end{bmatrix} \end{aligned}$$

with

$$\begin{aligned} \mathrm{Cov}_\kappa (\theta _\mathbf{x},\theta _\mathbf{y})=\frac{1}{6400}\sum _{k=0}^5\cos (k\theta _\mathbf{x})\cos (k\theta _\mathbf{y}) +\sin (k\theta _\mathbf{x})\sin (k\theta _\mathbf{y}). \end{aligned}$$

Furthermore, we set $\mathbf{E}[\mathbf{V}](\mathbf{x})\mathrel {\mathrel {\mathop :}=}\mathbf{x}$. A visualization of the reference interface with a particular displacement field $\mathbf{V}(\mathbf{x},\mathbf{y}_i)-\mathbf{x}$ and the resulting perturbed interface is found in Fig. 4.

A visualization of the QoI’s mean and variance computed by $N=10^6$ quasi-Monte Carlo samples and 1048576 finite elements (level 8) is shown in Fig. 3. This approximation serves as a reference in order to examine the convergence behavior of the quasi-Monte Carlo method. According to Lemma 7, we expect a rate of convergence of $N^{\delta -1}$ for any $\delta >0$. In our experiments, we thus apply $N_\ell =2^{\ell /(1-\delta )}$ Halton points on the finite element level $\ell =1,\ldots ,7$ for the choices $\delta =0.5,0.4,0.3,0.2$. Although all choices of $\delta > 0$ would asymptotically result in an almost linear rate of convergence, the constant in the error estimate is still dependent on the particular choice.

Figure 5 depicts the error of the QoI’s mean measured in the $H^1$-norm on the right hand side and the error of the QoI’s variance measured in the $W^{1,1}$-norm on the left hand side each versus the related cost, which is given by the number $N_\ell $ of samples times the degrees of freedom in the finite element approximation on level $\ell $. As can be seen, the error of the QoI’s mean provides similar errors for all choices of $\delta $. This suggests that the finite element error limits the overall approximation error. The choice $\delta =0.2$ is already sufficient here and results in the lowest cost. For the QoI’s variance, we observe successively smaller errors for increasing $\delta $. At least the error for the QoI’s mean seems to be dominated by the finite element discretization. Therefore, we found it instructive to present also the respective errors measured in the $L^2$-norm. They are plotted in Fig. 6. Here, the smallest error is obtained for $\delta =0.5$. Nevertheless, the best error versus cost rate is provided by $\delta =0.2$. The situation changes for the variance. Here, the error gets again successively smaller for increasing values of $\delta $. Resulting in the lowest error for $\delta =0.5$.

As a comparison and in order to validate the reference, we have also computed the approximate mean and variance on each level by the Monte Carlo method. Here, in order to maintain the linear approximation rate of the finite element method in the energy norm, we approximate the root mean square error by five realizations each of which being computed with $N_\ell =2^{2\ell }$ samples.

7.2 The Poisson equation on a random domain

For our second example, we consider an infinite dimensional random field described by its mean $\mathbb {E}[\mathbf{V}](\mathbf{x})=\mathbf{x}$ and its covariance function

$$\begin{aligned} \mathrm{Cov}[\mathbf{V}](\mathbf{x},\mathbf{y})=\frac{1}{100} \begin{bmatrix} 5\exp (-4\Vert \mathbf{x}-\mathbf{y}\Vert _2^2)&\exp (-0.1\Vert 2\mathbf{x}-\mathbf{y}\Vert _2^2)\\ \exp (-0.1\Vert \mathbf{x}-2\mathbf{y}\Vert _2^2)&5 \exp (-\Vert \mathbf{x}-\mathbf{y}\Vert _2^2) \end{bmatrix}. \end{aligned}$$

Furthermore, we consider the random variables in the Karhunen-Loève expansion to be uniformly distributed. The unit disc $D_{\mathrm{ref}}=\{\mathbf{x}\in \mathbb {R}^2:\Vert \mathbf{x}\Vert _2<1\}$ serves as reference domain and the load is set to $f(\mathbf{x})\equiv 1$. Figure 8 shows the reference domain with a particular displacement field and the resulting perturbed domain. In this example, the covariance between any two points in $D_{\mathrm{ref}}$ is actually known and can thus be incorporated into our model. Especially, there is no point inside the reference domain that is kept fixed by the random vector field. Therefore, we consider here the entire solution $\hat{u}$ as QoI and approximate its mean and its variance.

In Fig. 7, a visualization of the mean and the variance computed by $N=10^6$ quasi-Monte Carlo samples 1048576 finite elements (level 9) are found. Here, the Karhunen-Loève expansion has been truncated after $M=303$ terms which yields a truncation error, cf. (14), smaller than $10^{-6}$. For the convergence study, however, we have coupled the truncation error of the Karhunen-Loève expansion to the spatial discretization error of order $2^{-\ell }$ on the finite element level $\ell $. It is observed that the truncation rank M grows linearly in the level $\ell $, namely it holds $M=10,23,37,49,64,79,91,108$ for $\ell =1,2,3,4,5,6,7,8$.

The number of samples of the quadrature methods under consideration has been chosen in dependence on the finite element level $\ell $ as in the previous example. Figure 9 shows the error of the solution’s mean and variance measured in the $H^1$-norm and the $W^{1,1}$-norm, respectively, each versus the cost. Except for $\delta =0.2$, we observe for the quasi-Monte Carlo quadrature as well as for the Monte Carlo quadrature comparable errors for the approximation of the mean. In view of the cost, $\delta =0.2,0.3$ perform best here. In case of the variance, we obtain again successively smaller errors for increasing values of $\delta $. Again, we have also provided the respective errors with respect to the $L^2$-norm. The related plots are found in Fig. 10. Here, for the mean and the variance, $\delta =0.5$ provides asymptotically the lowest error.

8 Conclusion

In this article, we have provided regularity results for the domain mapping method for elliptic boundary value problems on random domains. Based on the decay of the random vector field’s Karhunen-Loève expansion, we have derived related decay rates for the solution’s derivatives. In particular, the presented framework is directly applicable to stochastic interface problems. The regularity results provide dimension independent convergence of the quasi-Monte Carlo quadrature and allow also for the use of (anisotropic) quadrature methods to approximate quantities of interest that involve an integration of the solution with respect to the random parameter. The numerical examples corroborate the theoretical results and demonstrate the flexibility of the approach.

Notes

It is sufficient to assume that $\mathbf{V}$ is a $C^1$-diffeomorphism and satisfies the uniformity in ${C^1(\overline{D_{\mathrm{ref}}};\mathbb {R}^d)}$. Nevertheless, in order to obtain $H^2$-regularity of the model problem, we make this stronger assumption.
With “formally” we mean that we ignore here the fact that the product of matrices is in general not Abelian. Nevertheless, a differentiation yields exactly the appearing products in a permuted order. The formal representation is justified since we only consider the norm of the representation in the sequel.
A more rigorous bound on the ordered Bell numbers is provided by [35]. There, it is shown that
$$\begin{aligned} \tilde{b}(n)=\frac{n!}{2(\log 2)^{n+1}}+\mathcal {O}\big ((0.16)^nn!\big ). \end{aligned}$$
Nevertheless, for our purposes, the bound from [3] is sufficient.
Each node consists of two quad-core Intel(R) Xeon(R) X5550 CPUs with a clock rate of 2.67GHz (hyperthreading enabled) and 48GB of main memory.

References

Abramowitz, M., Stegun, I.A.: Handbook of Mathematical Functions: With Formulas, Graphs, and Mathematical Tables. Applied mathematics series. Dover Publications, N. Chemsford (1964)
Alt, H.W.: Lineare Funktionalanalysis. Springer, London (2007)
MATH Google Scholar
Beck, J., Tempone, R., Nobile, F., Tamellini, L.: On the optimal polynomial approximation of stochastic pdes by galerkin and collocation methods. Math. Models Methods Appl. Sci. 22(9), 1250023 (2012)
Braess, D.: Finite Elemente: Theorie Schnelle Löser und Anwendungen in der Elastizitätstheorie. Springer, London (2007)
Google Scholar
Canuto, C., Kozubek, T.: A fictitious domain approach to the numerical solution of PDEs in stochastic domains. Numerische Mathematik 107(2), 257–293 (2007)
Article MathSciNet MATH Google Scholar
Castrillon-Candas, J.E., Nobile, F., Tempone, R.F.: Analytic regularity and collocation approximation for PDEs with random domain deformations. ArXiv e-prints 1312.7845 (2013)
Chen, Z., Zou, J.: Finite element methods and their convergence for elliptic and parabolic interface problems. Numerische Mathematik 79(2), 175–202 (1998)
Article MathSciNet MATH Google Scholar
Cohen, A., DeVore, R., Schwab, C.: Convergence rates of best $N$-term Galerkin approximations for a class of elliptic sPDEs. Found. Comput. Math. 10, 615–646 (2010)
Article MathSciNet MATH Google Scholar
Constantine, G.M., Savits, T.H.: A multivariate Faà di Bruno formula with applications. Trans. Am. Math. Soc. 248, 503–520 (1996)
Article MathSciNet MATH Google Scholar
Ghanem, R., Spanos, P.: Stochastic finite elements: A spectral approach. Springer, New York (1991)
Book MATH Google Scholar
Griebel, M., Harbrecht, H.: Approximation of bi-variate functions: singular value decomposition versus sparse grids. IMA J. Numer. Anal. 34(1), 28–54 (2014)
Article MathSciNet MATH Google Scholar
Gross, O.A.: Preferential arrangements. Am. Math. Month. pp. 4–8 (1962)
Halton, J.H.: On the efficiency of certain quasi-random sequences of points in evaluating multi-dimensional integrals. Numerische Mathematik 2(1), 84–90 (1960)
Article MathSciNet MATH Google Scholar
Harbrecht, H., Li, J.: First order second moment analysis for stochastic interface problems based on low-rank approximation. ESAIM. Math. Model. Numer. Anal. 47, 1533–1552 (2013)
Article MathSciNet MATH Google Scholar
Harbrecht, H., Peters, M., Schneider, R.: On the low-rank approximation by the pivoted Cholesky decomposition. Appl. Numer. Math. 62, 28–440 (2012)
Article MathSciNet MATH Google Scholar
Harbrecht, H., Peters, M., Siebenmorgen, M.: On the quasi-Monte Carlo method with Halton points for elliptic PDEs with log-normal diffusion. Preprint 2013-28, Mathematisches Institut Universität Basel (to appear in Mathematics of Computation) (2013)
Harbrecht, H., Peters, M., Siebenmorgen, M.: Efficient approximation of random fields for numerical applications. Numer. Linear Algebra Appl. 22(4), 596–617 (2015)
Article MathSciNet MATH Google Scholar
Harbrecht, H., Schneider, R., Schwab, C.: Sparse second moment analysis for elliptic problems in stochastic domains. Numerische Mathematik 109(3), 385–414 (2008)
Article MathSciNet MATH Google Scholar
Holmes, R.B.: Smoothness of certain metric projections on Hilbert space. Trans. Am. Math. Soc. 184, 87–100 (1973)
Article MathSciNet Google Scholar
Kadison, R.V., Ringrose, J.R.: Fundamentals of the theory of operator algebras. V1: Elementary theory. Academic Press, New York (1986)
Kuo, F.Y., Schwab, C., Sloan, I.H.: Quasi-Monte Carlo methods for high-dimensional integration: the standard (weighted Hilbert space) setting and beyond. ANZIAM J. 53, 1–37 (2012)
Article MathSciNet MATH Google Scholar
Lenoir, M.: Optimal isoparametric finite elements and error estimates for domains involving curved boundaries. SIAM J. Numer. Anal. 23(3), 562–580 (1986)
Article MathSciNet MATH Google Scholar
Li, J., Melenk, J.M., Wohlmuth, B., Zou, J.: Optimal a priori estimates for higher order finite elements for elliptic interface problems. Appl. Numer. Math. 60(1), 19–37 (2010)
Article MathSciNet MATH Google Scholar
Light, W.A., Cheney, E.W.: Approximation theory in tensor product spaces. Lecture notes in mathematics Volume 1169. Springer, New York (1985)
Loève, M.: Probability theory. I+II, Graduate Texts in Mathematics, vol. 45, 4th edn. Springer, New York (1977)
Mohan, P.S., Nair, P.B., Keane, A.J.: Stochastic projection schemes for deterministic linear elliptic partial differential equations on random domains. Int. J. Numer. Methods Eng. 85(7), 874–895 (2011)
Article MathSciNet MATH Google Scholar
Niederreiter, H.: Random number generation and Quasi-Monte Carlo methods. Society for Industrial and Applied Mathematics, Philadelphia (1992)
Book MATH Google Scholar
Nobile, F., Tempone, R., Webster, C.G.: An anisotropic sparse grid stochastic collocation method for partial differential equations with random input data. SIAM J. Numer. Anal. 46(5), 2411–2442 (2008)
Article MathSciNet MATH Google Scholar
Schwab, C., Todor, R.: Karhunen-Loève approximation of random fields by generalized fast multipole methods. J. Comput. Phys. 217, 100–122 (2006)
Article MathSciNet MATH Google Scholar
Simon, B.: Methods of modern mathematical physics: functional analysis, vol. 1. Academic Press, San Diego (1980)
MATH Google Scholar
Simon, J.: Differentiation with respect to the domain in boundary value problems. Numer. Funct. Anal. Optim. 2(7–8), 649–687 (1980)
Article MathSciNet MATH Google Scholar
Sokołowski, J., Zolésio, J.P.: Introduction to shape optimization. Shape sensitivity analysis. Springer series in computational mathematics. Springer, Berlin Heidelberg (1992)
Book MATH Google Scholar
Tartakovsky, D.M., Xiu, D.: Stochastic analysis of transport in tubes with rough walls. J. Comput. Phys. 217(1), 248–259 (2006)
Article MathSciNet MATH Google Scholar
Wang, X.: A constructive approach to strong tractability using quasi-Monte Carlo algorithms. J. Complex. 18, 683–701 (2002)
Article MathSciNet MATH Google Scholar
Wilf, H.S.: Generating functionology. A. K. Peters Ltd, Natick (2006)
Google Scholar
Xiu, D., Tartakovsky, D.M.: Numerical methods for differential equations in random domains. SIAM J. Sci. Comput. 28(3), 1167–1185 (2006)
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Departement Mathematik und Informatik, Universität Basel, Spiegelgasse 1, 4051, Basel, Switzerland
H. Harbrecht, M. Peters & M. Siebenmorgen

Authors

H. Harbrecht
View author publications
You can also search for this author in PubMed Google Scholar
M. Peters
View author publications
You can also search for this author in PubMed Google Scholar
M. Siebenmorgen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to M. Peters.

Additional information

This research has been supported by the Swiss National Science Foundation (SNSF) through the project “Rapid Solution of Boundary Value Problems on Stochastic Domains”.

Appendix

Lemma 8

Let ${\varvec{\gamma }}=\{\gamma _k\}_k\in \ell ^1(\mathbb {N})$ with finite support $\mathcal {I}\subset \mathbb {N}$ and $\gamma _k\ge 0$. Moreover, assume that $c_{\varvec{\gamma }}\mathrel {\mathrel {\mathop :}=}\sum _{k\in \mathcal {I}}\gamma _k<1$. Then, it holds

$$\begin{aligned} \sum _{\varvec{\alpha }}\frac{|{\varvec{\alpha }}|!}{{\varvec{\alpha }}!}{\varvec{\gamma }}^{\varvec{\alpha }}=\frac{1}{1-c_{\varvec{\gamma }}} \end{aligned}$$

and therefore there exists a constant with $|{\varvec{\alpha }}|!/{\varvec{\alpha }}!{\varvec{\gamma }}^{\varvec{\alpha }}\le c$ for all ${\varvec{\alpha }}\in \mathbb {N}^M_0$, where we set $M\mathrel {\mathrel {\mathop :}=}|\mathcal {I}|$ and $0^0=1$.

Proof

It holds

$$\begin{aligned} \sum _{\varvec{\alpha }}\frac{|{\varvec{\alpha }}|!}{{\varvec{\alpha }}!}{\varvec{\gamma }}^{\varvec{\alpha }}=\sum _{i=0}^\infty \sum _{|{\varvec{\alpha }}|=i}\frac{i!}{{\varvec{\alpha }}!}{\varvec{\gamma }}^{\varvec{\alpha }}=\sum _{i=0}^\infty \left( \sum _{k=1}^M\gamma _k\right) ^i=\sum _{i=0}^\infty c_{\varvec{\gamma }}^i=\frac{1}{1-c_{\varvec{\gamma }}} \end{aligned}$$

by the multinomial theorem and the limit of the geometric series. $\Box $

Lemma 9

Let $c,m\in \mathbb {R}$ with $m\ge 2$ and $c\ge m/(m-1)$. It holds for $n\in \mathbb {N}$ that

$$\begin{aligned} \frac{c}{m}\frac{c^n-1}{c-1}\le c^n. \end{aligned}$$

Proof

It holds

$$\begin{aligned} \begin{array}{l@{\qquad }rcl} &{}\displaystyle \frac{c}{m}\frac{c^n-1}{c-1}&{}\le &{} c^n\\ \displaystyle \Longleftrightarrow &{}c^{n+1}-c &{}\le &{} m(c^{n+1}-c^{n})\\ \displaystyle \Longleftrightarrow &{} mc^{n} &{}\le &{} (m-1)c^{n+1}+c\\ \displaystyle \Longleftrightarrow &{} \displaystyle \frac{m}{m-1}&{}\le &{} c+\frac{1}{(m-1)c^{n-1}}. \end{array} \end{aligned}$$

Omitting the second summand together with the condition $c\ge m/(m-1)$ yields the assertion. $\Box $

Rights and permissions

Reprints and permissions

About this article

Cite this article

Harbrecht, H., Peters, M. & Siebenmorgen, M. Analysis of the domain mapping method for elliptic diffusion problems on random domains. Numer. Math. 134, 823–856 (2016). https://doi.org/10.1007/s00211-016-0791-4

Download citation

Received: 22 May 2014
Revised: 30 November 2015
Published: 10 February 2016
Issue Date: December 2016
DOI: https://doi.org/10.1007/s00211-016-0791-4

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Analysis of the domain mapping method for elliptic diffusion problems on random domains

Abstract

Similar content being viewed by others

Quasi-Monte Carlo finite element methods for elliptic PDEs with lognormal random coefficients

Multilevel methods for uncertainty quantification of elliptic PDEs with random anisotropic diffusion

A domain mapping approach for elliptic equations posed on random bulk and surface domains

1 Introduction

2 Problem formulation

2.1 Reformulation on the reference domain

Remark 1

Lemma 1

Proof

3 Karhunen-Loève expansion

Lemma 2

Proof

Theorem 1

Proof

Definition 1

Theorem 2

Assumption 1

4 Regularity of the solution

Lemma 3

Proof

Lemma 4

Proof

Theorem 3

Proof

Lemma 5

Proof

Theorem 4

Lemma 6

Proof

Theorem 5

Proof

Remark 2

Remark 3

5 Curved domains and parametric finite elements

Theorem 6

Proof

Remark 4

6 Stochastic interface problems

6.1 Problem formulation

Remark 5

6.2 Modeling the stochastic interface

6.3 Reformulation for the reference interface

6.4 Finite element approximation for the stochastic interface problem

Theorem 7

7 Numerical examples

Lemma 7

Proof

7.1 The stochastic interface problem

7.2 The Poisson equation on a random domain

8 Conclusion

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Appendix

Appendix

Lemma 8

Proof

Lemma 9

Proof

Rights and permissions

About this article

Cite this article

Share this article

Mathematics Subject Classification

Search

Navigation