Abstract
In this article, we provide a rigorous analysis of the solution to elliptic diffusion problems on random domains. In particular, based on the decay of the Karhunen-Loève expansion of the domain perturbation field, we establish decay rates for the derivatives of the random solution that are independent of the stochastic dimension. For the implementation of a related approximation scheme, like quasi-Monte Carlo quadrature, stochastic collocation, etc., we propose parametric finite elements to compute the solution of the diffusion problem on each individual realization of the domain generated by the perturbation field. This simplifies the implementation and yields a non-intrusive approach. Having this machinery at hand, we can easily transfer it to stochastic interface problems. The theoretical findings are complemented by numerical examples for both, stochastic interface problems and boundary value problems on random domains.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Many problems in science and engineering lead to boundary value problems for an unknown function. In general, the numerical simulation is well understood provided that the input parameters are given exactly. Often, however, the input parameters are not known exactly. Especially, the treatment of uncertainties in the computational domain has become of growing interest, see e.g. [5, 18, 33, 36]. In this article, we consider the elliptic diffusion equation
as a model problem where the underlying domain \(D\subset \mathbb {R}^d\) or respectively its boundary \(\partial D\) are random. For example, one might think of tolerances in the shape of products fabricated by line production or shapes which stem from inverse problems, like e.g. tomography. Besides the fictitious domain approach considered in [5], one might essentially distinguish two approaches: the perturbation method and the domain mapping method.
The perturbation method starts with a prescribed perturbation field
at the boundary \(\partial D_{\mathrm{ref}}\) of a reference configuration and uses a shape Taylor expansion with respect to this perturbation field to represent the solution to (1), cf. [14, 18]. Whereas, the domain mapping method requires that the perturbation field is also known in the interior of the domain \(D_{\mathrm{ref}}\), i.e.
Then, the problem may be transformed to the fixed reference domain \(D_{\mathrm{ref}}\). This yields a partial differential equation with correlated random diffusion matrix and right hand side, cf. [6, 26, 33, 36].
The major drawback of the perturbation method is that it is only feasible for relatively small perturbations. Thus, in order to treat larger perturbations, the domain mapping method is the method of choice. Nevertheless, it might in practice be much easier to obtain measurements from the outside of a workpiece to estimate the perturbation field \(\mathbf{V}(\omega )\) rather than from its interior. If no information of the vector field inside the domain is available, it has to be extended appropriately, e.g. by the Laplacian, as proposed in [26, 36].
The perturbation method relies on a description in spatial or Eulerian coordinates. To that end, a compactum inside the domain is fixed and the domain deformation is considered relative to this compactum. The compactum has to be chosen in such a way that it is not intersected by the realizations of the domain’s boundary, cf. [18]. This particularly limits the magnitude of the boundary variation. The domain mapping method is based on a description in material or Lagrangian coordinates. Here, starting from the reference configuration \(D_{\mathrm{ref}}\), the trajectory of each particular point is tracked. In the domain mapping method, the notions of Eulerian and Lagrangian coordinates coincide on compacta, where the deformation is zero. Thus, in this sense, the domain mapping method provides the more general framework. The correspondence between the perturbation method and the domain mapping method can be expressed in terms of the local shape derivative \(\delta u[\mathbf{V}(\omega )]\) and the material derivative \(\dot{u}[\mathbf{V}(\omega )]\) of a given function u which differ by a transport term, cf. [32]:
In this article, we focus on the domain mapping method. In [6], it is shown for a specific class of variation fields that the solution to (1) provides analytic regularity with respect to the random parameter. We will generalize the result from [6] to arbitrary domain perturbation fields which are described by their mean \( \mathbb {E}[\mathbf{V}]:\!\! D_{\mathrm{ref}}\!\rightarrow \!\mathbb {R}^d,~\mathbb {E}[\mathbf{V}](\mathbf{x})\!=\!\big [\mathbb {E}[{v_1}](\mathbf{x}),\ldots ,\mathbb {E}[{v_d}](\mathbf{x})\big ]^\intercal \) and their (matrix-valued) covariance function
Note that the covariance function describes the covariance between any pair \((\mathbf{x},\mathbf{x}')\) of points in \(D_{\mathrm{ref}}\) and facilitates thus a modeling in terms of Lagrangian coordinates. Taking the Karhunen-Loève expansion of \(\mathbf{V}(\omega )\) as the starting point, we show decay rates for the derivatives of the solution to (1) with respect to the random parameter. Given that the Karhunen-Loève expansion decays fast enough, our results imply the dimension independent convergence of the quasi-Monte Carlo method based on the Halton sequence, cf. [13, 16, 34]. Moreover, our results are convenient for the convergence theory of the anisotropic sparse collocation, cf. [28], and best N-term approximations, cf. [8]. Although the presented results allow for a broad variety of methods for the stochastic approximation, we employ the quasi-Monte Carlo method in our numerical examples for the sake of simplicity.
For the spatial approximation, we propose to use parametric finite elements. Then, we are able to approximate the mean and the variance of the solution to (1) or a related quantity of interest by computing each sample on the particular realization \(D(\omega _i)=\mathbf{V}(D_{\mathrm{ref}},\omega _i)\) of the random domain rather than on the reference domain \(D_{\mathrm{ref}}\). This yields a non-intrusive approach to solve the problem at hand. In fact, any available finite element solver can be employed to compute the particular samples. Following this approach rather than mapping the diffusion problem always to the reference domain, we can easily treat also stochastic interface problems, cf. [14].
The rest of this article is organized as follows. In Sect. 2, we introduce some basic definitions and notation. Section 3 is dedicated to the Karhunen-Loève expansion of vector fields. Although this is a straightforward adaption of the state of the art literature [29], we think that it is sensible to explicitly introduce the related spaces, norms and operators. In Sect. 4, we present the essential contribution of this article: the regularity of the solution to the model problem defined in Sect. 2 with respect to the Karhunen-Loève expansion of the perturbation field. Section 5 introduces parametric finite elements which are the basic ingredient for the numerical realization of our approach. In Sect. 6, we extend our approach to stochastic interface problems. Finally, Sect. 7 provides numerical examples to validate and quantify the theoretical findings.
In the following, in order to avoid the repeated use of generic but unspecified constants, by \(C\lesssim D\) we mean that C can be bounded by a multiple of D, independently of parameters which C and D may depend on. Obviously, \(C\gtrsim D\) is defined as \(D\lesssim C\) and we write \(C\eqsim D\) if \(C\lesssim D\) and \(C\gtrsim D\).
2 Problem formulation
Let \(D_{\mathrm{ref}}\subset \mathbb {R}^d\) for \(d\in \mathbb {N}\) (of special interest are the cases \(d=2,3\)) denote a domain with Lipschitz continuous boundary \(\partial D_{\mathrm{ref}}\) and let \((\Omega ,\mathcal {F},\mathbb {P})\) be a complete probability space with \(\sigma \)-field \(\mathcal {F}\subset 2^\Omega \) and probability measure \(\mathbb {P}\). In order to guarantee that \(L^2_\mathbb {P}(\Omega )\) exhibits an orthonormal basis, we further assume that \(\Omega \) is a separable set. Let \(\mathbf{V}:\overline{D_{\mathrm{ref}}}\times \Omega \rightarrow \mathbb {R}^d\) be an invertible vector field of class \(C^2\), i.e. V is twice continuously differentiable with respect to \(\mathbf{x}\) for almost every \(\omega \in \Omega \). Moreover, we impose the uniformity condition
for some \(C\in (0,\infty )\) and almost every \(\omega \in \Omega \).Footnote 1 Thus, \(\mathbf{V}\) defines a family of domains
For the subsequent analysis, we restrict ourselves to the case of the Poisson equation, i.e. \(\alpha \equiv 1\),
This considerably simplifies the analysis and the extension to non-constant diffusion coefficients is straightforward, cf. Remark 2. In order to guarantee solvability for almost every \(\omega \in \Omega \), we consider the right hand side to be defined on the hold-all domain
From the uniformity condition, we infer for almost every \(\omega \in \Omega \) and every \(\mathbf{x}\in D\) that the singular-values of the vector field \(\mathbf{V}\)’s Jacobian \(\mathbf{J}(\omega ,\mathbf{x})\) satisfy
In particular, we assume without loss of generality that \(\underline{\sigma }\le 1\) and \(\overline{\sigma }\ge 1\).
2.1 Reformulation on the reference domain
In the sequel, we consider the spaces \(H^1_0\big (D(\omega )\big )\) and \(H^1_0(D_{\mathrm{ref}})\) to be equipped with the norms \( \Vert \cdot \Vert _{H^1(D(\omega ))}\mathrel {\mathrel {\mathop :}=}\Vert \nabla \cdot \Vert _{L^2(D(\omega );\mathbb {R}^d)} \text { and } \Vert \cdot \Vert _{H^1(D_{\mathrm{ref}})}\mathrel {\mathrel {\mathop :}=}\Vert \nabla \cdot \Vert _{L^2(D_{\mathrm{ref}};\mathbb {R}^d)}, \) respectively. Furthermore, we assume that the related dual spaces \(H^{-1}\big (D(\omega )\big )\) and \(H^{-1}(D_{\mathrm{ref}})\) are defined with respect to these norms. The main tool we use in the convergence analysis for the model problem (2) is the one-to-one correspondence between the problem which is pulled back to the reference domain \(D_{\mathrm{ref}}\) and the problem on the actual realization \(D(\omega )\). The equivalence between those two problems is described by the vector field \(\mathbf{V}(\mathbf{x},\omega )\). For an arbitrary function v on \(D(\omega )\), we denote the transported function by \(\hat{v}(\mathbf{x},\omega )\mathrel {\mathrel {\mathop :}=}(v\circ \mathbf{V})(\mathbf{x},\omega )\). According to the chain rule, we have for \(v\in C^1\big (D(\omega )\big )\) that
For given \(\omega \in \Omega \), the variational formulation for the model problem (2) reads as follows: Find \(u({\omega })\in H^1_0\big (D(\omega )\big )\) such that
Thus, with
and
we obtain the following variational formulation with respect to the reference domain: Find \(\hat{u}({\omega }) \in H^1_0(D_{\mathrm{ref}})\) such that
Here and afterwards, \(\langle \cdot ,\cdot \rangle \) denotes the canonical inner product for \(\mathbb {R}^d\).
Remark 1
Since \(\mathbf{V}\) is assumed to be a \(C^2\)-diffeomorphism, we have for almost every \(\omega \in \Omega \) that
Herein, \(\mathbf{I}\in \mathbb {R}^{d\times d}\) denotes the identity matrix. Especially, we infer \(\det \mathbf{J}^{-1},\det \mathbf{J}\ne 0\). The continuity of \(\mathbf{J},\mathbf{J}^{-1}\) and of the determinant function imply now that either \(\det \mathbf{J}^{-1},\det \mathbf{J}>0\) or \(\det \mathbf{J}^{-1},\det \mathbf{J}<0\) for all \(\mathbf{x}\). Therefore, without loss of generality, we will assume the positiveness of the determinants.
Notice that Eq. (9) contains for fixed \(v\in H^1_0\big (D(\omega )\big )\) the related transported test function \(\hat{v}(\omega )\).
The connection between the spaces \(H^1_0(D_{\mathrm{ref}})\) and \(H^1_0\big (D(\omega )\big )\) is given by the following
Lemma 1
The spaces \(H^1_0(D_{\mathrm{ref}})\) and \(H^1_0\big (D(\omega )\big )\) are isomorphic by the isomorphism
The inverse mapping is given by
Proof
The proof of this lemma is a consequence of the chain rule (5) and the ellipticity Assumption (4). \(\Box \)
This lemma implies that the space of test functions is not dependent on \(\omega \in \Omega \) at all: Obviously, we have \(H^1_0\big (D(\omega )\big ) =\left\{ \mathcal {E}(v):v\in H^1_0(D_{\mathrm{ref}})\right\} \). Thus, for an arbitrary function \(\mathcal {E}(v)\in H^1_0 \big (D(\omega )\big )\) it holds \(\widehat{\mathcal {E}(v)} = \mathcal {E}(v)\circ \mathbf{V}=v\circ \mathbf{V}^{-1}\circ \mathbf{V} = v\in H^1_0(D_{\mathrm{ref}})\) independent of \(\omega \in \Omega \). In particular, the solutions u to (6) and \(\hat{u}\) to (9) satisfy
3 Karhunen-Loève expansion
In order to make the random vector field \(\mathbf{V}(\mathbf{x},\omega )\) feasible for computations, we consider here its Karhunen-Loève expansion, cf. [25]. This section shall give a brief overview of the relevant facts concerning the Karhunen-Loève expansion of vector valued random fields. Especially, we introduce here the related function spaces which are used in the rest of this article. For further details on the Karhunen-Loève expansion in general and also on computational aspects, we refer to [10, 11, 17, 29].
Let \(D\subset \mathbb {R}^d\) always denote a domain. Then, we define \(L^2(D;\mathbb {R}^d)\) to be the Hilbert space which consists of all equivalence classes of square integrable functions \(\mathbf{v}:D\rightarrow \mathbb {R}^d\) equipped with the inner product
We assume that the vector field \(\mathbf{V}\) satisfies
Here and in the sequel, given a Banach space B and \(1\le p\le \infty \), the Lebesgue-Bochner space \(L_{\mathbb {P}}^p(\Omega ;B)\) consists of all equivalence classes of strongly measurable functions \(v:\Omega \rightarrow B\) whose norm
is finite. If \(B=H\) is a separable Hilbert space and \(p=2\), then the Lebesgue-Bochner space is isomorphic to the tensor product space \(L_{\mathbb {P}}^2(\Omega )\otimes H\) equipped with the inner product
The mean of \(\mathbf{V}\) is given by \( \mathbb {E}[\mathbf{V}](\mathbf{x}) = \big [\mathbb {E}[{v}_1](\mathbf{x}),\ldots , \mathbb {E}[{v}_d](\mathbf{x})\big ]^\intercal \) with
From the theory of Bochner integrals, see e.g. [24], it follows that \(\mathbb {E}[{v}_i](\mathbf{x})\in L^2(D)\) and thus \(\mathbb {E}[\mathbf{V}](\mathbf{x})\in L^2(D;\mathbb {R}^d)\). Furthermore, the (matrix-valued) covariance function of \(\mathbf{V}\) is given by \( \mathrm{Cov}[\mathbf{V}](\mathbf{x},\mathbf{y})=[\mathrm{Cov}_{i,j}(\mathbf{x},\mathbf{y})]_{i,j=1}^d \) with
We have \(\mathrm{Cov}_{i,j}(\mathbf{x},\mathbf{y})\in L^2(D\times D)\) which also follows from the properties of the Bochner integral and the application of the Cauchy-Schwarz inequality. We therefore conclude \(\mathrm{Cov}[\mathbf{V}](\mathbf{x},\mathbf{y})\in L^2(D\times D;\mathbb {R}^{d\times d})\) where we equip the space \(\mathbb {R}^{d\times d}\) with the inner product
This particularly induces the inner product on \(L^2(D\times D;\mathbb {R}^{d\times d})\) given by
Now, we shall introduce the operator
and its adjoint
Then, there holds the following
Lemma 2
The operators \(\mathcal {S}\) and \(\mathcal {S}^\star \) given by (11) and (12), respectively, are bounded with Hilbert-Schmidt norms \(\Vert \mathcal {S}\Vert _{\mathrm{HS}}=\Vert \mathcal {S}^\star \Vert _{\mathrm{HS}} =\Vert \mathbf{V}-\mathbb {E}[\mathbf{V}]\Vert _{L^2_{\mathbb {P}}(\Omega ;L^2(D;\mathbb {R}^d))}\). Moreover, the covariance operator
is a non-negative, symmetric, trace class operator with trace \(\Vert \mathbf{V}-\mathbb {E}[\mathbf{V}]\Vert _{L^2_{\mathbb {P}}(\Omega ;L^2(D;\mathbb {R}^d))}^2\).
Proof
The statement on the norms of \(\mathcal {S}\) and \(\mathcal {S}^\star \) follows by the application of Parseval’s identity, see the last part of the proof. Moreover, we have for all \(\mathbf{u}\in L^2(D;\mathbb {R}^d)\) that
In particular, \(\mathcal {C}\) is non-negative and symmetric according to
Finally, to show that \(\mathcal {C}\) is of trace class, let \(\{{\varvec{\varphi }}_k\}_k\) be an arbitrary orthonormal basis in \(L^2(D;\mathbb {R}^d)\). We thus have
where we employed Parseval’s identity in the second last step. \(\Box \)
Trace class operators are especially compact, see e.g. [20, 30], and exhibit hence a spectral decomposition.
Theorem 1
Let \(\mathcal {C}:L^2(D;\mathbb {R}^d)\rightarrow L^2(D;\mathbb {R}^d)\) be the covariance operator related to \(\mathbf{V}(\mathbf{x},\omega )\in L^2_\mathbb {P}\big (\Omega ;L^2(D;\mathbb {R}^d)\big )\). Then, there exists an orthonormal set \(\{{\varvec{\varphi }}_k\}_k\) and a sequence \(\lambda _1\ge \lambda _2\ge \cdots \ge 0\) such that \(\mathcal {C}{\varvec{\varphi }}_k=\lambda _k{\varvec{\varphi }}_k\) for all \(k=1,2,\ldots .\) Furthermore, it holds
Proof
For a proof of this theorem, we refer to [2]. \(\Box \)
We have now all prerequisites at hand to define the Karhunen-Loève expansion of the vector field \(\mathbf{V}(\mathbf{x},\omega )\in L^2_{\mathbb {P}}\big (\Omega ;L^2(D;\mathbb {R}^d)\big )\).
Definition 1
Let \(\mathbf{V}(\mathbf{x},\omega )\) be a vector field in \(L^2_{\mathbb {P}}\big (\Omega ;L^2(D;\mathbb {R}^d)\big )\). The expansion
with \(\sigma _k=\sqrt{\lambda _k}\) and \(X_k=\mathcal {S}^\star {\varvec{\varphi }}_k/\sigma _k\), where \(\{(\lambda _k,{\varvec{\varphi }}_k)\}_k\) is the sequence of eigenpairs of the underlying covariance operator \(\mathcal {C}=\mathcal {S}\mathcal {S}^\star \), is called Karhunen-Loève expansion of \(\mathbf{V}(\mathbf{x},\omega )\).
The space \(L^2(D;\mathbb {R}^d)\) served as pivot space for our considerations in the preceding derivation of the Karhunen-Loève expansion. In order to control the error of truncating the expansion after \(M\in \mathbb {N}\) terms, i.e.
one has to study the decay of the singular values \(\sigma _k\) in the representation (13). The particular rate of decay is known to depend on the spatial regularity of \(\mathbf{V}(\mathbf{x},{\omega })\). To that end, we consider the Sobolev space \(H^p(D;\mathbb {R}^d)\) for \(p>0\). The related inner product is given by
for \(p\in \mathbb {N}\) and
for \(p=\lfloor p\rfloor +s\) with \(s\in (0,1)\). Its dual space with respect to the \(L^2\)-duality pairing ist denoted as \(\tilde{H}^{-p}(D;\mathbb {R}^d)\).
For given \(\mathbf{V}(\mathbf{x},{\omega }) \in L^2_\mathbb {P}\big (\Omega ;H^p(D;\mathbb {R}^d)\big )\), it obviously holds
cf. [11]. Therefore, the following theorem is a straightforward modification of [11, Theorem 3.3] for the vector valued case.
Theorem 2
Let \(\mathbf{V}(\mathbf{x},{\omega }) \in L^2_\mathbb {P}\big (\Omega ;H^p(D;\mathbb {R}^d)\big )\). Then, the eigenvalues of the covariance operator \(\mathcal {C}:\tilde{H}^{-p}(D;\mathbb {R}^d)\rightarrow H^p(D;\mathbb {R}^d)\) decay like \(\lambda _k\lesssim (k/d)^{-2p/d}\) as \(k\rightarrow \infty \).
We may summarize the results of this section as follows. If the mean \(\mathbb {E}[\mathbf{V}](\mathbf{x})\) and the covariance function \(\mathrm{Cov}[\mathbf{V}](\mathbf{x},\mathbf{y})\) as well as the distribution of \(\mathbf{V}(\mathbf{x},{\omega })\) are known or appropriately estimated, cf. [29], we are able to reconstruct the vector field \(\mathbf{V}(\mathbf{x},{\omega })\) from its Karhunen-Loève expansion. In the following, in order to make the Karhunen-Loève expansion feasible for computations, we make some common assumptions.
Assumption 1
-
(1)
The random variables \(\{X_k\}_k\) are centered and take values in \([-1,1]\), i.e. \(X_k(\omega )\in [-1,1]\) for all k and almost every \(\omega \in \Omega \).
-
(2)
The random variables \(\{X_k\}_k\) are independent and identically distributed.
-
(3)
The sequence
$$\begin{aligned} \{\gamma _k\}_k \mathrel {\mathrel {\mathop :}=}\big \{\Vert \sigma _k{\varvec{\varphi }}_k\Vert _{ W^{1,\infty }(D;\mathbb {R}^{d})}\big \}_k \end{aligned}$$(15)is at least in \(\ell ^1(\mathbb {N})\). We denote its \(\ell ^1\)-norm by \(c_{\varvec{\gamma }}\mathrel {\mathrel {\mathop :}=}\sum _{k=1}^\infty \gamma _k\).
Here and hereafter, we shall equip the space \(W^{1,\infty }(D;\mathbb {R}^d)\) with the equivalent norm \( \Vert \mathbf{v}\Vert _{W^{1,\infty }(D;\mathbb {R}^d)}=\max \big \{\Vert \mathbf{v}\Vert _{L^\infty (D;\mathbb {R}^d)}, \Vert \mathbf{v}'\Vert _{L^\infty (D;\mathbb {R}^{d\times d})}\big \}, \) where \(\mathbf{v}'\) denotes the Jacobian of \(\mathbf{v}\) and \( \Vert \mathbf{v}'\Vert _{L^\infty (D;\mathbb {R}^{d\times d})}\mathrel {\mathrel {\mathop :}=}\mathrm{ess\,sup}_{\mathbf{x}\in D}\Vert \mathbf{v}'(\mathbf{x})\Vert _2. \) Herein, \(\Vert \cdot \Vert _2\) is the usual 2-norm of matrices, i.e. the largest singular value.
4 Regularity of the solution
In this section, we assume that the vector field \(\mathbf{V}(\mathbf{x},\mathbf{y})\) is given by a finite rank Karhunen-Loève expansion, i.e.
otherwise it has to be truncated appropriately. Nevertheless, we provide in this section estimates which are independent of \(M\in \mathbb {N}\). Thus, we explicitly allow M to become arbitrarily large.
For the rest of this article, we will refer to the randomness only via the coordinates \(\mathbf{y}\in \Box \mathrel {\mathrel {\mathop :}=}[-1,1]^M\), where \(\mathbf{y}=[y_1,\ldots ,y_M]\). Notice that due to the independence of the random variables, the related push-forward measure \(\mathbb {P}_\mathbf{X}\mathrel {\mathrel {\mathop :}=}\mathbb {P}\circ \mathbf{X}^{-1}\) where \(\mathbf{X}(\omega )\mathrel {\mathrel {\mathop :}=}[X_1(\omega ),\ldots X_M(\omega )]\) is of product structure. Furthermore, we always think of the spaces \(L^p(\Box )\) for \(p\in [1,\infty ]\) to be equipped with the measure \(\mathbb {P}_\mathbf{X}\). Moreover, we set \({\varvec{\gamma }}=[\gamma _k]_{k=1}^M\), cf. (15).
Without loss of generality, we may assume that \(\mathbb {E}[\mathbf{V}](\mathbf{x})=\mathbf{x}\) is the identity mapping. Otherwise, we replace \(D_{\mathrm{ref}}\) by
Therefore, we obtain
In the subsequent regularity results, we shall refer to the following Lebesgue-Bochner spaces. We define the space \(L^\infty \big (\Box ;L^\infty (D_{\mathrm{ref}};\mathbb {R}^d)\big )\) as the set of all equivalence classes of strongly measurable functions \(\mathbf{V}:\Box \rightarrow L^\infty (D_{\mathrm{ref}}; \mathbb {R}^d)\) with finite norm
Furthermore, the space \(L^\infty \big (\Box ;L^\infty (D_{\mathrm{ref}};\mathbb {R}^{d\times d})\big )\) consists of all equivalence classes of strongly measurable functions \(\mathbf{M}:\Box \rightarrow L^\infty (D_{\mathrm{ref}};\mathbb {R}^{d\times d})\) with finite norm
We start by providing bounds on the derivatives of \(\big (\mathbf{J}(\mathbf{x},\mathbf{y})^\intercal \mathbf{J}(\mathbf{x},\mathbf{y})\big )^{-1}\).
Lemma 3
Let \(\mathbf{J}:D_{\mathrm{ref}}\times \Box \rightarrow \mathbb {R}^{d\times d}\) be defined as in (16). Then, it holds for the derivatives of
under the conditions of Assumption 1.3 that
Proof
We define \( \mathbf{B}(\mathbf{x},\mathbf{y})\mathrel {\mathrel {\mathop :}=}\mathbf{J}(\mathbf{x},\mathbf{y})^\intercal \mathbf{J}(\mathbf{x},\mathbf{y})\) and \(\tilde{\mathbf{A}}(\mathbf{x},\mathbf{y})\mathrel {\mathrel {\mathop :}=}\big (\mathbf{B}(\mathbf{x},\mathbf{y})\big )^{-1}. \) Expanding the expression for \(\mathbf{B}(\mathbf{x},\mathbf{y})\) yields
Thus, the first order derivatives of \(\mathbf{B}(\mathbf{x},\mathbf{y})\) are given by
and the second order derivatives according to
Obviously, all higher order derivatives with respect to \(\mathbf{y}\) vanish.
The ellipticity Assumption (4) now yields the following bounds:
respectively. Furthermore, we derive from (17) that
and from (18) that \( {\big \vert \big \vert \big \vert {\partial _{y_j}\partial _{y_i}}{} \mathbf{B} \big \vert \big \vert \big \vert }_{d\times d}\le 2\gamma _i\gamma _j. \) Thus, we have
Since \(\tilde{\mathbf{A}}=v\circ \mathbf{B}\) is a composite function with \(v(x)=x^{-1}\), we may employ Faà di Bruno’s formula, cf. [9], which is a generalization of the chain rule, to compute its derivatives. For \(n=|{\varvec{\alpha }}|\) Faà di Bruno’s formula formally yieldsFootnote 2
Here, the set \(P({\varvec{\alpha }},r)\) contains restricted integer partitions of a multiindex \({\varvec{\alpha }}\) into r non-vanishing multiindices, i.e.
Herein, for multiindices \({\varvec{\beta }},{\varvec{\beta }}'\in \mathbb {N}_0^M\), the relation \({\varvec{\beta }}\prec {\varvec{\beta }}'\) means either \(|{\varvec{\beta }}|<|{\varvec{\beta }}'|\) or, if \(|{\varvec{\beta }}|=|{\varvec{\beta }}'|\), it denotes the lexicographical order which means that it holds that \(\beta _1=\beta _1',\ldots ,\beta _k=\beta _k'\) and \(\beta _{k+1}<\beta _{k+1}'\) for some \(0\le k< M\).
Taking the norm in (20), we derive the estimate
From [9] we know that
where \(S_{n,r}\) are the Stirling numbers of the second kind, cf. [1]. Thus, we obtain
The term \( \tilde{b}(n)\mathrel {\mathrel {\mathop :}=}\sum _{r=0}^n r!S_{n,r} \) coincides with the n-th ordered Bell number. The ordered Bell numbers satisfy the recurrence relation
see [12], and may be estimated as follows,Footnote 3 cf. [3],
This finally proves the assertion. \(\Box \)
The next lemma bounds the derivatives of \(\det \mathbf{J}(\mathbf{x},\mathbf{y})\).
Lemma 4
Let \(\mathbf{J}:\Box \rightarrow L^\infty (D_{\mathrm{ref}};\mathbb {R}^{d\times d})\) be defined as in (16). Then, it holds for the derivatives of \(\det \mathbf{J}(\mathbf{x},\mathbf{y})\) that
Proof
The proof is by induction on the minors of \(\mathbf{J}(\mathbf{x},\mathbf{y})=[j(\mathbf{x},\mathbf{y})_{k,\ell }]_{k,\ell =1}^d\in \mathbb {R}^{d\times d}\). For the \((1\times 1)\)-minors, we obviously obtain
For \(m\le d\), we set \(\mathbf{J}^{\mathbf{k},{\varvec{\ell }}}\mathrel {\mathrel {\mathop :}=}[j_{k,\ell }]_{k\in \mathbf{k},\ell \in {\varvec{\ell }}}\in \mathbb {R}^{m\times m}\), where \(\mathbf{k}=\{k_1,\ldots , k_m\}\) and \({\varvec{\ell }}=[\ell _1,\ldots , \ell _m]\) with \(1\le k_1<\cdots <k_m\le d\) and \(1\le \ell _1<\cdots < \ell _m\le d\). Now, let the assertion (23) hold for some \(m-1<d\). Then, Laplace’s rule for determinants yields
where \(\mathbf{k}'\mathrel {\mathrel {\mathop :}=}\mathbf{k}{\setminus }\{k_{k'}\}\) and \({\varvec{\ell }}'\mathrel {\mathrel {\mathop :}=}{\varvec{\ell }}{\setminus }\{\ell _{\ell '}\}\). The triangle inequality and Leibniz rule for differentiation give us
since \(j_{k_{k'},\ell _{\ell '}}\) is an affine function with respect to \(\mathbf{y}\) and all higher order derivatives, i.e. \(|{\varvec{\alpha }}'|>1\), vanish, see (24). A reapplication of the triangle inequality together with the induction hypothesis and the sub-multiplicativity of the \(L^\infty \)-norm hence provides
where we exploited that
\(\Box \)
The application of the Leibniz rule yields now a regularity estimate for the diffusion matrix \(\mathbf{A}(\mathbf{x},\mathbf{y})\).
Theorem 3
The derivatives of the diffusion matrix \(\mathbf{A}(\mathbf{x},\mathbf{y})\) defined in (7) satisfy under the conditions of Assumption 1.3 that
Proof
The Leibniz rule for \( \partial ^{\varvec{\alpha }}_\mathbf{y}{} \mathbf{A}(\mathbf{x},\mathbf{y}) \) reads as
Inserting the results of Lemmas 3 and 4 yields with \(C_{\det }\mathrel {\mathrel {\mathop :}=}d!(1+\overline{\sigma })^{d}\) that
Now, we employ the combinatorial identity
and obtain
\(\Box \)
In order to prove regularity results for the right hand side \(f_{\mathrm{ref}}\) in (9), we have to assume that f is a smooth function.
Lemma 5
Let \(f\in C^\infty (\mathcal {D})\) be analytic, i.e. \(\Vert \partial _\mathbf{x}^{{\varvec{\alpha }}}f\Vert _{L^\infty (\mathcal {D}; \mathbb {R}^d)}\le {\varvec{\alpha }}!\rho ^{-|{\varvec{\alpha }}|} c_f\) for all \({\varvec{\alpha }}\in \mathbb {N}_0^d\) and some \(\rho \in (0,1]\). Then, the derivatives of \(\hat{f}=f\circ \mathbf{V}\) are bounded by
Proof
In view of (16), differentiation of \(\mathbf{V}(\mathbf{x},\mathbf{y})\) yields \( {\partial _{y_i}}{} \mathbf{V}(\mathbf{x},\mathbf{y}) =\sigma _i{\varvec{\varphi }}_i(\mathbf{x}). \) Thus, all higher order derivatives with respect to an arbitrary direction \(y_j\) vanish. The norm of the first order derivatives is bounded by \( {\big \vert \big \vert \big \vert {\partial _{y_i}}{} \mathbf{V} \big \vert \big \vert \big \vert }_{d}\le \gamma _i. \)
The rest of the proof is also based on the application of Faà di Bruno’s formula. Nevertheless, we have this time to consider the multivariate case. To that end, we define the set \(P({\varvec{\alpha }},{\varvec{\alpha }}^\prime )\) given by
with \(n=|{\varvec{\alpha }}|\). The application of the multivariate Faà di Bruno formula yields now
From [9], we know that
where again \(S_{n,r}\) is the Stirling number of the second kind. Thus, we obtain
Analogously to the proof of Lemma 3, we finally arrive at the assertion. \(\Box \)
Now, in complete analogy to Theorem 3, we have the following regularity result for the right hand side \(f_{\mathrm{ref}}\).
Theorem 4
The derivatives of the right hand side \(f_{\mathrm{ref}}(\mathbf{x},\mathbf{y})\) defined in (8) satisfy
Finally, we establish the dependency between the solution \(\hat{u}\) to (9) and the data \({f}_{\mathrm{ref}}\).
Lemma 6
Let \(\hat{u}(\mathbf{y})\) be the solution to (9) and \({f}_{\mathrm{ref}}\in L^\infty \big (\Box ;L^\infty (D_{\mathrm{ref}})\big )\). Then, there holds
with a constant \(c_{{D}}\) only dependent on \({D}_{\mathrm{ref}}\) for almost every \(\mathbf{y}\in \Box \).
Proof
The bilinear form
is coercive and bounded according to (4) and \(\underline{\sigma }^d\le \det \mathbf{J}(\mathbf{x},\mathbf{y})\le \overline{\sigma }^d\). It holds
and
for all \(\hat{u},\hat{v}\in H^1(D_{\mathrm{ref}})\) and almost every \(\mathbf{y}\in \Box \). The assertion follows now by the application of the Lax-Milgram Lemma and the observation that
where \(c_P\) denotes the Poincaré constant of \(D_{\mathrm{ref}}\). \(\Box \)
Combining the constants arising from Theorems 3 and 4 leads to the modified sequence
such that
Herein, we set \(C\mathrel {\mathrel {\mathop :}=}C_{\det }\max (c_f,1/\underline{\sigma }^2)\). Notice that we introduced also the additional factor 2 in order to obtain the factor \(|{\varvec{\alpha }}|!\) in the derivatives instead of the factor \((|{\varvec{\alpha }}|+1)!\).
Theorem 5
The derivatives of the solution u to (9) satisfy under the conditions of Assumption 1.3 that
where \(c_D\) denotes the constant from the previous theorem.
Proof
Differentiating the variational formulation (9) with respect to \(\mathbf{y}\) leads to
The isomorphism of the spaces \(H^1_0(D_{\mathrm{ref}})\) and \(H^1_0\big (D(\mathbf{y})\big )\) from Lemma 1 allows us to consider the test functions v to be independent of \(\mathbf{y}\). Furthermore, the application of the Leibniz rule for the expression \( \partial ^{\varvec{\alpha }}_\mathbf{y}\big (\mathbf{A}(\mathbf{y})\nabla _\mathbf{x}\hat{u}(\mathbf{y})\big ) \) results in
Thus, rearranging the preceding expression and using the linearity of the gradient, we arrive at
By choosing \(v=\partial ^{\varvec{\alpha }}_\mathbf{y}\hat{u}(\mathbf{y})\) and by employing the estimates from Theorems 3 and 4, it follows that
From this, we obtain
by setting
The proof is now by induction on \(|{\varvec{\alpha }}|\). The induction hypothesis is given by
For \(|{\varvec{\alpha }}|=0\), we conclude just the stability estimate (26), where the right hand side of the inequality is scaled by the factor 4. Therefore, let the assertion hold for all \(|{\varvec{\alpha }}|\le n-1\) for some \(n\ge 1\). Then, we have
Again, we make use of the combinatorial identity (25) and obtain the estimate
Now, the application of Lemma 9 from the Appendix gives us
Since \(\tilde{C}>1\), we conclude
This completes the proof. \(\Box \)
Taking into account the additional factor provided by the theorem, we end up with the sequence
which yields in view of Theorem 5 that
with a constant \(C>0\) independent of the dimension M. Moreover, we observe \(\mu _k\eqsim \gamma _k\). Therefore, we obtain for \(\gamma _k\lesssim k^{-1-\delta }\) the analyticity of \(\hat{u}\) by Lemma 8 from the Appendix for any \(\delta >0\).
Remark 2
The discussion in this section only refers to the case of the Poisson equation. Of course, the analysis presented here straightforwardly applies also to the more general diffusion problem
In this case, one has to impose the restriction that \(\alpha (\mathbf{x})\) is an analytic function which is bounded from above and below away from 0. Then, an estimate analogous to Lemma 5 applies for \(\hat{\alpha }(\mathbf{x},\mathbf{y})\). The proof of a related Theorem 3 for \(\hat{\alpha }(\mathbf{x},\mathbf{y}) \mathbf{A}(\mathbf{x},\mathbf{y})\) then involves an additional application of the Leibniz rule.
Remark 3
We can obtain similar approximation results for the moments of \(\hat{u}\), i.e. for \(\hat{u}^p\) with \(p\in \mathbb {N}\), possibly with worse constants. To that end, one has to bound the derivatives of \(\hat{u}^p\) with respect to \(\mathbf{y}\), too. This is also achieved by the application of Faà di Bruno’s formula. For an idea of the related proofs, we refer to [16] where this topic is discussed in case of a random diffusion coefficient.
5 Curved domains and parametric finite elements
For the analysis of the regularity in the preceding section, we have exploited that there exists a one-to-one correspondence between the deterministic problem on the random domain and the random problem on the reference domain. For the computations, in contrast to [6, 36], we do however not aim at mapping the equation to the reference domain \(D_{\mathrm{ref}}\) but rather to solve the equation on each particular realization \(D(\mathbf{y}_i)=\mathbf{V} (D_{\mathrm{ref}},\mathbf{y}_i)\) for a suitable set of samples \(\{\mathbf{y}_i\}_{i=1}^N \subset \Box \). A first step towards this approach is made by [26], where a random boundary variation is assumed and a mesh on the realization \(D(\mathbf{y}_i)\) is generated via the solution of the Laplacian. Here, under the assumption that the random domain is obtained by a sufficiently smooth mapping \(\mathbf{V}(\mathbf{y}_i)\), we will employ parametric finite elements to map the mesh on \(D_{\mathrm{ref}}\) onto a mesh on \(D(\mathbf{y}_i)\).
We assume that the domain \(D_{\mathrm{ref}}\) is given as a collection of simplicial smooth patches. More precisely, let \(\triangle \) denote the reference simplex in \(\mathbb {R}^d\). We assume that the domain \(D_{\mathrm{ref}}\) is partitioned into K patches
where each \(\varvec{\kappa }_j:\triangle \rightarrow \tau _{0,j}\) defines a diffeomorphism of \(\triangle \) onto \(\tau _{0,j}\). Thus, we have especially that
where \(\varvec{\kappa }_j'\) denotes as before the Jacobian of \(\varvec{\kappa }_j\). Since there are only finitely many patches, we may set \(\rho \mathrel {\mathrel {\mathop :}=}\max _{j=1}^K\rho _j.\) The intersection \(\tau _{0,j}\cap \tau _{0,j'}\), \(j\ne j'\), of any two patches \(\tau _{0,j}\) and \(\tau _{0,j'}\) is supposed to be either \(\emptyset \), or a common lower dimensional face.
A mesh on level \(\ell \) on \(D_{\mathrm{ref}}\) is now obtained by regular subdivisions of depth \(\ell \) of the reference simplex into \(2^{\ell d}\) sub-simplices. This generates the \(2^{\ell d}\) elements \(\{\tau _{\ell ,j}\}_j\). In order to ensure that the triangulation \(\mathcal {T}_\ell \mathrel {\mathrel {\mathop :}=}\{\tau _{\ell ,j}\}_j\) on the level \(\ell \) forms a regular mesh on \(D_{\mathrm{ref}}\), the parametrizations \(\{ \varvec{\kappa }_j\}_j\) are assumed to be \(C^0\) compatible in the following sense: there exists a bijective, affine mapping \(\varvec{\Xi }:\triangle \rightarrow \triangle \) such that for all \(\mathbf{x} = \varvec{\kappa }_i(\mathbf{s})\) on a common interface of \(\tau _{0,j}\) and \(\tau _{0,j'}\) it holds that \(\varvec{\kappa }_j(\mathbf{s}) = (\varvec{\kappa }_{j'}\circ \varvec{\Xi })(\mathbf{s})\). In other words, the diffeomorphisms \(\varvec{\kappa }_j\) and \(\varvec{\kappa }_{j'}\) coincide at the common interface except for orientation. An illustration of such a triangulation is found in Fig. 1. Notice that in our construction the local element mappings \(\triangle \rightarrow \tau _{\ell ,j}\) satisfy the same bound (28) by definition. Therefore, especially the uniformity condition for (iso-) parametric finite elements is fulfilled, cf. [4, 22].
Finally, we define the finite element ansatz functions via the parametrizations \(\{ \varvec{\kappa }_j\}_j\) in the usual fashion, i.e. by lifting Lagrangian finite elements from \(\triangle \) to the domain \(D_{\mathrm{ref}}\) by using the mappings \(\varvec{\kappa }_j\). To that end, we define on the \(\ell \)-th subdivision \(\triangle _\ell \) of the reference domain the standard Lagrangian piecewise polynomial continuous finite elements \(\Phi _\ell =\{\varphi _{\ell ,i}:i\in \mathcal {I}_\ell \}\), where \(\mathcal {I}_\ell \) denotes an appropriate index set. The corresponding finite element space is then given by
with \(\dim {V}_{\triangle ,\ell }\eqsim 2^{\ell d}\) and \(\Pi _n\) denoting the space of polynomials of degree at most n. Continuous basis functions whose support overlaps with several patches are obtained by gluing across patch boundaries, using the \(C^0\) inter-patch compatibility. This yields a (nested) sequence of finite element spaces
with \(\dim V_{{\mathrm{ref}},\ell }\eqsim 2^{\ell d}\). It is well known that the spaces \(V_{{\mathrm{ref}},\ell }\) satisfy the following Jackson and Bernstein type estimates for all \(0\le s\le t<3/2\), \(t\le q\le n+1\)
and
uniformly in \(\ell \), where we set \(h_\ell \mathrel {\mathrel {\mathop :}=}2^{-\ell }\). Note that, by construction, \(h_\ell \) scales like the mesh size \(\max _k\{\mathrm{diam}\tau _{\ell ,k}\}\), i.e. it holds \(h_\ell \eqsim \max _k\{\mathrm{diam}\tau _{\ell ,k}\}\) uniformly in \(\ell \in \mathbb {N}\) due to (28).
We can employ the same argumentation to map the finite elements from the reference domain \(D_{\mathrm{ref}}\) to the particular realization \(D(\mathbf{y})=\mathbf{V}(D_{\mathrm{ref}},\mathbf{y})\) for \(\mathbf{y}\in \Box \). The ellipticity condition (4) on the Jacobian \(\mathbf{J}(\mathbf{x},\omega )\) of the random vector field guarantees that (28) is satisfied with \(\rho =\overline{\sigma }/\underline{\sigma }\). Also the Jackson and Bernstein type estimates (29) and (30) are still valid, where the only limitation is imposed by the smoothness of \(\mathbf{V}(\mathbf{x},\mathbf{y})\). If for example \(\mathbf{V}(\mathbf{x},\mathbf{y})\) is of class \(C^2\), then we have the restriction \(q\le 2\) such that
for all \(0\le t\le 3/2\), \(t\le q\le 2\) where \( {V_\ell }(\mathbf{y})\mathrel {\mathrel {\mathop :}=}\{\varphi \circ \mathbf{V}(\mathbf{y})^{-1}: \varphi \in {V}_{{\mathrm{ref}},\ell }\}\subset H^1\big (D(\mathbf{y})\big ). \)
The one-to-one correspondence between the solution \(u_\ell (\mathbf{y})\in {V}_\ell (\mathbf{y})\) to (6) and the solution \(\hat{u}_\ell (\mathbf{y})\in V_{{\mathrm{ref}},\ell }\) to (9) is given by the following
Theorem 6
Let \(u_\ell (\mathbf{y})\in {V}_\ell (\mathbf{y})\) be the Galerkin solution to (6) and \(\hat{u}_\ell (\mathbf{y})\in V_{{\mathrm{ref}},\ell }\) the Galerkin solution to (9), respectively. Then, it holds
Proof
The proof is a straightforward consequence of the construction of the spaces \({V}_\ell (\mathbf{y})\) and the equivalence of the problems (6) and (9), see also (10). \(\Box \)
Remark 4
The \(H^2\)-regularity of the mapped problem, i.e. on \(D(\mathbf{y})\), follows from the \(H^2\)-regularity of the problem on the reference domain \(D_{\mathrm{ref}}\) if the vector field \(\mathbf{V}(\mathbf{x},\mathbf{y})\) is at least a \(C^2\)-diffeomorphism. Especially, if \(\mathbf{V}(\mathbf{x},\mathbf{y})=\mathbf{x}+\mathbf{V}_0(\mathbf{x},\mathbf{y})\) is a perturbation of the identity as in (16) and \(\mathbf{V}_0(\mathbf{x},\mathbf{y})\) is of class \(C^2\), then \(\mathbf{V}(\mathbf{x},\mathbf{y})^{-1}\) is also a \(C^2\)-diffeomorphism provided that \(\Vert \mathbf{V}_0(\cdot ,\mathbf{y})\Vert _{C^2(D_{\mathrm{ref}})}<1/2\), cf. [31].
6 Stochastic interface problems
As a special case of a diffusion problem on a random domain, we shall focus on the stochastic interface problem as already discussed in e.g. [14].
6.1 Problem formulation
Let the hold-all \(\mathcal {D}\subset \mathbb {R}^d\), cf. (3), be a simply-connected and convex domain with Lipschitz continuous boundary \(\partial \mathcal {D}\). Inscribed into \(\mathcal {D}\), we have a randomly varying inclusion \(D^-(\mathbf{y})\subsetneq \mathcal {D}\) for \(\mathbf{y}\in \Box \) with a \(C^{2}\)-smooth boundary \(\Gamma (\mathbf{y})\mathrel {\mathrel {\mathop :}=}\partial D^-(\mathbf{y})\). The complement of \(\overline{D^-(\mathbf{y})}\) will be denoted by \(D^+(\mathbf{y})\mathrel {\mathrel {\mathop :}=}\mathcal {D}{{\setminus }}\overline{D^-(\mathbf{y})}\). A visualization of this setup is found in Fig. 2. For given \(\mathbf{y}\in \Box \), we can state the stochastic elliptic interface problem as follows:
Here, \(\mathbf{n}\) denotes the outward normal vector on \(\Gamma (\mathbf{y})\). Furthermore, the diffusion coefficient is given by
where \(\chi _{D^-(\mathbf{y})}\) is the characteristic function of \(D^-(\mathbf{y})\) and \(\alpha ^+,\alpha ^-\) are smooth deterministic functions with
By \([\![u(\mathbf{x},\mathbf{y})]\!]\mathrel {\mathrel {\mathop :}=}u^+(\mathbf{x},\mathbf{y})-u^-(\mathbf{x},\mathbf{y})\), we denote the jump of the solution u across \(\Gamma (\mathbf{y})\), where \(u^-(\mathbf{x},\mathbf{y})\mathrel {\mathrel {\mathop :}=}u|_{D^-(\mathbf{y})}\) and \(u^+(\mathbf{x},\mathbf{y})\mathrel {\mathrel {\mathop :}=}u|_{D^+(\mathbf{y})}\), respectively. Analogously, we define the jump of the co-normal derivative across \(\Gamma (\mathbf{y})\) via
Remark 5
This formulation of the stochastic interface problem also covers the case of elliptic equations on random domains. For example, for \(\alpha ^+{(\mathbf x)}\equiv 0\) and \(\alpha ^-(\mathbf{x})\equiv 1\) (perfect insulation), we have the Poisson equation on \(D^-(\mathbf{y})\) with homogeneous Neumann data on \(\Gamma (\mathbf{y})\) while, for \(\alpha ^+{(\mathbf x)}\equiv \infty \) and \(\alpha ^-(\mathbf{x})\equiv 1\) (perfect conduction), we have the Poisson equation on \(D^-(\mathbf{y})\) with homogeneous Dirichlet data on \(\Gamma (\mathbf{y})\).
6.2 Modeling the stochastic interface
Instead of solving the stochastic interface problem by the perturbation method by means of shape sensitivity analysis as in [14, 18], we propose here to apply the domain mapping approach. To that end, let \(\Gamma _{\mathrm{ref}}\subset \mathcal {D}\) denote a reference interface of class \(C^{2}\) and co-dimension 1 which separates the interior domain \(D^-_{\mathrm{ref}}\) and the outer domain \(D^+_{\mathrm{ref}}\). We assume that \(\Gamma (\mathbf{y})\) is prescribed by the application of a vector field \( \mathbf{V}:\mathcal {D}\times \Box \rightarrow \mathcal {D}, \) i.e. \(\Gamma (\mathbf{y})=\mathbf{V}(\Gamma _{\mathrm{ref}},\mathbf{y})\), which is a uniform \(C^{2}\)-diffeomorphism in the sense of Sect. 2. Furthermore, let the Jacobian of \(\mathbf{V}\) satisfy the ellipticity condition (4).
As an example, we can consider here an extension of the vector field in [14], which only prescribes the perturbation at the boundary: If \(\Gamma _{\mathrm{ref}}\) is of class \(C^{3}\), then its outward normal \(\mathbf{n}\) is of class \(C^{2}\). Thus, given a random field \(\kappa :\Gamma _{\mathrm{ref}}\times \Box \rightarrow \mathbb {R}\) which satisfies \(|\kappa (\mathbf{x},\mathbf{y})|\le \overline{\kappa }<1\) almost surely, we can define \( \mathbf{V}(\mathbf{x},\mathbf{y})\mathrel {\mathrel {\mathop :}=}\mathbf{x}+\kappa (\mathbf{x},\mathbf{y})\mathbf{n}(\mathbf{x}) \) for \(\mathbf{x}\in \Gamma _{\mathrm{ref}}\). A suitable extension of this vector field to the whole domain \(\mathcal {D}\) is given by \( \mathbf{V}(\mathbf{x},\mathbf{y})\mathrel {\mathrel {\mathop :}=}\mathbf{x}+\kappa (P\mathbf{x},\mathbf{y}) \mathbf{n}(P\mathbf{x})B(\Vert \mathbf{x}-P\mathbf{x}\Vert _2), \) where \(P\mathbf{x}\) is the orthogonal projection of \(\mathbf{x}\) onto \(\Gamma _{\mathrm{ref}}\) and \(B:[0,\infty )\rightarrow [0,1]\) is a smooth blending function with \(B(0)=1\) and \(B(t)=0\) for all \(t\ge c\) for some constant \(c\in (0,\infty )\). Notice that, if \(\Gamma _{\mathrm{ref}}\) is of class \(C^{3}\), the orthogonal projection P onto \(\Gamma _{\mathrm{ref}}\) and thus \(\mathbf{V}(\mathbf{x},\mathbf{y})\) is at least of class \(C^{2}\), cf. [19].
6.3 Reformulation for the reference interface
For \(\mathbf{y}\in \Box \), the variational formulation of the interface problem (31)–(34) is given as follows: Find \(u\in H^1_0(\mathcal {D})\) such that
As in Sect. 2, we can reformulate this variational formulation relative to the reference interface. As we have for the transported coefficient
we obtain the following variational formulation with the definition (7) of the diffusion matrix \(\mathbf{A}(\mathbf{x},\mathbf{y})\): Find \(\hat{u}(\mathbf{y})\in H^1_0(\mathcal {D})\) such that
for all \(v\in H^1_0(\mathcal {D})\). Since \(\hat{\alpha }(\mathbf{x},\mathbf{y})\) is a smooth function with respect to \(\mathbf{y}\), the regularity results from Sect. 4 remain valid here.
6.4 Finite element approximation for the stochastic interface problem
The application of parametric finite elements yields especially an interface-resolved triangulation for the discretization of the stochastic interface problem (31)–(34). By “interface-resolved” we mean that the vertices of elements around the interface lie exactly on the interface, cf. [7, 23]. Thus, the approximation error for a particular realization of the solution \(u(\mathbf{y})\) to the stochastic interface problem (31)–(34) can be quantified by the following theorem adopted from [23, Theorem 4.1].
Theorem 7
For \(\mathbf{y}\in \Box \), let \(\{ \mathcal {T}_{\ell }\} _{\ell >0}\) be a family of interface resolved triangulations for \(\mathbf{V}(\mathcal {D},\mathbf{y})\) and \(\{{V}_{\ell }(\mathbf{y})\} _{\ell >0}\) the associated finite element spaces. Let \({u}_{\ell }(\mathbf{y})\) be the finite element solution corresponding to the realization \({u}(\mathbf{y})\) of the solution to the elliptic problem (31)–(34). Then, for \(s=0,1\), there holds that
where \({H^2\big (D^-(\mathbf{y})\big )}\cup H^2\big (D^+(\mathbf{y})\big )\) is the broken Sobolev space equipped by the norm
In view of Theorem 6, the statement of the previous theorem is also valid for the realization of the solution which is pulled back to the domain \(\mathcal {D}\) relative to the reference interface \(\Gamma _{\mathrm{ref}}\).
7 Numerical examples
In this section, we consider two examples for boundary value problems on random domains. On the one hand, we consider a stochastic interface problem, and on the other hand, we consider the Laplace equation on a random domain. In both examples, we employ the pivoted Cholesky decomposition, cf. [15, 17], in order to approximate the Karhunen-Loève expansion of V. The spatial discretization is performed by using piecewise linear parametric finite elements on the mapped domain \(\mathbf{V}(D_{\mathrm{ref}},\mathbf{y}_i)\) for each sample \(\mathbf{y}_i\). It would of course be also possible to perform the computations on the reference domain. In this case, the diffusion matrix \(\mathbf{A}\) has to be computed from Karhunen-Loève expansion of \(\mathbf{V}\) for each particular sample.
For the stochastic approximation, we employ a quasi-Monte Carlo quadrature based on N Halton points \(\{{\varvec{\xi }}_i\}_{i=1}^N\) mapped to the hypercube \([-1,1]^M\), i.e.
In accordance with [16], we have the following convergence result for this quasi-Monte Carlo quadrature, which is valid for the variance of \(\hat{u}\) as well.
Lemma 7
The quasi-Monte Carlo quadrature with Halton points converges for the mean of the solution \(\hat{u}\) to (9) independent of the stochastic dimension M if \({\gamma }_k\lesssim k^{-3-\varepsilon }\). More precisely, for all \(\delta >0\), there exists a constant such that the quasi-Monte Carlo quadrature based on N Halton points satisfies
where \(C(\delta )\rightarrow \infty \) as \(\delta \rightarrow 0\).
Proof
From [16, 21], we know that the error of the quasi-Monte Carlo quadrature can be estimated by the weighted Koksma-Hlawka inequality, cf. [27],
Herein, we denote by \(\mathcal {D}^{\star }(\Xi _{\varvec{\alpha }})\) the star-discrepancy of the set of Halton-points on \([0,1]^M\) which are projected onto the dimensions where \(\alpha _k=1\). Additionally, the factor \(2^{|{\varvec{\alpha }}|}\) appears due to the transport of \(\hat{u}\) to the unit cube \([0,1]^M\). It is shown in [34] that the second factor in (37) is bounded by
with a constant \(C(\delta )\) which is independent of M if the weights \(w_{{\varvec{\alpha }}}\) are product weights, i.e. \(w_{{\varvec{\alpha }}}=\prod _{k=1}^M w_k^{\alpha _k}\), and satisfy
In order to bound the first product in (37), we employ the estimate
from Theorem 5 and choose the weights accordingly as \(w_k^{1/2}=2c k \gamma _k\). Then, the condition (38) can be rewritten as
which is satisfied if \(\gamma _k\lesssim k^{-3-\varepsilon }\). \(\Box \)
All computations have been carried out on a computing server consisting of four nodesFootnote 4 with up to 64 threads.
7.1 The stochastic interface problem
We consider the stochastic interface problem from [14] where the hold-all is given as \(\mathcal {D}=[-1,1]^2\) and the reference interface is given as \(\Gamma _{\mathrm{ref}}=\{\mathbf{x}\in \mathcal {D}:\Vert \mathbf{x}\Vert _2=0.7\}\). Thus, the outward normal is \(\mathbf{n}(\mathbf{x})=[\cos (\theta ),\sin (\theta )]^\intercal \) where \(\mathbf{x}=r[\cos (\theta ),\sin (\theta )]^\intercal \) is the representation of \(\mathbf{x}\) in polar coordinates. The random field under consideration reads
Here, \(X_0,\ldots ,X_{11}\) are independent, uniformly distributed random variables with variance 1, i.e. their range is \([-\sqrt{3},\sqrt{3}]\). The diffusion coefficient is given as \(\alpha ^-(\mathbf{x})\equiv 2\) in the interior part of the domain and as \(\alpha ^+(\mathbf{x})\equiv 1\) in the remaining part of the domain. The right hand side is chosen as \(f(\mathbf{x})\equiv 1\).
In this example, only the perturbation at the random interface is known. Thus, the solution of the associated diffusion problem depends on the particular extension of the vector field and it is reasonable to consider a quantity of interest (QoI) that does not depend on this extension. Specifically, the QoI is given by the solution on a non-varying part of the domain, namely on \(\{\Vert \mathbf{x}\Vert _2\le 0.4\}\). We therefore extend the random field (39) onto \(\mathcal {D}\) as described in Sect. 6.2 by using the quadratic B-spline \(B(\mathbf{x}) = \frac{4}{3} B_2(5\Vert \mathbf{x}-P\mathbf{x}\Vert _2)\) as blending function. Hence, the random perturbation is localized in the annulus \(\{0.4<\Vert \mathbf{x}\Vert _2<1\}\) and we end up with the covariance
with
Furthermore, we set \(\mathbf{E}[\mathbf{V}](\mathbf{x})\mathrel {\mathrel {\mathop :}=}\mathbf{x}\). A visualization of the reference interface with a particular displacement field \(\mathbf{V}(\mathbf{x},\mathbf{y}_i)-\mathbf{x}\) and the resulting perturbed interface is found in Fig. 4.
A visualization of the QoI’s mean and variance computed by \(N=10^6\) quasi-Monte Carlo samples and 1048576 finite elements (level 8) is shown in Fig. 3. This approximation serves as a reference in order to examine the convergence behavior of the quasi-Monte Carlo method. According to Lemma 7, we expect a rate of convergence of \(N^{\delta -1}\) for any \(\delta >0\). In our experiments, we thus apply \(N_\ell =2^{\ell /(1-\delta )}\) Halton points on the finite element level \(\ell =1,\ldots ,7\) for the choices \(\delta =0.5,0.4,0.3,0.2\). Although all choices of \(\delta > 0\) would asymptotically result in an almost linear rate of convergence, the constant in the error estimate is still dependent on the particular choice.
Figure 5 depicts the error of the QoI’s mean measured in the \(H^1\)-norm on the right hand side and the error of the QoI’s variance measured in the \(W^{1,1}\)-norm on the left hand side each versus the related cost, which is given by the number \(N_\ell \) of samples times the degrees of freedom in the finite element approximation on level \(\ell \). As can be seen, the error of the QoI’s mean provides similar errors for all choices of \(\delta \). This suggests that the finite element error limits the overall approximation error. The choice \(\delta =0.2\) is already sufficient here and results in the lowest cost. For the QoI’s variance, we observe successively smaller errors for increasing \(\delta \). At least the error for the QoI’s mean seems to be dominated by the finite element discretization. Therefore, we found it instructive to present also the respective errors measured in the \(L^2\)-norm. They are plotted in Fig. 6. Here, the smallest error is obtained for \(\delta =0.5\). Nevertheless, the best error versus cost rate is provided by \(\delta =0.2\). The situation changes for the variance. Here, the error gets again successively smaller for increasing values of \(\delta \). Resulting in the lowest error for \(\delta =0.5\).
As a comparison and in order to validate the reference, we have also computed the approximate mean and variance on each level by the Monte Carlo method. Here, in order to maintain the linear approximation rate of the finite element method in the energy norm, we approximate the root mean square error by five realizations each of which being computed with \(N_\ell =2^{2\ell }\) samples.
7.2 The Poisson equation on a random domain
For our second example, we consider an infinite dimensional random field described by its mean \(\mathbb {E}[\mathbf{V}](\mathbf{x})=\mathbf{x}\) and its covariance function
Furthermore, we consider the random variables in the Karhunen-Loève expansion to be uniformly distributed. The unit disc \(D_{\mathrm{ref}}=\{\mathbf{x}\in \mathbb {R}^2:\Vert \mathbf{x}\Vert _2<1\}\) serves as reference domain and the load is set to \(f(\mathbf{x})\equiv 1\). Figure 8 shows the reference domain with a particular displacement field and the resulting perturbed domain. In this example, the covariance between any two points in \(D_{\mathrm{ref}}\) is actually known and can thus be incorporated into our model. Especially, there is no point inside the reference domain that is kept fixed by the random vector field. Therefore, we consider here the entire solution \(\hat{u}\) as QoI and approximate its mean and its variance.
In Fig. 7, a visualization of the mean and the variance computed by \(N=10^6\) quasi-Monte Carlo samples 1048576 finite elements (level 9) are found. Here, the Karhunen-Loève expansion has been truncated after \(M=303\) terms which yields a truncation error, cf. (14), smaller than \(10^{-6}\). For the convergence study, however, we have coupled the truncation error of the Karhunen-Loève expansion to the spatial discretization error of order \(2^{-\ell }\) on the finite element level \(\ell \). It is observed that the truncation rank M grows linearly in the level \(\ell \), namely it holds \(M=10,23,37,49,64,79,91,108\) for \(\ell =1,2,3,4,5,6,7,8\).
The number of samples of the quadrature methods under consideration has been chosen in dependence on the finite element level \(\ell \) as in the previous example. Figure 9 shows the error of the solution’s mean and variance measured in the \(H^1\)-norm and the \(W^{1,1}\)-norm, respectively, each versus the cost. Except for \(\delta =0.2\), we observe for the quasi-Monte Carlo quadrature as well as for the Monte Carlo quadrature comparable errors for the approximation of the mean. In view of the cost, \(\delta =0.2,0.3\) perform best here. In case of the variance, we obtain again successively smaller errors for increasing values of \(\delta \). Again, we have also provided the respective errors with respect to the \(L^2\)-norm. The related plots are found in Fig. 10. Here, for the mean and the variance, \(\delta =0.5\) provides asymptotically the lowest error.
8 Conclusion
In this article, we have provided regularity results for the domain mapping method for elliptic boundary value problems on random domains. Based on the decay of the random vector field’s Karhunen-Loève expansion, we have derived related decay rates for the solution’s derivatives. In particular, the presented framework is directly applicable to stochastic interface problems. The regularity results provide dimension independent convergence of the quasi-Monte Carlo quadrature and allow also for the use of (anisotropic) quadrature methods to approximate quantities of interest that involve an integration of the solution with respect to the random parameter. The numerical examples corroborate the theoretical results and demonstrate the flexibility of the approach.
Notes
It is sufficient to assume that \(\mathbf{V}\) is a \(C^1\)-diffeomorphism and satisfies the uniformity in \({C^1(\overline{D_{\mathrm{ref}}};\mathbb {R}^d)}\). Nevertheless, in order to obtain \(H^2\)-regularity of the model problem, we make this stronger assumption.
With “formally” we mean that we ignore here the fact that the product of matrices is in general not Abelian. Nevertheless, a differentiation yields exactly the appearing products in a permuted order. The formal representation is justified since we only consider the norm of the representation in the sequel.
Each node consists of two quad-core Intel(R) Xeon(R) X5550 CPUs with a clock rate of 2.67GHz (hyperthreading enabled) and 48GB of main memory.
References
Abramowitz, M., Stegun, I.A.: Handbook of Mathematical Functions: With Formulas, Graphs, and Mathematical Tables. Applied mathematics series. Dover Publications, N. Chemsford (1964)
Alt, H.W.: Lineare Funktionalanalysis. Springer, London (2007)
Beck, J., Tempone, R., Nobile, F., Tamellini, L.: On the optimal polynomial approximation of stochastic pdes by galerkin and collocation methods. Math. Models Methods Appl. Sci. 22(9), 1250023 (2012)
Braess, D.: Finite Elemente: Theorie Schnelle Löser und Anwendungen in der Elastizitätstheorie. Springer, London (2007)
Canuto, C., Kozubek, T.: A fictitious domain approach to the numerical solution of PDEs in stochastic domains. Numerische Mathematik 107(2), 257–293 (2007)
Castrillon-Candas, J.E., Nobile, F., Tempone, R.F.: Analytic regularity and collocation approximation for PDEs with random domain deformations. ArXiv e-prints 1312.7845 (2013)
Chen, Z., Zou, J.: Finite element methods and their convergence for elliptic and parabolic interface problems. Numerische Mathematik 79(2), 175–202 (1998)
Cohen, A., DeVore, R., Schwab, C.: Convergence rates of best \(N\)-term Galerkin approximations for a class of elliptic sPDEs. Found. Comput. Math. 10, 615–646 (2010)
Constantine, G.M., Savits, T.H.: A multivariate Faà di Bruno formula with applications. Trans. Am. Math. Soc. 248, 503–520 (1996)
Ghanem, R., Spanos, P.: Stochastic finite elements: A spectral approach. Springer, New York (1991)
Griebel, M., Harbrecht, H.: Approximation of bi-variate functions: singular value decomposition versus sparse grids. IMA J. Numer. Anal. 34(1), 28–54 (2014)
Gross, O.A.: Preferential arrangements. Am. Math. Month. pp. 4–8 (1962)
Halton, J.H.: On the efficiency of certain quasi-random sequences of points in evaluating multi-dimensional integrals. Numerische Mathematik 2(1), 84–90 (1960)
Harbrecht, H., Li, J.: First order second moment analysis for stochastic interface problems based on low-rank approximation. ESAIM. Math. Model. Numer. Anal. 47, 1533–1552 (2013)
Harbrecht, H., Peters, M., Schneider, R.: On the low-rank approximation by the pivoted Cholesky decomposition. Appl. Numer. Math. 62, 28–440 (2012)
Harbrecht, H., Peters, M., Siebenmorgen, M.: On the quasi-Monte Carlo method with Halton points for elliptic PDEs with log-normal diffusion. Preprint 2013-28, Mathematisches Institut Universität Basel (to appear in Mathematics of Computation) (2013)
Harbrecht, H., Peters, M., Siebenmorgen, M.: Efficient approximation of random fields for numerical applications. Numer. Linear Algebra Appl. 22(4), 596–617 (2015)
Harbrecht, H., Schneider, R., Schwab, C.: Sparse second moment analysis for elliptic problems in stochastic domains. Numerische Mathematik 109(3), 385–414 (2008)
Holmes, R.B.: Smoothness of certain metric projections on Hilbert space. Trans. Am. Math. Soc. 184, 87–100 (1973)
Kadison, R.V., Ringrose, J.R.: Fundamentals of the theory of operator algebras. V1: Elementary theory. Academic Press, New York (1986)
Kuo, F.Y., Schwab, C., Sloan, I.H.: Quasi-Monte Carlo methods for high-dimensional integration: the standard (weighted Hilbert space) setting and beyond. ANZIAM J. 53, 1–37 (2012)
Lenoir, M.: Optimal isoparametric finite elements and error estimates for domains involving curved boundaries. SIAM J. Numer. Anal. 23(3), 562–580 (1986)
Li, J., Melenk, J.M., Wohlmuth, B., Zou, J.: Optimal a priori estimates for higher order finite elements for elliptic interface problems. Appl. Numer. Math. 60(1), 19–37 (2010)
Light, W.A., Cheney, E.W.: Approximation theory in tensor product spaces. Lecture notes in mathematics Volume 1169. Springer, New York (1985)
Loève, M.: Probability theory. I+II, Graduate Texts in Mathematics, vol. 45, 4th edn. Springer, New York (1977)
Mohan, P.S., Nair, P.B., Keane, A.J.: Stochastic projection schemes for deterministic linear elliptic partial differential equations on random domains. Int. J. Numer. Methods Eng. 85(7), 874–895 (2011)
Niederreiter, H.: Random number generation and Quasi-Monte Carlo methods. Society for Industrial and Applied Mathematics, Philadelphia (1992)
Nobile, F., Tempone, R., Webster, C.G.: An anisotropic sparse grid stochastic collocation method for partial differential equations with random input data. SIAM J. Numer. Anal. 46(5), 2411–2442 (2008)
Schwab, C., Todor, R.: Karhunen-Loève approximation of random fields by generalized fast multipole methods. J. Comput. Phys. 217, 100–122 (2006)
Simon, B.: Methods of modern mathematical physics: functional analysis, vol. 1. Academic Press, San Diego (1980)
Simon, J.: Differentiation with respect to the domain in boundary value problems. Numer. Funct. Anal. Optim. 2(7–8), 649–687 (1980)
Sokołowski, J., Zolésio, J.P.: Introduction to shape optimization. Shape sensitivity analysis. Springer series in computational mathematics. Springer, Berlin Heidelberg (1992)
Tartakovsky, D.M., Xiu, D.: Stochastic analysis of transport in tubes with rough walls. J. Comput. Phys. 217(1), 248–259 (2006)
Wang, X.: A constructive approach to strong tractability using quasi-Monte Carlo algorithms. J. Complex. 18, 683–701 (2002)
Wilf, H.S.: Generating functionology. A. K. Peters Ltd, Natick (2006)
Xiu, D., Tartakovsky, D.M.: Numerical methods for differential equations in random domains. SIAM J. Sci. Comput. 28(3), 1167–1185 (2006)
Author information
Authors and Affiliations
Corresponding author
Additional information
This research has been supported by the Swiss National Science Foundation (SNSF) through the project “Rapid Solution of Boundary Value Problems on Stochastic Domains”.
Appendix
Appendix
Lemma 8
Let \({\varvec{\gamma }}=\{\gamma _k\}_k\in \ell ^1(\mathbb {N})\) with finite support \(\mathcal {I}\subset \mathbb {N}\) and \(\gamma _k\ge 0\). Moreover, assume that \(c_{\varvec{\gamma }}\mathrel {\mathrel {\mathop :}=}\sum _{k\in \mathcal {I}}\gamma _k<1\). Then, it holds
and therefore there exists a constant with \(|{\varvec{\alpha }}|!/{\varvec{\alpha }}!{\varvec{\gamma }}^{\varvec{\alpha }}\le c\) for all \({\varvec{\alpha }}\in \mathbb {N}^M_0\), where we set \(M\mathrel {\mathrel {\mathop :}=}|\mathcal {I}|\) and \(0^0=1\).
Proof
It holds
by the multinomial theorem and the limit of the geometric series. \(\Box \)
Lemma 9
Let \(c,m\in \mathbb {R}\) with \(m\ge 2\) and \(c\ge m/(m-1)\). It holds for \(n\in \mathbb {N}\) that
Proof
It holds
Omitting the second summand together with the condition \(c\ge m/(m-1)\) yields the assertion. \(\Box \)
Rights and permissions
About this article
Cite this article
Harbrecht, H., Peters, M. & Siebenmorgen, M. Analysis of the domain mapping method for elliptic diffusion problems on random domains. Numer. Math. 134, 823–856 (2016). https://doi.org/10.1007/s00211-016-0791-4
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00211-016-0791-4