A mixed finite element method for thin film epitaxy

Chen, Wenbin; Wang, Yanqiu

doi:10.1007/s00211-012-0473-9

A mixed finite element method for thin film epitaxy

Published: 15 June 2012

Volume 122, pages 771–793, (2012)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Numerische Mathematik Aims and scope Submit manuscript

A mixed finite element method for thin film epitaxy

Download PDF

Wenbin Chen¹ &
Yanqiu Wang²

606 Accesses
19 Citations
Explore all metrics

Abstract

We present a mixed finite element method for the thin film epitaxy problem. Comparing to the primal formulation which requires $C^2$ elements in the discretization, the mixed formulation only needs to use $C^1$ elements, by introducing proper dual variables. The dual variable in our method is defined naturally from the nonlinear term in the equation, and its accurate approximation will be essential for understanding the long-time effect of the nonlinear term. For time-discretization, we use a backward-Euler semi-implicit scheme, which involves a convex–concave decomposition of the nonlinear term. The scheme is proved to be unconditionally stable and its convergence rate is analyzed.

A Second Order Energy Stable Linear Scheme for a Thin Film Model Without Slope Selection

Article 10 March 2018

Highly Efficient and Accurate Numerical Schemes for the Epitaxial Thin Film Growth Models by Using the SAV Approach

Article 20 September 2018

An adaptive BDF2 implicit time-stepping method for the no-slope-selection epitaxial thin film model

Article 23 March 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Molecular beam epitaxy (MBE) [11, 12] is a technology of depositing high-purity crystalline films with atomic thicknesses onto the surface of a base material. One distinguishing feature of MBE is the slow deposition rate of atoms or molecules, which allows the thin film on surface to grow epitaxially, or in other words, to grow as organized high-quality crystal. In this process, it is essential to have precise control on the surface morphology during epitaxial growth. This requires mathematical modeling, over multiple temporal and spatial scales, of particle adsorption, desorption, surface diffusion, and step dynamics (the Ehrlich–Schwoebel barrier). Many different models, describing part or all of the above-mentioned physical phenomena, have been developed. Generally speaking, these models can be classified into three categories. The atomistic models [13, 26, 38] describe molecular dynamics using kinetic Monte Carlo methods. However, their applicability is limited due to high computational costs. The continuum models ([28, 29, 32, 33, 36, 37, 42, 45] and references therein) are based on partial differential equations and the conservation of mass. They are able to capture large scale features of the crystal growth, and hence are interesting to physicists and mathematicians as well. There are also the hybrid models [9, 22] that seek a compromise between atomistic and continuum models.

Here we consider a continuum model. Although the continuum model contains many inherent simplifications and heuristics, it can still provide a unique insight into the long-term evolution of the physical problem, especially into certain types of instabilities during the epitaxial growth. For decades, there have been large amount of research work on building the continuum model for thin film epitaxial growth. Most of them are conducted by physicists. Only in recent years, there has been an emerging trend of mathematician’s involvement in this area. Most of these mathematical research has been focused on the existence, uniqueness and regularity of the solution to different types of governing evolution equations for MBE.

In 2002, Blömker and Gugg [6] have proved the global existence of the solution to a solid-on-solid model equation derived from [41]. The proof is based on Galerkin approximation and a priori estimates, using techniques similar to the proof of 2D Navier–Stokes equations. Later, Hoppe and Nash [24] proposed a combined spectral element/finite element approach for Blömker and Gugg’s model. In 2003, King, Stein and Winkler [27] have studied the existence, uniqueness and regularity of the fourth-order governing equation proposed by Ortiz, Repetto and Si [36]. Their proof is based on an asymptotic analysis. However, the continuum model that probably has received most of the attention from mathematicians is a simplification with linearized surface diffusion [28, 30, 31, 32, 33]. In [28], Kohn and Yan proved there is an upper bound of the averaged coarsening rate for this model with finite Ehrlich–Schwoebel barrier and with slope selection. Another important work was done by Li and Liu [32], in which they have proved the well-posedness of the model for finite Ehrlich–Schwoebel barrier, with or without slope selection. The proof is based on a Galerkin spectral approximation and its a priori bounds. Numerical results using this spectral method are presented. Also given in [32] is the regularity of the global solution, which lays the foundation for further numerical study of the model problem. For the case of finite Ehrlich–Schwoebel barrier without slope selection, Li and Liu in another paper [33] has obtained two main theoretical results. First, by using the energy method and the convexity argument, they have derived the bounds for several important physical quantities including the interface width, average slope and average energy. These theoretical predictions agree with heuristic arguments. Second, by using the perturbation theory, they have shown that the system evolves in such a way that it always stays close to a sequence of periodic equilibria. In [30, 31], Li has made further progress by generalizing the above results to the case of infinite Ehrlich–Schwoebel barriers and also for higher-order surface diffusion.

Numerical schemes for the simplified model problem proposed in [32] has been studied in [10, 43, 44, 45], for the finite Ehrlich–Schwoebel barrier case, either with or without slope selection. All these previous research are based on the primal formulation. In [10], an energy-stable semi-implicit scheme, which has linear implicit parts, was developed for the without-slope-selection case only. In [43], an unconditionally stable semi-implicit scheme has been developed for both with and without slope selection cases. In [44], a fully implicit, stable scheme was analyzed for the without-slope-selection case only. And in [45], the authors studied the time-stability of the large time-stepping method. We mention that for the semi-implicit schemes presented in [10, 43], a convex–concave decomposition is the key to the time-stability analysis. Indeed, we will also use this technique in our numerical method, and details shall be given later.

In this paper, we consider a mixed finite element method for the model problem, for the finite Ehrlich–Schwoebel barrier case and either with or without slope selection, as presented in [32]. Let $\varOmega $ be a rectangular domain and $u(\varvec{x}, t)$ be the height function of the thin film. The thin film epitaxy problem defined on $\varOmega \times (0, T]$ can be written as

$$\begin{aligned} \partial _t u&= -\delta \varDelta ^2 u + \nabla \cdot \nabla _{\mathbb F } G(\nabla u)\nonumber \\&= \nabla \cdot \left[\nabla _{\mathbb F } G(\nabla u) - \delta \nabla \varDelta u\right], \end{aligned}$$

(1)

where $\delta $ is a positive constant, $\nabla _{\mathbb F }$ is the Fréchet gradient, and $G(\nabla u)$ is defined by

$$\begin{aligned} G(\nabla u) = {\left\{ \begin{array}{ll} -\frac{1}{2}\ln (1+ |\nabla u|^2)&\text{ without} \text{ slope} \text{ selection},\\ \frac{1}{4} (|\nabla u|^2 - 1)^2&\text{ with} \text{ slope} \text{ selection}. \end{array}\right.} \end{aligned}$$

It is easy to check that their Fréchet gradients are, respectively

$$\begin{aligned} \nabla _{\mathbb F } G(\nabla u) = {\left\{ \begin{array}{ll} -\frac{\nabla u}{1+|\nabla u|^2}&\text{ without} \text{ slope} \text{ selection},\\ (|\nabla u|^2-1)\nabla u&\text{ with} \text{ slope} \text{ selection}. \end{array}\right.} \end{aligned}$$

Throughout the paper, we adopt the convention that a bold Latin or Greek character denotes a vector. Let $\varvec{n}$ be the unit outward normal on $\partial \varOmega $. To close the problem, we impose the following initial and boundary conditions. At $t=0$, let $u=u_0$. Two different type of boundary conditions will be considered:

1.
Dirichlet boundary condition. Let $u = \frac{\partial u}{\partial \varvec{n}} = 0$ on $\partial \varOmega $ for all time $t$.
2.
Periodic boundary condition, where $u$ is $\varOmega $-periodic for all time $t$. Since $u$ is unique up to a constant, it is convenient to set it to be mean value zero.

Obviously, the initial condition $u_0$ should satisfy the same boundary condition for compatibility.

We adopt the usual notation $H^s(\varOmega )$ for the Sobolev space with index $s$, equipped with the norm $\Vert \cdot \Vert _{H^s(\varOmega )}$ and sometimes also the semi-norm $|\cdot |_{H^s(\varOmega )}$. When $s=0, H^0(\varOmega )$ coincides with $L^2(\varOmega )$, and for simplicity of the notation, we suppress the subscript in $\Vert \cdot \Vert _{L^2(\varOmega )}$ and denote the norm by $\Vert \cdot \Vert $. Denote $(\cdot ,\,\cdot )$ to be the $L^2$ inner-product on $\varOmega $. Define $L^p(0,T; H^{s}(\varOmega )), 1\le p\le \infty $, to be the space of functions which are $H^s$ in space and $L^p$ in time. Finally, notice that all these notations can easily be extended to vector functions, by using product spaces. For convenience, when it is not ambiguous, some notations for product spaces will appear the same as those for a single space. For example, we write $\Vert \nabla u\Vert _{H^1(\varOmega )}$ instead of $\Vert \nabla u\Vert _{(H^1(\varOmega ))^2}$, and $\Vert \nabla u\Vert _{L^{\infty }(0,T; H^{1}(\varOmega ))}$ instead of $\Vert \nabla u\Vert _{L^{\infty }(0,T; (H^{1}(\varOmega ))^2)}$.

For the periodic boundary problem, it has been proved [32] that for $u_0\in H^s(\varOmega ), s\ge 2$,

$$\begin{aligned}&\text{ the} \text{ initial-boundary} \text{ value} \text{ problem} \text{ of}\ (1) \text{ has} \text{ a} \text{ unique} \text{ solution}\ u, \nonumber \\&u \in L^{\infty }(0,T; H^s(\varOmega )) \cap L^2(0,T; H^{s+2}(\varOmega )),\nonumber \\&\partial _t u \in L^2(0,T; H^{s-2}(\varOmega )). \end{aligned}$$

(2)

Such a result has not yet been proved for the Dirichlet boundary problem. However, the analysis in this paper will be based on the existence and regularity assumption (2), which is known to be true for at least the periodic boundary problem.

An important observation is that, the operator $G$ can be decomposed into a convex (+) and a concave ($-$) part [43]:

$$\begin{aligned} G(\varvec{w}) = G_+(\varvec{w}) + G_-(\varvec{w}), \end{aligned}$$

such that the Fréchet Hessian $\nabla _{\mathbb F }^2 G_+$ and $\nabla _{\mathbb F }^2 G_-$ are positively and negatively semi-definite, respectively. Moreover, similar to [43], we assume both $\nabla _{\mathbb F }^2 G_+(\varvec{w})$ and $\nabla _{\mathbb F }^2 G_-(\varvec{w})$ have at most polynomial growth in $\varvec{w}$, that is, there exists a positive integer $m$ such that

$$\begin{aligned} |\nabla _{\mathbb F }^2 G_+(\varvec{w})| + |\nabla _{\mathbb F }^2 G_-(\varvec{w})| \le C_G(1+|\varvec{w}|^m), \end{aligned}$$

(3)

where $C_G$ is a positive constant independent of $\varvec{w}, |\varvec{w}|$ is the vector 2-norm and $|\nabla _{\mathbb F }^2 G_+(\varvec{w})|, |\nabla _{\mathbb F }^2 G_-(\varvec{w})|$ are matrix 2-norms. An example of such a decomposition is to simply set [43]

$$\begin{aligned} G_-(\varvec{w}) = -\frac{1}{2} |\varvec{w}|^2,\quad G_+(\varvec{w}) = G(\varvec{w}) - G_-(\varvec{w}). \end{aligned}$$

(4)

Then $\nabla _{\mathbb F }^2 G_+$ and $\nabla _{\mathbb F }^2 G_-$ are positively and negatively semi-definite, respectively, and satisfy Inequality (3). Furthermore, it is easy to check that for both with and without slope selection, the decomposition defined in (4) satisfies

$$\begin{aligned} G_+(\varvec{w}) \ge 0 \end{aligned}$$

(5)

and

$$\begin{aligned}&(\nabla _{\mathbb F }G(\varvec{w})-\nabla _{\mathbb F }G(\varvec{\varphi }), \varvec{w}-\varvec{\varphi }) \nonumber \\&\quad =(\nabla _{\mathbb F }G_+(\varvec{w})-\nabla _{\mathbb F }G_+(\varvec{\varphi }), \varvec{w}-\varvec{\varphi })+ (\nabla _{\mathbb F }G_-(\varvec{w})-\nabla _{\mathbb F }G_-(\varvec{\varphi }), \varvec{w}-\varvec{\varphi }) \nonumber \\&\quad \ge (\nabla _{\mathbb F }G_-(\varvec{w})-\nabla _{\mathbb F }G_-(\varvec{\varphi }), \varvec{w}-\varvec{\varphi }) \nonumber \\&\quad = - \Vert \varvec{w}-\varvec{\varphi }\Vert ^2. \end{aligned}$$

(6)

The convex–concave decomposition defined above is essential in developing stable numerical schemes for problem (1). In the time discretization, the convex term will be approximated implicitly and the concave term explicitly. Such technique has been widely used in the time-discretization for Cahn–Hilliard equations [14, 15, 17, 18, 19, 20, 21]. For the thin film epitaxy problem, the use of convex–concave decomposition was first proposed in [43] for discretizing the primal formulation of problem (1). In this paper, we shall combine this decomposition with the mixed finite element method, and develop stable numerical schemes.

Although many ideas are borrowed from the previous research on primal finite element methods for the thin file epitaxy problem, we would like to point out that, the analysis of mixed finite element methods is quite different, due to its different finite element space settings. The time-stability and convergence analysis in this paper is relatively complicated. We are not sure whether an easier proof is possible or not, or whether a better convergence rate estimate can be achieved. The main contribution of this paper lies in that, it is the first in developing a mixed finite element method for thin film epitaxy model (1). New schemes, ideas and tools are introduced. Notice that the model problem (1) is essentially a fourth-order equation. A mixed formulation will break the fourth-order equation into more than one lower-order equations, hence avoiding the use of $C^1$ conforming or non-conforming finite elements in the numerical approximation. Also, as it will be explained in the next section, our mixed method involves a dual variable $\nabla u$, which provides a natural and accurate approximation to the nonlinear term $G(\nabla u)$.

The paper is organized as follows. In Sect. 2, we introduce a mixed formulation for problem (1). Its finite element discretization, together with its time-stability, will be discussed in Sect. 3. Finally, in Sect. 4, we analyze the convergence rate of the discrete scheme.

2 The mixed formulation

In this section, we consider a mixed formulation for Eq. (1). Equation (1) is essentially a time-dependent fourth-order problem with a nonlinear second order term. Let us first recall the mixed formulation for the biharmonic problem

$$\begin{aligned} \varDelta ^2 u = f. \end{aligned}$$

One popular method [39] is to define $w = \varDelta u$. Then the biharmonic problem can be rewritten into

$$\begin{aligned} {\left\{ \begin{array}{ll} w - \varDelta u = 0, \\ \varDelta w = f. \end{array}\right.} \end{aligned}$$

Another [23, 25] is to define $\varvec{w}= \nabla u, \varvec{\lambda }= \varDelta \varvec{w}$ and it gives

$$\begin{aligned} {\left\{ \begin{array}{ll} \varvec{w}- \nabla u = 0, \\ \varvec{\lambda }- \varDelta \varvec{w}= 0, \\ \nabla \cdot \varvec{\lambda }= f, \end{array}\right.} \end{aligned}$$

where the last equation follows from $\nabla \cdot (\varDelta \varvec{w}) = \varDelta (\nabla \cdot \varvec{w}) = \varDelta ^2 u$. This mixed formulation is similar to the reduced integration method proposed in [25, 35] for the biharmonic problem, which is also a popular numerical method for approximating the Reissner-Mindlin plate problems [1, 2, 3, 4, 7, 8, 16]. Indeed, we will use some existing theoretical results from these works. However, our analysis shall concentrate on the nonlinear well-posedness and the time-stability issue.

Since the nonlinear term in Eq. (1) depends solely on $\nabla u$, it will be natural to use the second mixed formulation. Indeed, by defining $\varvec{w}= \nabla u$ and $\varvec{\lambda }= \delta \varDelta \varvec{w}- \nabla _{\mathbb F } G(\varvec{w})$, Equation (1) can be rewritten into

$$\begin{aligned} {\left\{ \begin{array}{ll} -\delta \varDelta \varvec{w}+ \nabla _{\mathbb F } G(\varvec{w}) + \varvec{\lambda }= \varvec{0}, \\ \partial _t u + \nabla \cdot \varvec{\lambda }= 0, \\ \varvec{w}- \nabla u = 0. \end{array}\right.} \end{aligned}$$

(7)

It is not hard to see that for the Dirichlet boundary problem, $\varvec{w}= \varvec{0}$ on the boundary, and for the periodic boundary problem, $\varvec{w}=\nabla u$ is also periodic and each of its entries has mean value zero in $\varOmega $. Let $\dot{C}_{per}^{\infty }(\varOmega )$ be the space of infinitely differentiable periodic functions with mean value zero in $\varOmega $. Define $H_{per}^1(\varOmega )$ to be the closure of $\dot{C}_{per}^{\infty }(\varOmega )$ in $H^1(\varOmega )$. Denote spaces

$$\begin{aligned} S = {\left\{ \begin{array}{ll} H_0^1(\varOmega )&\text{ for} \text{ the} \text{ Dirichlet} \text{ boundary} \text{ problem}, \\ H_{per}^1(\varOmega )&\text{ for} \text{ the} \text{ periodic} \text{ boundary} \text{ problem}, \end{array}\right.} \end{aligned}$$

and $\varvec{Q}=(L^2(\varOmega ))^2$. We have the Poincaré inequality in $S$, that is, there exists a positive constant $C$ such that

$$\begin{aligned} \Vert v\Vert \le C \Vert \nabla v\Vert \quad \mathrm{for\; all }\ v\in S. \end{aligned}$$

By testing system (7) with $(v, \varvec{\varphi }, \varvec{\mu })\in S\times S^2\times \varvec{Q}$, we end up with the following weak formulation: find $(u,\varvec{w}, \varvec{\lambda })\in L^2(0,T;S)\times L^2(0,T;S^2)\times L^2(0,T;\varvec{Q})$ such that

$$\begin{aligned} {\left\{ \begin{array}{ll} \delta (\nabla \varvec{w}, \nabla \varvec{\varphi }) + (\nabla _{\mathbb F } G(\varvec{w}), \varvec{\varphi }) + (\varvec{\lambda }, \varvec{\varphi }) = 0&\mathrm{for\; all }\ \varvec{\varphi }\in S^2, \\ (\partial _t u, v) - (\varvec{\lambda }, \nabla v) = 0&\mathrm{for\; all }\ v\in S, \\ -(\varvec{w}- \nabla u, \varvec{\mu }) = 0,&\mathrm{for\; all }\ \varvec{\mu }\in \varvec{Q}, \end{array}\right.} \end{aligned}$$

(8)

almost everywhere for $t\in (0, T]$. Notice that the weak solution should satisfy the initial condition

$$\begin{aligned} u|_{t = 0} = u_0,\quad \varvec{w}|_{t =0} = \nabla u_0,\quad \varvec{\lambda }|_{t=0} = \delta \varDelta (\nabla u_0) - \nabla _{\mathbb F } G(\nabla u_0). \end{aligned}$$

Hence by the compatibility requirement, the entire mixed formulation is well-posed only when

$$\begin{aligned} u_0 \in H^2(\varOmega )\quad \text{ and}\quad \delta \varDelta (\nabla u_0) - \nabla _{\mathbb F } G(\nabla u_0) \in L^2(\varOmega ). \end{aligned}$$

(9)

This is mainly because of the introduction of the auxiliary variable $\varvec{\lambda }$. For simplicity, we assume $u_0\in H^3(\varOmega )$ in this paper, which is enough to guarantee (9).

Theorem 2.1

Given $u_0\in H^3(\varOmega )$, system (8) has a unique weak solution.

Proof

The existence of the solution follows from the existence and regularity assumptio n (2). By defining $\varvec{w}= \nabla u$ and $\varvec{\lambda }= \delta \varDelta \varvec{w}- \nabla _{\mathbb F } G(\varvec{w})$, one immediately ends up with a weak solution for (8).

The uniqueness of the solution follows from a stability result: let $u_0^{(i)}\in H^3(\varOmega ), i=1,2$, be two initial data, and $(u^{(i)}, \varvec{w}^{(i)}, \varvec{\lambda }^{(i)})$ be the corresponding weak solutions. Then

$$\begin{aligned} \Vert u^{(1)}\!-\!u^{(2)}\Vert _{L^{\infty }(0,T;L^2(\varOmega ))} \!+\! \Vert \varvec{w}^{(1)}\!-\!\varvec{w}^{(2)}\Vert _{L^2(0,T;H^1(\varOmega ))} \!\le \!C(\delta , T) \Vert u^{(1)}_0 \!-\! u^{(2)}_0\Vert _{L^2(\varOmega )}, \end{aligned}$$

where $C(\delta , T)$ is a positive constant. Next, we shall prove this stability result.

Denote $\tilde{u}=u^{(1)}-u^{(2)}, \tilde{\varvec{w}}=\varvec{w}^{(1)}-\varvec{w}^{(2)}$ and $\tilde{\varvec{\lambda }} = \varvec{\lambda }^{(1)} - \varvec{\lambda }^{(2)}$. Clearly,

$$\begin{aligned} {\left\{ \begin{array}{ll} \delta (\nabla \tilde{\varvec{w}}, \nabla \varvec{\varphi }) + (\nabla _{\mathbb F } G(\varvec{w}^{(1)}) - \nabla _{\mathbb F } G(\varvec{w}^{(2)}), \varvec{\varphi }) + (\tilde{\varvec{\lambda }}, \varvec{\varphi }) = 0, \\ (\partial _t \tilde{u}, v) - (\tilde{\varvec{\lambda }}, \nabla v) = 0, \\ -(\tilde{\varvec{w}} - \nabla \tilde{u}, \varvec{\mu }) = 0. \end{array}\right.} \end{aligned}$$

By setting $\varvec{\varphi }= \tilde{\varvec{w}}, v = \tilde{u}$ and $\varvec{\mu }= \tilde{\varvec{\lambda }}$, and adding up all three equations, one gets

$$\begin{aligned} \frac{1}{2}\frac{d}{dt}\Vert \tilde{u}\Vert ^2 + \delta \Vert \nabla \tilde{\varvec{w}}\Vert ^2 + (\nabla _{\mathbb F } G(\varvec{w}^{(1)}) - \nabla _{\mathbb F } G(\varvec{w}^{(2)}), \tilde{\varvec{w}}) = 0. \end{aligned}$$

By the lower bound (6), we have

$$\begin{aligned} \frac{1}{2}\frac{d}{dt}\Vert \tilde{u}\Vert ^2 + \delta \Vert \nabla \tilde{\varvec{w}}\Vert ^2&\le (\tilde{\varvec{w}}, \tilde{\varvec{w}}) = (\nabla \tilde{u}, \tilde{\varvec{w}}) \\&= -(\tilde{u}, \nabla \cdot \tilde{\varvec{w}}) \le \Vert \tilde{u}\Vert \left(2\Vert \nabla \tilde{\varvec{w}}\Vert \right) \\&\le \frac{2}{\delta } \Vert \tilde{u}\Vert ^2 + \frac{\delta }{2} \Vert \nabla \tilde{\varvec{w}}\Vert ^2. \end{aligned}$$

The stability result then follows from the Gronwall’s inequality. This completes the proof of the theorem. $\square $

3 Finite element discretization

We use the rectangular finite element spaces defined in [23] to discretize the mixed problem (8). Given a quasi-uniform rectangular mesh $\mathcal T _h$ in $\varOmega $ with characteristic mesh size $h$. Define $S_h\in S$ and $\varvec{Q}_h\in \varvec{Q}$ as follows:

$$\begin{aligned} S_h&= \{v\in S, v|_K\in Q_1(K) \text{ for} \text{ all}\ K\in \mathcal T _h \},\\ \varvec{Q}_h&= \{\varvec{\mu }\in \varvec{Q}, \varvec{\mu }|_K = \begin{pmatrix}a+by\\ c+dx\end{pmatrix} \text{ for} \text{ all}\ K\in \mathcal T _h\}, \end{aligned}$$

where $Q_1(K)$ is the space of bilinear polynomials on $K$. It is clear that $\nabla S_h \subset \varvec{Q}_h$. Let $I_h:\, S\cap H^2(\varOmega ) \rightarrow S_h$ be the nodal value interpolation and $\varvec{P}_h:\, \varvec{Q}\rightarrow \varvec{Q}_h$ be the $L^2$ orthogonal projection. We have the following approximation properties [23]:

$$\begin{aligned} \begin{array}{rl@{\quad }l} \Vert v - I_h v\Vert + h \Vert \nabla (v-I_h v)\Vert&\le C h^2 |v|_{H^2(\varOmega )}&\mathrm{for\; all }\ v\in S\cap H^2(\varOmega ),\\ \Vert \varvec{\mu }- \varvec{P}_h \varvec{\mu }\Vert&\le Ch |\varvec{\mu }|_{H^1(\varOmega )}&\mathrm{for\; all }\ \varvec{\mu }\in (H^1(\varOmega ))^2, \\ (\nabla (v-I_h v), \varvec{\mu }_h)&\le C h^2|v|_{H^3(\varOmega )} \Vert \varvec{\mu }_h\Vert&\mathrm{for\; all }\ v\in S\cap H^3(\varOmega ) \\&\quad \text{ and} \text{ all}\ \varvec{\mu }_h\in \varvec{Q}_h, \end{array}\qquad \quad \end{aligned}$$

(10)

where $C>0$ is a general constant independent of $h$.

Now we are able to introduce a fully-discrete scheme for the mixed problem (8). A convex-splitting semi-implicit scheme will be considered, whose main idea is to use implicit time discretization in $G_+$ and the fourth-order term, and to use explicit time discretization in $G_-$. Such an idea has been used in [14, 15, 17, 18, 19, 20, 21] for the Cahn–Hilliard flow, and in [43] for the primal formulation of the thin film epitaxy.

Denote $(u^n_h, \varvec{w}^n_h, \varvec{\lambda }^n_h) \in S_h\times S_h^2\times \varvec{Q}_h$ to be the approximation to the weak solution at time $t_n = n\varDelta t$, the discrete problem for (8) can be written as:

$$\begin{aligned} {\left\{ \begin{array}{ll} \delta (\nabla \varvec{w}_h^{n+1}, \nabla \varvec{\varphi }_h) + (\nabla _{\mathbb F } G_+(\varvec{w}_h^{n+1}), \varvec{\varphi }_h) + (\varvec{\lambda }_h^{n+1},\varvec{\varphi }_h)&\\ \quad = -(\nabla _{\mathbb F } G_-(\varvec{w}_h^{n}), \varvec{\varphi }_h)&\mathrm{for\; all }\ \varvec{\varphi }_h\in S_h^2, \\ \Big (\frac{u_h^{n+1}-u_h^n}{\varDelta t}, v_h\Big ) - (\varvec{\lambda }_h^{n+1}, \nabla v_h) = 0&\mathrm{for\; all }\ v_h\in S_h, \\ \varepsilon (\varvec{\lambda }_h^{n+1}, \varvec{\mu }_h)-(\varvec{w}_h^{n+1} - \nabla u_h^{n+1}, \varvec{\mu }_h) = 0,&\mathrm{for\; all }\ \varvec{\mu }_h\in \varvec{Q}_h, \end{array}\right.} \end{aligned}$$

(11)

where $\varepsilon =O(h^2)$ is a penalty constant which is needed to ensure the solvability of the discrete problem [23, 25]. Notice that given $u_h^{n+1}$ and $\varvec{w}_h^{n+1}, \varvec{\lambda }_h^{n+1}$ is uniquely solvable from the third equation of (11). In other words, the third equation can be decoupled from the system.

Equation (11) is a stabilized formulation. In practice, any stabilized finite element spaces [23, 25, 35] for fourth-order elliptic equations can be used to discretize problem (8) and the discretization leads to system (11). There are also stable finite element spaces available for the mixed formulation of Reissner–Mindlin plate [1, 2, 3, 4, 7, 8, 16], which can be adopted for discretizing problem (8). However, here we prefer the stabilized finite elements, because system (11) can easily be reduced to a simple minimization problem, which will be discussed later. The drawback is that, the stabilization term limits the finite element approximation rate. In the future, researchers can work towards increasing the approximation rate or adopting stable finite elements.

Define functional

$$\begin{aligned} \mathcal F ^{n+1}(u,\varvec{w},\varvec{\lambda })&= \int _{\varOmega } \left( G_+(\varvec{w}) + \frac{\delta }{2}|\nabla \varvec{w}|^2 + \left[(\varvec{w}-\nabla u)\cdot \varvec{\lambda }- \frac{\varepsilon }{2}|\varvec{\lambda }|^2\right]\right. \\&\qquad \quad \left.+ \frac{1}{2\varDelta t} |u|^2 + \nabla _{\mathbb F } G_-(\varvec{w}_h^n)\cdot \varvec{w}- \frac{1}{\varDelta t} u_h^n u \right)\,dx. \end{aligned}$$

Then the Fréchet gradient $\nabla _{\mathbb F } \mathcal F ^{n+1} = \varvec{0}$, which can be written as

$$\begin{aligned} {\left\{ \begin{array}{ll} \left[\mathcal F ^{n+1}_u(u, \varvec{w},\varvec{\lambda })\right](v_h) = \frac{d}{dk} \mathcal F ^{n+1}(u+kv_h, \varvec{w}, \varvec{\lambda }) \big |_{k=0} = 0&\text{ for} \text{ all}\ v_h\in S_h, \\ \left[\mathcal F ^{n+1}_{\varvec{w}}(u, \varvec{w}, \varvec{\lambda })\right](\varvec{\varphi }_h) = \frac{d}{dk} \mathcal F ^{n+1}(u, \varvec{w}+k\varvec{\varphi }_h, \varvec{\lambda }) \big |_{k=0} = 0&\text{ for} \text{ all}\ \varvec{\varphi }_h\in S_h^2, \\ {[}\mathcal F ^{n+1}_{\varvec{\lambda }}(u, \varvec{w},\varvec{\lambda }){]}(\varvec{\mu }_h) = \frac{d}{dk} \mathcal F ^{n+1}(u, \varvec{w},\varvec{\lambda }+k\varvec{\mu }_h) \big |_{k=0} = 0&\text{ for} \text{ all}\ \varvec{\mu }_h\in \varvec{Q}_h, \end{array}\right.} \end{aligned}$$

leads to exactly system (11) when being expanded. Indeed, given $(u_h^n, \varvec{w}_h^n)$, the solution $(u_h^{n+1}, \varvec{w}_h^{n+1}, \varvec{\lambda }_h^{n+1})$ for system (11) can be characterized as the solution to the following saddle point problem

$$\begin{aligned} \min _{\tiny \begin{matrix}u\in S_h\\ \varvec{w}\in S_h^2\end{matrix}} \max _{\varvec{\lambda }\in \varvec{Q}_h} \mathcal F ^{n+1}(u, \varvec{w},\varvec{\lambda }). \end{aligned}$$

(12)

It is not hard to see that $\max _{\varvec{\lambda }\in \varvec{Q}_h} \mathcal F ^{n+1}(u, \varvec{w},\varvec{\lambda })$ is reached at $\varvec{\lambda }= \varvec{P}_h(\varvec{w}-\nabla u)/\varepsilon $, hence the saddle problem (12) is also equivalent to the following minimization problem

$$\begin{aligned} \min _{\tiny \begin{matrix}{u\in S_h}\\ {\varvec{w}\in S_h^2}\end{matrix}} F^{n+1}(u, \varvec{w}) \end{aligned}$$

(13)

where

$$\begin{aligned} \begin{aligned} F^{n+1}(u, \varvec{w})&= \int _{\varOmega } \left(G_+(\varvec{w}) + \frac{\delta }{2}|\nabla \varvec{w}|^2 + \frac{1}{2\varepsilon }|\varvec{P}_h(\varvec{w}-\nabla u)|^2 \right.\\&\qquad \qquad \left.+\frac{1}{2\varDelta t} |u|^2 + \nabla _{\mathbb F } G_-(\varvec{w}_h^n)\cdot \varvec{w}- \frac{1}{\varDelta t} u_h^n u \right)\,dx. \end{aligned} \end{aligned}$$

Therefore, to prove the existence and uniqueness of the solution to problem (11), we only need to show that problem (13) has a unique solution. Indeed, we have the following theorem:

Theorem 3.1

Given $(u_h^n, \varvec{w}_h^n)$, the minimization problem (13) has a unique solution at $t_{n+1}$.

Proof

The minimization problem is an unconstrained convex optimization problem on finite dimensional spaces. According to the standard theory [5], we only need to prove the coercivity, which implies that $F^{n+1}(u, \varvec{w})$ goes to infinity as $\Vert u\Vert _{H^1}$ or $\Vert \varvec{w}\Vert _{H^1}$ goes to infinity, and the strict convexity of $F^{n+1}(u, \varvec{w})$.

We first prove the coercivity of $F^{n+1}(u, \varvec{w})$. Let $c_1$ be a positive constant such that

$$\begin{aligned} c_1 \Vert \varvec{w}\Vert ^2 \le \frac{\delta }{2}\Vert \nabla \varvec{w}\Vert ^2 \quad \mathrm{for\; all }\ \varvec{w}\in S_h^2. \end{aligned}$$

This is possible because of the Poincaré inequality. Then, by using the Schwarz inequality and the Young’s inequality,

$$\begin{aligned} F^{n+1}(u, \varvec{w})&\ge \int _{\varOmega } \left( G_+(\varvec{w}) + \frac{\delta }{2}|\nabla \varvec{w}|^2 + \frac{1}{2\varepsilon }|\varvec{P}_h(\varvec{w}-\nabla u)|^2 + \frac{1}{2\varDelta t} |u|^2 \right.\\&\qquad \quad \left. -\frac{1}{2c_1} |\nabla _{\mathbb F } G_-(\varvec{w}_h^n)|^2 -\frac{c_1}{2}|\varvec{w}|^2 - \frac{1}{\varDelta t}|u_h^n|^2-\frac{1}{4\varDelta t}|u|^2 \right)dx \\&\ge \int _{\varOmega } \left( \frac{\delta }{4}|\nabla \varvec{w}|^2 + \frac{1}{2\varepsilon }|\varvec{P}_h(\varvec{w}-\nabla u)|^2 + \frac{1}{4\varDelta t} |u|^2 -\beta \right)dx, \end{aligned}$$

where $\beta $ is a constant depending only on $\varvec{w}_h^n$ and $u_h^n$. Let $c_2>1$ be a constant satisfying

$$\begin{aligned} \frac{c_2 - 1}{2\varepsilon } \Vert \varvec{P}_h\varvec{w}\Vert ^2 \le \frac{\delta }{8} \Vert \nabla \varvec{w}\Vert ^2 \quad \mathrm{for\; all }\ \varvec{w}\in S_h^2. \end{aligned}$$

Again, this is possible by the stability of the $L^2$ projection $\varvec{P}_h$ and the Poincaré inequality. Clearly, $c_2$ is independent of the mesh size $h$. Then, since $\varvec{P}_h\nabla u = \nabla u$ for all $u\in S_h$,

$$\begin{aligned} F^{n+1}(u, \varvec{w})&\ge \int _{\varOmega } \left( \frac{\delta }{4}|\nabla \varvec{w}|^2 + \frac{1}{2\varepsilon }(|\varvec{P}_h\varvec{w}|^2 - 2\varvec{P}_h\varvec{w}\cdot \nabla u + |\nabla u|^2) \right.\\&\qquad \quad \left.+\frac{1}{4\varDelta t} |u|^2 -\beta \right)dx, \\&\ge \int _{\varOmega } \left(\frac{\delta }{4}|\nabla \varvec{w}|^2 + \frac{1}{2\varepsilon }\left((1-c_2)|\varvec{P}_h\varvec{w}|^2 + \left(1-\frac{1}{c_2}\right)|\nabla u|^2\right)\right.\\&\qquad \quad \left.+ \frac{1}{4\varDelta t} |u|^2 -\beta \right)dx, \\&\ge \int _{\varOmega } \left(\frac{\delta }{8}|\nabla \varvec{w}|^2 + \frac{c_2-1}{2\varepsilon c_2} |\nabla u|^2 + \frac{1}{4\varDelta t} |u|^2 -\beta \right)\, dx. \end{aligned}$$

This completes the proof of coercivity.

Next, we prove that the functional $F^{n+1}(u, \varvec{w})$ is strictly convex on $S_h\times S_h^2$, then the minimization problem (13) admits a unique solution. This can be done by showing that the Hessian of $F^{n+1}$ is positively definite. Indeed, for any $(v,\, \varvec{\varphi })\in S_h\times S_h^2$,

$$\begin{aligned}&\nabla _{\mathbb F } F^{n+1}(u,\varvec{w}) \begin{pmatrix}v\\ \varvec{\varphi }\end{pmatrix} = \frac{d}{dk} F^{n+1}(u+kv, \varvec{w}+k\varvec{\varphi })\bigg |_{k=0} \\&\quad = \int _{\varOmega } \left[ \nabla _{\mathbb F }G_+(\varvec{w})\cdot \varvec{\varphi }+ \delta \nabla \varvec{w}\cdot \nabla \varvec{\varphi }+ \frac{1}{\varepsilon } \varvec{P}_h(\varvec{w}-\nabla u)\cdot \varvec{P}_h(\varvec{\varphi }-\nabla v)\right.\\&\qquad \qquad \quad \left.+ \frac{1}{\nabla t} uv + \nabla _{\mathbb F } G_-(\varvec{w}_h^n)\cdot \varvec{\varphi }- \frac{1}{\varDelta t}u_h^n v\right] dx. \end{aligned}$$

Therefore

$$\begin{aligned}&\left[\nabla _{\mathbb F }^2F^{n+1}(u, \varvec{w}) \begin{pmatrix}v\\ \varvec{\varphi }\end{pmatrix} \right] \begin{pmatrix}v\\ \varvec{\varphi }\end{pmatrix} = \frac{d}{dk} \left( \nabla _{\mathbb F } F^{n+1}(u+kv, \varvec{w}+ k\varvec{\varphi }) \begin{pmatrix}v\\ \varvec{\varphi }\end{pmatrix} \right)\bigg |_{k=0} \\&\quad = \frac{d}{dk} \int _{\varOmega } \left[ \nabla _{\mathbb F }G_+(\varvec{w}+k\varvec{\varphi })\cdot \varvec{\varphi }+ \delta \nabla (\varvec{w}+k\varvec{\varphi })\cdot \nabla \varvec{\varphi }\right.\\&\qquad \qquad \qquad + \frac{1}{\varepsilon } \varvec{P}_h\left(\varvec{w}-\nabla u + k(\varvec{\varphi }-\nabla v)\right)\cdot \varvec{P}_h(\varvec{\varphi }-\nabla v) \\&\qquad \qquad \qquad +\left. \frac{1}{\nabla t} (u+kv)v + \nabla _{\mathbb F } G_-(\varvec{w}_h^n)\cdot \varvec{\varphi }- \frac{1}{\varDelta t}u_h^n v\right]\,dx \bigg |_{k=0} \\&\quad =\int _{\varOmega }\left[ \varvec{\varphi }\cdot \nabla _{\mathbb F }^2G_+(\varvec{w})\cdot \varvec{\varphi }+ \delta |\nabla \varvec{\varphi }|^2 + \frac{1}{\varepsilon } |\varvec{P}_h(\varvec{\varphi }-\nabla v)|^2 + \frac{1}{\varDelta t}v^2 \right]\,dx \\&\quad \ge \int _{\varOmega }\left[ \delta |\nabla \varvec{\varphi }|^2 + \frac{1}{\varepsilon } \left((1-c_3)|\varvec{P}_h\varvec{\varphi }|^2+\left(1-\frac{1}{c_3}\right)|\nabla v|^2\right) + \frac{1}{\varDelta t}v^2 \right]\,dx \\&\quad \ge \int _{\varOmega }\left[ \frac{\delta }{2} |\nabla \varvec{\varphi }|^2 +\frac{c_3-1}{\varepsilon c_3}|\nabla v|^2 + \frac{1}{\varDelta t}v^2 \right]\,dx \\&\quad \ge \min \left(\frac{\delta }{2}, \frac{c_3-1}{\varepsilon c_3}\right) (\Vert \nabla \varvec{\varphi }\Vert ^2 + \Vert \nabla v\Vert ^2), \end{aligned}$$

where $c_3>1$ is a constant satisfying

$$\begin{aligned} \frac{c_3-1}{\varepsilon } \Vert \varvec{P}_h\varvec{\varphi }\Vert ^2 \le \frac{\delta }{2} \Vert \nabla \varvec{\varphi }\Vert ^2 \quad \mathrm{for\; all }\ \varvec{\varphi }\in S_h^2. \end{aligned}$$

Finally, by applying the Poincaré inequality, we have shown that $F^{n+1}(u, \varvec{w})$ is strictly convex. This completes the proof of the theorem.$\square $

Next, we show that the numerical scheme is time-stable. Let $(u_h^n, \varvec{w}_h^n, \varvec{\lambda }_h^n)$ be the solution to problem (11). Define an “energy” functional

$$\begin{aligned} E^n&= \int _{\varOmega } \left( G(\varvec{w}_h^n)+ \frac{\delta }{2}|\nabla \varvec{w}_h^n|^2 + \frac{1}{2\varepsilon } |\varvec{P}_h(\varvec{w}_h^n - \nabla u_h^n)|^2 \right)dx \\&= \int _{\varOmega } \left( G(\varvec{w}_h^n)+ \frac{\delta }{2}|\nabla \varvec{w}_h^n|^2 + \frac{\varepsilon }{2} |\varvec{\lambda }_h^n|^2 \right)dx. \end{aligned}$$

Note that by definition, $G(\varvec{w}_h^n)$ is always non-negative in the case with slope selection, but is negative in the case without slope selection. However, even when $G(\varvec{w}_h^n)$ is negative, as long as $E^n$ satisfies certain conditions, it can still be considered an “energy” functional and thus be used to prove the time-stability. We shall explain this in details below. For the case with slope selection, clearly,

$$\begin{aligned} E^n\ge \int _{\varOmega } \left( \frac{\delta }{2}|\nabla \varvec{w}_h^n|^2 + \frac{\varepsilon }{2} |\varvec{\lambda }_h^n|^2 \right)\, dx. \end{aligned}$$

(14)

Now consider the case without slope selection. Using elementary Calculus, one can show that for any constant $c>0$ and $x\ge 0$,

$$\begin{aligned} c x - \frac{1}{2}\ln (1+x) \ge {\left\{ \begin{array}{ll} \frac{1}{2}-c+\frac{1}{2}\ln (2c) >\frac{1}{2}\ln (2c)&\text{ for}\ 0<c<\frac{1}{2} \\ 0&\text{ for}\ c\ge \frac{1}{2} \end{array}\right.}\!\!. \end{aligned}$$

Set $c = \delta /4$, then the “energy” functional $E^n$ for the case without slope selection satisfies

$$\begin{aligned} E^n&\ge |\varOmega | \min \left\{ \frac{1}{2}\ln \frac{\delta }{2}, 0\right\} + \int _{\varOmega } \left( \frac{\delta }{4}|\nabla \varvec{w}_h^n|^2 + \frac{\varepsilon }{2} |\varvec{\lambda }_h^n|^2 \right)dx \nonumber \\&\ge \int _{\varOmega } \left( \frac{\delta }{4}|\nabla \varvec{w}_h^n|^2 + \frac{\varepsilon }{2} |\varvec{\lambda }_h^n|^2 \right)dx - C_{\delta }, \end{aligned}$$

(15)

where $|\varOmega |$ is the measure of domain $\varOmega $, and $C_{\delta }$ is a non-negative constant that only depends on $\delta $ and $\varOmega $. We point out that such an observation has been used before in [43] to define a similar “energy” functional for thin film epitaxy in the primal formulation. Next, we prove that the “energy” functional $E^n$ is non-increasing and thus the numerical scheme (11) is time-stable.

Theorem 3.2

The energy functional $E^{n}$ is non-increasing in time. Indeed,

$$\begin{aligned} E^{n+1} \le E^n - \frac{1}{2\varDelta t} \Vert u_h^{n+1}-u_h^n\Vert ^2. \end{aligned}$$

(16)

Consequently,

$$\begin{aligned} \Vert u_h^n\Vert _{H^1(\varOmega )}^2 + \Vert \varvec{w}_h^n\Vert _{H^1(\varOmega )}^2 + \varepsilon \Vert \varvec{\lambda }_h^n\Vert ^2 \le C, \end{aligned}$$

(17)

where $C$ is a positive constant depending on $\varOmega , \delta $ and $E^0$, but not on $h, n$ or $\varDelta t$.

Proof

Since $F^{n+1}(u_h^{n+1},\varvec{w}_h^{n+1}) \le F^{n+1}(u_h^{n},\varvec{w}_h^{n})$, we have

$$\begin{aligned}&E^{n+1} + \int _{\varOmega } \left( -G_-(\varvec{w}_h^{n+1}) + \frac{1}{2\varDelta t}|u_h^{n+1}|^2 + \nabla _{\mathbb F }G_-(\varvec{w}_h^{n})\cdot \varvec{w}_h^{n+1}\right.-\left. \frac{1}{\varDelta t} u_h^n u_h^{n+1} \right)dx \\&\quad \le E^n + \int _{\varOmega } \left( -G_-(\varvec{w}_h^{n}) + \frac{1}{2\varDelta t}|u_h^{n}|^2 + \nabla _{\mathbb F }G_-(\varvec{w}_h^{n})\cdot \varvec{w}_h^{n} - \frac{1}{\varDelta t} |u_h^n|^2 \right)dx. \end{aligned}$$

Then, Inequality (16) follows from

$$\begin{aligned}&\int _{\varOmega } (G_-(\varvec{w}_h^{n+1})-G_-(\varvec{w}_h^{n}) - \nabla _{\mathbb F }G_-(\varvec{w}_h^{n})\cdot (\varvec{w}_h^{n+1}-\varvec{w}_h^n) )\, dx \\&\quad = \int _{\varOmega } ( (\nabla _{\mathbb F }G_-(\varvec{w}_h^{n}+ s_1(\varvec{w}_h^{n+1}-\varvec{w}_h^n)) - \nabla _{\mathbb F }G_-(\varvec{w}_h^{n}))\cdot (\varvec{w}_h^{n+1}-\varvec{w}_h^n))\, dx \\&\quad = \int _{\varOmega } s_1 (\varvec{w}_h^{n+1}-\varvec{w}_h^n) \cdot \nabla _{\mathbb F }^2 G_-(\varvec{w}_h^{n}+ s_2(\varvec{w}_h^{n+1}-\varvec{w}_h^n)) \cdot (\varvec{w}_h^{n+1}-\varvec{w}_h^n) \, dx \\&\quad \le 0, \end{aligned}$$

where $0\le s_2\le s_1\le 1$ are constants from the mean-value theorem.

For (17), by using (14) and (15), we only need to prove that $\Vert u_h^n\Vert _{H^1(\varOmega )}$ is bounded. This is because

$$\begin{aligned} \Vert u_h^n\Vert _{H^1(\varOmega )}^2 \le C \Vert \nabla u_h^n\Vert ^2 = C\Vert \varvec{P}_h\varvec{w}_h^n - \varepsilon \varvec{\lambda }_h^n\Vert ^2 \le C\Vert \varvec{w}_h^n\Vert ^2 + C\varepsilon ^2 \Vert \varvec{\lambda }_h^n\Vert ^2. \end{aligned}$$

As long as $\varepsilon \le O(1)$, Inequality (17) is true. $\square $

4 Convergence

In this section, we analyze the convergence rate of the fully-discrete, mixed finite element approximation (11). Notice that the well-posedness and time-stability results in the previous section is proved for arbitrary convex–concave decomposition $G = G_+ + G_-$. However, for the convergence rate, so far we limit our analysis for the special decomposition defined in (4). Convergence rate analysis for general convex–concave decomposition can be non-trivial.

Let $(u,\varvec{w},\lambda )$ be the solution to (8) and $(u_h^n,\varvec{w}_h^n,\varvec{\lambda }_h^n)$ be the solution to (11). Denote $u^n=u(\cdot ,t_n), \varvec{w}^n=\varvec{w}(\cdot ,t_n)$ and $\varvec{\lambda }^n=\varvec{\lambda }(\cdot ,t_n)$. Define the error terms

$$\begin{aligned} \underline{u}^n = u^n-u_h^n,\quad \underline{\varvec{w}}^n = \varvec{w}^n - \varvec{w}_h^n, \quad \underline{\varvec{\lambda }}^n = \varvec{\lambda }^n - \varvec{\lambda }_h^n. \end{aligned}$$

By subtracting (11) from (8), we have

$$\begin{aligned} {\left\{ \begin{array}{ll} \delta (\nabla \underline{\varvec{w}}^{n+1}, \nabla \varvec{\varphi }_h) + (\nabla _{\mathbb F } G_+(\varvec{w}^{n+1})-\nabla _{\mathbb F } G_+(\varvec{w}_h^{n+1}), \\ \quad \varvec{\varphi }_h)+ (\underline{\varvec{\lambda }}^{n+1},\varvec{\varphi }_h)= -(\nabla _{\mathbb F } G_-(\varvec{w}^{n+1})-\nabla _{\mathbb F } G_-(\varvec{w}_h^{n}), \varvec{\varphi }_h)&\mathrm{for\; all }\ \varvec{\varphi }_h\in S_h^2,\\ (\partial _tu^{n+1} - \frac{u_h^{n+1}-u_h^n}{\varDelta t}, v_h) - (\underline{\varvec{\lambda }}^{n+1}, \nabla v_h) = 0&\mathrm{for\; all }\ v_h\in S_h, \\ -\varepsilon (\varvec{\lambda }_h^{n+1}, \varvec{\mu }_h)-(\underline{\varvec{w}}^{n+1} - \nabla \underline{u}^{n+1}, \varvec{\mu }_h) = 0,&\mathrm{for\; all }\ \varvec{\mu }_h\in \varvec{Q}_h. \end{array}\right.}\nonumber \\ \end{aligned}$$

(18)

We first introduce several technical lemmas. For simplicity, use $C$ to denote a general positive constant that depends only on $\delta , C_G, m, \varOmega , T, \Vert u\Vert _{L^{\infty }(0,T;H^3(\varOmega ))}, \Vert \varvec{w}\Vert _{L^{\infty }(0,T;H^2(\varOmega ))}, \Vert \varvec{\lambda }\Vert _{L^{\infty }(0,T;H^1(\varOmega ))}$ and $E_0$.

Lemma 4.1

$\Vert \nabla _{\mathbb F } G_+(\varvec{w}^{n})-\nabla _{\mathbb F } G_+(\varvec{w}_h^{n})\Vert \le C \Vert \nabla \underline{\varvec{w}}^{n}\Vert . $

Proof

In two-dimension, $H^1(\varOmega )$ is continuously embedded in the Hölder space $L^q(\varOmega )$ for all $1\le q< \infty $. By Assumption (3), the Sobolev embedding theorem, Poincaré inequality, and Inequality (17),

$$\begin{aligned}&\Vert \nabla _{\mathbb F } G_+(\varvec{w}^{n})-\nabla _{\mathbb F } G_+(\varvec{w}_h^{n})\Vert \\&\quad \le \left\Vert\max _{0\le s\le 1} |\nabla _{\mathbb F }^2 G_+ (s\varvec{w}^{n}+(1-s)\varvec{w}_h^{n}) |\, |\underline{\varvec{w}}^n| \right\Vert \\&\quad \le C \left\Vert (1+|\varvec{w}^{n}|^m+|\varvec{w}_h^{n}|^m) |\underline{\varvec{w}}^n| \right\Vert \\&\quad \le C(1+\Vert \varvec{w}^{n}\Vert _{L^{4m}(\varOmega )}^m+\Vert \varvec{w}_h^{n}\Vert _{L^{4m}(\varOmega )}^m ) \Vert \underline{\varvec{w}}^n\Vert _{L^4(\varOmega )} \\&\quad \le C(1+\Vert \varvec{w}^{n}\Vert _{H^1(\varOmega )}^m+\Vert \varvec{w}_h^{n}\Vert _{H^1(\varOmega )}^m ) \Vert \nabla \underline{\varvec{w}}^{n}\Vert \\&\quad \le C(1+\Vert \varvec{w}\Vert _{L^{\infty }(0,T;H^1(\varOmega ))}^m) \Vert \nabla \underline{\varvec{w}}^{n}\Vert . \end{aligned}$$

Here $\varvec{w}$ is the solution to the mixed problem (8). By the regularity assumption (2), $\Vert \varvec{w}\Vert _{L^{\infty }(0,T;H^1(\varOmega ))}$ is bounded as long as $u_0\in H^3(\varOmega )$. This completes the proof of the lemma. $\square $

Lemma 4.2

$\Vert \nabla \underline{u}^n\Vert \le \Vert \underline{\varvec{w}}^n\Vert + C(h+\sqrt{\varepsilon })$.

Proof

Note that by definition, $\nabla u^n = \varvec{w}^n$. By Eq. (11) and the fact that $\varvec{\lambda }_h^n\in \varvec{Q}_h, \nabla u_h^n\in \varvec{Q}_h$, we have

$$\begin{aligned} \varvec{0}= \varvec{P}_h(\varepsilon \varvec{\lambda }_h^n -\varvec{w}_h^n + \nabla u_h^n) = \varepsilon \varvec{\lambda }_h^n - \varvec{P}_h \varvec{w}_h^n + \nabla u_h^n. \end{aligned}$$

Combining the above and using the triangle inequality, the property of the $L^2$ orthogonal projection $\varvec{P}_h$, and Inequality (17),

$$\begin{aligned} \Vert \nabla \underline{u}^n\Vert&= \Vert \nabla u^n - \nabla u_h^n\Vert = \Vert \varvec{w}^n - (\varvec{P}_h\varvec{w}_h^n - \varepsilon \varvec{\lambda }_h^n)\Vert \\&\le \Vert \underline{\varvec{w}}^n\Vert + \Vert (I-\varvec{P}_h)\varvec{w}_h^n\Vert + \varepsilon \Vert \varvec{\lambda }_h^n\Vert \le \Vert \underline{\varvec{w}}^n\Vert + C(h+\sqrt{\varepsilon }). \end{aligned}$$

$\square $

Lemma 4.3

Let $G_-$ be defined as in (4), then

$$\begin{aligned}&\frac{1}{4}(3\Vert \nabla \underline{\varvec{w}}^{n+1}\Vert ^2 - \Vert \nabla \underline{\varvec{w}}^{n}\Vert ^2)+ \frac{\varepsilon }{2\delta } \Vert \varvec{P}_h\underline{\varvec{\lambda }}^{n+1}\Vert ^2 \\&\quad \le C\left(h^2 + \frac{h^4}{\varepsilon } + \varepsilon \right) + C\varDelta t \int _{t_n}^{t_{n+1}} \Vert \varvec{w}_t(\cdot ,s)\Vert ^2\,ds + C \Vert \underline{u}^{n+1}\Vert ^2 \\&\qquad - \frac{2}{\delta }(\nabla \underline{u}^{n+1}, \varvec{P}_h\underline{\varvec{\lambda }}^{n+1}). \end{aligned}$$

Proof

For all $\varvec{\psi }_h\in S_h^2$,

$$\begin{aligned}&\Vert \nabla (\varvec{w}^{n+1}-\varvec{\psi }_h)\Vert ^2 \\&\quad = \Vert \nabla \underline{\varvec{w}}^{n+1}\Vert ^2 + \Vert \nabla (\varvec{w}_h^{n+1}-\varvec{\psi }_h)\Vert ^2 + 2(\nabla \underline{\varvec{w}}^{n+1}, \nabla (\varvec{w}_h^{n+1}-\varvec{\psi }_h)). \end{aligned}$$

By setting $\varvec{\psi }_h$ to be the nodal value interpolation of $\varvec{w}^{n+1}$ and the test function $\varvec{\varphi }_h=\varvec{w}_h^{n+1}-\varvec{\psi }_h$ in (18), we have

$$\begin{aligned}&\Vert \nabla \underline{\varvec{w}}^{n+1}\Vert ^2 \nonumber \\&\quad \le \Vert \nabla (\varvec{w}^{n+1}-\varvec{\psi }_h)\Vert ^2 - 2(\nabla \underline{\varvec{w}}^{n+1}, \nabla (\varvec{w}_h^{n+1}-\varvec{\psi }_h))\nonumber \\&\quad \le Ch^2 + \frac{2}{\delta }(\nabla _{\mathbb F } G_+(\varvec{w}^{n+1})-\nabla _{\mathbb F } G_+(\varvec{w}_h^{n+1}), \varvec{w}_h^{n+1}-\varvec{\psi }_h) \nonumber \\&\qquad + \frac{2}{\delta }(\underline{\varvec{\lambda }}^{n+1},\varvec{w}_h^{n+1}-\varvec{\psi }_h) + \frac{2}{\delta }(\nabla _{\mathbb F } G_-(\varvec{w}^{n+1})-\nabla _{\mathbb F } G_-(\varvec{w}_h^{n}), \varvec{w}_h^{n+1}-\varvec{\psi }_h) \nonumber \\&\quad \le Ch^2 + \frac{2}{\delta }(\nabla _{\mathbb F } G_+(\varvec{w}^{n+1})-\nabla _{\mathbb F } G_+(\varvec{w}_h^{n+1}), \varvec{w}^{n+1}-\varvec{\psi }_h) \nonumber \\&\qquad + \frac{2}{\delta }(\underline{\varvec{\lambda }}^{n+1}, \varvec{w}_h^{n+1}-\varvec{\psi }_h) + \frac{2}{\delta }(\nabla _{\mathbb F } G_-(\varvec{w}^{n+1})-\nabla _{\mathbb F } G_-(\varvec{w}_h^{n}), \varvec{w}_h^{n+1}-\varvec{\psi }_h) \nonumber \\&\quad = Ch^2 + I_1 + I_2 + I_3, \end{aligned}$$

(19)

where the second last step follows from the fact that $G_+$ is convex.

For $I_1$, by Lemma 4.1, we have

$$\begin{aligned} I_1&\le C \Vert \nabla _{\mathbb F } G_+(\varvec{w}^{n+1})-\nabla _{\mathbb F } G_+(\varvec{w}_h^{n+1})\Vert \,\Vert \varvec{w}^{n+1}-\varvec{\psi }_h\Vert \nonumber \\&\le C\Vert \nabla \underline{\varvec{w}}^{n+1}\Vert \, \Vert \varvec{w}^{n+1}-\varvec{\psi }_h\Vert \nonumber \\&\le Ch^2. \end{aligned}$$

(20)

Now we consider $I_2$. By the triangle inequality, Schwarz inequality, Poincaré inequality, and the Young’s inequality, we have

$$\begin{aligned}&(\underline{\varvec{\lambda }}^{n+1}, \varvec{w}_h^{n+1}-\varvec{\psi }_h) = (\varvec{\lambda }^{n+1} - \varvec{P}_h\varvec{\lambda }^{n+1}, \varvec{w}_h^{n+1}-\varvec{\psi }_h) \\&\qquad + (\varvec{P}_h\underline{\varvec{\lambda }}^{n+1}, \varvec{w}_h^{n+1} - \varvec{w}^{n+1}) + (\varvec{P}_h\underline{\varvec{\lambda }}^{n+1}, \varvec{w}^{n+1}-\varvec{\psi }_h) \\&\quad \le \Vert (I-\varvec{P}_h)\varvec{\lambda }^{n+1}\Vert (\Vert \varvec{w}_h^{n+1}-\varvec{w}^{n+1}\Vert + \Vert \varvec{w}^{n+1}-\varvec{\psi }_h\Vert ) \\&\qquad + (\varvec{P}_h\underline{\varvec{\lambda }}^{n+1}, \varvec{w}_h^{n+1} - \varvec{w}^{n+1}) + \frac{\varepsilon }{4} \Vert \varvec{P}_h\underline{\varvec{\lambda }}^{n+1}\Vert ^2 + \frac{1}{\varepsilon } \Vert \varvec{w}^{n+1}-\varvec{\psi }_h\Vert ^2 \\&\quad \le \frac{\delta }{16} \Vert \nabla \underline{\varvec{w}}^{n+1}\Vert ^2 + \frac{1}{2}\Vert \varvec{w}^{n+1}-\varvec{\psi }_h\Vert ^2 + C \Vert (I-\varvec{P}_h)\varvec{\lambda }^{n+1}\Vert ^2 \\&\qquad + (\varvec{P}_h\underline{\varvec{\lambda }}^{n+1}, \varvec{w}_h^{n+1} - \varvec{w}^{n+1}) + \frac{\varepsilon }{4} \Vert \varvec{P}_h\underline{\varvec{\lambda }}^{n+1}\Vert ^2 + \frac{1}{\varepsilon } \Vert \varvec{w}^{n+1}-\varvec{\psi }_h\Vert ^2 \\&\quad \le \frac{\delta }{16} \Vert \nabla \underline{\varvec{w}}^{n+1}\Vert ^2 + C\left(h^2 + \frac{h^4}{\varepsilon }\right) + \frac{\varepsilon }{4} \Vert \varvec{P}_h\underline{\varvec{\lambda }}^{n+1}\Vert ^2 \\&\qquad + (\varvec{P}_h\underline{\varvec{\lambda }}^{n+1}, \varvec{w}_h^{n+1} - \varvec{w}^{n+1}). \end{aligned}$$

By setting $\varvec{\mu }_h = \varvec{P}_h\underline{\varvec{\lambda }}^{n+1}$ in (18), we have

$$\begin{aligned}&(\varvec{P}_h\underline{\varvec{\lambda }}^{n+1}, \varvec{w}_h^{n+1} - \varvec{w}^{n+1}) \\&\quad = \varepsilon (\varvec{\lambda }_h^{n+1}, \varvec{P}_h\underline{\varvec{\lambda }}^{n+1}) -(\nabla \underline{u}^{n+1}, \varvec{P}_h\underline{\varvec{\lambda }}^{n+1}) \\&\quad =-\varepsilon \Vert \varvec{P}_h\underline{\varvec{\lambda }}^{n+1}\Vert ^2 + \varepsilon (\varvec{P}_h\varvec{\lambda }^{n+1}, \varvec{P}_h\underline{\varvec{\lambda }}^{n+1}) - (\nabla \underline{u}^{n+1}, \varvec{P}_h\underline{\varvec{\lambda }}^{n+1}) \\&\quad \le -\frac{\varepsilon }{2} \Vert \varvec{P}_h\underline{\varvec{\lambda }}^{n+1}\Vert ^2 + \frac{\varepsilon }{2}\Vert \varvec{P}_h\varvec{\lambda }^{n+1}\Vert ^2 - (\nabla \underline{u}^{n+1}, \varvec{P}_h\underline{\varvec{\lambda }}^{n+1}) \\&\quad \le -\frac{\varepsilon }{2} \Vert \varvec{P}_h\underline{\varvec{\lambda }}^{n+1}\Vert ^2 + C\varepsilon - (\nabla \underline{u}^{n+1}, \varvec{P}_h\underline{\varvec{\lambda }}^{n+1}). \end{aligned}$$

Combining the above, we have

$$\begin{aligned} I_2&\le \frac{1}{8} \Vert \nabla \underline{\varvec{w}}^{n+1}\Vert ^2 + C\left(h^2 + \frac{h^4}{\varepsilon } + \varepsilon \right) - \frac{\varepsilon }{2\delta } \Vert \varvec{P}_h\underline{\varvec{\lambda }}^{n+1}\Vert ^2 - \frac{2}{\delta }(\nabla \underline{u}^{n+1}, \varvec{P}_h\underline{\varvec{\lambda }}^{n+1}).\nonumber \\ \end{aligned}$$

(21)

Finally, we consider $I_3$. The analysis of $I_3$ depends on the definition of $G_-$. It is not trivial to get an upper bound of $I_3$ when $G_-$ satisfying only (3), without making further assumptions. However, if $G_-$ is defined as in (4), then $\nabla _{\mathbb F } G_-(\varvec{w}) = -\varvec{w}$ and the analysis is given as following:

$$\begin{aligned} I_3&= C (\varvec{w}_h^n - \varvec{w}^{n+1}, \varvec{w}_h^{n+1}-\varvec{\psi }_h) \\&= C( (\varvec{w}^n - \varvec{w}^{n+1}, \varvec{w}^{n+1}-\varvec{\psi }_h) - (\varvec{w}^n - \varvec{w}^{n+1}, \underline{\varvec{w}}^{n+1}) \\&\quad + (\underline{\varvec{w}}^n, \underline{\varvec{w}}^{n+1}) - (\underline{\varvec{w}}^n, \varvec{w}^{n+1}-\varvec{\psi }_h) )\\&\le Ch^2+ C(\varvec{w}^{n+1} - \varvec{w}^{n}, \underline{\varvec{w}}^{n+1}) + C\Vert \underline{\varvec{w}}^n\Vert \, \Vert \varvec{w}^{n+1}-\varvec{\psi }_h\Vert \\&+ C ((I-\varvec{P}_h)\underline{\varvec{w}}^n, \underline{\varvec{w}}^{n+1}) + C (\varvec{P}_h\underline{\varvec{w}}^n, \underline{\varvec{w}}^{n+1}) \\&\le Ch^2 + C (\varvec{w}^{n+1} - \varvec{w}^{n}, \underline{\varvec{w}}^{n+1}) + \frac{1}{16} \Vert \nabla \underline{\varvec{w}}^{n+1}\Vert ^2 + C (\varvec{P}_h\underline{\varvec{w}}^n, \underline{\varvec{w}}^{n+1}). \end{aligned}$$

Notice that by (18), the triangle inequality, Schwarz inequality, Poincaré inequality, Young’s inequality, Theorem 3.2 and Lemma 4.2,

$$\begin{aligned}&C (\varvec{P}_h\underline{\varvec{w}}^n, \underline{\varvec{w}}^{n+1})\\&\quad =C( (\nabla \underline{u}^{n+1}, \varvec{P}_h\underline{\varvec{w}}^n) - \varepsilon (\varvec{\lambda }_h^{n+1}, \varvec{P}_h \underline{\varvec{w}}^n) )\\&\quad =C( (\nabla \underline{u}^{n+1}, \underline{\varvec{w}}^n) - (\nabla \underline{u}^{n+1}, (I-\varvec{P}_h) \underline{\varvec{w}}^n) - \varepsilon (\varvec{\lambda }_h^{n+1}, \underline{\varvec{w}}^n) ) \\&\quad =C( -(\underline{u}^{n+1}, \nabla \cdot \underline{\varvec{w}}^n) - (\nabla \underline{u}^{n+1}, (I-\varvec{P}_h) \underline{\varvec{w}}^n) - \varepsilon (\varvec{\lambda }_h^{n+1}, \underline{\varvec{w}}^n) ) \\&\quad \le \frac{1}{4} \Vert \nabla \underline{\varvec{w}}^n\Vert ^2 + C \Vert \underline{u}^{n+1}\Vert ^2 + C \varepsilon ^2 \Vert \varvec{\lambda }_h^{n+1}\Vert ^2 \\&\qquad + C( \Vert \underline{\varvec{w}}^{n+1}\Vert + C(h+\sqrt{\varepsilon }) )\Vert (I-\varvec{P}_h) \underline{\varvec{w}}^n\Vert \\&\quad \le \frac{1}{4} \Vert \nabla \underline{\varvec{w}}^n\Vert ^2 + C \Vert \underline{u}^{n+1}\Vert ^2 + \frac{1}{32}\Vert \nabla \underline{\varvec{w}}^{n+1}\Vert ^2 + C(h^2 + \varepsilon ). \end{aligned}$$

Also,

$$\begin{aligned} C (\varvec{w}^{n+1} - \varvec{w}^{n}, \underline{\varvec{w}}^{n+1})&= C \left(\int _{t_n}^{t_{n+1}} \varvec{w}_t(\cdot ,s)\,ds, \, \underline{\varvec{w}}^{n+1}\right)\\&\le C \int _{t_n}^{t_{n+1}} \Vert \varvec{w}_t(\cdot ,s)\Vert \,\Vert \underline{\varvec{w}}^{n+1}\Vert \, ds \\&\le \frac{1}{32}\Vert \nabla \underline{\varvec{w}}^{n+1}\Vert ^2 + C\varDelta t \int _{t_n}^{t_{n+1}} \Vert \varvec{w}_t(\cdot ,s)\Vert ^2\, ds. \end{aligned}$$

Combining the above,

$$\begin{aligned} I_3&\le C \varDelta t \int _{t_n}^{t_{n+1}} \Vert \varvec{w}_t(\cdot ,s)\Vert ^2\, ds + \frac{1}{4} \Vert \nabla \underline{\varvec{w}}^n\Vert ^2 + \frac{1}{8} \Vert \nabla \underline{\varvec{w}}^{n+1}\Vert ^2 \nonumber \\&+ C \Vert \underline{u}^{n+1}\Vert ^2 + C(h^2 + \varepsilon ). \end{aligned}$$

(22)

Combining (19), (20), (21), (22), we have proved the lemma.$\square $

Finally, we are able to prove the main result of this section. The following discrete Gronwall’s inequality will be needed [34]: let $y^n, a^n, b^n, c^n$ , be non-negative sequences satisfying

$$\begin{aligned} y^n + \varDelta t \sum _{i=1}^n a^i \le y^0 + \varDelta t \sum _{i=1}^n (b^iy^i+c^i) \end{aligned}$$

with $\varDelta t b^i <1$, then

$$\begin{aligned} y^n + \varDelta t \sum _{i=1}^n a^i \le e^{C_b\varDelta t \sum _{i=1}^n b^i} \left(\varDelta t \sum _{i=1}^n c^i + y^0\right), \end{aligned}$$

where $C_b=\max _{0\le i\le n} (1-\varDelta t b^i)^{-1}$.

Theorem 4.4

Let $G_-$ be defined as in (4). Then there exists a constant $C$ independent of $h$ or $\varepsilon $ such that for $O(h^2)\le \varDelta t \le C$,

$$\begin{aligned}&\Vert \underline{u}^n\Vert ^2 + \sum _{i=1}^n \varDelta t \Vert \nabla \underline{\varvec{w}}^i\Vert ^2 + \sum _{i=1}^n \varDelta t \varepsilon \Vert \varvec{P}_h \underline{\varvec{\lambda }}^i\Vert ^2 \nonumber \\&\quad \le C e^{C t_n } \left(\Vert \underline{u}^0\Vert ^2 + \varDelta t \Vert \nabla \underline{\varvec{w}}^0\Vert ^2 + \varDelta t \varepsilon \Vert \varvec{P}_h \underline{\varvec{\lambda }}^0\Vert ^2 + t_n \left(h^2+\frac{h^4}{\varepsilon } + \varepsilon \right) \right.\nonumber \\&\qquad \qquad \qquad \quad \left.+\varDelta t^2 \int _0^{t_n} (\Vert u_{tt}(\cdot ,s)\Vert ^2+\Vert \varvec{w}_t(\cdot ,s)\Vert ^2 ) \, ds \right)\!. \end{aligned}$$

(23)

Here again, all general constant $C$ may depend on $\delta , C_G, m, \varOmega , \Vert u\Vert _{L^{\infty }(0,T;H^3(\varOmega ))}, \Vert \varvec{w}\Vert _{L^{\infty }(0,T;H^2(\varOmega ))}, \Vert \varvec{\lambda }\Vert _{L^{\infty }(0,T;H^1(\varOmega ))}$ and $E_0$, but not on $h, \varDelta t$ or $\varepsilon $.

Proof

By setting $v_h = I_h \underline{u}^{n+1}$ in (18), we have

$$\begin{aligned} \frac{1}{\varDelta t} (\underline{u}^{n+1}-\underline{u}^n, I_h\underline{u}^{n+1})&= (\underline{\varvec{\lambda }}^{n+1}, \nabla (I_h\underline{u}^{n+1})) - (\xi ^{n+1}, I_h \underline{u}^{n+1}) \nonumber \\&= (\varvec{P}_h\underline{\varvec{\lambda }}^{n+1}, \nabla (I_h\underline{u}^{n+1})) - (\xi ^{n+1}, I_h \underline{u}^{n+1}), \end{aligned}$$

(24)

where the local truncation error $\xi ^{n+1}$ is

$$\begin{aligned} \xi ^{n+1}= \partial _t u^{n+1} - \frac{u^{n+1}-u^n}{\varDelta t} = \frac{1}{\varDelta t} \int _{t_n}^{t_{n+1}} (s-t_{n}) u_{tt}(\cdot ,s)\, ds. \end{aligned}$$

By the Schwarz inequality, Lemma 4.2, Poincaré inequality and Young’s inequality, it is not hard to see that

$$\begin{aligned}&(\xi ^{n+1}, I_h \underline{u}^{n+1})\\&\quad \le \frac{1}{\varDelta t} \int _{t_n}^{t_{n+1}} \Vert (s-t_{n}) u_{tt}(\cdot ,s)\Vert \Vert I_h \underline{u}^{n+1}\Vert \, ds \\&\quad \le \frac{C}{\varDelta t} \int _{t_n}^{t_{n+1}} \Vert (s-t_{n}) u_{tt}(\cdot ,s)\Vert ( \Vert \nabla \underline{u}^{n+1}\Vert + \Vert \underline{u}^{n+1}-I_h \underline{u}^{n+1}\Vert ) \, ds \\&\quad \le \frac{C}{\varDelta t} \int _{t_n}^{t_{n+1}} \Vert (s-t_{n}) u_{tt}(\cdot ,s)\Vert (\Vert \nabla \underline{\varvec{w}}^{n+1}\Vert + (h+\sqrt{\varepsilon })) \, ds \\&\quad \le \frac{\delta }{8} \Vert \nabla \underline{\varvec{w}}^{n+1}\Vert ^2 + C(h^2+\varepsilon ) + \frac{C}{\varDelta t} \int _{t_n}^{t_{n+1}} \Vert (s-t_{n}) u_{tt}(\cdot ,s)\Vert ^2 \, ds \\&\quad \le \frac{\delta }{8} \Vert \nabla \underline{\varvec{w}}^{n+1}\Vert ^2 + C(h^2+\varepsilon ) + C\varDelta t \int _{t_n}^{t_{n+1}} \Vert u_{tt}(\cdot ,s)\Vert ^2 \, ds. \end{aligned}$$

Then by using the Schwarz inequality and the Young’s inequality,

$$\begin{aligned}&\frac{1}{2\varDelta t}(\Vert \underline{u}^{n+1}\Vert ^2 - \Vert \underline{u}^n\Vert ^2+ \Vert \underline{u}^{n+1}-\underline{u}^n\Vert ^2 ) \nonumber \\&\quad = \frac{1}{\varDelta t} (\underline{u}^{n+1}-\underline{u}^n, (I-I_h)\underline{u}^{n+1}) + \frac{1}{\varDelta t} (\underline{u}^{n+1}-\underline{u}^n, I_h\underline{u}^{n+1}) \nonumber \\&\quad = \frac{1}{\varDelta t} (\underline{u}^{n+1}-\underline{u}^n, (I-I_h)u^{n+1})+ \frac{1}{\varDelta t} (\underline{u}^{n+1}-\underline{u}^n, I_h\underline{u}^{n+1}) \nonumber \\&\quad \le \frac{Ch^2}{\varDelta t^2} (\Vert \underline{u}^n\Vert ^2 + \Vert \underline{u}^{n+1}\Vert ^2) + Ch^2 + (\varvec{P}_h\underline{\varvec{\lambda }}^{n+1}, \nabla (I_h\underline{u}^{n+1})) \nonumber \\&\qquad +\frac{\delta }{8} \Vert \nabla \underline{\varvec{w}}^{n+1}\Vert ^2 + C(h^2+\varepsilon ) + C\varDelta t \int _{t_n}^{t_{n+1}} \Vert u_{tt}(\cdot ,s)\Vert ^2 \, ds. \end{aligned}$$

(25)

Combine the above with Lemma 4.3, we have

$$\begin{aligned}&\frac{1}{2\varDelta t}(\Vert \underline{u}^{n+1}\Vert ^2 - \Vert \underline{u}^n\Vert ^2+ \Vert \underline{u}^{n+1}-\underline{u}^n\Vert ^2) \\&\qquad + \frac{\delta }{8}(2\Vert \nabla \underline{\varvec{w}}^{n+1}\Vert ^2 - \Vert \nabla \underline{\varvec{w}}^{n}\Vert ^2) + \frac{\varepsilon }{4} \Vert \varvec{P}_h\underline{\varvec{\lambda }}^{n+1}\Vert ^2 \\&\quad \le \frac{Ch^2}{\varDelta t^2} (\Vert \underline{u}^n\Vert ^2 + \Vert \underline{u}^{n+1}\Vert ^2) + C\varDelta t \int _{t_n}^{t_{n+1}} \Vert u_{tt}(\cdot ,s)\Vert ^2 \, ds\\&\qquad +C\left(h^2 + \frac{h^4}{\varepsilon } + \varepsilon \right) + C\varDelta t \int _{t_n}^{t_{n+1}} \Vert \varvec{w}_t(\cdot ,s)\Vert ^2\, ds + C \Vert \underline{u}^{n+1}\Vert ^2 \\&\qquad + (\varvec{P}_h\underline{\varvec{\lambda }}^{n+1}, \nabla (I_h\underline{u}^{n+1})-\nabla \underline{u}^{n+1}) \end{aligned}$$

Since by Inequality (10) and the Young’s inequality

$$\begin{aligned} (\varvec{P}_h\underline{\varvec{\lambda }}^{n+1}, \nabla (I_h\underline{u}^{n+1})-\nabla \underline{u}^{n+1})&= (\varvec{P}_h\underline{\varvec{\lambda }}^{n+1}, \nabla (I_h u^{n+1}-u^{n+1})) \\&\le Ch^2\Vert u^{n+1}\Vert _{H^3} \Vert \varvec{P}_h\underline{\varvec{\lambda }}^{n+1}\Vert ,\\&\le C\frac{h^4}{\varepsilon } + \frac{\varepsilon }{8} \Vert \varvec{P}_h\underline{\varvec{\lambda }}^{n+1}\Vert ^2, \end{aligned}$$

then we can conclude that

$$\begin{aligned}&\frac{1}{2\varDelta t}(\Vert \underline{u}^{n+1}\Vert ^2 - \Vert \underline{u}^n\Vert ^2 )+ \frac{\delta }{8}(2\Vert \nabla \underline{\varvec{w}}^{n+1}\Vert ^2 - \Vert \nabla \underline{\varvec{w}}^{n}\Vert ^2) + \frac{\varepsilon }{8} \Vert \varvec{P}_h\underline{\varvec{\lambda }}^{n+1}\Vert ^2 \\&\quad \le C\left(1+\frac{h^2}{\varDelta t^2}\right)(\Vert \underline{u}^n\Vert ^2 + \Vert \underline{u}^{n+1}\Vert ^2)+ C\left(h^2 + \frac{h^4}{\varepsilon } + \varepsilon \right) \\&\qquad + C\varDelta t \int _{t_n}^{t_{n+1}} (\Vert u_{tt}(\cdot ,s)\Vert ^2+\Vert \varvec{w}_t(\cdot ,s)\Vert ^2) \, ds. \end{aligned}$$

Finally, by taking summation of the above inequality with respect to $n$, and setting

$$\begin{aligned} y^0 = \frac{1}{2}\Vert \underline{u}^0\Vert ^2 + \frac{\delta \varDelta t}{4} \Vert \nabla \underline{\varvec{w}}^0\Vert ^2 + \frac{\varepsilon \varDelta t}{8} \Vert \varvec{P}_h\underline{\varvec{\lambda }}^{0}\Vert ^2, \end{aligned}$$

and for $n\ge 1$,

$$\begin{aligned} y^n&= \frac{1}{2}\Vert \underline{u}^n\Vert ^2,\\ a^n&= \frac{\varepsilon }{8} \Vert \varvec{P}_h \underline{\varvec{\lambda }}^n\Vert ^2 + \frac{\delta }{8} \Vert \nabla \underline{\varvec{w}}^n\Vert ^2, \\ b^n&= C\left(1+\frac{h^2}{\varDelta t^2}\right), \\ c^n&= C\left(h^2 + \frac{h^4}{\varepsilon } + \varepsilon \right) + C\varDelta t \int _{t_n}^{t_{n+1}} (\Vert u_{tt}(\cdot ,s)\Vert ^2+\Vert \varvec{w}_t(\cdot ,s)\Vert ^2) \, ds, \end{aligned}$$

and using the Gronwall’s inequality, we get Inequality (23). Notice that in order to guarantee $\varDelta t b^n < 1$, we need $O(h^2)<\varDelta t < C$. However, the upper bound of $\varDelta t$ does not depend on $h$.$\square $

The constraint $O(h^2)\le \varDelta t$ is unusual, since most numerical schemes performs better when fixing $h$ and decreasing $\varDelta t$. Indeed, if we assume further regularity of the solution, then this condition can be dropped.

Theorem 4.5

Let $G_-$ be defined as in (4) and assume $u_t\in L^{\infty }(0,T; H^{1+s}(\varOmega ))$, where $s>0$. Then there exists a constant $C$ independent of $h$ or $\varepsilon $ such that for all $\varDelta t\le C$,

$$\begin{aligned}&\Vert \underline{u}^n\Vert ^2 + \sum _{i=1}^n \varDelta t \Vert \nabla \underline{\varvec{w}}^i\Vert ^2 + \sum _{i=1}^n \varDelta t \varepsilon \Vert \varvec{P}_h \underline{\varvec{\lambda }}^i\Vert ^2 \\&\quad \le C e^{C t_n } \left(\Vert \underline{u}^0\Vert ^2 + \varDelta t \Vert \nabla \underline{\varvec{w}}^0\Vert ^2 + \varDelta t \varepsilon \Vert \varvec{P}_h \underline{\varvec{\lambda }}^0\Vert ^2 + t_n \left(h^2+\frac{h^4}{\varepsilon } + \varepsilon \right) \right.\\&\qquad \qquad \qquad \left.+ \varDelta t^2 \int _0^{t_n} (\Vert I_h u_{tt}(\cdot ,s)\Vert ^2+\Vert \varvec{w}_t(\cdot ,s)\Vert ^2 ) \, ds \right). \end{aligned}$$

The general constant in the above inequality may depend on all parameters mentioned as in Theorem 4.4 plus $\Vert u_t\Vert _{L^{\infty }(0,T; H^{1+s}(\varOmega ))}$, but not on $h, \varDelta t$, or $\varepsilon $.

Proof

Similar to Eq. (24) in the beginning of the proof for Theorem 4.4, we have

$$\begin{aligned}&\frac{1}{\varDelta t} (I_h \underline{u}^{n+1}-I_h \underline{u}^n, I_h\underline{u}^{n+1})\\&\quad = \frac{1}{\varDelta t} (I_h (u^{n+1}-u^n), I_h\underline{u}^{n+1}) - \frac{1}{\varDelta t}(u_h^{n+1}-u_h^n, \nabla (I_h\underline{u}^{n+1}))\\&\quad =(I_h (\partial _t u^{n+1} - \xi ^{n+1}), I_h\underline{u}^{n+1}) - (\varvec{\lambda }_h^{n+1}, \nabla (I_h\underline{u}^{n+1})) \\&\quad =(\varvec{\lambda }^{n+1}, \nabla (I_h\underline{u}^{n+1})) - ((I-I_h)\partial _t u^{n+1}, I_h\underline{u}^{n+1}) - (I_h\xi ^{n+1}, I_h \underline{u}^{n+1}) \\&\qquad - (\varvec{\lambda }_h^{n+1}, \nabla (I_h\underline{u}^{n+1})) \\&\quad = (\underline{\varvec{\lambda }}^{n+1}, \nabla (I_h\underline{u}^{n+1})) - (I_h\xi ^{n+1}, I_h \underline{u}^{n+1}) - ((I-I_h)\partial _t u^{n+1}, I_h\underline{u}^{n+1})\\&\quad = (\varvec{P}_h\underline{\varvec{\lambda }}^{n+1}, \nabla (I_h\underline{u}^{n+1})) - (I_h\xi ^{n+1}, I_h \underline{u}^{n+1}) - ((I-I_h)\partial _t u^{n+1}, I_h\underline{u}^{n+1}). \end{aligned}$$

Here $(I_h\xi ^{n+1}, I_h \underline{u}^{n+1})$ can be bounded similarly as the term $(\xi ^{n+1}, I_h \underline{u}^{n+1})$ in the proof of Theorem 4.4, with $\Vert u_{tt}(\cdot ,s)\Vert $ being replaced by $\Vert I_h u_{tt}(\cdot ,s)\Vert $. We also have an extra term, which can be bounded by

$$\begin{aligned} ((I-I_h)\partial _t u^{n+1}, I_h\underline{u}^{n+1})&\le Ch\Vert u_t\Vert _{L^{\infty }(0,T;H^{1+s}(\varOmega ))} \Vert I_h\underline{u}^{n+1}\Vert \\&\le Ch^2 + \Vert I_h\underline{u}^{n+1}\Vert ^2. \end{aligned}$$

Then, Eq. (25) in the proof of Theorem 4.4 can then be rewritten as

$$\begin{aligned}&\frac{1}{2\varDelta }(\Vert I_h\underline{u}^{n+1}\Vert ^2 - \Vert I_h\underline{u}^n\Vert ^2 + \Vert I_h\underline{u}^{n+1}-I_h\underline{u}^n\Vert ^2 )\\&\quad = \frac{1}{\varDelta t} (I_h\underline{u}^{n+1}-I_h\underline{u}^n, I_h\underline{u}^{n+1}) \\&\quad \le Ch^2 + \Vert I_h\underline{u}^{n+1}\Vert ^2 + (\varvec{P}_h\underline{\varvec{\lambda }}^{n+1}, \nabla (I_h\underline{u}^{n+1})) \\&\qquad +\frac{\delta }{8} \Vert \nabla \underline{\varvec{w}}^{n+1}\Vert ^2 + C(h^2+\varepsilon ) + C\varDelta t \int _{t_n}^{t_{n+1}} \Vert I_h u_{tt}(\cdot ,s)\Vert ^2 \, ds. \end{aligned}$$

The rest of the proof is the same as the proof of Theorem 4.4, with the only differences that $\Vert \underline{u}^{n+1}\Vert ^2$ and $\Vert \underline{u}^n\Vert ^2$ are now substituted by $\Vert I_h\underline{u}^{n+1}\Vert ^2$ and $\Vert I_h\underline{u}^n\Vert ^2$ and we no longer have the term $1+\frac{h^2}{\varDelta t^2}$. The Gronwall’s inequality will now be applied with $y^n = \frac{1}{2}\Vert I_h\underline{u}^n\Vert ^2$ and $b^n = C$. Since

$$\begin{aligned} \Vert \underline{u}^n\Vert&\le \Vert (I-I_h)\underline{u}^n\Vert + \Vert I_h\underline{u}^n\Vert ^2 \le Ch^2 + \Vert I_h\underline{u}^n\Vert , \\ \Vert I_h\underline{u}^n\Vert&\le \Vert (I-I_h)\underline{u}^n\Vert + \Vert \underline{u}^n\Vert ^2 \le Ch^2 + \Vert \underline{u}^n\Vert , \end{aligned}$$

we will be able to get the error estimation in this theorem.$\square $

Remark 4.6

If

$$\begin{aligned} \Vert \underline{u}^0\Vert ^2 + \varDelta t \Vert \nabla \underline{\varvec{w}}^0\Vert ^2 + \varDelta t \varepsilon \Vert \varvec{P}_h \underline{\varvec{\lambda }}^0\Vert ^2 \le Ch^2,\quad \varepsilon = Ch^2, \end{aligned}$$

and

$$\begin{aligned} \int _{0}^{t_{n}} (\Vert I_h u_{tt}(\cdot ,s)\Vert ^2+\Vert \varvec{w}_t(\cdot ,s)\Vert ^2)\, ds \le C, \end{aligned}$$

then we have

$$\begin{aligned} \Vert \underline{u}^n\Vert ^2 + \sum _{i=1}^n \varDelta t \Vert \nabla \underline{\varvec{w}}^i\Vert ^2 + \sum _{i=1}^n \varDelta t \varepsilon \Vert \varvec{P}_h \underline{\varvec{\lambda }}^i\Vert ^2 \le C(h^2 + \varDelta t^2). \end{aligned}$$

Furthermore, by Lemma 4.2 and the Poincaré inequality, it is easy to see that

$$\begin{aligned} \sum _{i=1}^n \varDelta t \Vert \nabla \underline{u}^i\Vert ^2 \le C(h^2 + \varDelta t^2). \end{aligned}$$

Remark 4.7

One may be able to slightly improve the result in Theorem 4.5, by defining $I_h$ to be a Clément-type interpolation preserving homogeneous or periodic boundary conditions. Such an interpolation has been constructed in [40]. However, the advantage of doing so is not very obvious. For smooth solutions, Theorem 4.5 already gives the optimal convergence rate. For the future research, a more interesting direction would be to explore the role of parameters $\delta $ and $\varepsilon $.

References

Arnold, D.N., Falk, R.S.: A uniformly accurate finite element method for the Reissner–Mindlin plate. SIAM J. Numer. Anal. 26, 1276–1290 (1989)
Article MathSciNet MATH Google Scholar
Bathe, K.J., Dvorkin, E.N.: A four-node plate bending element based on Mindlin–Reissner plate theory and a mixed interpolation. J. Numer. Methods Eng. 21, 367–383 (1985)
Article MATH Google Scholar
Bathe, K.J., Brezzi, F.: On the convergence of a four-node plate bending element based on Mindlin–Reissner plate theory and a mixed interpolation. In: Whiteman, J.R. (ed.) MAFELAP V, pp. 491–503. Academic Press, London (1985)
Google Scholar
Bathe, K.J., Brezzi, F.: A simplified analysis of two plate-bending elements-the MITC4 and MITC9 elements. In: Pande, G.N., Middleton, J. (eds.) MUNETA 87. Numerical Techniques for Engineering Analysis and Design, vol. 1 (1987)
Berkovitz, L.D.: Convexity and optimization in ${\mathbb{R}}^n$. Wiley, New York (2002)
Book MATH Google Scholar
Blomker, D., Gugg, C.: On the existence of solutions for amorphous molecular beam epitaxy. Nonlinear Anal. Real World Appl. 3, 61–73 (2002)
Article MathSciNet Google Scholar
Brezzi, F., Fortin, M.: Numerical approximation of Mindlin-Reissner plates. Math. Comp. 47, 151–158 (1986)
Article MathSciNet MATH Google Scholar
Brezzi, F., Bathe, K.J., Fortin, M.: Mixed interpolated elements for Reissner–Mindlin plates. J. Numer. Methods Eng. 28, 1787–1801 (1989)
Article MathSciNet MATH Google Scholar
Caflisch, R.E., Gyure, M.F., Merriman, B., Osher, S., Ratsch, C., Vvedensky, D.D.: Island dynamics and the level set method for epitaxial growth. Appl. Math. Lett. 12, 13–22 (1999)
Article MathSciNet MATH Google Scholar
Chen, W., Conde, S., Wang, C., Wang, X., Wise, S.M.: A linear energy stable scheme for a thin film model without slope selection. J. Sci. Comput. 26, 1–17 (2011)
MATH Google Scholar
Cho, A.: Film deposition by molecular beam techniques. J. Vac. Sci. Technol. 8, S31–S38 (1971)
Article Google Scholar
Cho, A., Arthur, J.: Molecular beam epitaxy. Prog. Solid State Chem. 10, 157–192 (1975)
Article Google Scholar
Clarke, S., Vvedensky, D.D.: Origin of reflection high-energy electron-diffraction intensity oscillations during molecular-beam epitaxy: a computational modeling approach. Phys. Rev. Lett. 58, 2235–2238 (1987)
Article Google Scholar
Copetti, M.I.M., Elliot, C.M.: Numerical Analysis of the Cahn–Hilliard equation with a logarithmic free energy. Numer. Math. 63, 39–65 (1992)
Article MathSciNet MATH Google Scholar
Du, Q., Nicolaides, R.A.: Numerical analysis of a continuum model of phase transition. SIAM J. Numer. Anal. 28, 1310–1322 (1991)
Article MathSciNet MATH Google Scholar
Duran, R., Liberman, E.: On mixed finite element methods for the Reissner–Mindlin plate model. Math. Comp. 58, 561–573 (1992)
Article MathSciNet MATH Google Scholar
Elliot, C.M., French, D.A.: Numerical studies of the Cahn–Hilliard equation for phase separation. IMA J. Appl. Math. 38, 97–128 (1987)
Article MathSciNet Google Scholar
Elliot, C.M., French, D.A.: A nonconforming finite-element method for the two-dimensional Cahn–Hilliard equation. SIAM J. Numer. Anal. 26, 884–903 (1989)
Article MathSciNet Google Scholar
Elliot, C.M., French, D.A.: A second order splitting method for the Cahn–Hilliard equation. Numer. Math. 54, 575–590 (1989)
Article MathSciNet Google Scholar
Eyre, D.J.: Unconditionally gradient stable time marching the Cahn–Hilliard equation. In: Bullard, J.W., Kalia, R., Stoneham, M., , Chen, L.Q. (eds.) Computational and Mathematical Models of Microstructural Evolution, p. 1712. Materials Research Society, Warrendale (1998)
Feng, X., Prohl, A.: Error analysis of a mixed finite element method for the Cahn–Hilliard equation. Numer. Math. 99, 47–84 (2004)
Article MathSciNet MATH Google Scholar
Gyure, M.F., Ratsch, C., Merriman, B., Caflisch, R.E., Osher, S.: Level-set methods for the simulation of epitaxial phenomena. Phys. Rev. E 58, R6927–R6930 (1998)
Article Google Scholar
Han, W., Cheng, X., Huang, H.: Some mixed finite element methods for biharmonic equation. J. Comp. Appl. Math. 126, 91–109 (1999)
MathSciNet Google Scholar
Hoppe, R.H., Nash, E.M.: A combined spectral element/finite element approach to the numerical solution of a nonlinear evolution equation describing amorphous surface growth of thin films. J. Numer. Math. 10, 127–136 (2002)
Article MathSciNet MATH Google Scholar
Johnson, C., Pitkäranta, J.: Analysis of some mixed finite element methods related to reduced integration. Math. Comp. 38, 375–400 (1982)
Article MathSciNet MATH Google Scholar
Kang, H.C., Weinberg, W.H.: Dynamic Monte Carlo with a proper energy barrier: surface diffusion and two-dimensional domain ordering. J. Chem. Phys. 90, 2824–2830 (1989)
Article Google Scholar
King, B.B., Stein, O., Winkler, M.: A fourth-order parabolic equation modeling epitaxial thin film growth. J. Math. Anal. Appl. 286, 459–490 (2003)
Article MathSciNet MATH Google Scholar
Kohn, R.V., Yan, X.: Upper bounds on the coarsening rate for an epitaxial growth model. Commun. Pure Appl. Math. 56, 1549–1564 (2003)
Article MathSciNet MATH Google Scholar
Krug, J.: Origins of scale invariance in growth processes. Adv. Phys. 46, 139–282 (1997)
Article Google Scholar
Li, B.: High-order surface relaxation versus the Ehrlich–Schwoebel effect. Nonlinearity 19, 2581–2603 (2006)
Article MathSciNet MATH Google Scholar
Li, B.: Variational properties of unbounded order parameters. SIAM J. Math. Anal. 38, 16–36 (2006)
Article MathSciNet MATH Google Scholar
Li, B., Liu, J.: Thin film epitaxy with or without slope selection. Eur. J. Appl. Math. 14, 713–743 (2003)
Article MATH Google Scholar
Li, B., Liu, J.: Epitaxial growth without slope selection: energetics, coarsening, and dynamic scaling. J. Nonlinear Sci. 14, 429–451 (2004)
Article MathSciNet MATH Google Scholar
Lu, X., Lin, P., Liu, J.: Analysis of a sequential regularization method for the unsteady Navier–Stokes equations. Math. Comp. 77, 1467–1494 (2008)
Article MathSciNet MATH Google Scholar
Malkus, D.S., Hughes, T.J.R.: Mixed finite element methods-reduced and selective integration techniques: a unification of concepts. Comput. Methods Appl. Mech. Eng. 15, 63–81 (1978)
Article MATH Google Scholar
Ortiz, M., Repetto, E., Si, H.: A continuum model of kinetic roughening and coarsening in thin films. J. Mech. Phys. Solids 47, 697–730 (1999)
Article MathSciNet MATH Google Scholar
Rost, M.: Continuum models for surface growth. Int. Ser. Numer. Math. 149, 195–208 (2005)
Article MathSciNet Google Scholar
Schneider, M., Schuller, I.K., Rahman, A.: Epitaxial growth of silicon: a molecular-dynamics simulation. Phys. Rev. B 36, 1340–1343 (1987)
Article Google Scholar
Scholtz, R.: A mixed method for fourth-order problems using the linear finite elements. RAIRO Numer. Anal. 15, 85–90 (1978)
Google Scholar
Scott, L.R., Zhang, S.: Finite element interpolation of nonsmooth function satisfying boundary conditions. Math. Comp. 54, 483–493 (1990)
Article MathSciNet MATH Google Scholar
Siegert, M., Plischke, M.: Solid-on-solid models of molecular-beam epitaxy. Phys. Rev. E 50, 917–931 (1994)
Article Google Scholar
Villain, J.: Continuum models of crystal growth from atomistic beams with and without desorption. J. Phys. I 1, 19–42 (1991)
Google Scholar
Wang, C., Wang, X., Wise, S.: Unconditionally stable schemes for equations of thin film epitaxy. Discrete Contin. Dyn. Syst. 28, 405–423 (2010)
Article MathSciNet MATH Google Scholar
Xia, X., Chen, W., Liu, J.: Convergence analysis of implicit full discretization for the epitaxial growth model of thin films. Numer. Math. J. Chin. Univ. 34(1), 30–51 (2012). (in Chinese)
Google Scholar
Xu, C., Tang, T.: Stability analysis of large time-stepping methods for epitaxial growth models. SIAM J. Numer. Anal. 44, 1759–1779 (2006)
Article MathSciNet MATH Google Scholar

Download references

Acknowledgments

Chen was supported by the 111 project, Key Project National Science Foundation of China (91130004) and the Natural Science Foundation of China (11171077). He also thanks Jianguo Liu in Duke University and Xiaoming Wang in Florida State University for the fruitful discussions. Wang thanks the Key Laboratory of Mathematics for Nonlinear Sciences (EYH1140070), Fudan University, for the support during her visit. The authors are also grateful to the anonymous referees for their helpful comments and suggestions which greatly improved the quality of this paper.

Author information

Authors and Affiliations

School of Mathematical Sciences, Fudan University, Shanghai, China
Wenbin Chen
Department of Mathematics, Oklahoma State University, Stillwater, OK, USA
Yanqiu Wang

Authors

Wenbin Chen
View author publications
You can also search for this author in PubMed Google Scholar
Yanqiu Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yanqiu Wang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chen, W., Wang, Y. A mixed finite element method for thin film epitaxy. Numer. Math. 122, 771–793 (2012). https://doi.org/10.1007/s00211-012-0473-9

Download citation

Received: 20 July 2011
Revised: 25 April 2012
Published: 15 June 2012
Issue Date: December 2012
DOI: https://doi.org/10.1007/s00211-012-0473-9

Mathematics Subject Classification (2000)

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A mixed finite element method for thin film epitaxy

Abstract

Similar content being viewed by others

A Second Order Energy Stable Linear Scheme for a Thin Film Model Without Slope Selection

Highly Efficient and Accurate Numerical Schemes for the Epitaxial Thin Film Growth Models by Using the SAV Approach

An adaptive BDF2 implicit time-stepping method for the no-slope-selection epitaxial thin film model

1 Introduction

2 The mixed formulation

Theorem 2.1

Proof

3 Finite element discretization

Theorem 3.1

Proof

Theorem 3.2

Proof

4 Convergence

Lemma 4.1

Proof

Lemma 4.2

Proof

Lemma 4.3

Proof

Theorem 4.4

Proof

Theorem 4.5

Proof

Remark 4.6

Remark 4.7

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Mathematics Subject Classification (2000)

Navigation

A mixed finite element method for thin film epitaxy

Abstract

Similar content being viewed by others

A Second Order Energy Stable Linear Scheme for a Thin Film Model Without Slope Selection

Highly Efficient and Accurate Numerical Schemes for the Epitaxial Thin Film Growth Models by Using the SAV Approach

An adaptive BDF2 implicit time-stepping method for the no-slope-selection epitaxial thin film model

1 Introduction

2 The mixed formulation

Theorem 2.1

Proof

3 Finite element discretization

Theorem 3.1

Proof

Theorem 3.2

Proof

4 Convergence

Lemma 4.1

Proof

Lemma 4.2

Proof

Lemma 4.3

Proof

Theorem 4.4

Proof

Theorem 4.5

Proof

Remark 4.6

Remark 4.7

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Mathematics Subject Classification (2000)

Search

Navigation