Abstract
In this contribution, we discuss some basic mechanical and mathematical features of the finite element technology for elliptic boundary value problems. Originating from an engineering perspective, we will introduce step by step of some basic mathematical concepts in order to set a basis for a deeper discussion of the rigorous mathematical approaches. In this context, we consider the boundedness of functions, the classification of the smoothness of functions, classical and mixed variational formulations as well as the \(H^{-1}\)-FEM in linear elasticity. Another focus is on the analysis of saddle point problems occurring in several mixed finite element formulations, especially on the solvability and stability of the associated discretized versions.
Access provided by Autonomous University of Puebla. Download chapter PDF
Similar content being viewed by others
1 Introduction
This chapter deals with some fundamental concepts needed for the understanding of the mathematical background of the finite element method (FEM). Starting from a one-dimensional boundary value problem, we motivate the formulation of an abstract minimization problem in order to generalize the problems occurring in the numerical approximation of elliptic boundary value problems. The presented general explanations originate from an engineering point of view and are consulted of the mathematical framework needed for a deeper understanding. Of course, there are a variety of excellent textbooks dealing with this topic, from the engineering as well as from the mathematical point of view. Textbooks with a more mechanical motivation are (amongst many others) e.g. Hughes (1987), Wriggers (2008), Auricchio et al. (2004), Gockenbach (2006), Berdichevsky (2009), Becker et al. (1981), and Oden and Carey (1983); representatives with a mathematical background are e.g. Braess (1997), Boffi et al. (2013), Oden and Reddy (1976), Brenner and Scott (2002), and Ern and Guermond (2013).
2 Introductory Example and Propaedeutic Remarks
Let \({\mathcal B}\subset \mathbb {R}^d\) be the body of interest parametrized in \(x \in \mathbb {R}^d\) with \(d=1,\,2,\,3\). The boundary \(\partial {\mathcal B}\) of \({\mathcal B}\) is decomposed into \(\partial {\mathcal B}_{N}\) and \(\partial {\mathcal B}_{D}\), where Neumann and Dirichlet boundary conditions are prescribed, respectively. They satisfy
The boundary value problem is typically defined by a set of partial differential equations (PDEs) on the open domain \({\mathcal B}\) and boundary conditions.
For simplicity we start with the simple one-dimensional (\(d=1\)) boundary value problem
with Young’s modulus \(E > 0\), cross section \(A > 0\), and continuous elastic support \(K_s > 0\), with units \([E]=\mathrm {N}/\mathrm {m^2}\), \([A]=\mathrm {m^2}\), \([K_s]=\mathrm {N}/\mathrm {m^2}\), \([u]=\mathrm {m}\), \([f]=\mathrm {N}/\mathrm {m}\), see Fig. 1. At \(x = 0\) a Dirichlet and at \(x = l\) a Neumann boundary condition is applied:
respectively. For the following explanations, we assume that the solution u(x) and the distributed loading f(x) are sufficiently regular.
Analytical solution. The general solution of (2) for constant EA and \(f(x)=f_0 + \Delta f \, x/l\) is based on the ansatz
Substituting these expressions into the homogeneous part of (2), denoted by \(\tilde{u}(x)\), yields
i.e., the solution is of the form
Adding the particular solution \(f_0/K_s + (\Delta f \cdot x)/(K_s \, l)\) yields
Evaluating the boundary conditions yields the analytical expressions for the constants \(c_1\) and \(c_2\):
A weak formulation of the boundary value problem is obtained by multiplying (2) with a test function \(\delta u\) and partial integration:
where V is a suitable space of functions, e.g., a Hilbert space. All test functions \(\delta u \in V\) have to vanish at the Dirichlet boundary condition \(\delta u (0) = 0\). In the variational formulation u is an element of the class of trial functions \(V_{trial}\). The collection of both functions are denoted as admissible functions; in this simple case, V and \(V_{trial}\) coincide and have to satisfy
The idea of approximation methods is to compute an approximate solution \(u_h \in V_h\) in the finite-dimensional subspace \(V_h \subset V\), based on
In order to do this within the finite element method, we have to subdivide the domain in \({\text {num}}_{\text {ele}}\) subsections, here we choose individual finite elements with (for simplicity reasons) unit length \(l^e = l / {\text {num}}_{\text {ele}}\). On this individual elements we define a set of ansatz functions, i.e., shape functions \(N_i | i = 1, \ldots k\) with local support. We use piecewise polynomial functions which are globally \(C^0\) continuous, as depicted in Fig. 2.
In this case, the continuity can be easily enforced by sharing the degrees of freedom at the interface between two neighboring elements. With this definitions we approximate the individual fields on element level as follows:
with the matrix of shape functions \(\underline{{{\varvec{N}}}}^e\), the matrix containing the derivatives of the shape functions \(\underline{{{\varvec{B}}}}^e\), and the vectors of nodal (virtual) degrees of freedom (\(\underline{\delta {{\varvec{d}}}}^e\)) \(\underline{{{\varvec{d}}}} {}^e\) of the element \(e \in \{1, 2, \ldots , {\text {num}}_{\text {ele}} \}\):
After substituting these approximations equation (11) is reformulated into
Assembling the element matrices,
yields
where \(\underline{{{\varvec{K}}}}\) is the global element stiffness matrix, \(\underline{{{\varvec{R}}}}\) the global right-hand side, \(\underline{{{\varvec{D}}}}\) the global vector of unknowns, and \( \underline{\delta {{\varvec{D}}}}\) the global vector of virtual node displacements. The FEM solution is depicted for a various number of elements num\(_\text {ele}\) based on a constant \(K_s\) in Fig. 3. Figure 4 compares the approximation with num\(_\text {ele}=16\) to the analytical solution considering different values of \(K_s\).
Generalizations: In order to formulate an abstract minimization problem we define a quadratic energy functional J(u), e.g., the total potential energy,
with the symmetric form \( a(u,v) = a (v,u)\). We assume that a(u, u) is positive definite, i.e.,
A bilinear form \( {a}:{V \times V} \rightarrow {\mathbb {R}} \) is called H-elliptic (or simply elliptic) if there exists a constant \(c_\alpha > 0\) such that
The H-elliptic bilinear form induces the so-called energy norm
which is equivalent to a norm of the associated Hilbert space H. Under this assumptions the minimization problem
where the minimum is characterized by \(\langle J'(u),v \rangle := \displaystyle \int _0^l J^{\prime }(u) \, v \, \text {d} x=0\), which is equivalent to the variational problem
Identifying our model problem with the abstract formulation yields the quadratic functional
and the linear functional
with the Dirichlet boundary condition \(u(0) = 0\), to be satisfied by the function u(x), and the Neumann boundary condition \(EA\, u^{\prime } (l) = t_l\), appearing as a natural boundary condition in the functional.
Modus operandi. There are several direct methods for the computation of the approximate solution. Beyond this, there are several mathematical frameworks for the qualitative analysis of the existence and uniqueness of solutions. Beside well-known direct methods for the treatment of established models, described by partial differential equations, this topic is rather important for the derivation of new models in continuum thermodynamics. In this contribution, we want to motivate the main ideas of this scientific branch.
A functional is called bounded from below on the space V, if there exist a constant \(c \in \mathbb {R}\), such that
This requirement can be violated if the functional is not bounded by below on V or if it is bounded by below but its minimum is not reached on V, for a physical interpretation see Berdichevsky (2009), Chap. 5.
We assume that in the quadratic functional a(u, u) of our model problem (23) u(x) is a differentiable function and that the integrals exists. We conclude with the meaningful engineering constants
that a(u, u) is obviously nonnegative and therefore bounded from below. A linear functional \(L(u)=<l,u>\) is bounded (from above) if for \(C_L>0\)
Now we have to answer the question, if the functional (17) is bounded from below or not. Let us consider our model problem. Our functional (17) can now be estimated, compare Braess (1997) Chap. 2.5, by
Obviously, the functional is bounded from below.
Lax–Milgram Theorem (existence of classical solutions); Let \(V'\) be a Hilbert space, \( {a}:{V \times V} \rightarrow {\mathbb {R}} \) a continuous and H-elliptic bilinear form defined on V, \(L \in V^{\prime }\) any continuous linear functional. Subject to these conditions there exists a unique solution
such that
\(\blacksquare \)
Reminder, the properties of the bilinear and linear form are:
-
the bilinear form has to be continuous (bounded from above), i.e., there exists a constant \(C_a \in \mathbb {R}^+\)Footnote 1 such that
$$\begin{aligned} |a(w,v)| \le C_a \, \Vert w\Vert _V \Vert v\Vert _V \quad \forall \, w, v \in V \, , \end{aligned}$$ -
the bilinear form has to be H-elliptic, i.e., there exists a constant \(c_a \in \mathbb {R}^+\) such that
$$\begin{aligned} a(v,v) \ge c_a \, \Vert v\Vert ^2_V \quad \forall \, v \in V \, , \end{aligned}$$ -
the linear functional L is continuous, i.e., there exists a constant \(C_L \in \mathbb {R}^+\) such that
$$\begin{aligned} |L(v)| \le C_L \, \Vert v\Vert _V \quad \forall \, v \in V \, . \end{aligned}$$
Note: From the continuity of the bilinear form \( a (\cdot , \cdot ) \), discussed in exercise 1, we obtain \(|a(u,u)| \le C_a \, \Vert u\Vert ^2 \). The continuity of the linear form yields \( |L(u)| \le C_L \, \Vert u\Vert \). From the H-ellipticity of the bilinear form \( a (\cdot , \cdot ) \), see (19), we deduce
Exploiting the continuity of the linear form allows with \(C\in \mathbb {R}^+\) for
and we deduce from \( \langle l , u \rangle \le \Vert l\Vert \, \Vert u\Vert \)
and
In this sense the bounded linear functional is generated by the continuity and H-ellipticity of the bilinear form \( a (\cdot , \cdot ) \).
Approximate solutions: In general it is cumbersome or even impossible to find exact solutions \( u \in V \), therefore we are interested in approximate solution concepts. Applying Ritz method we seek a solution \(u_h \in V_h\) with the discrete subspace \(V_h \subset V\), i.e.,
The Ritz approach, based on our technical assumptions, is equivalent to the Galerkin method of the variational counterpart
where \(a(u_h,v_h)\) is a bilinear functional (linear in both arguments).
Exercise 1
Show that the bilinear form of our model problem in Eq. (9)
with \(EA\in (0,\infty )\) and \(K_s\in (0,\infty )\), is continuous!
Remark: Definition of continuous bilinear forms \( {a}:{U \times V} \rightarrow {\mathbb {R}} \) on linear normed spaces U and V: A bilinear form \(a (\cdot , \cdot )\) is a continuous bilinear form, if there exists a constant \(C_a \in \mathbb {R}^+\) such that
In anticipation of the following chapters we introduce the norm
obviously we obtain the inequalities
Solution. In order to show that the bilinear form is continuous, consider
Applying the Cauchy–Schwarz inequality yields
Using the inequalities (38) yields the final estimation
For our bilinear form we write
Thus, a(u, v) is continuous, or in other words it is bounded by above. \(\spadesuit \)
3 Classification of the Smoothness of Functions
In this section we discuss the classification of functions and their derivatives with respect to Hilbert spaces. For this we first set a few notations, a more detailed summary is given in Appendix A.
characterizes the space of square integrable functions on \({\mathcal B}\). At this point, it seems to be meaningful to give some remarks concerning the Riemann integral and the Lebesgue integral. The Riemann integral has some disqualifications if we would like to use it for a satisfactory theory of PDEs.
In order to obtain a satisfactory theory of PDEs, we must—for technical reasons—integrate certain singular functions. If functions are regular enough to integrate them they are called Lebesgue measurable. An introduction to this topic is given in Royden (1968). For \(m\in \mathrm{I\! N}_0\)Footnote 2 we define
with the multi-index notation for the derivatives of u, with the 3-tuple of nonnegative integers
Thus the \(\alpha \)-th derivative of u with respect to \((x_1,x_2,x_3)\) is defined by
Explanatory examples:
The introduction of the Sobolev spaces \(H^m({\mathcal B})\) allows for the quantification of the smoothness (regularity) of functions. Let \(C^m({\mathcal B})\) be the linear space of functions u with continuous derivatives \(D^{|{\varvec{\alpha }}|}u\) of the order \(0\le |{\varvec{\alpha }}| \le m\). The Sobolev spaces \(H^m({\mathcal B})\) are related with the \(C^k(\overline{{\mathcal B}})\) spaces by the Sobolev embedding theorem: Let \(\overline{{\mathcal B}}={\mathcal B}\cup \partial {\mathcal B}\) be a bounded domain with a Lipschitz boundary. Every function in \(H^m({\mathcal B})\) belongs to \(C^k(\overline{B})\) if
\(\blacksquare \)
It should be noted that the embedding is continuous:
Furthermore we introduce the notation
3.1 One-Dimensional Example
To discuss the smoothness of functions, we consider the two functions depicted in Fig. 5.
For the function u in Fig. 5a we observe
because u is twice continuously differentiable and \(u\in H^2({\mathcal B})\). In contrast, the function v depicted in Fig. 5b is
because already its first derivative is not continuous. Obviously, \(v\in H^{1}({\mathcal B})\), due to the fact that its first derivative is square integrable,Footnote 3 i.e., \(v'\in L^2({\mathcal B})\). Although the classical derivative of v(x) does not exist at \(x=l/2\) we can define the weak derivative of v. Consider
with the infinitely differentiable function \(\eta \), satisfying \(\eta (0)=\eta (l)=0\). Integration by part, i.e.,
yields
The function \(v'(x)\) in (53) is denoted as the weak derivative of v(x).
Let us now consider the function in Fig. 6 which is a delta function representing a force acting at a point. This function (distribution) is not square integrable. Before we are able to quantify its smoothness it has to be integrated. In order to generalize the discussion we introduce the antiderivative \(D^{-1}\), by means of
The meaning of this operator becomes clear if we consider again the function \(\widetilde{h}(x)\) in Fig. 5b:
The calculation of the antiderivative
yields the hat function depicted in Fig. 5b up to a constant. Switching back to our function (distribution) shown in Fig. 6a: Evaluating the antiderivative of the delta function leads to
then we conclude that h(x) is a square integrable function. In other words its first antiderivative, i.e., its “first integral”, is in \(L^2({\mathcal B})\). Therefore we define
The question is: What are negative Sobolev spaces?
Let m be a positive integer, then the negative Sobolev space \(H^{-m}({\mathcal B})\) is defined as the dual of \(H_0^m({\mathcal B})\), i.e.,
The associated norm, exemplarily for \(m=1\) is
Based on the relations
we conclude the inclusion properties
3.2 \(H({\text {div}},{\mathcal B})\) Hilbert Spaces
The introduction of special Hilbert spaces related to vector or tensor valued fields is expedient for the suitable description of many engineering problems. A frequent representative, which is especially of importance in the field of elasticity, heat conduction, flow problems, etc. is the \(H({\text {div}},{\mathcal B})\) space which demands a \(L^2({\mathcal B})\)-measurable weak divergence. The corresponding space in introduced by
whereas d denotes the dimension of vector \({{\varvec{u}}}\).
4 Variational Formulations of Linear Elasticity
In the following chapters, we concentrate on the formulation of elasticity. Let \({\mathcal B}\subset \mathrm{I\! R}^3\) be the body of interest, parametrized in \({{\varvec{x}}}\in \mathrm{I\! R}^3\), \({\varvec{\sigma }}\) the second-order stress tensor, \({\varvec{\varepsilon }}=\nabla _s {{\varvec{u}}}\) the symmetric second-order strain tensor, \({{\varvec{u}}}\) the displacement field, \({{\varvec{f}}}\) the given body force per unit volume, the fourth-order elasticity tensor and \(\overline{{{\varvec{t}}}}\) the prescribed Neumann boundary conditions. Then the governing equations in linear elasticity are given byFootnote 4
4.1 Classical (Bubnov-)Galerkin Formulation
A direct substitution of (62)\(_2\)–(62)\(_4\) into (62)\(_1\) leads to a variational formulation where solely the displacements are solved in a weak form. Multiplication with a test function \(\delta {{\varvec{u}}}\) and integration over the domain leads to the problem of seeking \({{\varvec{u}}}\) such that
Integration by parts and the insertion of the important test function property \(\delta {{\varvec{u}}}=\varvec{0}\) on \(\partial {\mathcal B}_D\) leads to the formulation of seeking \({{\varvec{u}}}\in H^1_{0,D}({\mathcal B})\) of
It can be recognized, that for the latter weak formulation, the corresponding function space of the trial function \({{\varvec{u}}}\) and the test function \(\delta {{\varvec{u}}}\) coincide, which is the classical characteristic of the (Bubnov-)Galerkin method. The solution of (64) is equivalent to the minimizer \({{\varvec{u}}}\in H^1_{0,D}({\mathcal B})\) of the potential energy
and constitutes the basis of the well-known displacement based FEM for linear elasticity.
4.2 Alternative Methods
In the previously discussed approach, Green’s theorem has been applied to shift a derivative from the trial to the test functions. Particularly in the framework of finite elements, this is the prevalent approach. However, various alternative approximation techniques are available. In these formulations, the space of the approximative solution is distinct to the space of test functions. A first example is represented by the variational problem in Eq. (63) which is e.g. the basis of collocation methods. In the corresponding discrete formulations, the approximative solution is sought in a subspace of \(H^2({\mathcal B})\) whereas the admissible test space corresponds to \(L^2({\mathcal B})\).
In contrast to these formulations where broken (discontinuous) test spaces are appropriate, both derivatives may be transferred to the test spaces. This is executed by means of successive application of Green‘s theorem. The corresponding formulation is established as the so-called \(H^{-1}\)-method, proposed by Rachford et al. (1974). In this \(H^{-1}\)-Galerkin method different subspaces for the space of trial functions (approximation functions) \(U_{k}\) and the space of test functions (weighting functions) \(W_{h}\) are used, i.e.,
In Kendall and Wheeler (1976), the authors adopted the procedure to a Crank–Nicolson–\(H^{-1}\)-Galerkin procedure and investigates single space variables in parabolic problems. The ansatz was recapitulated in Thomée (2006), Chap. 16. A discussion of negative norm error estimates for semi-discrete Galerkin-type Finite Element formulations for nonheterogenous parabolic equations is given in Thomée (1980). A more recent approach can be found in Goebbels (2015).
The basis of the approach is the theory of distributional differential equations. Let u be a distribution, v a test function and the equation of interest is the second-order ordinary differential equation
with appropriate boundary conditions.
Based on this assumptions we can set up a family of distributional differential equations, see Oden and Reddy (1976), page 365 ff:
with the space of distribution \(\mathcal D(\mathbb R)\). From the distributional point of view all equations can be interpreted as equivalent. Equation (68)\(_{1}\) is the basis for collocation methods, (68)\(_{2}\) for the classical Galerkin method and (68)\(_{3}\) for the \(H^{-1}\)-method.
A descriptive explanation of the \(H^{-1}\)-method on the basis of a one-dimensional boundary value problem
with \(u(0)=u(l)=0\) is discussed in Sect. 5.2.
4.3 Mixed Variational Frameworks for Linear Elasticity
Considering again the governing equations in linear elasticity in Eq. (62) it is apparent that the direct substitution of (62)\(_2\)–(62)\(_4\) into (62)\(_1\) is not mandatory. Alternatively it is possible to solve another set of equations of (62) in a weak sense. Here, especially two common variants are considered in the following.
Hellinger–Reissner Formulation: The stress–displacement based formulation solves (62)\(_1\) and (62)\(_2\) in a weak sense, seeking \({\varvec{\sigma }}\in H({\text {div}},{\mathcal B})\) and \({{\varvec{u}}}\in H^1_{0,D}({\mathcal B})\) such that
On this basis, two additional variational formulations can be achieved which differ in their corresponding solution spaces.
Application of integration by parts in (70)\(_1\) leads to the so-called primal version of the Hellinger–Reissner formulation. This yields a the saddle point problem seeking for \({\varvec{\sigma }}\in L^2({\mathcal B})\) and \({{\varvec{u}}}\in H^1_{0,D}({\mathcal B})\) such that
Equivalently, this problem can be described by the potential
The expression
represents the free energy \(\psi ({\varvec{\varepsilon }})\) in terms of the complementary potential \(\psi ^\star ({\varvec{\sigma }})\), i.e., \( \psi ({\varvec{\varepsilon }})={\varvec{\sigma }}:\nabla _s {{\varvec{u}}}-\psi ^{\star }({\varvec{\sigma }}). \)
In contrast, integration by parts may be applied to (70)\(_2\), which yields the so-called dual Hellinger–Reissner formulation, seeking the saddle point \({\varvec{\sigma }}\in H({\text {div}},{\mathcal B})\) and \({{\varvec{u}}}\in L^2({\mathcal B})\) such that
It should be remarked, that in this formulation the traction boundary condition (62)\(_6\) has to be incorporated into the solution space of the stresses, since they do not appear in the weak form. In addition the stress symmetry condition has to be enforced.
Hu-Washizu Functional – Three Field Formulation: A third option is the independent interpolation of all variables entering the elasticity problem. This formulation solves (62)\(_1\)–(62)\(_3\) in a weak sense. The optimization problem is: seek a saddle point \({\varvec{\varepsilon }}\in L^2(\mathcal B)\), \({\varvec{\sigma }}\in L^2(\mathcal B)\) and \({{\varvec{u}}}\in H^1_{0,D}(\mathcal B)\) such that
An equivalent potential formulation can be given by
with .
5 Finite Element Method
The finite element method constitutes the most prevalent discretization technique for the approximation of boundary value problems in the field of computational mechanics. As discussed in the previous chapters, the solution of the variational equations are in their corresponding Sobolev space. For the numerical treatment this solution space is restrained to a finite-dimensional space, in the following called finite element space and is denoted by the subscript h, e.g., \(V_h\).
5.1 Conforming and Non-conforming Finite Elements
In case of a conforming discretization, the finite element space is a discrete subspace of the corresponding Sobolev space. Considering the problem of linear elasticity with the displacements as the only unknown, we seek \({{\varvec{u}}}\in H^1_{0,D}({\mathcal B})\)
A conforming discretization of \({{\varvec{u}}}_h \in V_h\) demands in this case
It can be shown that \({{\varvec{u}}}_h\) of a conforming finite element converges monotonically to \({{\varvec{u}}}\) with increasing mesh density, if it is in addition able to represent the rigid body displacements and the constant strain states, see Bathe (1996).
Standard \(H^1({\mathcal B})\) conforming finite elements on triangles \(\mathbb {P}_k\) or quadrilaterals \(\mathbb {Q}_k\) are assigned with \(k+1\) nodes on each edge of the element, see Fig. 7. Continuity of the approximated variable is enforced, when these nodes are shared with the adjacent elements.
\(H({\text {div}},{\mathcal B})\) conforming elements can be constructed, for example, with help of the Raviart–Thomas functions. In case of triangles the \(\mathbb {RT}_k\) elements have \(k+1\) vector-valued sampling points on each edge and in addition k (\(k+1\)) vector-valued sampling points in the interior of the element, as exemplary depicted in Fig. 8.
In contrast, the finite element space of non-conforming elements is not a subspace of the appropriate Sobolev solution space and convergence is not obvious. Due to the non-conforming discretization, an additional error is introduced and has to be controlled. However, this leads to additional flexibility in the design of the finite element. The simplest non-conforming element is the Crouzeix–Raviart element, see Crouzeix and Raviart (1973). Here, we only assign k nodes on each edge and generally do not have continuity across inter-element boundaries; thus, these are non-conforming elements. The Crouzeix–Raviart finite elements are depicted in Fig. 9.
5.2 Example of \(H^{-1}\)-FEM for 1D Elliptic Problem
In order to approach the \(H^{-1}\)-method we analyze a one-dimensional truss element, in analogy to the one examined in Rachford et al. (1974):
where EA(x)u(x) characterize the normal force in the straight bar with the longitudinal stiffness
where \(\alpha \) and \(\bar{x}\) are constant parameters and the right- hand side is given by
The Dirichlet boundary conditions are defined by
The closed-form solution of this problem is given by
and is explicitly depicted for two different sets of \(\alpha \) and \(\bar{x}\) in Fig. 10c and 11c. Considering the plots of the longitudinal stiffness and the applied load in Fig. 10, where the parameter are chosen as \(\alpha =5\) and \(\bar{x}=0.5\). I would like to draw the reader’s attention to the low stiffness in the middle of the domain. In the case \(\alpha =1000\) the domain responds to this with a rapid, jump like rising displacement.
Variational approach: The solution in terms of a variational weak form is obtained via
A reformulation using integration by parts and exploiting \(v(0)=v(1)=0\) yields
The classical FE discretization with \(u_h \in U_h \subset H^{1}_{0}\) and \(u_h \in {\mathbb P}_{1}\) or \(u_h \in {\mathbb P}_{2}\), yields an approximation of the displacements as illustrated in Figs. 12 and 13. This standard displacement FEM ansatz even with second-order interpolation is inaccurate in an extreme edge case. It is also worth mentioning that the normal force computed from these element are also very inaccurate in comparison to the analytical solution.
Repeated application of integration by parts leads to another weak form, which constitutes the basis of the \(H^{-1}\)-FE approach
In this case the natural discretization is of the form \(u_h \in U_h \subset H^{0}(\mathcal B)\), i.e., it is possible to choose discontinuous approximations of \(u_h\), denoted by \(u_h \in {\text {d}}\!{\mathbb P}\). This reduces the coupling between the elements. The associated subspace consists of all piecewise polynomial functions in \(C^{k}\). Simultaneously the continuity requirements regarding the test space \(V_h\) are increased. Let \({\mathcal B}_{h}\) denote the discretization of \({\mathcal B}\), with
By setting \(r \ge 1\) and \(-1\le k \le r-2\) we define the trial space
and the test space
For \(k=-1\) the trial space \(U_{h}\) exhibits discontinuities at the node of the partition, where the functions in the test space \(V_{h}\) are continuously differentiable. Thus we have
whereas \(C^{-1}({\mathcal B}_{h})\) considers all functions whose antiderivative is in \(C^0({\mathcal B}_h)\), which means we do not require continuity at the nodal points. For convenience we define this space by
to enforce that the trial functions are discontinuous at the exterior nodes.
This leads to the corresponding space for the test functions as
The numerical results obtained from this discretization are given in Fig. 14 for the displacements and for the stresses. The discontinuity in the displacements is clearly visible, especially for the coarse discretization. The method, however, shows significant advantage for this model problem in comparison to the standard FE method. This is even more significant in terms of the normal force. The interested reader is referred to the error plots of each solution space in Figs. 15, 16, 17 and 18, with respect to both considered loading cases.
6 Analysis of Mixed Finite Elements
For the existence, uniqueness, and approximation of saddle point, problems arise from Lagrangian multipliers see Brezzi (1974). The following explanations are mainly based on the excellent treatises of Auricchio et al. (2004) and Boffi et al. (2013).
6.1 Theoretical Framework
The idea of mixed methods is based on the introduction of Lagrangian multipliers in order to relax several constraints denoted by \(constr(v)=0\), e.g., the incompressibility condition \({\text {div}}{{\varvec{u}}}=0\). Let’s start from the constrained minimization problem
This can be reformulated by means of a Lagrangian functional of the form
with
where q denotes the Lagrange multiplier. The solution of this abstract optimization problem is (u, p) if the condition
is fulfilled, V and \(\Sigma \) are suitable Hilbert spaces.
are continuous bilinear forms, and \(L(v):V\rightarrow \mathrm{I\! R}\) is a continuous linear form. It should be noted that the classical Lax–Milgram Lemma cannot be applied. In fact we should apply the so-called Banach–Nečas–Babuška theorem also known as the generalized Lax–Milgram theorem, see, e.g., Ern and Guermond (2013). In summary the variational formulation has a unique solution if
-
1.
The continuous linear form \(a(\cdot ,\cdot )\) is coercive on
$$\begin{aligned} K=\big \{ v\in V:\, b(q,v)=0\quad \forall \, q\in \Sigma \big \}, \end{aligned}$$i.e., there exist an \(\alpha \in \mathrm{I\! R}_+\), such that
$$\begin{aligned} a(v,v)\ge \alpha \Vert v\Vert ^2_V \qquad \forall \, v\in K\,, \end{aligned}$$and
-
2.
the inf-sup condition, also known as LBB-condition (Ladyzhenskaya–Babuška–Brezzi), is verified, i.e., there exists a \(\beta \in \mathrm{I\! R}^+\), such that
$$\begin{aligned} \underset{q\in \Sigma \backslash 0}{\text {inf}}\; \underset{v\in V\backslash 0}{\text {sup}}\; \frac{b(v,q)}{\Vert q\Vert _\Sigma \,\Vert v\Vert _V} \ge \beta \,. \end{aligned}$$
Furthermore, there exists the a priori estimate for the solution
and
6.2 Treatment of Saddle Point Problems, Sensitization
The discrete mixed problem is given by the matrix representation
with \({{\varvec{A}}}\in \mathbb {R}^{n \times n} \), \({{\varvec{B}}}\in \mathbb {R}^{m \times n} \), \({{\varvec{B}}}^T \in \mathbb {R}^{n \times m} \), \(({{\varvec{d}}}_u , {{\varvec{r}}}_u) \in \mathbb {R}^{n} \), \(({{\varvec{d}}}_p , {{\varvec{r}}}_p) \in \mathbb {R}^{m} \), \(\widehat{{{\varvec{K}}}}\in \mathbb {R}^{(n + m) \times (n + m)} \), and \((\widehat{{{\varvec{d}}}} , \widehat{{{\varvec{r}}}}) \in \mathbb {R}^{n + m} \).
For the solvability of (100) we postulate that the system has a unique solution for every right-hand side \({{\varvec{r}}}_u\) and \({{\varvec{r}}}_p\). Obviously, this condition is fulfilled if \(\widehat{{{\varvec{K}}}}\) is nonsingular. In other words, we must have a continuous dependency of the solution upon the right-hand side. Therefore, the existence of a constant c, satisfying
is required. However, the existence of c does not depend on the chosen norms, because in finite dimensions all norms are equivalent. Indeed the numerical values will depend on the dimension of the system. As examples consider \({{\varvec{u}}}\in \mathbb {R}^n \) with the equivalent norms
For \(n<\infty \) there exist the two positive constants \(c_1\) and \(c_2\) satisfying
For \(n \rightarrow \infty \) the latter inequality becomes unbounded from above.
In addition to the solvability condition we are interested in an estimate of the stability of (100): In general we consider a sequence of discrete saddle point problems with increasing mesh densities \(h \rightarrow 0 \) and therefore with increasing dimensions. Let \( k = 0, 1, 2, 3, ..\) denote a sequence of discretizations with increasing mesh densities, i.e., we consider
with \({{\varvec{A}}}_k \in \mathbb {R}^{n_k \times n_k} \), \({{\varvec{B}}}_k \in \mathbb {R}^{m_k \times n_k} \), ..., where the dimensions \(n_k, m_k\) increase with the sequence of k. In addition to the solvability condition we are interested in an estimate
with a constant c independent on k, i.e., independent of the increasing mesh densities. For a meaningful analysis we have to specify the norms entering (105) carefully. For the stability requirement this choice is rather important, because constants appearing in the relations between equivalent (discrete) norms depend on the dimension of the problem, which goes to infinity with \(h \rightarrow 0 \). Obviously, the stability is a concept which has to be applied to a sequence of discretized boundary value problems.
Incidental remark: In order to get an impression on the well-known dependency of some characteristic values of a discretized system on the mesh density we perform an eigenvalue analysis of a one-dimensional elasticity problem, minimizing the discrete energy potential functional on \({{\mathcal B}\in [0,1] = {\mathcal B}_h =\displaystyle \bigcup _e {\mathcal B}^e}\)
In order to find the minimum we compute \(\delta \Pi (u_h, \delta u_h ) = 0\), with
We investigate the evolution of the minimal eigenvalue \(\lambda _{\text {min}}\) for the eigenvalue problems
where \({{\varvec{K}}}_k\) is the stiffness matrix, \(\lambda _k\) the associated eigenvalue to the eigenvector \({{\varvec{v}}}^k\), \({{\varvec{I}}}_k\) the identity and \({{\varvec{M}}}_k\) the mass matrix. We consider a truss clamped at the edges, i.e., the Dirichlet boundary conditions \(u (0) = u (1) = 0\), a Young’s modulus \(E=1\), a cross section \(A=1\). Figure 19 shows the evolution of the minimal eigenvalues with respect to mesh refinement.
Obviously, the eigenvalue problem (108)\(_1\) exhibits a decrease of the amplitude of the lowest eigenvalue with increasing mesh density (from \(h = 1/2 \rightarrow h = 1/500 \)) whereas the formulation (108)\(_2\) seems to offer a lower bound for \(\text {min} (\lambda _k) \). \(\blacksquare \)
6.3 A Saddle Point Problem-Finite-Dimensional Case
Let us concentrate on the saddle point problem for a linear incompressible material behavior. Starting from the general strong form of elasticity given in (62) and substituting the pressure as an additional unknown field as \(p=\lambda {\text {tr}}({\varvec{\varepsilon }}({{\varvec{u}}}))\) leads for the incompressible case \(\lambda \rightarrow \infty \) to
The solution of this problem is similar to the saddle point of the potential
The variational approach and a finite element discretization leads to the finite-dimensional saddle point problem
The equivalent problem is the stationarity requirement of the discrete Lagrange functional
i.e., \(\delta _{{{\varvec{d}}}_u}{\mathcal L}_h=0\) and \(\delta _{{{\varvec{d}}}_p}{\mathcal L}_h=0\), which leads to
where the field quantities have been substituted by the approximations
with \(\mathrm{I\! N}\) and \({{\varvec{d}}}\) denoting the shape functions and nodal values corresponding to the displacements, pressure and its virtual counterparts. The solution of this set of algebraic equation follows from
with \({{\varvec{A}}}_h\in \mathrm{I\! R}^{n\times n}\), \({{\varvec{A}}}_h={{\varvec{A}}}_h^T\) and positive definite, \({{\varvec{B}}}_h\in \mathrm{I\! R}^{m\times n}\), \({{\varvec{f}}}_n\in \mathrm{I\! R}^n\) and \(m<n\). The physical interpretation of \(m<n\) is obvious, there must be less constraints than “free” variables. Obviously, we have the “identical” structure as we obtain from equation (111). Now we have to ensure that (115) is solvable for all right-hand sides \({{\varvec{f}}}_h\), following the remarks of Devendran et al. (2009). This is of course the fact if the whole matrix is invertible, i.e., nonsingular. Let us consider the congruent transformation, known as Sylvester’s law of inertia
this transformation preserves the number of positive and negative eigenvalues (but not their numerical values). However, our system has full rank if the Schur complement
is nonsingular. In this case \({{\varvec{S}}}_h\) is invertible and we can solve system (115). The full rank requirement is equivalent to
i.e., the Schur complement is negative definite. Due to the assumption that \({{\varvec{A}}}_h\) and therefore \({{\varvec{A}}}_h^{-1}\) is positive definite we argue
This means that the kernel of \({{\varvec{B}}}^T\), i.e.,
is trivial, i.e., the image of \({{\varvec{B}}}^T\) is
in other words \({{\varvec{B}}}_h^T \in \mathrm{I\! R}^{n\times m}\), with \(m<n\) has full column rank. If this conditions are fulfilled the system (115) is invertible, i.e.,
Let \(\beta ^2>0\) denote the smallest singular value of \({{\varvec{B}}}_h\). The condition that the smallest eigenvalue \(\beta \), is greater than zero is directly related to the inf-sup condition of saddle point problems, which states
or equivalently
The independency of the mesh size, as discussed for Eq. (105), demands here \(\beta \) to be bounded above zero for \(h\rightarrow 0\).
Furthermore, we obtain the bounds
with the energy norm . Obviously, if \(\beta \) is small the bound for \({{\varvec{d}}}_p\) gets large.
Numerical Inf-Sup Test The numerical inf-sup test was proposed by Chapelle and Bathe (1993). In order to evaluate the inf-sup constant we use the fact that it is equivalent to the square root of the smallest eigenvalue of
with the global mass matrices \({{\varvec{M}}}_{u,h}\), \({{\varvec{M}}}_{p,h}\) as
whereas, \(\mathrm{I\! B}_u\) contains spatial derivatives of the shape functions such that it holds \({\varvec{\varepsilon }}_h=\mathrm{I\! B}_u\,{{\varvec{d}}}_u\). For exemplary purposes, the inf-sup stability is investigated by means of an inf-sup test on the example of the well-known Q\(_1\)P\(_0\) and T\(_2\)P\(_0\), representing elements with a discontinuous pressure approximation, and the T\(_2\)P\(_1\), representing an element with a continuous pressure approximation, see Hood and Taylor (1974). The considered boundary value problem is a simple supported rectangle in 2D and brick in 3D, whereas a consecutive number of mesh refinements is considered. The statement on the inf-sup criterium of the considered elements are well known and the formal proofs can be found in the literature, e.g., Boffi et al. (2009). The T\(_2\)P\(_1\) element is a famous representative of the Taylor–Hood family, which is well known to be inf-sup stable. In contrast the Q\(_1\)P\(_0\) formulation is a text book example for an element which does not satisfy the inf-sup criterium neither in the two-dimensional nor in the three-dimensional case. Interestingly the T\(_2\)P\(_0\) formulation fulfills the inf-sup condition in the two-dimensional case but fails in three dimensions. The depicted results in Fig. 20 approve numerically the statements on the inf-sup stability of the elements: In 2D the T\(_2\)P\(_0\) and the T\(_2\)P\(_1\) elements have an approximately constant \(\Lambda >0\), whereas \(\Lambda \) tends to zero for the Q\(_1\)P\(_0\) element. In 3D, only the T\(_2\)P\(_1\) seems to have a bounded value for \(\Lambda \). Of course, a purely numerical check of the LBB condition is not sufficient, but it gives a first impression of the properties of the solution. To be save, a rigorous mathematical proof is needed.
Notes
- 1.
nonnegative real values \(\mathbb {R}^+_0\), positive real values \(\mathbb {R}^+=\mathbb {R}^+_0\backslash 0\).
- 2.
positive integers \(\mathrm{I\! N}_+=\{1,2,3,\dots \},\) nonnegative integers \(\mathrm{I\! N}_0=\{0,1,2,3,\dots \}=\mathrm{I\! N}_+ \cup \{0\}\).
- 3.
The derivatives occurring in \(H^m({\mathcal B})\) have to be interpreted as weak or generalized derivatives. Classical derivatives are functions defined pointwise on an interval. A weak derivative need only to be locally integrable. If the function is sufficiently smooth, e.g., \(v\in C^m(\overline{{\mathcal B}})\), then its weak derivatives \(D^\alpha u\) coincide with the classical ones for \(|\alpha |\le m\).
- 4.
Note that a restriction to homogeneous Dirichlet boundary conditions is only of technical nature and does not constitute a loss of generality, see, e.g., Braess (1997).
References
Auricchio, F., Brezzi, F., & Lovadina, C. (2004). Mixed finite element methods. In E. Stein, R. de Borst, & T. J. R. Hughes (Eds.), Encyclopedia of computational mechanics (Chap. 9, pp. 238–277). Wiley and Sons.
Bathe, K.-J. (1996). Finite element procedures. New Jersey: Prentice Hall.
Becker, E. B., Carey, G. F. & Oden, J. T. (1981). Finite elements, an introduction: Volume I. Prentice-Hall.
Berdichevsky, V. L. (2009). Variational principles of continuum mechanics. Springer.
Boffi, D., Brezzi, F., & Fortin, M. (2009). Reduced symmetry elements in linear elasticity. Communications on Pure and Applied Analysis, 8, 95–121.
Boffi, D., Brezzi, F., & Fortin, M. (2013). Mixed finite element methods and applications. Heidelberg: Springer.
Braess, D. (1997). Finite elemente (2nd ed.). Berlin: Springer.
Brenner, S. C. & Scott, L. R. (2002). The mathematical theory of finite element methods. In Texts in applied mathematics (Vol. 15, 2nd edition). New York: Springer.
Brezzi F. (1974). On the existence, uniqueness and approximation of saddle-point problems arising from lagrangian multipliers. Revue française d’automatique, informatique, recherche opérationnelle. Analyse numérique, 8(2), 129–151.
Chapelle, D., & Bathe, K.-J. (1993). The inf-sup test. Computers and Structures, 47, 537–545.
Crouzeix, M., & Raviart, P.-A. (1973). Conforming and nonconforming finite element methods for solving the stationary stokes equations i. Revue francaise d’automatique, informatique, recherche operationnelle, 7(3), 33–75.
Devendran, D., May, S., & Corona, E. (2009). Computational fluid dynamics reading group: Finite element methods for stokes and the infamous inf-sup condition.
Ern, A., & Guermond, J.-L. (2013). Theory and practice of finite elements (Vol. 159). Springer Science & Business Media.
Gockenbach, M. S. (2006). Understanding and implementing the finite element method. SIAM.
Goebbels, S. (2015). An inequality for negative norms with application to errors of finite element methods.
Hood, P., & Taylor, C. (1974). Navier-stokes equations using mixed interpolation. In J. T. Oden, O. C. Zienkiewicz, R. H. Gallagher, & C. Taylor (Eds.), Finite element methods in flow problems (pp. 121–132). UAH Press.
Hughes, T. J. R. (1987). The finite element method. Englewood Cliffs, New Jersey: Prentice Hall.
Kendall, R. P., & Wheeler, M. F. (1976). A Crank-Nicolson H\(^{-1}\)-Galerkin procedure for parabolic problems in a single-space variable. SIAM Journal on Numerical Analysis, 13, 861–876.
Oden, J. T., & Carey, G. F. (1983). Finite elements. Mathematical aspects. Volume IV. Prentice-Hall.
Oden, J. T., & Reddy, J. N. (1976). An introduction to the mathematical theory of finite elements. Wiley.
Rachford, H. H., Wheeler, J. R., & Wheeler, M. F. (1974). An H\(^{-1}\)-Galerkin procedure for the two-point boundary value problem. In C. de Boor (Ed.), Mathematical Aspects of Finite Elements in Partial Differential Equations, Proceedings of Symposium, Conducted by the Mathematics Research Center, The University of Wisconsin-Madison. Academic Press.
Royden, H. L. (1968). Real analysis. New York: Macmillan.
Thomée, V. (1980). Negative norm estimates and superconvergence in Galerkin methods for parabolic problems. Mathematics of Computation, 34(149), 93–113.
Thomée, V. (2006). Galerkin finite element methods for parabolic problems. Berlin, Heidelberg: Springer.
Wriggers, P. (2008). Nonlinear finite element methods. Springer.
Acknowledgements
We thank the DFG for the financial support within the SPP 1748 Reliable Simulation Techniques in Solid Mechanics. Development of Non-standard Discretization Methods, Mechanical and Mathematical Analysis, project Novel finite elements— Mixed, Hybrid and Virtual Element formulations (Projectnumber: 255432295) (SCHR 570/23-2). I would also like to thank Nils Viebahn for helpful discussions and his help with the manuscript and Sascha Maassen and Rainer Niekamp for the implementation of the \(H^{-1}\)procedure and accompanying discussions.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
A Sobolev and Hilbert Spaces
A Sobolev and Hilbert Spaces
In the following we will use the Sobolev and Hilbert Spaces, they are based on the space of square integrable functions on \({\mathcal B}\):
Let \(s \ge 0\) be a real number, the standard notation for a Sobolev space is \(H^s ({\mathcal B}) \) and \(H^s (\partial {\mathcal B}) \) with the inner products and norm
respectively. For \(s = 0\) the space \(H^0 ({\mathcal B}) \) represents the Hilbert space \(L^2 ({\mathcal B}) \) of all square integrable functions, i.e.,
If s is a positive integer the spaces \(H^s ({\mathcal B}) \) consist of all square integrable functions whose derivatives up to the order s are also square integrable, i.e.,
Here we shall use the semi-norms
and the norm
Critism: This expression for the norm does not take into account a typical length scale l of the problem, i.e., we are adding, for example, a square integrable function \( |u|^2_{L^2 ({\mathcal B})}\) and its square integrable derivative \( |u^{\prime }|^2_{L^2 ({\mathcal B})}\). Without any physically meaningful parameters these expression is hardly to interpret. This could be avoided by using the expression
where d characterizes the dimension of \({\mathcal B}\subset \mathbb {R}^{\text {d} }\), Boffi et al. (2013).
With \({\text {D}}^{\alpha }\) as the \(\alpha \)-st weak differential operator. Thus the often used spaces \( H^1 ({\mathcal B}) \) and \( H^1_0 ({\mathcal B}) \) are defined by
and
For completeness we introduce the spaces \( H^2 ({\mathcal B}) \) and \( H^2_0 ({\mathcal B}) \) defined by
and
For negative superscripts, i.e., \(H^{-s} ({\mathcal B}) \) with \( s > 0\), the spaces are identified with the duals of \(H^{s}_0 ({\mathcal B}) \):
For example, the norm associated to \(H^{-1} ({\mathcal B}) \), which is the dual of \(H^{1}_0 ({\mathcal B}) \), is defined as
The norm associated to \(H^{-{}^{1}\!/_{2}} (\partial {\mathcal B}) \), the dual of \(H^{{}^{1}\!/_{2}}_0 (\partial {\mathcal B}) \), is defined as
The Hilbert space \(H_0^{m} ({\mathcal B}) \) is a closed subspace of \(H^{m} ({\mathcal B}) \); furthermore is \(H_0^{0} ({\mathcal B}) = L_{2} ({\mathcal B}) \).
For tensorial Sobolev spaces, e.g., the three-dimensional tensor product space
we use the abbreviation
Let \({{\varvec{u}}}\in \mathbb {R}^3\) and set the Hilbert space
with the associated norm
Rights and permissions
Copyright information
© 2020 CISM International Centre for Mechanical Sciences
About this chapter
Cite this chapter
Schröder, J. (2020). Engineering Notes on Concepts of the Finite Element Method for Elliptic Problems. In: Schröder, J., de Mattos Pimenta, P. (eds) Novel Finite Element Technologies for Solids and Structures. CISM International Centre for Mechanical Sciences, vol 597. Springer, Cham. https://doi.org/10.1007/978-3-030-33520-5_1
Download citation
DOI: https://doi.org/10.1007/978-3-030-33520-5_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-33519-9
Online ISBN: 978-3-030-33520-5
eBook Packages: EngineeringEngineering (R0)