An unbiased Nitsche’s approximation of the frictional contact between two elastic structures

Chouly, Franz; Mlika, Rabii; Renard, Yves

doi:10.1007/s00211-018-0950-x

An unbiased Nitsche’s approximation of the frictional contact between two elastic structures

Published: 17 February 2018

Volume 139, pages 593–631, (2018)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Numerische Mathematik Aims and scope Submit manuscript

An unbiased Nitsche’s approximation of the frictional contact between two elastic structures

Download PDF

Franz Chouly¹,
Rabii Mlika² &
Yves Renard³

485 Accesses
26 Citations
Explore all metrics

Abstract

Most of the numerical methods dedicated to the contact problem involving two elastic bodies are based on the master/slave paradigm. It results in important detection difficulties in the case of self-contact and multi-body contact, where it may be impractical, if not impossible, to a priori nominate a master surface and a slave one. In this work we introduce an unbiased finite element method for the finite element approximation of frictional contact between two elastic bodies in the small deformation framework. In the proposed method the two bodies expected to come into contact are treated in the same way (no master and slave surfaces). The key ingredient is a Nitsche-based formulation of contact conditions, as in Chouly et al. (Math Comput 84:1089–1112, 2015). We carry out the numerical analysis of the method, and prove its well-posedness and optimal convergence in the $H^1$-norm. Numerical experiments are performed to illustrate the theoretical results and the performance of the method.

An Overview of Recent Results on Nitsche’s Method for Contact Problems

Isogeometric frictionless contact analysis with the third medium method

Article 23 January 2018

Frictionless Contact Problems

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Although being a very rich subject in the past, contact computational mechanics for deformable bodies in small or large strain is still the subject of intensive research. The most common paradigm to treat the problem of two deformable bodies in contact is known as the master/slave formulation. In this approach one distinguishes between a master surface and a slave one on which is prescribed the non-penetration condition. A presentation of this formulation and the contact problem can be found in Laursen’s work [18, 19] (see also [20]) and a presentation of discretization schemes and numerical algorithms for mechanical contact is given in [28]. This approach is confronted with important difficulties especially in the case of self-contact and multi-body contact where it is impossible or impractical to a priori nominate a master surface and a slave one. Automating the detection and the separation between slave and master surfaces in these cases may generate a lack of robustness since it may create detection problems.

If the master/slave formulation consists in a natural extension of the contact treatment between a deformable body and a rigid ground, it has no complete theoretical justification. Consequently, to avoid these difficulties, we provide in this article an unbiased formulation of the two elastic bodies contact problem in the small strain framework. In this formulation we do not distinguish between a master surface and a slave one since we impose the non-penetration and the friction conditions on both of them. Unbiased contact and friction formulations have been considered before in [26] and references therein. There, the authors present a numerical study of the method and make use of a penalized formulation of contact and friction. The terms two-pass and two-half-pass are also used in literature to describe this type of methods.

This study can be seen as a first step in the construction of a method taking into account contact between two elastic solids and self-contact in large transformations in the same formalism. The present formulation, in small deformations, allows us to ensure the consistency, the convergence and the optimality of the method. In this context, the aim of this paper is to provide an unbiased description of the contact and Tresca friction conditions, that relies upon a Nitsche’s treatment of contact conditions.

Nitsche’s treatment of contact is an extension of the method proposed in 1971 by J. Nitsche to impose Dirichlet conditions within the variational formulation without adding Lagrange multipliers [23]. Nitsche’s method has been widely applied on problems involving linear conditions on the boundary of a domain or at the interface between sub-domains: see, e.g. [27] for the Dirichlet problem or [1] for domain decomposition with non-matching meshes. More recently, in [13] and [15] it has been adapted for bilateral (persistent) contact, which still involves linear boundary conditions on the contact zone. A Nitsche-based formulation for the Finite Element discretization of the unilateral (non-linear) contact problem in linear elasticity was introduced in [6] and generalized in [8] to encompass symmetric and non-symmetric variants. A simple adaptation of the Nitsche-based Finite Element Method to Tresca’s friction is proposed in [5]. Conversely to standard penalization techniques (see [7, 17]), the resulting method is consistent. Moreover, unlike mixed methods (see [14, 16]), no additional unknown (Lagrange multiplier) is needed.

Other possibilties for contact discretization are for instance node-to-segment techniques or the mortar method. Note that the mortar method is an efficient alternative that has been widely applied to contact problem (see [2, 21, 24]). The mortar technique allows to match independent discretizations of the contacting solids and takes into account the unilateral contact conditions in a convenient way. The procedure provides variationally consistent contact pressures. But mortar methods normally represent asymmetric formulations, by distinguishing between a master (or mortar) and a slave (or non-mortar) surface. Thus, the adaptation to an unbiased contact description is quite easier with Nitsche’s method than a mortar one. In fact, since Nitsche’s method uses the contact stress as a multiplier, it is very simple to divide this contact effort equitably on both of contact surfaces. A comparison between Nitsche’s method and mortar-type ones for linear elasticity is provided in [12].

The formulation described in this paper uses an additional parameter $\theta $ as in [8], allowing us to introduce some variants acting on the symmetry/skew-symmetry / non-symmetry of the discrete formulation. Moreover, a unified analysis of all these variants can be performed. We provide, as well, theoretical and numerical verifications of the proposed method. First, we prove the consistency of the method, its well-posedness and its optimal convergence. And then, a numerical verification is performed to confirm the theoretical results.

In Sect. 1 we build an unbiased formulation of the two elastic bodies frictional (Tresca) contact problem. This formulation will be based on Nitsche’s method. To prove the efficiency of the method (15), we carry out some mathematical analysis in Sect. 2. In the last Sect. 3 of this paper, we present the results of several two/three-dimensional numerical tests. The tests cover a convergence study of the global relative error of displacement in $H^1$-norm and the contact pressure error in $L^2$-norm with different values of the parameter $\theta $ and the Nitsche’s parameter $\gamma _0$. The open source environment GetFEM++^{Footnote 1} is used to perform the tests.

2 Setting of the problem

2.1 Formal statement of the two bodies contact problem

We consider two elastic bodies expected to come into contact. To simplify notations, a general index i is used to represent indifferently the 1st or the 2nd body. Let $\Omega ^i$ be the domain in $\mathbb {R}^d$ occupied by the reference configuration of the i-th body, with $d=2$ or 3. Small strain assumption is made, as well as plane strain when $d=2$. We suppose that the boundary $\partial \Omega ^i$ of each body consists in three non-overlapping parts ${\Gamma }^i_D$, ${\Gamma }^i_N$ and ${\Gamma }^i_C$. On ${\Gamma }^i_D$ (resp ${\Gamma }^i_N$) displacements $\mathbf{u}^i$ (resp. tractions $\mathbf{t}^i$) are given. The body is clamped on ${\Gamma }^i_D$ for the sake of simplicity. In addition each body can be subjected to a volumic force $\mathbf{f}^i$ (such as gravity). We denote by ${\Gamma }^i_C$ a portion of the boundary of the i-th body which is a candidate contact surface with an outward unit normal vector $\mathbf{n}^i$. The actual surface on which a body comes into contact with the other one is not known in advance, but is contained in the portion ${\Gamma }^i_C$ of $\partial \Omega ^i$.

Furthermore let us suppose that ${\Gamma }^i_C$ is smooth. For the contact surfaces, let us assume a sufficiently smooth one to one application (projection for instance) mapping each point of the first contact surface to a point of the second one:

$$\begin{aligned} \Pi ^1: {\Gamma }^1_C \rightarrow {\Gamma }^2_C. \end{aligned}$$

Let $ J^1$ be the Jacobian determinant of the transformation $\Pi ^1$ and $\displaystyle J^2=\frac{1}{J^1}$ the Jacobian determinant of $\Pi ^2=(\Pi ^1)^{-1}$. We suppose in the following that $J^1>0$.

We define on each contact surface a normal vector $\tilde{\mathbf{n}}^i$ (see Fig. 1) such that:

$$\begin{aligned} \tilde{\mathbf{n}}^i(\mathbf{x})= {\left\{ \begin{array}{ll} \displaystyle \frac{ \Pi ^i(\mathbf{x}) -\mathbf{x}}{\Vert \Pi ^i(\mathbf{x}) -\mathbf{x}\Vert } &{} \text {if}\quad \mathbf{x}\ne \Pi ^i(\mathbf{x}), \\ \mathbf{n}^i &{} \text {if}\quad \mathbf{x}=\Pi ^i(\mathbf{x}). \end{array}\right. } \end{aligned}$$

Note that $\tilde{\mathbf{n}}^1=-\tilde{\mathbf{n}}^2\circ \Pi ^1$ and $\tilde{\mathbf{n}}^2=-\tilde{\mathbf{n}}^1\circ \Pi ^2$.

The displacements of the bodies, relatively to the fixed spatial frame are represented by $\mathbf{u}= (\mathbf{u}^1, \mathbf{u}^2) $, where $\mathbf{u}^i$ is the displacement field of the i-th body.

The contact problem in linear elasticity consists in finding the displacement field $\mathbf{u}$ satisfying the Eq. (1) and the contact conditions described hereafter:

$$\begin{aligned}&\mathbf{div}\, {{\varvec{\sigma }}}^i(\mathbf{u}^i) + \mathbf{f}^i = \mathbf{0} \text { in } \Omega ^i , \end{aligned}$$

(1a)

$$\begin{aligned}&{{\varvec{\sigma }}}^i(\mathbf{u}^i)= A^i {{\varvec{\varepsilon }}}(\mathbf{u}^i) \text { in } \Omega ^i , \end{aligned}$$

(1b)

$$\begin{aligned}&\mathbf{u}^i = \mathbf{0} \text { on } {\Gamma }^i_D , \end{aligned}$$

(1c)

$$\begin{aligned}&{{\varvec{\sigma }}}^i(\mathbf{u}^i) \mathbf{n}^i = \mathbf{t}^i \text { on } {\Gamma }^i_N, \end{aligned}$$

(1d)

where ${{\varvec{\sigma }}}^i= {\sigma }^i_{(j,k)}, 1 \le j,k \le d$, stands for the stress tensor field and div denotes the divergence operator of tensor valued functions. The notation ${{\varvec{\varepsilon }}}(\mathbf{v}) = \tfrac{1}{2}({{\varvec{\nabla }}}\mathbf{v}+ {{\varvec{\nabla }}}\mathbf{v}^{^T})$ represents the linearized strain tensor field and $A^i$ is the fourth order symmetric elasticity tensor on $\Omega ^i$ having the usual uniform ellipticity and boundedness property.

For any displacement field $\mathbf{v}^i$ and for any density of surface forces ${{\varvec{\sigma }}}^i(\mathbf{v}^i)\mathbf{n}^i$ defined on $\partial \Omega _i$ we adopt the following notation:

$$\begin{aligned} \mathbf{v}^i = v^i_n\tilde{\mathbf{n}}^i+ \mathbf{v}^i_t \text { and } {{\varvec{\sigma }}}^i(\mathbf{v}^i)\mathbf{n}^i = {\sigma }^i_n(\mathbf{v}^i)\tilde{\mathbf{n}}^i + {{\varvec{\sigma }}}^i_t(\mathbf{v}^i), \end{aligned}$$

where $ \mathbf{v}^i_t$ (resp ${{\varvec{\sigma }}}^i_t(\mathbf{v}^i)$) are the tangential components of $\mathbf{v}^i$ (resp ${{\varvec{\sigma }}}^i(\mathbf{v}^i)\mathbf{n}^i$).

We define an initial normal gap representing the normal distance between a point $\mathbf{x}$ of ${\Gamma }^i_C$ and its image on the other body: $g^i_n=(\Pi ^i(\mathbf{x})-\mathbf{x})\cdot \tilde{\mathbf{n}}^i$.

We define, as well, the relative normal displacements $\llbracket u \rrbracket ^1_n= (\mathbf{u}^1-\mathbf{u}^2\circ \Pi ^1)\cdot \tilde{\mathbf{n}}^1$ and $\llbracket u \rrbracket ^2_n= (\mathbf{u}^2-\mathbf{u}^1\circ \Pi ^2)\cdot \tilde{\mathbf{n}}^2$.

Remark 1.1

Note that: $g_n^1\circ \Pi ^2=g_n^2$ and $g_n^2\circ \Pi ^1=g_n^1$; $\llbracket u \rrbracket ^1_n\circ \Pi ^2 = \llbracket u \rrbracket ^2_n$ and $\llbracket u \rrbracket ^2_n\circ \Pi ^1=\llbracket u \rrbracket ^1_n$.

In order to obtain an unbiased formulation of the contact problem we prescribe the contact conditions deduced from the Signorini problem conditions (see [17]) on the two surfaces in a symmetric way. Thus, the conditions describing contact on ${\Gamma }^1_C$ and ${\Gamma }^2_C$ are:

$$\begin{aligned} \llbracket u \rrbracket ^1_n\le & {} g^1_n \end{aligned}$$

(2a)

$$\begin{aligned} {\sigma }^1_n(\mathbf{u}^1)\le & {} 0 \quad \text { on }{\Gamma }^1_C, \end{aligned}$$

(2b)

$$\begin{aligned} {\sigma }^1_n(\mathbf{u}^1)(\llbracket u\rrbracket ^1_n-g^1_n)= & {} 0 \end{aligned}$$

(2c)

$$\begin{aligned} \llbracket u \rrbracket ^2_n\le & {} g^2_n \end{aligned}$$

(3a)

$$\begin{aligned} {\sigma }^2_n(\mathbf{u}^2)\le & {} 0 \quad \text { on }{\Gamma }^2_C. \end{aligned}$$

(3b)

$$\begin{aligned} {\sigma }^2_n(\mathbf{u}^2)(\llbracket u \rrbracket ^2_n-g^2_n)= & {} 0 \end{aligned}$$

(3c)

Let $s^i \in L^2({\Gamma }^i_C) $, $s^i\ge 0$, be the Tresca friction threshold associated to the physical properties of the i-th surface, $\llbracket \mathbf{u}\rrbracket ^1_t = \mathbf{u}_t^1-\mathbf{u}_t^2\circ \Pi ^1$ and $\llbracket \mathbf{u}\rrbracket ^2_t = \mathbf{u}_t^2-\mathbf{u}_t^1\circ \Pi ^2=- \llbracket \mathbf{u}\rrbracket ^1_t\circ \Pi ^2$.

The Tresca friction condition on ${\Gamma }^1_C$ and ${\Gamma }^2_C$ reads:

$$\begin{aligned} {\left\{ \begin{array}{ll} \Vert {{\varvec{\sigma }}}^i_t(\mathbf{u}^i)\Vert \le s^i &{} \text {if}\quad \llbracket \mathbf{u}\rrbracket ^i_t = 0,\\ \displaystyle {{\varvec{\sigma }}}^i_t(\mathbf{u}^i)= -s^i\frac{\llbracket \mathbf{u}\rrbracket ^i_t}{\Vert \llbracket \mathbf{u}\rrbracket ^i_t\Vert } &{} \text {otherwise}, \end{array}\right. } \end{aligned}$$

(4)

where $\Vert \cdot \Vert $ stands for the Euclidean norm in $\mathbb {R}^{d-1}$.

Remark 1.2

In the frictionless contact case this condition is simply replaced by $ {{\varvec{\sigma }}}^i_t=0$.

Finally, we need to consider the second Newton law between the two bodies:

$$\begin{aligned} {\left\{ \begin{array}{ll} \displaystyle \int _{\gamma ^1_C}{\sigma }^1_n(\mathbf{u}^1) \mathrm{d}s - \int _{\gamma ^2_C}{\sigma }^2_n(\mathbf{u}^2) \mathrm{d}s=0,\\ \displaystyle \int _{\gamma ^1_C}{{\varvec{\sigma }}}^1_t(\mathbf{u}^1) \mathrm{d}s+\int _{\gamma ^2_C}{{\varvec{\sigma }}}^2_t(\mathbf{u}^2) \mathrm{d}s=0, \end{array}\right. } \end{aligned}$$

where $\gamma ^1_C$ is any subset of ${\Gamma }^1_C$ and $\gamma ^2_C=\Pi ^1(\gamma _C^1)$. Mapping all terms on $\gamma _C^1$ allows writing:

$$\begin{aligned} {\left\{ \begin{array}{ll} \displaystyle \int _{\gamma ^1_C}{\sigma }^1_n(\mathbf{u}^1) - J^1{\sigma }^2_n(\mathbf{u}^2\circ \Pi ^1) \mathrm{d}s = 0,\\ \displaystyle \int _{\gamma ^1_C}{{\varvec{\sigma }}}^1_t(\mathbf{u}^1) + J^1{{\varvec{\sigma }}}^2_t(\mathbf{u}^2\circ \Pi ^1) \mathrm{d}s = 0, \end{array}\right. } \forall \gamma _C^1 \subset {\Gamma }^1_C \end{aligned}$$

so we obtain:

$$\begin{aligned} {\left\{ \begin{array}{ll} \displaystyle {\sigma }^1_n(\mathbf{u}^1)- J^1 {\sigma }^2_n(\mathbf{u}^2\circ \Pi ^1)= 0 , \\ \displaystyle {{\varvec{\sigma }}}^1_t(\mathbf{u}^1)+ J^1 {{\varvec{\sigma }}}^2_t(\mathbf{u}^2\circ \Pi ^1) = 0, \end{array}\right. } \text {on }{\Gamma }_C^1. \end{aligned}$$

(5)

Remark 1.3

A similar condition holds on ${\Gamma }_c^2$:

$$\begin{aligned} {\left\{ \begin{array}{ll} \displaystyle {\sigma }^2_n(\mathbf{u}^2) - J^2 {\sigma }^1_n(\mathbf{u}^1\circ \Pi ^2) = 0,\\ \displaystyle {{\varvec{\sigma }}}^2_t(\mathbf{u}^2) +J^2 {{\varvec{\sigma }}}^1_t(\mathbf{u}^1\circ \Pi ^2) = 0. \end{array}\right. } \end{aligned}$$

It is important to mention that, due to second Newton law, we need to fix $ s^1$ and $s^2$ such that: $\displaystyle -s^1\frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\Vert \llbracket \mathbf{u}\rrbracket ^1_t\Vert }={{\varvec{\sigma }}}^1_t(\mathbf{u}^1)= -J^1{{\varvec{\sigma }}}^2_t(\mathbf{u}^2\circ \Pi ^1) = J^1 s^2\frac{\llbracket \mathbf{u}\rrbracket ^2_t\circ \Pi ^1}{\Vert \llbracket \mathbf{u}\rrbracket ^2_t\circ \Pi ^1\Vert }= -J^1 s^2\frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\Vert \llbracket \mathbf{u}\rrbracket ^1_t\Vert }.$

And so:

$$\begin{aligned} s^1=J^1s^2. \end{aligned}$$

(6)

2.2 Variational formulation using Nitsche’s method

In this section, we establish the weak formulation of problem (1)–(5) using Nitsche’s method and the unbiased form of the contact and the friction conditions given in Sect. 1.1.

As in [8], we introduce an additional parameter $\theta $. This generalization will allow several variants, depending on the value of $\theta $. The symmetric case is obtained when $\theta = 1$. The advantage of the symmetric formulation is that it derives from an energy potential (see 1.3). These features are lost when $\theta \ne 1$. Nevertheless the variants $\theta =-1$ and 0 presents some other advantages, mostly from the numerical viewpoint. In particular, the case $\theta = 0$ involves a reduced number of terms, which makes it easier to implement and to extend to contact problems involving non-linear elasticity. Also, for $\theta =-1$, the well-posedness of the discrete formulation and the optimal convergence are preserved irrespectively of the value of the Nitsche parameter $\gamma ^i$. Some general guidelines on how to choose $\gamma _0$ and $\theta $ are provided in the Sect. 3.5. First, we introduce the Hilbert space

$$\begin{aligned} \mathbf{V}=\Big \{\mathbf{v}= (\mathbf{v}^1,\mathbf{v}^2) \in H^1(\Omega ^1)^d \times H^1(\Omega ^2)^d : \quad \mathbf{v}^1= \mathbf{0} \text { on }\Gamma ^1_D \text { and } \mathbf{v}^2= \mathbf{0} \text { on }\Gamma ^2_D\Big \}. \end{aligned}$$

Let $\mathbf{u}=(\mathbf{u}^1,\mathbf{u}^2)$ be the solution of the contact problem in its strong form (1)–(5). We assume that $\mathbf{u}$ is sufficiently regular so that all the following calculations make sense.

The derivation of a Nitsche-based method comes from a reformulation of the contact conditions (2a)–(2b)–(2c) (see for instance [6] and [8]). This reformulation is similar to the augmented Lagrangian formulation of contact problems. The contact conditions (2a)–(2b)–(2c) are equivalent to the Eq. (7) for a given positive function $\gamma ^i$:

$$\begin{aligned} {\sigma }^i_n\left( \mathbf{u}^i\right) = -\frac{1}{\gamma ^i}\left[ (\llbracket u \rrbracket ^i_n - g^i_n) - \gamma ^i {\sigma }^i_n(\mathbf{u}^i)\right] _{+}, \end{aligned}$$

(7)

where the notation $[\cdot ]_+$ refers to the the positive part of a scalar quantity. Similarly, as in [5], the Tresca friction condition is equivalent to the equation

$$\begin{aligned} {{\varvec{\sigma }}}_t\left( \mathbf{u}^i\right) = -\frac{1}{\gamma ^i}\left[ \llbracket \mathbf{u}\rrbracket ^i_t-\gamma ^i{{\varvec{\sigma }}}(\mathbf{u}^i)\right] _{\gamma ^i s^i }, \end{aligned}$$

(8)

where, for any $\alpha \in \mathbb {R}^+$, the notation $[\cdot ]_{\alpha }$ refers to the orthogonal projection onto $ \mathcal B(0,\alpha )\subset \mathbb {R}^{d-1}$, the closed ball centered at the origin and of radius $\alpha $. In what follows some properties of the positive part and the projection are mentioned. Those properties will be useful in the analysis of the method.

Since $a\le [a]_+$ and $a[a]_+=[a]_+^2$ $ \forall a\in \mathbb {R}$, we can write that for all $a,b \in \mathbb {R}$:

$$\begin{aligned} ([a]_+-[b]_+)(a-b)= & {} a[a]_++b[b]_+-b[a]_+-a[b]_+ \nonumber \\\ge & {} [a]_+^2+[b]_+^2-2[a]_+[b]_+\nonumber \\= & {} ([a]_+-[b]_+)^2. \end{aligned}$$

(9)

We note, also, the following classical property for a projection for all $\mathbf{x},\mathbf{y}\in \mathbb {R}^{d-1}$:

$$\begin{aligned} (\mathbf{y}-\mathbf{x}) \cdot ([\mathbf{y}]_\alpha -[\mathbf{x}]_\alpha )\ge \Vert [\mathbf{y}]_\alpha -[\mathbf{x}]_\alpha \Vert ^2. \end{aligned}$$

(10)

From the Green formula and Eq. (1), we get for every $\mathbf{v}\in \mathbf{V}$:

$$\begin{aligned}&\int _{\Omega ^1}{{\varvec{\sigma }}}^1(\mathbf{u}^1):{{\varvec{\varepsilon }}}(\mathbf{v}^1) \mathrm{d}\Omega +\int _{\Omega ^2}{{\varvec{\sigma }}}^2(\mathbf{u}^2):{{\varvec{\varepsilon }}}(\mathbf{v}^2) \mathrm{d}\Omega = \int _{\Omega ^1}\mathbf{f}^1\cdot \mathbf{v}^1\mathrm{d}\Omega +\int _{\Omega ^2}\mathbf{f}^2\cdot \mathbf{v}^2 \mathrm{d}\Omega \\&\quad + \int _{{\Gamma }^1_N}\mathbf{t}^1\cdot \mathbf{v}^1 \mathrm{d}{\Gamma }+\int _{{\Gamma }^2_N}\mathbf{t}^2\cdot \mathbf{v}^2\mathrm{d}{\Gamma }+ \int _{{\Gamma }^1_C}{{\varvec{\sigma }}}^1(\mathbf{u}^1)\mathbf{n}^1\cdot \mathbf{v}^1\mathrm{d}{\Gamma }+ \int _{{\Gamma }^2_C}{{\varvec{\sigma }}}^2(\mathbf{u}^2)\mathbf{n}^2\cdot \mathbf{v}^2\mathrm{d}{\Gamma }. \end{aligned}$$

We define

$$\begin{aligned} a(\mathbf{u},\mathbf{v})= & {} \int _{\Omega ^1}{{\varvec{\sigma }}}^1(\mathbf{u}^1):{{\varvec{\varepsilon }}}(\mathbf{v}^1) \mathrm{d}\Omega +\int _{\Omega ^2}{{\varvec{\sigma }}}^2(\mathbf{u}^2):{{\varvec{\varepsilon }}}(\mathbf{v}^2) \mathrm{d}\Omega ,\\ \text {and}\\ L(\mathbf{v})= & {} \int _{\Omega ^1}\mathbf{f}^1\cdot \mathbf{v}^1\mathrm{d}\Omega +\int _{\Omega ^2}\mathbf{f}^2\cdot \mathbf{v}^2 \mathrm{d}\Omega + \int _{{\Gamma }^1_N}\mathbf{t}^1\cdot \mathbf{v}^1 \mathrm{d}{\Gamma }+\int _{{\Gamma }^2_N}\mathbf{t}^2\cdot \mathbf{v}^2\mathrm{d}{\Gamma }. \end{aligned}$$

So, there holds:

$$\begin{aligned}&a(\mathbf{u},\mathbf{v})- \int _{{\Gamma }^1_C}{\sigma }_n^1(\mathbf{u}^1)v_n^1\mathrm{d}{\Gamma }- \int _{{\Gamma }^2_C}{\sigma }_n^2(\mathbf{u}^2)v_n^2\mathrm{d}{\Gamma }-\int _{{\Gamma }^1_C}{{\varvec{\sigma }}}_t^1(\mathbf{u}^1)\cdot \mathbf{v}_t^1\mathrm{d}{\Gamma }\\&\quad - \int _{{\Gamma }^2_C}{{\varvec{\sigma }}}_t^2(\mathbf{u}^2)\cdot \mathbf{v}_t^2\mathrm{d}{\Gamma }= L(\mathbf{v}). \end{aligned}$$

Using condition (5) we can write

$$\begin{aligned}&\displaystyle a(\mathbf{u},\mathbf{v})- \frac{1}{2}\int _{{\Gamma }^1_C}\left( {\sigma }_n^1(\mathbf{u}^1) + J^1 {\sigma }_n^2(\mathbf{u}^2\circ \Pi ^1)\right) v_n^1\mathrm{d}{\Gamma }- \frac{1}{2}\int _{{\Gamma }^2_C}\left( {\sigma }_n^2(\mathbf{u}^2)\right. \\&\left. \quad +\, J^2{\sigma }_n^1(\mathbf{u}^1\circ \Pi ^2)\right) v_n^2\mathrm{d}{\Gamma }\\&\quad \displaystyle - \frac{1}{2}\int _{{\Gamma }^1_C}\left( {{\varvec{\sigma }}}_t^1(\mathbf{u}^1) - J^1 {{\varvec{\sigma }}}_t^2(\mathbf{u}^2\circ \Pi ^1)\right) \cdot \mathbf{v}_t^1\mathrm{d}{\Gamma }- \frac{1}{2}\int _{{\Gamma }^2_C}\left( {{\varvec{\sigma }}}_t^2(\mathbf{u}^2)\right. \\&\left. \quad - J^2{{\varvec{\sigma }}}_t^1(\mathbf{u}^1\circ \Pi ^2)\right) \cdot \mathbf{v}_t^2\mathrm{d}{\Gamma }= L(\mathbf{v}). \end{aligned}$$

So, using the property $\displaystyle \int _{{\Gamma }^1_C}J^1f\mathrm{d}{\Gamma }=\int _{{\Gamma }^2_C}f\circ \Pi ^2\mathrm{d}{\Gamma }$, we have

$$\begin{aligned}&\displaystyle a(\mathbf{u},\mathbf{v})- \frac{1}{2}\int _{{\Gamma }^1_C} {\sigma }_n^1(\mathbf{u}^1) v_n^1\mathrm{d}{\Gamma }- \frac{1}{2}\int _{{\Gamma }^1_C}{\sigma }_n^1(\mathbf{u}^1)(v_n^2\circ \Pi ^1)\mathrm{d}{\Gamma }- \frac{1}{2}\int _{{\Gamma }^2_C}{\sigma }_n^2(\mathbf{u}^2)v_n^2\mathrm{d}{\Gamma }\\&\quad \displaystyle - \frac{1}{2}\int _{{\Gamma }^2_C} {\sigma }_n^2(\mathbf{u}^2)(v_n^1\circ \Pi ^2)\mathrm{d}{\Gamma }- \frac{1}{2}\int _{{\Gamma }^1_C}{{\varvec{\sigma }}}_t^1(\mathbf{u}^1)\cdot \mathbf{v}_t^1 + \frac{1}{2}\int _{{\Gamma }^1_C} {{\varvec{\sigma }}}_t^1(\mathbf{u}^1) \cdot (\mathbf{v}_t^2\circ \Pi ^1)\mathrm{d}{\Gamma }\\&\quad \displaystyle - \frac{1}{2}\int _{{\Gamma }^2_C}{{\varvec{\sigma }}}_t^2(\mathbf{u}^2)\cdot \mathbf{v}_t^2+\frac{1}{2}\int _{{\Gamma }^2_C} {{\varvec{\sigma }}}_t^2(\mathbf{u}^2)\cdot (\mathbf{v}_t^1\circ \Pi ^2)\mathrm{d}{\Gamma }= L(\mathbf{v}). \end{aligned}$$

This leads to:

$$\begin{aligned}&\displaystyle a(\mathbf{u},\mathbf{v})- \frac{1}{2}\int _{{\Gamma }^1_C}{\sigma }_n^1(\mathbf{u}^1)(v_n^1+v_n^2\circ \Pi ^1)\mathrm{d}{\Gamma }-\frac{1}{2}\int _{{\Gamma }^2_C}{\sigma }_n^2(\mathbf{u}^2) (v_n^2+v_n^1\circ \Pi ^2)\mathrm{d}{\Gamma }\\&\quad \displaystyle -\frac{1}{2}\int _{{\Gamma }^1_C}{{\varvec{\sigma }}}_t^1(\mathbf{u}^1)\cdot (\mathbf{v}_t^1-\mathbf{v}_t^2\circ \Pi ^1)\mathrm{d}{\Gamma }-\frac{1}{2}\int _{{\Gamma }^2_C}{{\varvec{\sigma }}}_t^2(\mathbf{u}^2) \cdot (\mathbf{v}_t^2-\mathbf{v}_t^1\circ \Pi ^2)\mathrm{d}{\Gamma }= L(\mathbf{v}). \end{aligned}$$

Using the writings, for $\theta \in \mathbb {R}$,

$$\begin{aligned} {\left\{ \begin{array}{ll} v_n^1+v_n^2\circ \Pi ^1=( v_n^1+v_n^2\circ \Pi ^1 -\theta \gamma ^1 {\sigma }_n^1(\mathbf{v}^1)) + \theta \gamma ^1 {\sigma }_n^1(\mathbf{v}^1)\\ v_n^2 +v_n^1\circ \Pi ^2 =(v_n^2+v_n^1\circ \Pi ^2 -\theta \gamma ^2 {\sigma }_n^2(\mathbf{v}^2)) +\theta \gamma ^2 {\sigma }_n^2(\mathbf{v}^2) \end{array}\right. }\\ {\left\{ \begin{array}{ll} \displaystyle \mathbf{v}_t^1 - \mathbf{v}_t^2\circ \Pi ^1=( \mathbf{v}_t^1-\mathbf{v}_t^2\circ \Pi ^1 -\theta \gamma ^1 {{\varvec{\sigma }}}_t^1(\mathbf{v}^1)) + \theta \gamma ^1 {{\varvec{\sigma }}}_t^1(\mathbf{v}^1)\\ \displaystyle \mathbf{v}_t^2 - \mathbf{v}_t^1\circ \Pi ^2=( \mathbf{v}_t^2-\mathbf{v}_t^1\circ \Pi ^2 -\theta \gamma ^2 {{\varvec{\sigma }}}_t^2(\mathbf{v}^2)) + \theta \gamma ^2 {{\varvec{\sigma }}}_t^2(\mathbf{v}^2), \end{array}\right. } \end{aligned}$$

we obtain:

$$\begin{aligned}&a(\mathbf{u},\mathbf{v}) - \frac{1}{2}\int _{{\Gamma }^1_C}\theta \gamma ^1 {\sigma }_n^1(\mathbf{u}^1){\sigma }_n^1(\mathbf{v}^1)\mathrm{d}{\Gamma }- \frac{1}{2}\int _{{\Gamma }^2_C}\theta \gamma ^2 {\sigma }_n^2(\mathbf{u}^2) {\sigma }_n^2(\mathbf{v}^2)\mathrm{d}{\Gamma }\displaystyle \nonumber \\&\quad - \frac{1}{2}\int _{{\Gamma }^1_C}\theta \gamma ^1 {{\varvec{\sigma }}}_t^1(\mathbf{u}^1)\cdot {{\varvec{\sigma }}}_t^1(\mathbf{v}^1)\mathrm{d}{\Gamma }- \frac{1}{2}\int _{{\Gamma }^2_C}\theta \gamma ^2 {{\varvec{\sigma }}}_t^2(\mathbf{u}^2)\cdot {{\varvec{\sigma }}}_t^2(\mathbf{v}^2)\mathrm{d}{\Gamma }\nonumber \\&\quad \displaystyle - \frac{1}{2}\int _{{\Gamma }^1_C}{\sigma }_n^1(\mathbf{u}^1)(v_n^1+v_n^2\circ \Pi ^1-\theta \gamma ^1 {\sigma }_n^1(\mathbf{v}^1))\mathrm{d}{\Gamma }\nonumber \\&\quad - \frac{1}{2}\int _{{\Gamma }^2_C}{\sigma }_n^2(\mathbf{u}^2) (v_n^2+v_n^1\circ \Pi ^2 -\theta \gamma ^2 {\sigma }_n^2(\mathbf{v}^2))\mathrm{d}{\Gamma }\nonumber \\&\quad \displaystyle -\frac{1}{2}\int _{{\Gamma }^1_C}{{\varvec{\sigma }}}_t^1(\mathbf{u}^1)\cdot (\mathbf{v}_t^1-\mathbf{v}_t^2\circ \Pi ^1-\theta \gamma ^1 {{\varvec{\sigma }}}_t^1(\mathbf{v}^1))\mathrm{d}{\Gamma }\displaystyle \nonumber \\&\quad - \frac{1}{2}\int _{{\Gamma }^2_C}{{\varvec{\sigma }}}_t^2(\mathbf{u}^2)\cdot \big (\mathbf{v}_t^2-\mathbf{v}_t^1\circ \Pi ^2 -\theta \gamma ^2 {{\varvec{\sigma }}}_t^2(\mathbf{v}^2)\big )\mathrm{d}{\Gamma }\nonumber \\&= L(\mathbf{v}). \end{aligned}$$

(11)

Let us define:

$$\begin{aligned} \begin{array}{ll} P_{n,\gamma ^i}^i(\mathbf{u}) = \llbracket u \rrbracket ^i_n - \gamma ^i {{\varvec{\sigma }}}^i_n(\mathbf{u}^i)-g_n^i, &{} \mathbf{P}_{t,\gamma ^i}^i(\mathbf{u}) = \llbracket \mathbf{u}\rrbracket ^i_t-\gamma ^i{{\varvec{\sigma }}}^i_t(\mathbf{u}^i), \\ P_{n,\theta \gamma ^i}^i(\mathbf{v})= \llbracket v \rrbracket ^i_n- \theta \gamma ^i {\sigma }_n^i(\mathbf{v}^i),&{} \mathbf{P}_{t,\theta \gamma ^i}^i(\mathbf{v})= \llbracket \mathbf{v}\rrbracket ^i_t - \theta \gamma ^i {{\varvec{\sigma }}}_t^i(\mathbf{v}^i) \end{array} \end{aligned}$$

(12)

and

$$\begin{aligned} \displaystyle A_{\theta }(\mathbf{u},\mathbf{v})= & {} \mathbf{a}(\mathbf{u},\mathbf{v})- \frac{1}{2}\int _{{\Gamma }^1_C}\theta \gamma ^1 {\sigma }_n^1(\mathbf{u}^1){\sigma }_n^1(\mathbf{v}^1)\mathrm{d}{\Gamma }\\&- \frac{1}{2}\int _{\Gamma ^2_C}\theta \gamma ^2 {\sigma }_n^2(\mathbf{u}^2) {\sigma }_n^2(\mathbf{v}^2)\mathrm{d}{\Gamma }\\&\displaystyle - \frac{1}{2}\int _{{\Gamma }^1_C}\theta \gamma ^1 {{\varvec{\sigma }}}_t^1(\mathbf{u}^1)\cdot {{\varvec{\sigma }}}_t^1(\mathbf{v}^1)\mathrm{d}{\Gamma }\\&-\frac{1}{2}\int _{\Gamma ^2_C}\theta \gamma ^2 {{\varvec{\sigma }}}_t^2(\mathbf{u}^2)\cdot {{\varvec{\sigma }}}_t^2(\mathbf{v}^2)\mathrm{d}{\Gamma }\\ \displaystyle= & {} \mathbf{a}(\mathbf{u},\mathbf{v})- \frac{1}{2}\int _{{\Gamma }^1_C}\theta \gamma ^1 {{\varvec{\sigma }}}^1(\mathbf{u}^1)\mathbf{n}\cdot {{\varvec{\sigma }}}^1(\mathbf{v}^1)\mathbf{n}\,\mathrm{d}{\Gamma }\\&\quad - \frac{1}{2}\int _{{\Gamma }^2_C}\theta \gamma ^2 {{\varvec{\sigma }}}^2(\mathbf{u}^2)\mathbf{n}\cdot {{\varvec{\sigma }}}^2(\mathbf{v}^2)\mathbf{n}\,\mathrm{d}{\Gamma }. \end{aligned}$$

Now we insert the expressions (7) of ${\sigma }^i_n(u^i)$ and (8) of ${{\varvec{\sigma }}}^i_t(\mathbf{u}^i)$ in (11) and the variational problem could be formally written as follows:

$$\begin{aligned} {\left\{ \begin{array}{ll} &{}\text {Find a sufficiently regular } \mathbf{u}\in \mathbf{V}\text { such that for all sufficiently regular} \,\mathbf{v}\in \mathbf{V}, \\ &{}\displaystyle A_{\theta }(\mathbf{u},\mathbf{v})+ \frac{1}{2} \int _{{\Gamma }^1_C} \frac{1}{\gamma ^1} [ P^1_{n,\gamma ^1}(\mathbf{u})]_{+} P_{n,\theta \gamma ^1}^1(\mathbf{v})\mathrm{d}{\Gamma }\\ &{}\quad + \frac{1}{2} \int _{{\Gamma }^2_C} \frac{1}{\gamma ^2} [P^2_{n,\gamma ^2}(\mathbf{u})]_{+} P_{n,\theta \gamma ^2}^2(\mathbf{v})\mathrm{d}{\Gamma }\\ &{} \displaystyle + \frac{1}{2} \int _{{\Gamma }^1_C} \frac{1}{\gamma ^1} [ \mathbf{P}^1_{t,\gamma ^1}(\mathbf{u})]_{\gamma ^1 s^1}\cdot \mathbf{P}_{t,\theta \gamma ^1}^1(\mathbf{v})\mathrm{d}{\Gamma }+ \frac{1}{2} \int _{{\Gamma }^2_C} \frac{1}{\gamma ^2}[\mathbf{P}^2_{t,\gamma ^2}(\mathbf{u})]_{\gamma ^2 s^2}\\ &{}\quad \cdot \mathbf{P}_{t,\theta \gamma ^2}^2(\mathbf{v})\mathrm{d}{\Gamma }= L(\mathbf{v}). \end{array}\right. } \end{aligned}$$

(13)

Remark 1.4

In the frictionless contact case the formulation reads:

$$\begin{aligned} {\left\{ \begin{array}{ll} &{}\text {Find a sufficiently regular } \mathbf{u}\in \mathbf{V}\text { such that for all sufficiently regular} \, \mathbf{v}\in \mathbf{V}\\ &{}\displaystyle A_{\theta }(\mathbf{u},\mathbf{v})+ \frac{1}{2} \int _{{\Gamma }^1_C} \frac{1}{\gamma ^1} [ P^1_{n,\gamma ^1}(\mathbf{u})]_{+}P_{n,\theta \gamma ^1}^1(\mathbf{v})\mathrm{d}{\Gamma }\\ &{}\quad + \frac{1}{2} \int _{{\Gamma }^2_C} \frac{1}{\gamma ^2} [P^2_{n,\gamma ^2}(\mathbf{u})]_{+} P_{n,\theta \gamma ^2}^2(\mathbf{v}) \mathrm{d}{\Gamma }= L(\mathbf{v}). \end{array}\right. } \end{aligned}$$

.

2.3 Derivation of the method from a potential

In this section we show, through a formal demonstration, that the method derives from a potential in the frictional symmetric ($\theta =1$) case. Let us define the potential:

$$\begin{aligned} J(\mathbf{u})={{\varvec{\varepsilon }}}_{\Omega }(\mathbf{u})+\sum \limits _{i=1}^2 \left( {{\varvec{\varepsilon }}}_n^i(\mathbf{u})+{{\varvec{\varepsilon }}}_t^i(\mathbf{u})\right) {,} \end{aligned}$$

with:

$$\begin{aligned} \displaystyle {{\varvec{\varepsilon }}}_{\Omega }(\mathbf{u})= & {} \frac{1}{2} a(\mathbf{u},\mathbf{u})- \sum \limits _{i=1}^2 \Big ( \frac{1}{4}\int _{{\Gamma }^i_C} \gamma ^i ({\sigma }_n^i(\mathbf{u}^i))^2 +\frac{1}{4}\int _{{\Gamma }^i_C} \gamma ^i \Vert {{\varvec{\sigma }}}_t^i(\mathbf{u}^i)\Vert ^2\mathrm{d}{\Gamma }\Big ) - L(\mathbf{u})\\= & {} \frac{1}{2}A_{1}(\mathbf{u},\mathbf{u})-L(\mathbf{u}){,}\\ \displaystyle {{\varvec{\varepsilon }}}_n^i(\mathbf{u})= & {} \frac{1}{4}\int _{{\Gamma }^i_C}\frac{1}{\gamma ^i}\left[ P^i_{n,\gamma ^i}(\mathbf{u})\right] _+^2 \mathrm{d}{\Gamma },\\ \displaystyle {{\varvec{\varepsilon }}}_t^i(\mathbf{u})= & {} \frac{1}{4}\int _{{\Gamma }^i_C}\frac{1}{\gamma ^i}\Vert \mathbf{P}^i_{t,\gamma ^i}(\mathbf{u})\Vert ^2 \mathrm{d}{\Gamma }- \frac{1}{4}\int _{{\Gamma }^i_C}\frac{1}{\gamma ^i}\Vert \mathbf{P}^i_{t,\gamma ^i}(\mathbf{u})- \left[ \mathbf{P}^i_{t,\gamma ^i}(\mathbf{u})\right] _{\gamma ^i s^i}\Vert ^2 \mathrm{d}{\Gamma }. \end{aligned}$$

We compute now the derivative of this potential. We have:

$$\begin{aligned} D{{\varvec{\varepsilon }}}_{\Omega }(\mathbf{u})[\mathbf{v}]= & {} A_{1}(\mathbf{u},\mathbf{v})-L(\mathbf{v}) ({\textit{L}} \text { is linear and}\, A_{\theta }\, \text {is bilinear), }\\ \displaystyle D{{\varvec{\varepsilon }}}_n^i(\mathbf{u})[\mathbf{v}]= & {} \frac{1}{2}\int _{{\Gamma }^i_C}\frac{1}{\gamma ^i}\left[ P^i_{n,\gamma ^i}(\mathbf{u})\right] _+D\left( [P^i_{n,\gamma ^i}(\mathbf{u})]_+ \right) [\mathbf{v}] \mathrm{d}{\Gamma }\\= & {} \frac{1}{2}\int _{{\Gamma }^i_C}\frac{1}{\gamma ^i}\left[ P^i_{n,\gamma ^i}(\mathbf{u})\right] _+H\big (P^i_{n,\gamma ^i}(\mathbf{u})\big ) \left( D(P^i_{n,\gamma ^i}(\mathbf{u}) \right) [\mathbf{v}] \mathrm{d}{\Gamma }, \end{aligned}$$

where H is the Heaviside step function. Using the equalities: $H({\varphi }(X))[{\varphi }(X)]_+=[{\varphi }(X)]_+$ and $D\big (P^i_{n,\gamma ^i}(\mathbf{u})\big )[\mathbf{v}]=P^i_{n,\gamma ^i}(\mathbf{v})$ (since $P^i_{n,\gamma ^i}$ is linear), we get:

$$\begin{aligned} \displaystyle D{{\varvec{\varepsilon }}}_n^i(\mathbf{u})[\mathbf{v}]=\frac{1}{2}\int _{{\Gamma }^i_C}\frac{1}{\gamma ^i}\left[ P^i_{n,\gamma ^i}(\mathbf{u})\right] _+P^i_{n,\gamma ^i}(\mathbf{v}) \mathrm{d}{\Gamma }. \end{aligned}$$

Finally:

$$\begin{aligned} \displaystyle D{{\varvec{\varepsilon }}}_t^i(\mathbf{u})[\mathbf{v}]= & {} \frac{1}{2}\int _{{\Gamma }^i_C}\frac{1}{\gamma ^i}\mathbf{P}^i_{t,\gamma ^i}(\mathbf{u})\cdot \mathbf{P}^i_{t,\gamma ^i}(\mathbf{v})\mathrm{d}{\Gamma }\\&\quad - \frac{1}{2}\int _{{\Gamma }^i_C}\frac{1}{\gamma ^i}\left( \mathbf{P}^i_{t,\gamma ^i}(\mathbf{u})- \left[ \mathbf{P}^i_{t,\gamma ^i}(\mathbf{u})\right] _{\gamma ^i s^i}\right) \cdot \left( \mathbf{P}^i_{t,\gamma ^i}(\mathbf{v})\right. \\&\quad -\left. D\big ([\mathbf{P}^i_{t,\gamma ^i}(\mathbf{u})]_{\gamma ^i s^i}\big )[\mathbf{v}] \right) \\ \end{aligned}$$

$$\begin{aligned} {\left\{ \begin{array}{ll} \displaystyle \text {if } \Vert \mathbf{P}^i_{t,\gamma ^i}(\mathbf{u})\Vert \le \gamma ^is^i, \text {then } \mathbf{P}^i_{t,\gamma ^i}(\mathbf{u})- [\mathbf{P}^i_{t,\gamma ^i}(\mathbf{u})]_{\gamma ^i s^i}=0 \\ \displaystyle \text {if } \Vert \mathbf{P}^i_{t,\gamma ^i}(\mathbf{u})\Vert > \gamma ^is^i, \text {then } D\big ([\mathbf{P}^i_{t,\gamma ^i}(\mathbf{u})]_{\gamma ^i s^i} \big )[\mathbf{v}]\text { is tangential to} \,\mathcal B(0,\gamma ^is^i)\, \text {and }\\ D\big ([\mathbf{P}^i_{t,\gamma ^i}(\mathbf{u})]_{\gamma ^i s^i}\big )[\mathbf{v}]\cdot (\mathbf{P}^i_{t,\gamma ^i}(\mathbf{u})- [\mathbf{P}^i_{t,\gamma ^i}(\mathbf{u})]_{\gamma ^i s^i}) =0. \end{array}\right. } \end{aligned}$$

So, in both cases we have:

$$\begin{aligned} \displaystyle D{{\varvec{\varepsilon }}}_t^i(\mathbf{u})[\mathbf{v}]= & {} \frac{1}{2}\int _{{\Gamma }^i_C}\frac{1}{\gamma ^i}\left[ \mathbf{P}^i_{t,\gamma ^i}(\mathbf{u})\right] _{\gamma ^is^i}\cdot \mathbf{P}^i_{t,\gamma ^i}(\mathbf{v})\mathrm{d}{\Gamma }\end{aligned}$$

so, if we consider the first order optimality condition $D{{\varvec{\varepsilon }}}(\mathbf{u})[\mathbf{v}]=0$ $\forall \mathbf{v}\in \mathbf{V}$, we get:

$$\begin{aligned}&A_{1}(\mathbf{u},\mathbf{v})+\sum \limits _{i=1}^2 \Big (\frac{1}{2} \int _{{\Gamma }^i_C}\frac{1}{\gamma ^i}[P^i_{n,\gamma ^i}(\mathbf{u})]_+P^i_{n,\gamma ^i}(\mathbf{v}) \mathrm{d}{\Gamma }\\&\quad +\frac{1}{2}\int _{{\Gamma }^i_C}\frac{1}{\gamma ^i}[\mathbf{P}^i_{t,\gamma ^i}(\mathbf{u})]_{\gamma ^is^i}\cdot \mathbf{P}^i_{t,\gamma ^i}(\mathbf{v})\mathrm{d}{\Gamma }\Big )= L(\mathbf{v}). \end{aligned}$$

This is exactly (13) when $\theta =1$.

2.4 Strong–weak formulation equivalence

In this section, we are going to establish the formal equivalence between (13) and (1)–(5). Since the construction of (13) is quite elaborated and consists in particular in the splitting of the contact terms into two parts, this step is necessary to ensure the coherence of the formulation.

Theorem 1.5

Let $\mathbf{u}=(\mathbf{u}^1,\mathbf{u}^2)$ be a sufficiently regular solution to the problem (13), then $\mathbf{u}$ solves the problem (1)–(5) for all $\theta \in \mathbb {R}$.

Proof

See “Appendix A” $\square $

2.5 Discretization of the variational formulation

Let $(\mathcal {T}^i_h)_{h>0}$ be a family of triangulations of the domain $\Omega ^i $ supposed regular and conformal to the subdivisions of the boundaries into $\Gamma ^i_D$, $\Gamma ^i_N$ and $\Gamma ^i_C$. We introduce

$$\begin{aligned} \mathbf{V}_ h= & {} \left( \mathbf{V}_h^1\times \mathbf{V}_h^2\right) \text {, with } \,\\ \mathbf{V}_h^i= & {} \Big \{\mathbf{v}^i_h \in \mathscr {C}^0(\overline{\Omega ^i}): \mathbf{v}^i_{h \vert T}\in (\mathbb {P}_k(T))^d, \forall T\in \mathcal {T}^i_h, \mathbf{v}_h^i= \mathbf{0} \text { on }\Gamma ^i_D\Big \}, \end{aligned}$$

the family of finite dimensional vector spaces indexed by h and coming from $\mathcal {T}^i_h$.

We consider in what follows that $\gamma ^i$ is a positive piecewise constant function on the contact interface $\Gamma ^i_C$ which satisfies

$$\begin{aligned} \gamma ^i_{|K^i \cap {\Gamma }^i_C} = \gamma _0 h_{K^i}, \end{aligned}$$

for every $K^i\in \mathcal {T}^i_h$ that has a non-empty intersection of dimension $d-1$ with $\Gamma ^i_C$, and where $\gamma _0$ is a positive given constant. Note that the value of $\gamma ^i$ on element intersections has no influence. This allows to define a discrete counterpart of (13). Let us introduce for this purpose, with the same notation, the discrete linear operators:

$$\begin{aligned} \begin{array}{ll} P_{n,\gamma ^i}^i(\mathbf{u}_h)= \llbracket u_h \rrbracket _n -g^i_n- \gamma ^i {\sigma }^i_n(\mathbf{u}_h^i), &{} \mathbf{P}_{t,\gamma ^i}^i(\mathbf{u}_h) = \llbracket \mathbf{u}_h \rrbracket ^i_t-\gamma ^i{{\varvec{\sigma }}}^i_t(\mathbf{u}_h^i),\\ P_{n,\theta \gamma ^i}^i(\mathbf{v}_h)=\llbracket v_h \rrbracket ^i_n - \theta \gamma ^i {\sigma }_n^i(\mathbf{v}_h^i),&{}\mathbf{P}_{t,\theta \gamma ^i}^i(\mathbf{v}_h)= \llbracket \mathbf{v}_h \rrbracket ^i_t - \theta \gamma ^i {{\varvec{\sigma }}}_t^i(\mathbf{v}_h^i). \end{array} \end{aligned}$$

(14)

Then the unbiased formulation of the two bodies contact in the discrete setting reads:

$$\begin{aligned} {\left\{ \begin{array}{ll} &{}\text {Find } \mathbf{u}_h \in \mathbf{V}_h \text { such that, for all}\, \mathbf{v}_h \in \mathbf{V}_h, \\ &{}\displaystyle A_{\theta }(\mathbf{u}_h,\mathbf{v}_h)\\ &{}\quad \displaystyle + \frac{1}{2} \int _{{\Gamma }^1_C} \frac{1}{\gamma ^1} P_{n,\theta \gamma ^1}^1(\mathbf{v}_h)[ P^1_{n,\gamma ^1}(\mathbf{u}_h)]_{+}\mathrm{d}\Gamma + \frac{1}{2} \int _{\Gamma ^2_C} \frac{1}{\gamma ^2} P_{n,\theta \gamma ^2}^2(\mathbf{v}_h) [P^2_{n,\gamma ^2}(\mathbf{u}_h)]_{+}\mathrm{d}\Gamma \\ &{}\quad \displaystyle + \frac{1}{2} \int _{\Gamma ^1_C} \frac{1}{\gamma ^1} \mathbf{P}_{t,\theta \gamma ^1}^1(\mathbf{v}_h)\cdot [ \mathbf{P}^1_{t,\gamma ^1}(\mathbf{u}_h)]_{\gamma ^1 s^1}\mathrm{d}\Gamma \\ &{}\quad \displaystyle + \frac{1}{2} \int _{\Gamma ^2_C} \frac{1}{\gamma ^2}\mathbf{P}_{t,\theta \gamma ^2}^2(\mathbf{v}_h)\cdot [\mathbf{P}^2_{t,\gamma ^2}(\mathbf{u}_h)]_{\gamma ^2 s^2}\mathrm{d}\Gamma = L(\mathbf{v}_h). \end{array}\right. } \end{aligned}$$

(15)

Remark 1.6

Note that Nitsche’s method is not a standard penalty method, since it is consistent. In fact the Nitsche’s method is closer to Barbosa & Hughes stabilization (see [27] and [6, Section 2.3]), so the Nitsche parameter $\gamma _0$ is in fact a stabilization parameter. As a result, making $\gamma _0$ tend to 0 does not increase necessarily precision, conversely to standard penalty (see as well Figs. 16 and 17 in Sect. 3.5 for a numerical illustration here). The parameter $\gamma _0$ must therefore be just smaller than a threshold value ensuring the coercivity so that the problem is well posed (and not too small not to cause ill-conditioning). This threshold value depends on the variant ($\theta $).

3 Mathematical analysis of the method

A major difference between Nitsche’s method and standard penalty methods is the consistency demonstrated in 2.1. Using the same arguments as in [6] we prove the well-posedness and the optimal convergence of (15) when the mesh size h vanishes. To insure well-posedness and convergence of the method we need to impose $\gamma _0$ to be sufficiently small when $\theta \ne -1$. This condition is avoided when $\theta =-1$ which is a major advantage of this version.

3.1 Consistency

Similarly to Nitsche’s method for unilateral contact problems [6], our Nitsche-based formulation (15) is consistent:

Lemma 2.1

Suppose that the solution $\mathbf{u}$ of (1)–(5) lies in $(H^{\frac{3}{2}+\nu }(\Omega ^1))^d \times (H^{\frac{3}{2}+\nu }(\Omega ^2))^d$ with $\nu > 0$, then $\mathbf{u}$ is also solution to:

$$\begin{aligned}&\displaystyle A_{\theta }(\mathbf{u},\mathbf{v}_h)+ \frac{1}{2} \int _{\Gamma ^1_C} \frac{1}{\gamma ^1} P_{n,\theta \gamma ^1}^1(\mathbf{v}_h)[ P^1_{n,\gamma ^1}(\mathbf{u})]_{+}\mathrm{d}\Gamma \nonumber \\&\qquad + \frac{1}{2} \int _{\Gamma ^2_C} \frac{1}{\gamma ^2} P_{n,\theta \gamma ^2}^2(\mathbf{v}_h) [P^2_{n,\gamma ^2}(\mathbf{u})]_{+}\mathrm{d}\Gamma \nonumber \\&\qquad \displaystyle + \frac{1}{2} \int _{\Gamma ^1_C} \frac{1}{\gamma ^1} \mathbf{P}_{t,\theta \gamma ^1}^1(\mathbf{v}_h)\cdot [ \mathbf{P}^1_{t,\gamma ^1} (\mathbf{u})]_{\gamma ^1 s^1}\mathrm{d}\Gamma \nonumber \\&\qquad + \frac{1}{2} \int _{\Gamma ^2_C} \frac{1}{\gamma ^2}\mathbf{P}_{t\theta \gamma ^2}^2(\mathbf{v}_h)\cdot [\mathbf{P}^2_{t,\gamma ^2}(\mathbf{u})]_{\gamma ^2 s^2}\mathrm{d}\Gamma \nonumber \\&\quad = L(\mathbf{v}_h),\quad \forall \mathbf{v}_h \in \mathbf{V}_h. \end{aligned}$$

(16)

Proof

Let $\mathbf{u}$ be a solution of (1)–(5) and set $\mathbf{v}_h \in \mathbf{V}_h$. Since $\mathbf{u}^i \in (H^{\frac{3}{2}+\nu }(\Omega ^i))^d$, we have ${\sigma }_n^i(\mathbf{u}^i) \in (H^{\nu }(\Gamma _C^i))^d$ and $P_{n \gamma ^i}$ and $\mathbf{P}_{t \gamma ^i} $ are well-defined and belong to $L^2(\Gamma _C^i)$.

With Eqs. (1)–(4) and integration by parts, it holds:

$$\begin{aligned}&\mathbf{a}(\mathbf{u},\mathbf{v}_h)- \int _{\Gamma ^1_C}{\sigma }_n^1(\mathbf{u}^1)v_{h n}^1\mathrm{d}\Gamma - \int _{\Gamma ^2_C}{\sigma }_n^2(\mathbf{u}^2)v_{h n}^2\mathrm{d}\Gamma -\int _{\Gamma ^1_C}{{\varvec{\sigma }}}_t^1(\mathbf{u}^1)\cdot \mathbf{v}_{h t}^1\mathrm{d}\Gamma \\&\quad - \int _{\Gamma ^2_C}{{\varvec{\sigma }}}_t^2(\mathbf{u}^2)\cdot \mathbf{v}_{h t}^2\mathrm{d}\Gamma =L(\mathbf{v}_h). \end{aligned}$$

We use now (5) to write:

$$\begin{aligned}&\mathbf{a}(\mathbf{u},\mathbf{v})- \frac{1}{2}\int _{\Gamma ^1_C}{\sigma }_n^1(\mathbf{u}^1)(v_{h n}^1+v_{h n}^2\circ \Pi ^1)\mathrm{d}\Gamma -\frac{1}{2}\int _{\Gamma ^2_C}{\sigma }_n^2(\mathbf{u}^2) (v_{h n}^2+v_{h n}^1\circ \Pi ^2)\mathrm{d}\Gamma \\&\quad \displaystyle -\frac{1}{2}\int _{\Gamma ^1_C}{{\varvec{\sigma }}}_t^1(\mathbf{u}^1)\cdot (\mathbf{v}_{h t}^1-\mathbf{v}_{h t}^2\circ \Pi ^1)\mathrm{d}\Gamma \\&\quad -\frac{1}{2}\int _{\Gamma ^2_C}{{\varvec{\sigma }}}_t^2(\mathbf{u}^2)\cdot (\mathbf{v}_{h t}^2-\mathbf{v}_{h t}^1\circ \Pi ^2)\mathrm{d}\Gamma = L(\mathbf{v}_h). \end{aligned}$$

For any $\theta \in \mathbb {R}$, we can write:

$$\begin{aligned} {\left\{ \begin{array}{ll} \displaystyle v_{h n}^1+v_{h n}^2\circ \Pi ^1=( v_{h n}^1+v_{h n}^2\circ \Pi ^1 -\theta \gamma ^1 {\sigma }_n^1(\mathbf{v}_h^1)) + \theta \gamma ^1 {\sigma }_n^1(\mathbf{v}_h^1)\\ \displaystyle v_{h n}^2+v_{h n}^1\circ \Pi ^2= ( v_{h n}^2+v_{h n}^1\circ \Pi ^2 -\theta \gamma ^2 {\sigma }_n^2(\mathbf{v}_h^2)) +\theta \gamma ^2{\sigma }_n^2(\mathbf{v}_h^2)\\ \end{array}\right. }\nonumber \\ {\left\{ \begin{array}{ll} \displaystyle \mathbf{v}_{t h}^1 - \mathbf{v}_{h t}^2\circ \Pi ^1=( \mathbf{v}_{h t}^1-\mathbf{v}_{h t}^2\circ \Pi ^1 -\theta \gamma ^1 {{\varvec{\sigma }}}_t^1(\mathbf{v}_h^1)) + \theta \gamma ^1 {{\varvec{\sigma }}}_t^1(\mathbf{v}_h^1)\\ \displaystyle \mathbf{v}_{t h}^2 - \mathbf{v}_{h t}^1\circ \Pi ^2=( \mathbf{v}_{h t}^2-\mathbf{v}_{h t}^1\circ \Pi ^2 -\theta \gamma ^2 {{\varvec{\sigma }}}_t^2(\mathbf{v}_h^2)) + \theta \gamma ^2 {{\varvec{\sigma }}}_t^2(\mathbf{v}_h^2). \end{array}\right. } \end{aligned}$$

(17)

Using (17), formulations (7) and (8) of the contact and friction conditions and the notations (12), we obtain (16). $\square $

Remark 2.2

The regularity assumption that we made in Lemma 2.1 is standard for Signorini contact. It was proved for an elliptic scalar problem in [22] and noted numerically for linear elasticity. In fact the singularities that appear with contact-non-contact transitions allow us, generally, to expect a Sobolev regularity between 3 / 2 and 5 / 2.

3.2 Well-posedness

To prove well-posedness of our formulation, we first need the following discrete trace inequality.

Lemma 2.3

There exists $C > 0$, independent of the parameter $\gamma _0$ and of the mesh size h, such that:

$$\begin{aligned} \Vert {\gamma ^i}^\frac{1}{2}{{\varvec{\sigma }}}^i_t(\mathbf{v}^i_h)\Vert ^2_{0,\Gamma _c^i}\ +\Vert {\gamma ^i}^\frac{1}{2}{\sigma }^i_n(\mathbf{v}^i_h)\Vert ^2_{0,\Gamma _c^i}\ \le C\gamma _0\Vert \mathbf{v}^i_h\Vert ^2_{1,\Omega ^i}, \end{aligned}$$

(18)

for all $\mathbf{v}^i_h \in \mathbf{V}_{h}^i$.

Proof

The inequality (18) is obtained using a scaling argument as in [5, Lemma 3.2]. $\square $

We then show in Theorem 2.4 that the problem (15) is well-posed using an argument from [4] for M-type and pseudo-monotone operators. In the proof of the well-posedness, two cases are discused: $\theta =1$ and $\theta \ne 1$.

Theorem 2.4

Suppose that $\gamma _0 > 0$ is sufficiently small or $\theta = -1$, then Problem (15) admits one unique solution $\mathbf{u}_h$ in $\mathbf{V}_h$. When $\theta = -1$ we do not need the assumption of smallness of $\gamma _0$.

Proof

Using the Riesz representation theorem, we define a (non-linear) operator $\mathbf{B}:\mathbf{V}_h \rightarrow \mathbf{V}_h$, by means of the formula:

$$\begin{aligned} \begin{aligned} (\mathbf{B}\mathbf{u}_h,\mathbf{v}_h)_1&=\displaystyle A_{\theta }(\mathbf{u}_h,\mathbf{v}_h)+ \frac{1}{2} \int _{\Gamma ^1_C} \frac{1}{\gamma ^1} P_{n, \theta \gamma ^1}^1(\mathbf{v}_h)[ P^1_{n,\gamma ^1}(\mathbf{u}_h)]_{+}\mathrm{d}\Gamma \\&\quad \displaystyle + \frac{1}{2} \int _{\Gamma ^2_C} \frac{1}{\gamma ^2} P_{n,\theta \gamma ^2}^2(\mathbf{v}_h) [P^2_{n,\gamma ^2}(\mathbf{u}_h)]_{+}\mathrm{d}\Gamma \\&\quad + \frac{1}{2} \int _{\Gamma ^1_C} \frac{1}{\gamma ^1} \mathbf{P}_{t,\theta \gamma ^1}^1(\mathbf{v}_h)\cdot [ \mathbf{P}^1_{t,\gamma ^1}(\mathbf{u}_h)]_{\gamma ^1 s^1}\mathrm{d}\Gamma \\&\quad \displaystyle + \frac{1}{2} \int _{\Gamma ^2_C} \frac{1}{\gamma ^2}\mathbf{P}_{t,\theta \gamma ^2}^2(\mathbf{v}_h)\cdot [\mathbf{P}^2_{t,\gamma ^2}(\mathbf{u}_h)]_{\gamma ^2 s^2}\mathrm{d}\Gamma , \end{aligned} \end{aligned}$$

for all $\mathbf{u}_h,\mathbf{v}_h \in \mathbf{V}_h$, and where $(.,. )_1$ stands for the scalar product in V and the notations $P^i_{n,\gamma ^i}$, $\mathbf{P}^i_{t,\gamma ^i}$, $P_{n, \theta \gamma ^i}^i$ and $\mathbf{P}_{t,\theta \gamma ^i}^i$ are given by (12).

Note that Problem (15) is well-posed if and only if $\mathbf{B}$ is a one-to-one operator. Let $\mathbf{v}_h,\mathbf{w}_h \in \mathbf{V}_h$, using the writings $P_{n, \theta \gamma ^i}^i(\cdot )=P^i_{n,\gamma ^i}(\cdot )+g^i_n+(1-\theta ){\sigma }^i_n(\cdot )$ and $\mathbf{P}_{t,\theta \gamma ^i}^i(\cdot ) = \mathbf{P}^i_{t,\gamma ^i}(\cdot ) + (1-\theta ){{\varvec{\sigma }}}^i_t(\cdot )$, we have:

$$\begin{aligned}&(\mathbf{B}\mathbf{v}_h-\mathbf{B}\mathbf{w}_h,\mathbf{v}_h-\mathbf{w}_h)_1 = \mathbf{a}(\mathbf{v}_h-\mathbf{w}_h,\mathbf{v}_h-\mathbf{w}_h)\\&\quad \displaystyle +\sum \limits _{i=1}^2 \Big (- \frac{\theta }{2}\Vert {\gamma ^i}^{\frac{1}{2}}{{\varvec{\sigma }}}(\mathbf{v}_h^i-\mathbf{w}_h^i)\mathbf{n}\Vert _{0,\Gamma _C^i} ^2\\&\quad +\displaystyle \frac{1}{2} \int _{\Gamma ^i_C}\frac{1}{\gamma ^i}P^i_{n,\gamma ^i}(\mathbf{v}_h-\mathbf{w}_h)\big ([P^i_{n,\gamma ^i}(\mathbf{v}_h)]_{+}-[ P^i_{n,\gamma ^i}(\mathbf{w}_h)]_{+}\big ) \mathrm{d}\Gamma \\&\quad \displaystyle +\frac{(1-\theta )}{2} \int _{\Gamma ^i_C}\frac{1}{\gamma ^i} \gamma ^i{\sigma }^i_n(\mathbf{v}_h^i-\mathbf{w}^i_h)\big ([ P^i_{n,\gamma ^i}(\mathbf{v}_h)]_{+}-[ P^i_{n,\gamma ^i}(\mathbf{w}_h)]_{+}\big ) \mathrm{d}\Gamma \\&\quad +\displaystyle \frac{1}{2} \int _{\Gamma ^i_C}\frac{1}{\gamma ^i} \mathbf{P}^i_{t,\gamma ^i}(\mathbf{v}_h-\mathbf{w}_h) \cdot \big ([ \mathbf{P}^i_{t,\gamma ^i}(\mathbf{v}_h)]_{\gamma ^i s^i}-[ \mathbf{P}^i_{t,\gamma ^i}(\mathbf{w}_h)]_{\gamma ^i s^i}\big ) \mathrm{d}\Gamma \\&\quad \displaystyle + \frac{(1-\theta )}{2} \int _{\Gamma ^i_C}\frac{1}{\gamma ^i} \gamma ^i{{\varvec{\sigma }}}^i_t(\mathbf{v}_h^i-\mathbf{w}^i_h) \cdot \big ([ \mathbf{P}^i_{t,\gamma ^i}(\mathbf{v}_h)]_{\gamma ^i s^i}-[\mathbf{P}^i_{t,\gamma ^i}(\mathbf{w}_h)]_{\gamma ^i s^i}\big ) \mathrm{d}\Gamma \Big ). \end{aligned}$$

We use Cauchy–Schwarz inequality and the proprieties (9) and (10) to get:

$$\begin{aligned}&\displaystyle (\mathbf{B}\mathbf{v}_h-\mathbf{B}\mathbf{w}_h,\mathbf{v}_h-\mathbf{w}_h)_1 \ge \mathbf{a}(\mathbf{v}_h-\mathbf{w}_h,\mathbf{v}_h-\mathbf{w}_h)\\&\quad +\sum \limits _{i=1}^2 \Big ( -\frac{\theta }{2}\Vert {\gamma ^i}^{\frac{1}{2}}{{\varvec{\sigma }}}(\mathbf{v}_h^i-\mathbf{w}_h^i)\mathbf{n}\Vert _{0,\Gamma _C^i} ^2\\&\quad \displaystyle +\frac{1}{2}\Vert {\gamma ^i}^{-\frac{1}{2}}\big ([ P^i_{n,\gamma ^i}(\mathbf{v}_h)]_{+}-[P^i_{n,\gamma ^i}(\mathbf{w}_h)]_{+}\big ) \Vert _{0,\Gamma _C^i}^2\\&\quad +\frac{1}{2}\Vert {\gamma ^i}^{-\frac{1}{2}} \big ([\mathbf{P}^i_{t,\gamma ^i}(\mathbf{v}_h)]_{\gamma ^i s^i}-[\mathbf{P}^i_{t,\gamma ^i}(\mathbf{w}_h)]_{\gamma ^i s^i}\big ) \Vert _{0,\Gamma _C^i}^2\\&\quad \displaystyle -\frac{|1-\theta |}{2}\Vert {\gamma ^i}^{-\frac{1}{2}}\big ([ P^i_{n,\gamma ^i}(\mathbf{v}_h)]_{+}-[P^i_{n,\gamma ^i}(\mathbf{w}_h)]_{+}\big ) \Vert _{0,\Gamma _C^i}\Vert {\gamma ^i}^\frac{1}{2}{\sigma }^i_n(\mathbf{v}^i_h-\mathbf{w}^i_h)\Vert _{0,\Gamma _C^i} \\&\quad \displaystyle -\frac{|1-\theta |}{2}\Vert {\gamma ^i}^{-\frac{1}{2}}\big ([ \mathbf{P}^i_{t,\gamma ^i}(\mathbf{v}_h)]_{\gamma ^i s^i}-[\mathbf{P}^i_{t,\gamma ^i}(\mathbf{w}_h)]_{\gamma ^i s^i}\big ) \Vert _{0,\Gamma _C^i}\Vert {\gamma ^i}^\frac{1}{2}{{\varvec{\sigma }}}^i_t(\mathbf{v}^i_h-\mathbf{w}^i_h)\Vert _{0,\Gamma _C^i}\Big ). \end{aligned}$$

If $\theta =1$, we use the coercivity of $a(\cdot ,\cdot )$ and the property (18) to get:

$$\begin{aligned}&\displaystyle (\mathbf{B}\mathbf{v}_h-\mathbf{B}\mathbf{w}_h,\mathbf{v}_h-\mathbf{w}_h)_1 \ge \mathbf{a}(\mathbf{v}_h-\mathbf{w}_h,\mathbf{v}_h-\mathbf{w}_h)- \sum \limits _{i=1}^2 \frac{1}{2}\Vert {\gamma ^i}^{\frac{1}{2}}{{\varvec{\sigma }}}^i(\mathbf{v}_h^i-\mathbf{w}_h^i)\mathbf{n}^i\Vert _{0,\Gamma _C^i} ^2\\&\quad \displaystyle \ge \mathbf{a}(\mathbf{v}_h-\mathbf{w}_h,\mathbf{v}_h-\mathbf{w}_h)- \sum \limits _{i=1}^2 \frac{1}{2}\Big (\Vert {\gamma ^i}^{\frac{1}{2}}{\sigma }^i_n(\mathbf{v}_h^i-\mathbf{w}_h^i)\Vert _{0,\Gamma _C^i} ^2\\&\qquad +\Vert {\gamma ^i}^{\frac{1}{2}}{{\varvec{\sigma }}}^i_t(\mathbf{v}_h^i-\mathbf{w}_h^i)\Vert _{0,\Gamma _C^i} ^2\Big )\\&\quad \displaystyle \ge C \Vert \mathbf{v}_h-\mathbf{w}_h\Vert ^2_1 \end{aligned}$$

when $\gamma _0$ is sufficiently small.

We suppose now that $\theta \ne 1$; let $\beta > 0$. Applying Young inequality yields:

$$\begin{aligned}&\displaystyle (\mathbf{B}\mathbf{v}_h-\mathbf{B}\mathbf{w}_h,\mathbf{v}_h-\mathbf{w}_h)_1 \ge \mathbf{a}(\mathbf{v}_h-\mathbf{w}_h,\mathbf{v}_h-\mathbf{w}_h)\\&\qquad +\, \sum \limits _{i=1}^2 \Big (- \frac{\theta }{2}\Vert {\gamma ^i}^{\frac{1}{2}}{{\varvec{\sigma }}}^i(\mathbf{v}_h^i-\mathbf{w}_h^i)\mathbf{n}^i\Vert _{0,\Gamma _C^i} ^2 \nonumber \\&\qquad \displaystyle +\,\frac{1}{2}\Vert {\gamma ^i}^{-\frac{1}{2}}\big ([P^i_{n,\gamma ^i}(\mathbf{v}_h)]_{+}-[ P^i_{n,\gamma ^i}(\mathbf{w}_h)]_{+}\big ) \Vert _{0,\Gamma _C^i}^2\nonumber \\&\qquad +\,\frac{1}{2}\Vert {\gamma ^i}^{-\frac{1}{2}}\big ([ \mathbf{P}^i_{t,\gamma ^i}(\mathbf{v}_h)]_{\gamma ^i s^i}-[ \mathbf{P}^i_{t,\gamma ^i}(\mathbf{w}_h)]_{ \gamma ^i s^i}\big ) \Vert _{0,\Gamma _C^i}^2 \nonumber \\&\qquad \displaystyle - \frac{|1-\theta |}{4\beta }\Vert {\gamma ^i}^{-\frac{1}{2}}\big ([P^i_{n,\gamma ^i}(\mathbf{v}_h)]_{+}-[ P^i_{n,\gamma ^i}(\mathbf{w}_h)]_{+}\big ) \Vert _{0,\Gamma _C^i}^2 \nonumber \\&\qquad - \frac{|1-\theta |\beta }{4} \Vert {\gamma ^i}^\frac{1}{2}{\sigma }^i_n(\mathbf{v}^i_h-\mathbf{w}^i_h)\Vert _{0,\Gamma _C^i}^2 \nonumber \\&\qquad \displaystyle - \frac{|1-\theta |}{4\beta }\Vert {\gamma ^i}^{-\frac{1}{2}}\big ( \mathbf{P}^i_{t,\gamma ^i}(\mathbf{v}_h)]_{\gamma ^i s^i}-[ \mathbf{P}^i_{t,\gamma }(\mathbf{w}_h)]_{\gamma ^i s^i}\big ) \Vert _{0,\Gamma _C^i}^2 \nonumber \\&\qquad - \frac{|1-\theta |\beta }{4} \Vert {\gamma ^i}^\frac{1}{2}{{\varvec{\sigma }}}^i_t(\mathbf{v}^i_h-\mathbf{w}^i_h)\Vert _{0,\Gamma _C^i}^2\Big ) \nonumber \\&\quad \displaystyle = \mathbf{a}(\mathbf{v}_h-\mathbf{w}_h,\mathbf{v}_h-\mathbf{w}_h)+ \sum \limits _{i=1}^2 \Big (- \frac{1}{2}\Big ( \theta +\frac{|1-\theta |\beta }{2} \Big ) \Big (\Vert {\gamma ^i}^\frac{1}{2} {\sigma }^i_n(\mathbf{v}^i_h-\mathbf{w}^i_h)\Vert _{0,\Gamma _C^i}^2 \nonumber \\&\qquad \displaystyle +\,\Vert {\gamma ^i}^\frac{1}{2} {{\varvec{\sigma }}}^i_t(\mathbf{v}^i_h-\mathbf{w}^i_h)\Vert _{0,\Gamma _C^i}^2 \Big )\nonumber \\&\qquad + \frac{1}{2}\Big ( 1-\frac{|1-\theta |}{2\beta } \Big ) \Big (\Vert {\gamma ^i}^{-\frac{1}{2}}\big ([ P^i_{n,\gamma ^i}(\mathbf{v}_h)]_{+} -\,[ P^i_{n,\gamma ^i}(\mathbf{w}_h)]_{+}\big ) \Vert _{0,\Gamma _C^i}^2 \nonumber \\&\qquad \displaystyle + \,\Vert {\gamma ^i}^{-\frac{1}{2}}\big ( [\mathbf{P}^i_{t,\gamma ^i}(\mathbf{v}_h)]_{\gamma ^i s^i}-[ \mathbf{P}^i_{t,\gamma ^i}(\mathbf{w}_h)]_{\gamma ^i s^i}\big ) \Vert _{0,\Gamma _C^i}^2\Big )\Big ). \end{aligned}$$

Choosing $\displaystyle \beta = \frac{|1-\theta |}{2}$ and $\gamma _0$ sufficiently small we get:

$$\begin{aligned}&\displaystyle (\mathbf{B}\mathbf{v}_h-\mathbf{B}\mathbf{w}_h,\mathbf{v}_h-\mathbf{w}_h)_1 \ge \mathbf{a}(\mathbf{v}_h-\mathbf{w}_h,\mathbf{v}_h-\mathbf{w}_h) \\&\quad \displaystyle -\frac{(1+\theta )^2}{8} \sum \limits _{i=1}^2 \Big (\Vert {\gamma ^i}^\frac{1}{2}{\sigma }^i_n(\mathbf{v}^i_h-\mathbf{w}^i_h)\Vert _{0,\Gamma _C^i}^2 +\Vert {\gamma ^i}^\frac{1}{2} {{\varvec{\sigma }}}^i_t(\mathbf{v}^i_h-\mathbf{w}^i_h)\Vert _{0,\Gamma _C^i}^2 \Big ).\\&\quad \displaystyle (\mathbf{B}\mathbf{v}_h-\mathbf{B}\mathbf{w}_h,\mathbf{v}_h-\mathbf{w}_h)_1 \ge C\Vert \mathbf{v}-\mathbf{w}\Vert _1^2 \end{aligned}$$

Note that, when $\theta = -1$ we do not need the assumption of smallness of $\gamma _0$.

Let us show, now, that $\mathbf{B}$ is hemicontinuous. Since $\mathbf{V}^h$ is a vector space, it is sufficient to show that:

$$\begin{aligned}&{\varphi }: [0,1] \rightarrow \mathbb {R}\\&\quad t \mapsto (\mathbf{B}(\mathbf{v}_h-t\mathbf{w}_h),\mathbf{w}_h)_1 \end{aligned}$$

is a continuous real function for all $ \mathbf{v}_h,\mathbf{w}_h \in \mathbf{V}_h$. Let $t,s \in [0,1]$, we compute:

$$\begin{aligned}&|{\varphi }(t)-{\varphi }(s)|\\&\quad = \Big |(\mathbf{B}(\mathbf{v}_h-t\mathbf{w}_h)-\mathbf{B}(\mathbf{v}_h-s\mathbf{w}_h),\mathbf{w}_h)_1\Big |\\&\quad =\displaystyle \Big | A_{\theta }((s-t)\mathbf{w}_h,\mathbf{w}_h)+ \sum \limits _{i=1}^2 \Big (\frac{1}{2} \int _{\Gamma ^i_C} \frac{1}{\gamma ^i} P_{n,\theta \gamma ^i}^i(\mathbf{w}_h)\big ([ P^i_{n \gamma ^i}(\mathbf{v}_h-t\mathbf{w}_h)]_{+}\\&\qquad -\, [ P^i_{n \gamma ^i}(\mathbf{v}_h-s\mathbf{w}_h)]_{+}\big )\mathrm{d}\Gamma \\&\qquad \displaystyle + \,\frac{1}{2} \int _{\Gamma ^i_C} \frac{1}{\gamma ^i} \mathbf{P}_{t,\theta \gamma ^i}^i(\mathbf{w}_h) \big ([ \mathbf{P}^i_{t \gamma ^i}(\mathbf{v}_h-t\mathbf{w}_h)]_{\gamma ^i s^i}\\&\qquad - \,[ \mathbf{P}^i_{t \gamma ^i}(\mathbf{v}_h-s\mathbf{w}_h)]_{\gamma ^i s^i}\big )\mathrm{d}\Gamma \Big )\Big |\\&\quad \displaystyle \le |s-t|A_{\theta }(\mathbf{w}_h,\mathbf{w}_h)\\&\qquad + \,\sum \limits _{i=1}^2 \Big ( \frac{1}{2} \int _{\Gamma ^i_C}\frac{1}{\gamma ^i}|P_{n,\theta \gamma ^i}^i(\mathbf{w}_h)|\Big |[ P^i_{n \gamma ^i }(\mathbf{v}_h-t\mathbf{w}_h)]_{+}- [ P^i_{n \gamma ^i }(\mathbf{v}_h-s\mathbf{w}_h)]_{+}\Big |\mathrm{d}\Gamma \\&\qquad \displaystyle + \frac{1}{2} \int _{\Gamma ^i_C}\frac{1}{\gamma ^i}\Vert \mathbf{P}_{t,\theta \gamma ^i}^i(\mathbf{w}_h)\Vert \Big \Vert [\mathbf{P}^i_{t \gamma ^i }(\mathbf{v}_h-t\mathbf{w}_h)]_{\gamma ^i s^i}- [\mathbf{P}^i_{t \gamma ^i}(\mathbf{v}_h-s\mathbf{w}_h)]_{\gamma ^i s^i}\Big \Vert \mathrm{d}\Gamma \Big ). \end{aligned}$$

We use the bounds $|[a]_+ - [b]_+| \le |a- b|$ for all $a,b \in \mathbb {R}$ and $\big \Vert [\mathbf{a}]_{\gamma ^i g^i} - [\mathbf{b}]_{\gamma ^i g^i}\big \Vert \le \Vert \mathbf{a}- \mathbf{b}\Vert $ for all $\mathbf{a},\mathbf{b}\in \mathbb {R}^{d-1}$ to deduce that:

$$\begin{aligned} \begin{aligned}&\displaystyle \int _{\Gamma ^i_C}\frac{1}{\gamma ^i}|P_{n,\theta \gamma ^i}^i(\mathbf{w}_h)|\Big |[ P^i_{n \gamma ^i}(\mathbf{v}_h-t\mathbf{w}_h)]_{+}- [ P^i_{n \gamma ^i}(\mathbf{v}_h-s\mathbf{w}_h)]_{+}\Big |\mathrm{d}\Gamma \\&\qquad \displaystyle + \int _{\Gamma ^i_C}\frac{1}{\gamma ^i}\Vert \mathbf{P}_{t,\theta \gamma ^i}^i(\mathbf{w}_h)\Vert \Big \Vert [ \mathbf{P}^i_{t \gamma ^i}(\mathbf{v}_h-t\mathbf{w}_h)]_{\gamma ^i s^i}- [ \mathbf{P}^i_{t \gamma ^i}(\mathbf{v}_h-s\mathbf{w}_h)]_{\gamma ^i s^i}\Big \Vert \mathrm{d}\Gamma \\&\quad \displaystyle \le \int _{\Gamma ^i_C}\frac{1}{\gamma ^i}|P_{n,\theta \gamma ^i}^i(\mathbf{w}_h)|\Big | P^i_{n \gamma ^i}(\mathbf{v}_h-t\mathbf{w}_h)- P^i_{n \gamma ^i}(\mathbf{v}_h-s\mathbf{w}_h)\Big |\mathrm{d}\Gamma \\&\qquad \displaystyle + \int _{\Gamma ^i_C}\frac{1}{\gamma ^i}\Vert \mathbf{P}_{t,\theta \gamma ^i}^i(\mathbf{w}_h)\Vert \Big \Vert \mathbf{P}^i_{t \gamma ^i}(\mathbf{v}_h-t\mathbf{w}_h)- \mathbf{P}^i_{t \gamma ^i}(\mathbf{v}_h-s\mathbf{w}_h)\Big \Vert \mathrm{d}\Gamma \\&\quad \displaystyle \le |s-t| \Big (\int _{\Gamma ^i_C}\frac{1}{\gamma ^i}|P_{n,\theta \gamma ^i}^i(\mathbf{w}_h)|| P^i_{n \gamma ^i}(\mathbf{w}_h)|\mathrm{d}\Gamma \\&\qquad +\int _{\Gamma ^i_C}\frac{1}{\gamma ^i}\Vert \mathbf{P}_{t,\theta \gamma ^i}^i(\mathbf{w}_h)\Vert \Vert \mathbf{P}^i_{t \gamma ^i}(\mathbf{w}_h)\Vert \mathrm{d}\Gamma \Big ). \end{aligned} \end{aligned}$$

It results that:

$$\begin{aligned} \displaystyle |{\varphi }(t)-{\varphi }(s)|\le & {} |s-t|\Big (A_{\theta }(\mathbf{w}_h,\mathbf{w}_h)+\sum \limits _{i=1}^2 \Big (\int _{\Gamma ^i_C}\frac{1}{2\gamma ^i}|P_{n,\theta \gamma ^i}^i(\mathbf{w}_h)|| P^i_{n \gamma ^i}(\mathbf{w}_h)|\mathrm{d}\Gamma \\&\displaystyle +\int _{\Gamma ^i_C}\frac{1}{2\gamma ^i}\Vert \mathbf{P}_{t,\theta \gamma ^i}^i(\mathbf{w}_h)\Vert \Vert \mathbf{P}^i_{t \gamma ^i}(\mathbf{w}_h)\Vert \mathrm{d}\Gamma \Big )\Big ). \end{aligned}$$

Which means that ${\varphi }$ is Lipschitz, so that $\mathbf{B}$ is hemicontinuous. We finally apply the Corollary 15 (p.126) of [4] to conclude that $\mathbf{B}$ is a one to one operator. $\square $

3.3 A priori error analysis

Our Nitsche-based method (15) converges in a optimal way as the mesh parameter h vanishes. This is proved in the Theorem 2.6, where we provide an estimate of the displacement error in $H^1$-norm and of the contact error in $L^2({\Gamma }_C^i)$-norm. We establish, first, the following abstract error estimate.

Theorem 2.5

Suppose that $\mathbf{u}$ is a solution to (1–5) and belongs to $(H^{\frac{3}{2}+\nu }(\Omega ^1))^d \times (H^{\frac{3}{2}+\nu }(\Omega ^2))^d$ with $\nu >0$.

1.
We suppose $\gamma _0$ sufficiently small. The solution $\mathbf{u}_h$ to the discrete problem (15) satisfies the following error estimate:
$$\begin{aligned} \begin{aligned}&\displaystyle \sum \limits _{i=1}^2 \Big (\Vert \mathbf{u}^i-\mathbf{u}_h^i \Vert ^2_{1,\Omega ^i} +\frac{1}{2}\Vert {\gamma ^i}^{\frac{1}{2}} \big ({\sigma }^i_n(\mathbf{u}^i)+\frac{1}{\gamma ^i}[P^i_{n,\gamma }(\mathbf{u}_h)]_+\big )\Vert ^2_{0,\Gamma _C^i} \\&\qquad \displaystyle +\frac{1}{2}\Vert {\gamma ^i}^{\frac{1}{2}} \big ({{\varvec{\sigma }}}^i_t(\mathbf{u}^i)+\frac{1}{\gamma ^i}[\mathbf{P}^i_{t,\gamma ^i}(\mathbf{u}_h)]_{\gamma ^i s^i}\big )\Vert ^2_{0,\Gamma _C^i}\Big )\\&\quad \displaystyle \le C\inf _{\mathbf{v}_h \in \mathbf{V}_h} \Big ( \sum \limits _{i=1}^2 \Vert \mathbf{u}^i-\mathbf{v}_h^i \Vert ^2_{1,\Omega ^i}+ \frac{1}{2}\Vert {\gamma ^i}^{-\frac{1}{2}}(\mathbf{u}^i-\mathbf{v}_h^i) \Vert ^2_{0,\Gamma _C^i}\\&\qquad + \frac{1}{2}\Vert {\gamma ^i}^{\frac{1}{2}}{{\varvec{\sigma }}}(\mathbf{u}^i-\mathbf{v}_h^i)\mathbf{n}^i \Vert ^2_{0,\Gamma _C^i}\Big ), \end{aligned} \end{aligned}$$
(19)
where $C>0$ is a constant independent of h, $\mathbf{u}$ and $\gamma _0$.
2.
If $\theta =-1$, for all $\gamma _0>0$, the solution $\mathbf{u}_h$ to the problem (15) satisfies the estimate (19) with $C>0$ a constant independent of h and $\mathbf{u}$, but eventually dependent of $\gamma _0$.

Proof

Let $\mathbf{v}_h\in \mathbf{V}_h$, using the coercivity and the continuity of the form $a(\cdot ,\cdot )$ as well as Young’s inequality, we obtain:

$$\begin{aligned}&\alpha \sum \limits _{i=1}^2 \Vert \mathbf{u}^i-\mathbf{u}_h^i \Vert ^2_{1,\Omega ^i}\le a(\mathbf{u}-\mathbf{u}_h,\mathbf{u}-\mathbf{u}_h) \\&\quad =a(\mathbf{u}-\mathbf{u}_h,\mathbf{u}-\mathbf{v}_h)+a(\mathbf{u}-\mathbf{u}_h,\mathbf{v}_h-\mathbf{u}_h) \\&\quad \le C\sum \limits _{i=1}^2 \Vert \mathbf{u}^i-\mathbf{u}_h^i \Vert _{1,\Omega ^i} \Vert \mathbf{u}^i-\mathbf{v}_h^i \Vert _{1,\Omega ^i}+a(\mathbf{u}-\mathbf{u}_h,\mathbf{v}_h-\mathbf{u}_h)\\&\quad \le \frac{\alpha }{2}\sum \limits _{i=1}^2 \Vert \mathbf{u}^i-\mathbf{u}_h^i \Vert ^2_{1,\Omega ^i}\\&\qquad +\frac{C^2}{2\alpha }\sum \limits _{i=1}^2 \Vert \mathbf{u}^i-\mathbf{v}_h^i \Vert ^2_{1,\Omega ^i}\\&\qquad +\,a(\mathbf{u},\mathbf{v}_h-\mathbf{u}_h)-a(\mathbf{u}_h,\mathbf{v}_h-\mathbf{u}_h). \end{aligned}$$

Therefore, we get:

$$\begin{aligned} \frac{\alpha }{2}\sum \limits _{i=1}^2 \Vert \mathbf{u}^i-\mathbf{u}_h^i \Vert ^2_{1,\Omega ^i} \le \frac{C^2}{2\alpha }\sum \limits _{i=1}^2 \Vert \mathbf{u}^i-\mathbf{v}_h^i \Vert ^2_{1,\Omega ^i}+a(\mathbf{u},\mathbf{v}_h-\mathbf{u}_h)-a(\mathbf{u}_h,\mathbf{v}_h-\mathbf{u}_h). \end{aligned}$$

Since $\mathbf{u}$ solves (1-5) and $\mathbf{u}_h$ solves (15), using the Lemma 2.1 yields:

$$\begin{aligned}&\displaystyle \frac{\alpha }{2}\sum \limits _{i=1}^2 \Vert \mathbf{u}^i-\mathbf{u}_h^i \Vert ^2_{1,\Omega ^i} \le \frac{C^2}{2\alpha }\sum \limits _{i=1}^2 \Vert \mathbf{u}^i-\mathbf{v}_h^i \Vert ^2_{1,\Omega ^i} \nonumber \\&\qquad +\sum \limits _{i=1}^2 \Big (-\frac{\theta }{2}\int _{\Gamma ^i_C} \gamma ^i {{\varvec{\sigma }}}^i(\mathbf{u}_h^i-\mathbf{u}^i)\mathbf{n}^i\cdot {{\varvec{\sigma }}}^i(\mathbf{v}_h^i-\mathbf{u}_h^i)\mathbf{n}^i\mathrm{d}\Gamma \nonumber \\&\qquad \displaystyle +\frac{1}{2} \int _{\Gamma ^i_C}\frac{1}{\gamma ^i} \mathbf{P}^i_{t,\theta \gamma ^i}(\mathbf{v}_h-\mathbf{u}_h)\cdot \big ([ \mathbf{P}^i_{t,\gamma ^i}(\mathbf{u}_h)]_{\gamma ^i s^i}-[ \mathbf{P}^i_{t,\gamma ^i}(\mathbf{u})]_{\gamma ^i s^i}\big ) \mathrm{d}\Gamma \nonumber \\&\qquad \displaystyle +\frac{1}{2} \int _{\Gamma ^i_C}\frac{1}{\gamma ^i}P^i_{n,\theta \gamma ^i}(\mathbf{v}_h-\mathbf{u}_h) \big ([P^i_{n,\gamma ^i}(\mathbf{u}_h)]_{+}-[ P^i_{n,\gamma ^i}(\mathbf{u})]_{+}\big ) \mathrm{d}\Gamma \Big ). \end{aligned}$$

(20)

Let $\beta _1 >0$. The first integral term in (20) is bounded, using Cauchy–Schwarz and Young’s inequalities, as follows:

$$\begin{aligned}&\displaystyle -\frac{\theta }{2}\int _{\Gamma ^i_C} \gamma ^i {{\varvec{\sigma }}}^i(\mathbf{u}_h^i-\mathbf{u}^i)\mathbf{n}^i\cdot {{\varvec{\sigma }}}^i(\mathbf{v}_h^i-\mathbf{u}_h^i)\mathbf{n}^i\mathrm{d}\Gamma \nonumber \\&\quad \displaystyle =\frac{\theta }{2}\int _{\Gamma ^i_C} \gamma ^i {{\varvec{\sigma }}}^i(\mathbf{v}_h^i-\mathbf{u}_h^i)\mathbf{n}^i\cdot {{\varvec{\sigma }}}^i(\mathbf{v}_h^i-\mathbf{u}_h^i)\mathbf{n}^i\mathrm{d}\Gamma \nonumber \\&\qquad -\frac{\theta }{2}\int _{\Gamma ^i_C} \gamma ^i {{\varvec{\sigma }}}^i(\mathbf{v}_h^i-\mathbf{u}^i)\mathbf{n}^i\cdot {{\varvec{\sigma }}}^i(\mathbf{v}_h^i-\mathbf{u}_h^i)\mathbf{n}^i\mathrm{d}\Gamma \nonumber \\&\quad \displaystyle \le \frac{\theta }{2}\Vert {\gamma ^i}^{\frac{1}{2}}{{\varvec{\sigma }}}^i(\mathbf{v}_h^i-\mathbf{u}_h^i)\mathbf{n}^i\Vert ^2_{0,\Gamma _C^i}\nonumber \\&\quad +\frac{|\theta |}{2}\Vert {\gamma ^i}^{\frac{1}{2}}{{\varvec{\sigma }}}^i(\mathbf{v}_h^i-\mathbf{u}^i)\mathbf{n}^i\Vert _{0,\Gamma _C^i}\Vert {\gamma ^i}^{\frac{1}{2}}{{\varvec{\sigma }}}^i(\mathbf{v}_h^i-\mathbf{u}_h^i)\mathbf{n}^i\Vert _{0,\Gamma _C^i}\nonumber \\&\quad \displaystyle \le \frac{\beta _1\theta ^2}{4}\Vert {\gamma ^i}^{\frac{1}{2}}{{\varvec{\sigma }}}^i(\mathbf{v}_h^i-\mathbf{u}^i)\mathbf{n}^i\Vert _{0,\Gamma _C^i}^2\nonumber \\&\quad + \frac{1}{2}\big (\theta +\frac{1}{2\beta _1}\big )\Vert {\gamma ^i}^{\frac{1}{2}}{{\varvec{\sigma }}}^i(\mathbf{v}_h^i-\mathbf{u}_h^i)\mathbf{n}^i\Vert ^2_{0,\Gamma _C^i}. \end{aligned}$$

(21)

For the second integral term in (20), we can write:

$$\begin{aligned}&\displaystyle \int _{\Gamma ^i_C}\frac{1}{\gamma ^i} \mathbf{P}^i_{t,\theta \gamma ^i}(\mathbf{v}_h-\mathbf{u}_h)\cdot \big ([ \mathbf{P}^i_{t,\gamma ^i}(\mathbf{u}_h)]_{\gamma ^i s^i}-[ \mathbf{P}^i_{t,\gamma ^i}(\mathbf{u})]_{\gamma ^i s^i}\big ) \mathrm{d}\Gamma \\&\quad =\displaystyle \ \int _{\Gamma ^i_C}\frac{1}{\gamma ^i} \mathbf{P}^i_{t,\gamma ^i}(\mathbf{v}_h-\mathbf{u})\cdot \big ([ \mathbf{P}^i_{t,\gamma ^i}(\mathbf{u}_h)]_{\gamma ^i s^i}-[ \mathbf{P}^i_{t,\gamma ^i}(\mathbf{u})]_{\gamma ^i s^i}\big ) \mathrm{d}\Gamma \\&\qquad \displaystyle +\int _{\Gamma ^i_C}\frac{1}{\gamma ^i} \mathbf{P}^i_{t,\gamma ^i}(\mathbf{u}-\mathbf{u}_h)\cdot \big ([ \mathbf{P}^i_{t,\gamma ^i}(\mathbf{u}_h)]_{\gamma ^i s^i}-[ \mathbf{P}^i_{t,\gamma ^i}(\mathbf{u})]_{\gamma ^i s^i}\big ) \mathrm{d}\Gamma \\&\qquad +\displaystyle \int _{\Gamma ^i_C}(1-\theta ){{\varvec{\sigma }}}_t^i(\mathbf{v}^i_h-\mathbf{u}^i_h)\cdot \big ([ \mathbf{P}^i_{t,\gamma ^i}(\mathbf{u}_h)]_{\gamma ^i s^i}-[ \mathbf{P}^i_{t,\gamma ^i}(\mathbf{u})]_{\gamma ^i s^i}\big ) \mathrm{d}\Gamma . \end{aligned}$$

Using the bound (10) and applying two times Cauchy–Schwarz and Young’s inequalities, we obtain for $\beta _2>0$ and $\beta _3>0$:

$$\begin{aligned}&\displaystyle \int _{\Gamma ^i_C}\frac{1}{\gamma ^i} \mathbf{P}^i_{t,\theta \gamma ^i}(\mathbf{v}_h-\mathbf{u}_h)\cdot \big ([ \mathbf{P}^i_{t,\gamma ^i}(\mathbf{u}_h)]_{\gamma ^i s^i}-[ \mathbf{P}^i_{t,\gamma ^i}(\mathbf{u})]_{\gamma ^i s^i}\big ) \mathrm{d}\Gamma \nonumber \\&\quad \displaystyle \le \frac{1}{2\beta _2}\Big \Vert {\gamma ^i}^{\frac{1}{2}}\Big ({{\varvec{\sigma }}}^i_t(\mathbf{u}^i) +\frac{1}{\gamma ^i}[ \mathbf{P}^i_{t,\gamma ^i}(\mathbf{u}_h)]_{\gamma ^i s^i}\Big )\Big \Vert ^2_{0,\Gamma _C^i}\nonumber \\&\qquad + \frac{\beta _2}{2}\Vert {\gamma ^i}^{-\frac{1}{2}}[ \mathbf{P}^i_{t,\gamma ^i}(\mathbf{v}_h-\mathbf{u})]_{\gamma ^i s^i}\Vert ^2_{0,\Gamma _C^i} \nonumber \\&\qquad \displaystyle -\Big \Vert {\gamma ^i}^{\frac{1}{2}}\Big ({{\varvec{\sigma }}}^i_t(\mathbf{u}^i)+\frac{1}{\gamma ^i}[ \mathbf{P}^i_{t,\gamma ^i}(\mathbf{u}_h)]_{\gamma ^i s^i}\Big )\Big \Vert ^2_{0,{\Gamma }_C^i}\nonumber \\&\qquad +\frac{|1-\theta |}{2\beta _3}\Big \Vert {\gamma ^i}^{\frac{1}{2}}\Big ({{\varvec{\sigma }}}^i_t(\mathbf{u}^i)\nonumber \\&\qquad +\frac{1}{\gamma ^i}[ \mathbf{P}^i_{t,\gamma ^i}(\mathbf{u}_h)]_{\gamma ^i s^i}\Big )\Big \Vert ^2_{0,\Gamma _C^i} \nonumber \\&\qquad \displaystyle +\frac{|1-\theta |\beta _3}{2}\Vert {\gamma ^i}^{\frac{1}{2}}{{\varvec{\sigma }}}^i_t(\mathbf{v}_h^i-\mathbf{u}_h^i)\Vert ^2_{0,\Gamma _C^i}. \end{aligned}$$

(22)

In a similar way, we can upper bound the third integral term of (20). Noting that:

$$\begin{aligned}&\displaystyle \Vert {\gamma ^i}^{-\frac{1}{2}}[ \mathbf{P}^i_{t,\gamma ^i}(\mathbf{v}_h-\mathbf{u})]_{\gamma ^i s^i}\Vert ^2_{0,\Gamma _C^i}+ \Vert {\gamma ^i}^{-\frac{1}{2}}[ P^i_{n,\gamma ^i}(\mathbf{v}_h-\mathbf{u})]_{+}\Vert ^2_{0,\Gamma _C^i}\nonumber \\&\quad \le 2\Vert {\gamma ^i}^{-\frac{1}{2}}(\llbracket u-v_h \rrbracket _n^i +\llbracket \mathbf{u}-\mathbf{v}_h \rrbracket _t^i )\Vert ^2_{0,\Gamma _C^i}+2\Vert {\gamma ^i}^{\frac{1}{2}}{{\varvec{\sigma }}}^i(\mathbf{u}^i-\mathbf{v}^i_h)\mathbf{n}^i\Vert ^2_{0,\Gamma _C^i}\nonumber \\&\quad \displaystyle \le 2 \sum \limits _{i=1}^2 \Big (\Vert {\gamma ^i}^{-\frac{1}{2}}( \mathbf{u}^i-\mathbf{v}^i_h )\Vert ^2_{0,\Gamma _C^i}\Big )+2\Vert {\gamma ^i}^{\frac{1}{2}}{{\varvec{\sigma }}}^i(\mathbf{u}^i-\mathbf{v}^i_h)\mathbf{n}^i\Vert ^2_{0,\Gamma _C^i}, \end{aligned}$$

(23)

and using estimates (21) and (22) in (20), we obtain:

$$\begin{aligned}&\displaystyle \frac{\alpha }{2}\sum \limits _{i=1}^2 \Vert \mathbf{u}^i-\mathbf{u}_h^i \Vert ^2_{1,\Omega ^i} \le \frac{C^2}{2\alpha }\sum \limits _{i=1}^2 \Vert \mathbf{u}^i-\mathbf{v}_h^i \Vert ^2_{1,\Omega ^i} \nonumber \\&\qquad \displaystyle + \frac{1}{2}\sum \limits _{i=1}^2 \Big ((\frac{\beta _1\theta ^2}{2}+\beta _2)\Vert {\gamma ^i}^{\frac{1}{2}}{{\varvec{\sigma }}}^i(\mathbf{u}^i-\mathbf{v}^i_h)\mathbf{n}^i\Vert ^2_{0,\Gamma _C^i}+2\beta _2\Vert {\gamma ^i}^{-\frac{1}{2}}(\mathbf{u}^i-\mathbf{v}^i_h)\Vert ^2_{0,\Gamma _C^i} \nonumber \\&\qquad \displaystyle +\big (-1+\frac{1}{2\beta _2}+\frac{|1-\theta |}{2\beta _3}\big )\nonumber \\&\qquad \big (\Vert {\gamma ^i}^{\frac{1}{2}}({{\varvec{\sigma }}}^i_t(\mathbf{u}^i)\! +\!\frac{1}{\gamma ^i}[ \mathbf{P}^i_{t,{\gamma ^i}}(\mathbf{u}_h)]_{\gamma s^i})\Vert ^2_{0,\Gamma _C^i} \!+\!\Vert {\gamma ^i}^{\frac{1}{2}}({\sigma }^i_n(\mathbf{u}^i)\!+\!\frac{1}{\gamma }[ P^i_{n,\gamma ^i}(\mathbf{u}_h)]_{+})\Vert ^2_{0,\Gamma _C^i}\big ) \nonumber \\&\qquad \displaystyle +\big (\frac{1}{2\beta _1}+\theta +\frac{|1-\theta |\beta _3}{2}\big )\Vert {\gamma ^i}^{\frac{1}{2}}{{\varvec{\sigma }}}^i(\mathbf{v}_h^i-\mathbf{u}_h^i)\mathbf{n}^i\Vert ^2_{0,\Gamma _C^i} \Big ). \end{aligned}$$

(24)

We use now the estimate (18) to get:

$$\begin{aligned} \Vert {\gamma ^i}^{\frac{1}{2}}{{\varvec{\sigma }}}^i(\mathbf{v}_h^i-\mathbf{u}_h^i)\mathbf{n}^i\Vert ^2_{0,\Gamma _C^i} \le C\gamma _0^{\frac{1}{2}}\Vert \mathbf{v}_h^i-\mathbf{u}_h^i\Vert ^2_{1,\Omega ^i}\le C\gamma _0^{\frac{1}{2}}(\Vert \mathbf{v}_h^i-\mathbf{u}^i\Vert ^2_{1,\Omega ^i}+\Vert \mathbf{u}_h^i-\mathbf{u}^i\Vert ^2_{1,\Omega ^i}) \end{aligned}$$

(25)

For a fixed $\theta \in \mathbb {R}$ we choose $\beta _2$ and $\beta _3$ large enough that:

$$\begin{aligned} -1+\frac{1}{2\beta _2}+\frac{|1-\theta |}{2\beta _3} < -\frac{1}{2} \end{aligned}$$

Choosing $\gamma _0$ small enough in (25) and putting the estimate in (24), we establish the first statement of the theorem.

We consider now the case $\theta =-1$ in which (24) becomes:

$$\begin{aligned}&\displaystyle \frac{\alpha }{2}\sum \limits _{i=1}^2 \Vert \mathbf{u}^i-\mathbf{u}_h^i \Vert ^2_{1,\Omega ^i} \le \frac{C^2}{2\alpha }\sum \limits _{i=1}^2 \Vert \mathbf{u}^i-\mathbf{v}_h^i \Vert ^2_{1,\Omega ^i} \\&\quad \displaystyle + \frac{1}{2}\sum \limits _{i=1}^2 \Big ((\frac{\beta _1}{2}+\beta _2)\Vert {\gamma ^i}^{\frac{1}{2}}{{\varvec{\sigma }}}^i(\mathbf{u}^i-\mathbf{v}^i_h)\mathbf{n}^i\Vert ^2_{0,\Gamma _C^i}+2\beta _2\Vert {\gamma ^i}^{-\frac{1}{2}}(\mathbf{u}^i-\mathbf{v}^i_h)\Vert ^2_{0,\Gamma _C^i}\\&\quad \displaystyle +\big (-1+\frac{1}{2\beta _2}+\frac{1}{\beta _3}\big )\\&\quad \big (\Vert {\gamma ^i}^{\frac{1}{2}}({{\varvec{\sigma }}}^i_t(\mathbf{u}^i)\!+\!\frac{1}{\gamma ^i}[ \mathbf{P}^i_{t,\gamma ^i}(\mathbf{u}_h)]_{\gamma ^i s^i})\Vert ^2_{0,\Gamma _C^i} \!+\!\Vert {\gamma ^i}^{\frac{1}{2}}({\sigma }^i_n(\mathbf{u}^i)\!+\!\frac{1}{{\gamma ^i}}[ P^i_{n,{\gamma ^i}}(\mathbf{u}_h)]_{+})\Vert ^2_{0,\Gamma _C^i}\big )\\&\quad \displaystyle +\big (\frac{1}{2\beta _1}-1+\beta _3\big )\big (\Vert {\gamma ^i}^{\frac{1}{2}}{{\varvec{\sigma }}}^i(\mathbf{v}_h^i-\mathbf{u}_h^i)\mathbf{n}^i\Vert ^2_{0,\Gamma _C^i}\big )\Big ). \end{aligned}$$

Let be given $\eta >0$. Set $\beta _1=\frac{1}{2\eta }$, $\beta _2=1+\frac{1}{\eta }$, $\beta _3=1+\eta $. And so we arrive at:

$$\begin{aligned}&\displaystyle \frac{\alpha }{2}\sum \limits _{i=1}^2 \Vert \mathbf{u}^i-\mathbf{u}_h^i \Vert ^2_{1,\Omega ^i} \le \frac{C^2}{2\alpha }\sum \limits _{i=1}^2 \Vert \mathbf{u}^i-\mathbf{v}_h^i \Vert ^2_{1,\Omega ^i} \\&\quad \displaystyle + \frac{1}{2}\sum \limits _{i=1}^2 \Big ((\frac{5}{4\eta }+1)\Vert {\gamma ^i}^{\frac{1}{2}}{{\varvec{\sigma }}}^i(\mathbf{u}^i-\mathbf{v}^i_h)\mathbf{n}^i\Vert ^2_{0,\Gamma _C^i}+2\frac{1+\eta }{\eta }\Vert {\gamma ^i}^{-\frac{1}{2}}(\mathbf{u}^i-\mathbf{v}^i_h)\Vert ^2_{0,\Gamma _C^i}\\&\quad \displaystyle -\frac{\eta }{2(1+\eta )}\big (\Vert {\gamma ^i}^{\frac{1}{2}}({{\varvec{\sigma }}}^i_t(\mathbf{u}^i)+\frac{1}{{\gamma ^i}}[ \mathbf{P}^i_{t,{\gamma ^i}}(\mathbf{u}_h)]_{{\gamma ^i} s^i})\Vert ^2_{0,\Gamma _C^i}\\&\quad +\Vert {\gamma ^i}^{\frac{1}{2}}({\sigma }^i_n(\mathbf{u}^i)+\frac{1}{{\gamma ^i}}[ P^i_{n,{\gamma ^i}}(\mathbf{u}_h)]_{+})\Vert ^2_{0,\Gamma _C^i}\big )\\&\quad \displaystyle +2\eta \Vert {\gamma ^i}^{\frac{1}{2}}{{\varvec{\sigma }}}^i(\mathbf{v}_h^i-\mathbf{u}_h^i)\mathbf{n}^i\Vert ^2_{0,\Gamma _C^i}\Big ) \end{aligned}$$

Set $\displaystyle \eta =\frac{\alpha }{16C^2\gamma _0}$, where C is the constant in (25) to conclude the proof of the theorem. $\square $

Theorem 2.6

Suppose that $\mathbf{u}=(\mathbf{u}^1,\mathbf{u}^2)$ is a solution to problem (1–5) and belongs to $(H^{\frac{3}{2}+\nu }(\Omega ^1))^d \times (H^{\frac{3}{2}+\nu }(\Omega ^2))^d$ with $\displaystyle 0<\nu \le \frac{1}{2}$ if $k=1$ and $0< \nu \le 1$ if $k=2$ ( k is the degree of the finite element method). If $\theta =-1$ or $\gamma _0$ is sufficiently small, the solution $\mathbf{u}_h$ to the problem (15) satisfies the following estimate:

$$\begin{aligned} \begin{array}{lcr} \displaystyle \sum \limits _{i=1}^2 \Big (\Vert \mathbf{u}^i-\mathbf{u}_h^i \Vert ^2_{1,\Omega ^i} +\frac{1}{2}\Vert {\gamma ^i}^{\frac{1}{2}} \big ({\sigma }^i_n(\mathbf{u}^i) +\frac{1}{{\gamma ^i}}[P^i_{n,{\gamma ^i}}(\mathbf{u}_h)]_+\big )\Vert ^2_{0,\Gamma _C^i} \\ \quad \displaystyle + \frac{1}{2}\Vert {\gamma ^i}^{\frac{1}{2}} \big ({{\varvec{\sigma }}}^i_t(\mathbf{u}^i)+\frac{1}{{\gamma ^i}}[\mathbf{P}^i_{t,{\gamma ^i}}(\mathbf{u}_h)]_{{\gamma ^i} s^i}\big )\Vert ^2_{0,\Gamma _C^i}\Big ) \displaystyle \le C h^{1+2\nu } \sum \limits _{i=1}^2 \Vert \mathbf{u}^i\Vert ^2_{\frac{3}{2}+\nu ,\Omega ^i} \end{array} \end{aligned}$$

(26)

where C is a constant independent of $\mathbf{u}$ and h.

Proof

To establish (26) we need to bound the right terms in estimate (19). We choose $\mathbf{v}_h^i=\mathcal {I}^i_h \mathbf{u}^i$ where $\mathcal {I}^i_h$ stands for the Lagrange interpolation operator mapping onto $\mathbf{V}_ h^i$. The estimation of the Lagrange interpolation error in the $H^1$-norm on a domain is classical (see, e.g. [3, 9] and [10])

$$\begin{aligned} \Vert \mathbf{u}^i-\mathcal {I}^i_h\mathbf{u}^i\Vert _{1,\Omega ^i}\le Ch^{\frac{1}{2}+\nu }\Vert \mathbf{u}^i\Vert _{\frac{3}{2}+\nu ,\Omega ^i} \end{aligned}$$

(27)

for $-\frac{1}{2}<\nu \le k-\frac{1}{2}$.

Let E in $\Gamma ^i_C$ be an edge of triangle $K\in T^i_h$, we have:

$$\begin{aligned} \Vert {\gamma ^i}^{-\frac{1}{2}}(\mathbf{u}^i-\mathcal {I}^i_h\mathbf{u}^i)\Vert _{0,E}\le Ch^{\frac{1}{2}+\nu }\Vert \mathbf{u}^i\Vert _{1+\nu ,E} \end{aligned}$$

A summation on all the edges E, with the trace theorem yields:

$$\begin{aligned} \Vert {\gamma ^i}^{-\frac{1}{2}}(\mathbf{u}^i-\mathcal {I}^i_h\mathbf{u}^i)\Vert _{0,\Gamma _C^i}\le Ch^{\frac{1}{2}+\nu }\Vert \mathbf{u}^i\Vert _{1+\nu ,\Gamma _C^i}\le C h^{\frac{1}{2}+\nu } \Vert \mathbf{u}^i\Vert _{ \frac{3}{2}+\nu ,\Omega ^i} \end{aligned}$$

(28)

From Appendix A of [8] (see also [12]), we get the following estimate:

$$\begin{aligned} \Vert {\gamma ^i}^{\frac{1}{2}}{{\varvec{\sigma }}}(\mathbf{u}^i-\mathcal {I}^i_h\mathbf{u}^i)\mathbf{n}^i\Vert _{0,\Gamma _C^i}\le C h^{\frac{1}{2}+\nu } \Vert \mathbf{u}^i\Vert _{\frac{3}{2}+\nu ,\Omega ^i} \end{aligned}$$

(29)

By inserting (27), (28) and (29) onto (19) we get (26). $\square $

4 Numerical experiments

In this section, we test the Nitsche unbiased method (15) for two/three-dimensional contact between two elastic bodies $\Omega ^1$ and $\Omega ^2$. The first body is a disk/sphere and the second is a rectangle/rectangular cuboid. This situation is not strictly a Hertz type contact problem because $\Omega ^2$ is bounded.

The tests are performed with $P_1$ and $P_2 $ Lagrange finite elements. The finite element library Getfem++ is used. The discrete contact problem is solved by using a generalized Newton method. Further details on generalized Newton’s method applied to contact problems can be found for instance in [25] and the references therein. The accuracy of the method is discussed for the different cases with respect to the finite element used, the mesh size, and the value of the parameters $\theta $ and $\gamma _0$. We perform experiments with a frictionless contact to compare the results of the formulation with other ones using Nitsche’s method (given mainly in [8, 11]). Moreover, we present the convergence curves for frictional contact in Figs. 11 and 12.

The numerical tests in two dimensions (resp. three dimensions) are performed on a domain $\Omega =]-0.5, 0.5[^2$ (resp. $\Omega =] -0.5, 0.5[^3$) containing the two bodies $\Omega ^1$ and $\Omega ^2$. The first body is a disk of radius 0.25 and center (0,0) (resp. a sphere of radius 0.25 and center (0,0,0)), and the second is a rectangle $]-0.5,0.5[\times ] -0.5,-0.25[$ (resp. $\Omega ^2=] -0.5,0.5[^2\times ]- 0.5,0.25[$). The contact surface $\Gamma _C^1$ is the lower semicircle and $\Gamma _C^2$ is the top surface of $\Omega ^2$ (i.e. $\Gamma ^1_C =\{ \mathbf{x}\in \partial \Omega ^1; x_2 \le 0\}$ and $ \Gamma ^2_C =\{ \mathbf{x}\in \partial \Omega ^2; x_2 = -0.25\}$ (resp. $\Gamma ^1_C =\{ \mathbf{x}\in \partial \Omega ^1; x_3 \le 0\}$ and $ \Gamma ^2_C =\{ \mathbf{x}\in \partial \Omega ^2; x_3 = -0.25\}$)). A Dirichlet condition is prescribed at the bottom of the rectangle (resp. cuboid). Since no Dirichlet condition is applied on $\Omega ^1$ the problem is only semi-coercive. To overcome the non-definiteness coming from the free rigid motions, the horizontal displacement is prescribed to be zero on the two points of coordinates (0,0) and (0,0.1) (resp. (0,0,0) and (0,0,0.1)) which blocks the horizontal translation and the rigid rotation. The projector $\Pi ^1$ is defined from $\Gamma _C^1$ to $\Gamma _C^2$ in the vertical direction. All remaining parts of the boundaries are considered traction free. For simplicity, we consider a dimensionless configuration with Lamé coefficients $\lambda =1$ and $\mu =1$ and a volume density of vertical force $f_v=-0.25$.

The expression of the exact solution being unknown, the convergence is studied with respect to a reference solution computed with a $P_2$ element on a very fine mesh for $\theta =-1$ (see Figs. 2 and 3).

To show the quality of the approximation method we plot in Fig. 4 the contact stress profile on the second boundary and we compare it to Hertz’s solution. The diagrams in Fig. 4 correspond to the pressure profiles for the reference fine mesh with quadratic elements. The vertical green arrows correspond to values of the contact pressure field at quadrature points. The blue solid line represents the analytically calculated Hertz’s pressure profile. The left diagram correspond to the bidimensional case and the right one is the obtained pressure at quadrature points of the elements crossing the plan $y=0$ in the three dimensional case.

4.1 Convergence in the two dimensional frictionless case

We perform a numerical convergence study on the three methods $\theta = 1$, $\theta = 0$ and $\theta =-1$ for a fixed parameter $\gamma _0 = \frac{1}{100}$ (chosen small in order to obtain convergence for the three cases) and friction coefficients $s^1=s ^2= 0$. In each case we plot the relative error in percentage in the $H^1$-norm of the displacement in the two bodies and the error of the $L^2$ norm of the Nitsche’s contact condition on ${\Gamma }_C^1$ and ${\Gamma }_C^2$. The error of the Nitsche’s contact condition is equal to:

$$\begin{aligned} \frac{\Vert {\gamma ^i}^{\frac{1}{2}} \big ({\sigma }^i_n(\mathbf{u}^{h i}_{ref})+\frac{1}{{\gamma ^i}}[P^i_{n,\gamma }(\mathbf{u}_h)]_+\big )\Vert _{0,\Gamma _C^i}}{\Vert \gamma ^{\frac{1}{2}}{\sigma }^i_n(\mathbf{u}^{h i}_{ref})\Vert _{0,\Gamma _C^i}}, \text { where}\,\, \mathbf{u}^{h i}_{ref}\, \text {is the reference solution on}\, \Omega ^i. \end{aligned}$$

On Figs. 5, 6 and 7 the curves of relative error in percentage for Lagrange $P_1$ finite elements are plotted. The convergence rate in a $H^1$-norm is about 1 for the three values of $\theta $ which is in this case optimal, according to Theorem 2.6. On Figs. 8, 9 and 10 the same experiments are reported for Lagrange $P_2$ finite elements. The convergence rate for the three cases is about 1.5 which corresponds to optimality as well.

4.2 Convergence in 2D frictional contact case

We establish, as well, the convergence curves for a frictional contact (Tresca friction) with a friction coefficient $s^1= 0.1$ with the method $\theta = -1$, for a Nitsche’s parameter $\gamma _0 = \frac{1}{100}$. The frictional contact curves are presented for $P_1$ and $P_2$ Lagrange elements in Figs. 11 and 12. Similar curves are obtained with other values of $\theta $. We mention here that this numerical validation is the first one for Nitsche’s method with frictional contact since in [5] no numerical study was performed. This validation confirms optimal convergence with a convergence rate close to the frictionless case.

4.3 Convergence in the three dimensional case

The three-dimensional tests are similar to the two-dimensional ones. The error curves with $\theta =-1$ and $P_1$ Lagrange elements are presented in Fig. 13. Very similar conclusions can be drawn compared with the two-dimensional case.

As expected the optimal convergence is obtained in $H^1$ and $L^2({\Gamma }_C)$-norm for all methods in good accordance with Theorem 2.6.

4.4 Comparison with other methods

To better compare the proposed method with other ones we present in the following the convergence curves of our test case with the convergence curves of the biased Nitsche’s formulation and the augmented Lagrangian method [8, 16], see Figs. 14 and 15.

The curves are exactly the same for $P_1$ elements and very similar for $P_2$ ones and the convergence rate of the unbiased Nitsche’s method is equal to other formulations’ rate. We note that, for different values of $\theta $ the convergence is obtained for Nitsche’s method (biased and unbiased) and the augmented Lagrangian method generally with a close number of iterations of the Newton algorithm.

4.5 Influence of the Nitsche parameter

The influence of $\gamma _0$ on the $H^1$-norm of the error for $P_2$ elements is plotted in Fig. 16 in the frictionless case and on Fig. 17 with a friction coefficient $s^1=0.1$. It is remarkable that the error curves for the smallest value of $\gamma _0$ are rather the same for the three values of $\theta $.

The variant $\theta = 1$ is the most influenced by the value of $\gamma _0$. It converges only for $\gamma _0$ very small ($\le 10^{-1}$). The method for $\theta = 0$ gives a much large window of choice of $\gamma _0$ though it has to remain small to keep a good solution. In agreement with the theoretical result of Theorem 2.6, the influence of $\gamma _0$ on the method $\theta =-1$ is limited.

So the choice of $\gamma _0$ depends on the considered version. We can always choose $\theta =-1$ to insure the stability and convergence independently on $\gamma _0$. In this case we loose symmetry and we need to introduce ${{\varvec{\sigma }}}(\mathbf{v}_h)$ into the weak formulation. The version $\theta =1$ allows to keep symmetry, however it requires that $\gamma _0$ be rather small. The version $\theta =0$ can be seen as good compromise since it is the simplest and it remains stable and converges optimally even for moderate values of $\gamma _0$. A strategy to guarantee the coercivity of the problem and then an optimal convergence is of course to consider a sufficiently small $\gamma _0$. However, the price to pay is an ill-conditioned discrete problem. The study presented in [25] shows that Newton’s method has important difficulties to converge when $\gamma _0$ is very small because the nonlinear discrete system (15) becomes very stiff in this case.

5 Conclusion

A theoretical and numerical study of Nitsche’s method was carried out for the Signorini problem in [6, 8]. These analysis prove the performance of this type of formulation for contact between an elastic body and a rigid support. In this work we adapt Nitsche’s method to the two elastic bodies contact problem and we proposed an unbiased method that could be directly applicable to multi-body contact and self-contact. The method was analysed and we proved its consistency, well-posedness and optimal convergence. For the numerical study, the accuracy of the method was discussed for the Hertz problem with different types of finite elements, for variations of the mesh size and the value of the parameters $\theta $ and $\gamma _0$. Frictionless and frictional situations have been considered, as well as two- and three-dimensional cases. The theoretical results are, generally, confirmed by numerical tests, especially the optimal convergence and the influence of the parameter $\gamma _0$. Since the analysis in the small strain case are promising, forthcoming studies will be for the non-linear materials in the large deformation framework. In this case, our goal is to provide a construction of the method similar to the linear case and the corresponding numerical study.

As well, other solvers than semi-smooth Newton could be considered for improved computational efficiency. For instance highly efficient multigrid methods have been designed for mortar-type discretizations of contact problems in [29]. The adaptation of multigrid techniques to Nitsche’s discretization of contact is still an open issue and could be considered as a perspective (see [12] for multigrid with Nitsche for interface problems).

Notes

http://getfem.org/.

References

Becker, R., Hansbo, P., Stenberg, R.: A finite element method for domain decomposition with non-matching grids. Math. Model. Numer. Anal. 37, 209–225 (2003)
Article MathSciNet MATH Google Scholar
Ben Belgacem, F., Hild, P., Laborde, P.: Extension of the mortar finite element method to a variational inequality modeling unilateral contact. Math. Models Methods Appl. Sci. 09, 287–303 (1999)
Article MathSciNet MATH Google Scholar
Brenner, S.-C., Scott, L.-R.: The Mathematical Theory of Finite Element Methods, vol. 15. Springer, New York (2007). Texts in Applied Mathematics
MATH Google Scholar
Brezis, H.: Équations et inéquations non linéaires dans les espaces vectoriels en dualité. Ann. Inst. Fourier (Grenoble) 18, 115–175 (1968)
Article MathSciNet MATH Google Scholar
Chouly, F.: An adaptation of Nitsche’s method to the Tresca friction problem. J. Math. Anal. Appl. 411, 329–339 (2014)
Article MathSciNet MATH Google Scholar
Chouly, F., Hild, P.: Nitsche-based method for unilateral contact problems: numerical analysis. SIAM J. Numer. Anal. 51, 1295–1307 (2013)
Article MathSciNet MATH Google Scholar
Chouly, F., Hild, P.: On convergence of the penalty method for unilateral contact problems. App. Numer. Math. 65, 27–40 (2013)
Article MathSciNet MATH Google Scholar
Chouly, F., Hild, P., Renard, Y.: Symmetric and non-symmetric variants of Nitsche’s method for contact problems in elasticity: theory and numerical experiments. Math. Comput. 84, 1089–1112 (2015)
Article MathSciNet MATH Google Scholar
Dupont, T., Scott, R.: Polynomial approximation of functions in Sobolev spaces. Math. Comput. 34, 441–463 (1980)
Article MathSciNet MATH Google Scholar
Ern, A., Guermond, J.-L.: Theory and Practice of Finite Elements. Applied Mathematical Sciences, vol. 159. Springer, New York (2004)
Book MATH Google Scholar
Fabre, M., Pousin, J., Renard, Y.: A fictitious domain method for frictionless contact problems in elasticity using Nitsche’s method. SMAI J. Comp. Math. 2, 19–50 (2016)
Article MathSciNet Google Scholar
Fritz, A., Hüeber, S., Wohlmuth, B.: A comparison of mortar and Nitsche techniques for linear elasticity. Calcolo 41, 115–137 (2004)
Article MathSciNet MATH Google Scholar
Hansbo, A., Hansbo, P.: A finite element method for the simulation of strong and weak discontinuities in solid mechanics. Comput. Methods Appl. Mech. Eng. 193, 3523–3540 (2004)
Article MathSciNet MATH Google Scholar
Haslinger, J., Hlaváĉek, I., Neĉas, J.: Numerical methods for unilateral problems in solid mechanics. In: Ciarlet, P.G., Lions, J.L. (eds.) Handbook of Numerical Analysis. Elsevier, London (1996)
Google Scholar
Heintz, P., Hansbo, P.: Stabilized lagrange multiplier methods for bilateral elastic contact with friction. Comput. Methods Appl. Mech. Eng. 195, 4323–4333 (2006)
Article MathSciNet MATH Google Scholar
Hild, P., Renard, Y.: A stabilized lagrange multiplier method for the finite element approximation of contact problems in elastostatics. Numer. Math. 115, 101–129 (2010)
Article MathSciNet MATH Google Scholar
Kikuchi, N., Oden, J.T.: Contact Problems in Elasticity: A Study of Variational Inequalities and Finite Element Methods. Society for Industrial and Applied Mathematics (SIAM), Philadelphia (1988)
Book MATH Google Scholar
Laursen, T.: Formulation and treatment of frictional contact problems using finite elements. PhD thesis, Stanford Univ., CA. (1992)
Laursen, T.: Computational Contact and Impact Mechanics. Springer, Berlin (2002)
MATH Google Scholar
Laursen, T., Simo, J.: A continuum-based finite element formulation for the implicit solution of multibody, large deformation frictional contact problems. Int. J. Numer. Meth. Eng. 36, 451–3485 (1993)
Article MathSciNet MATH Google Scholar
McDevitt, T.W., Laursen, T.A.: A mortar-finite element formulation for frictional contact problems. Int. J. Numer. Methods Eng. 48, 1525–1547 (2000)
Article MathSciNet MATH Google Scholar
Moussaoui, M., Khodja, K.: Régularité des solutions d’un problème mêlé Dirichlet-Signorini dans un domaine polygonal plan. Commun. Partial Differ. Equ. 17, 805–826 (1992)
Article MATH Google Scholar
Nitsche, J.: Über ein Variationsprinzip zur Lösung von Dirichlet-Problemen bei Verwendung von Teilräumen, die keinen Randbedingungen unterworfen sind. Abh. Math. Semin. Univ. Hamb. 36, 9–15 (1971)
Article MATH Google Scholar
Popp, A., Wohlmuth, B.I., Gee, M.W., Wall, W.A.: Dual quadratic mortar finite element methods for 3D finite deformation contact. SIAM J. Sci. Comput. 34, B421–B446 (2012)
Article MathSciNet MATH Google Scholar
Renard, Y.: Generalized Newton’s methods for the approximation and resolution of frictional contact problems in elasticity. Comp. Methods Appl. Mech. Eng. 256, 38–55 (2013)
Article MathSciNet MATH Google Scholar
Sauer, R.A., DeLorenzis, L.: An unbiased computational contact formulation for 3D friction. Int. J. Numer. Meth. Eng. 101, 251–280 (2015)
Article MathSciNet MATH Google Scholar
Stenberg, R.: On some techniques for approximating boundary conditions in the finite element method. J. Comput. Appl. Math. 63, 139–148 (1995)
Article MathSciNet MATH Google Scholar
Wohlmuth, B.: Variationally consistent discretization schemes and numerical algorithms for contact problems. Acta Numer. 20, 569–734 (2011)
Article MathSciNet MATH Google Scholar
Wohlmuth, B.I., Krause, R.H.: Monotone multigrid methods on nonmatching grids for nonlinear multibody contact problems. SIAM J. Sci. Comput. 25, 324–347 (2003)
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

We would like to sincerely thank the company “Manufacture Française des Pneumatiques Michelin” for the financial and technical support. We thank, as well, Région Franche-Comté for partial funding (Convention Région 2015C-4991 “Modèles mathématiques et méthodes numériques pour l’élasticité non-linéaire”).

Author information

Authors and Affiliations

Laboratoire de Mathématiques de Besançon - UMR CNRS 6623, Université de Franche Comté, 16 Route de Gray, 25030, Besançon Cedex, France
Franz Chouly
CNRS, INSA-Lyon, LaMCoS UMR5259, Université de Lyon, 69621, Villeurbanne, France
Rabii Mlika
CNRS, INSA-Lyon, ICJ UMR5208, LaMCoS UMR5259, Université de Lyon, 69621, Villeurbanne, France
Yves Renard

Authors

Franz Chouly
View author publications
You can also search for this author in PubMed Google Scholar
Rabii Mlika
View author publications
You can also search for this author in PubMed Google Scholar
Yves Renard
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rabii Mlika.

Appendices

Appendix

Strong–weak formulation equivalence

Let $\mathbf{u}=(\mathbf{u}^1,\mathbf{u}^2)$ be a sufficiently regular solution to the problem (13). Using the definitions of $A_{\theta }, P^i_{\gamma ^i}(\mathbf{u}) $ and $P_{\theta \gamma ^i}^i(\mathbf{v})$, we obtain:

$$\begin{aligned}&\displaystyle \mathbf{a}(\mathbf{u},\mathbf{v})- \frac{1}{2}\int _{\Gamma ^1_C}\theta \gamma ^1 {\sigma }_n^1(\mathbf{u}^1){\sigma }_n^1(\mathbf{v}^1)\mathrm{d}\Gamma - \frac{1}{2}\int _{\Gamma ^2_C}\theta \gamma ^2 {\sigma }_n^2(\mathbf{u}^2) {\sigma }_n^2(\mathbf{v}^2)\mathrm{d}\Gamma \\&\qquad - \frac{1}{2}\int _{\Gamma ^1_C}\theta \gamma ^1 {{\varvec{\sigma }}}_t^1(\mathbf{u}^1)\cdot {{\varvec{\sigma }}}_t^1(\mathbf{v}^1)\mathrm{d}\Gamma \\&\qquad \displaystyle -\frac{1}{2}\int _{\Gamma ^2_C}\theta \gamma ^2{{\varvec{\sigma }}}_t^2(\mathbf{u}^2)\cdot {{\varvec{\sigma }}}_t^2(\mathbf{v}^2)\mathrm{d}\Gamma +\frac{1}{2}\int _{\Gamma ^1_C}\frac{1}{\gamma ^1}[\llbracket u \rrbracket _n^1-g^1_n - \gamma ^1 {\sigma }^1_n(\mathbf{u}^1)]_{+}\\&(v_n^1+v_n^2\circ \Pi ^1-\theta \gamma ^1 {\sigma }_n^1(\mathbf{v}^1))\mathrm{d}\Gamma \\&\qquad \displaystyle +\frac{1}{2}\int _{\Gamma ^2_C}\frac{1}{\gamma ^2}[\llbracket u \rrbracket ^2_n-g^2_n - \gamma ^2 {\sigma }^2_n(\mathbf{u}^2)]_{+} (v_n^2+v_n^1\circ \Pi ^2 -\theta \gamma ^2 {\sigma }_n^2(\mathbf{v}^2))\mathrm{d}\Gamma \\&\qquad \displaystyle + \frac{1}{2}\int _{\Gamma ^1_C}\frac{1}{\gamma ^1}[\llbracket \mathbf{u}\rrbracket ^1_t - \gamma ^1 {{\varvec{\sigma }}}^1_t(\mathbf{u}^1)]_{\gamma ^1 s^1}\cdot (\mathbf{v}_t^1-\mathbf{v}_t^2\circ \Pi ^1-\theta \gamma ^1 {{\varvec{\sigma }}}_t^1(\mathbf{v}^1))\mathrm{d}\Gamma \\&\qquad \displaystyle +\frac{1}{2}\int _{\Gamma ^2_C}\frac{1}{\gamma ^2}[\llbracket \mathbf{u}\rrbracket ^2_t - \gamma ^2 {{\varvec{\sigma }}}^2_t(\mathbf{u}^2)]_{\gamma ^2 s^2}\cdot \big (\mathbf{v}_t^2-\mathbf{v}_t^1\circ \Pi ^2 -\theta \gamma ^2 {{\varvec{\sigma }}}_t^2(\mathbf{v}^2)\big )\mathrm{d}\Gamma = L(\mathbf{v}). \end{aligned}$$

Using Green’s formula we can write

$$\begin{aligned} \mathbf{a}(\mathbf{u},\mathbf{v})&\displaystyle =-\int _{\Omega ^1}{} \mathbf{div}{{\varvec{\sigma }}}^1(\mathbf{u}^1)\cdot \mathbf{v}^1 \mathrm{d}\Omega -\int _{\Omega ^2}{} \mathbf{div}{{\varvec{\sigma }}}^2(\mathbf{u}^2)\cdot \mathbf{v}^2 \mathrm{d}\Omega \\&\displaystyle + \int _{\partial \Omega ^1} {{\varvec{\sigma }}}^1(\mathbf{u}^1)\mathbf{n}^1\cdot \mathbf{v}^1 \mathrm{d}\Gamma + \int _{\partial \Omega ^2} {{\varvec{\sigma }}}^2(\mathbf{u}^2)\mathbf{n}^2\cdot \mathbf{v}^2 \mathrm{d}\Gamma . \end{aligned}$$

If we take $\mathbf{v}=(\mathbf{v}^1,\mathbf {0})$ with $\mathbf{v}^1=\mathbf {0}$ on $\partial \Omega ^1$, we obtain:

$$\begin{aligned} \int _{\Omega ^1}{} \mathbf{div}{{\varvec{\sigma }}}^1(\mathbf{u}^1)\cdot \mathbf{v}^1 \mathrm{d}\Omega =\int _{\Omega ^1}\mathbf{f}^1\cdot \mathbf{v}^1\mathrm{d}\Omega \forall \mathbf{v}^1, \end{aligned}$$

which yields (1a) for i $=$ 1. In the same way we establish (1a) for i $=$ 2.

To establish (2), (3), (4) and (5), we consider a displacement field $\mathbf{v}$ that vanishes on the boundary except on the contact surfaces where $\mathbf{v}=(\mathbf{v}^1,\mathbf{v}^2)$. Then (13) and (1a) gives

$$\begin{aligned}&\displaystyle \int _{\Gamma _C^1} {\sigma }_n^1(\mathbf{u}^1)v_n^1 \mathrm{d}\Gamma +\int _{\Gamma _C^1} {{\varvec{\sigma }}}_t^1(\mathbf{u}^1)\cdot \mathbf{v}_t^1 \mathrm{d}\Gamma \nonumber \\&\qquad +\int _{\Gamma _C^2} {\sigma }_n^2(\mathbf{u}^2)v_n^2 \mathrm{d}\Gamma +\int _{\Gamma _C^2} {{\varvec{\sigma }}}_t^2(\mathbf{u}^2)\cdot \mathbf{v}_t^2 \mathrm{d}\Gamma \nonumber \\&\qquad \displaystyle - \frac{1}{2}\int _{\Gamma ^1_C}\theta \gamma ^1 {\sigma }_n^1(\mathbf{u}^1){\sigma }_n^1(\mathbf{v}^1)\mathrm{d}\Gamma - \frac{1}{2}\int _{\Gamma ^2_C}\theta \gamma ^2 {\sigma }_n^2(\mathbf{u}^2) {\sigma }_n^2(\mathbf{v}^2)\mathrm{d}\Gamma \nonumber \\&\qquad - \frac{1}{2}\int _{\Gamma ^1_C}\theta \gamma ^1 {{\varvec{\sigma }}}_t^1(\mathbf{u}^1)\cdot {{\varvec{\sigma }}}_t^1(\mathbf{v}^1)\mathrm{d}\Gamma \nonumber \\&\qquad \displaystyle -\frac{1}{2}\int _{\Gamma ^2_C}\theta \gamma ^2 {{\varvec{\sigma }}}_t^2(\mathbf{u}^2)\cdot {{\varvec{\sigma }}}_t^2(\mathbf{v}^2)\mathrm{d}\Gamma +\frac{1}{2}\int _{\Gamma ^1_C}\frac{1}{\gamma ^1}[\llbracket u \rrbracket ^1_n-g^1_n \nonumber \\&\qquad - \gamma ^1 {\sigma }^1_n(\mathbf{u}^1)]_{+}(v_n^1+v_n^2\circ \Pi ^1 -\theta \gamma ^1 {\sigma }_n^1(\mathbf{v}^1))\mathrm{d}\Gamma \nonumber \\&\qquad \displaystyle +\frac{1}{2}\int _{\Gamma ^2_C}\frac{1}{\gamma ^2}[\llbracket u \rrbracket ^2_n-g^2_n \nonumber \\&\qquad - \gamma ^2 {\sigma }^2_n(\mathbf{u}^2)]_{+} \nonumber \\&(v_n^2+v_n^1\circ \Pi ^2 -\theta \gamma ^2 {\sigma }_n^2(\mathbf{v}^2))\mathrm{d}\Gamma \nonumber \\&\qquad \displaystyle + \frac{1}{2}\int _{\Gamma ^1_C}\frac{1}{\gamma ^1}[\llbracket \mathbf{u}\rrbracket ^1_t - \gamma ^1 {{\varvec{\sigma }}}^1_t(\mathbf{u}^1)]_{\gamma ^1 s^1}\cdot (\mathbf{v}_t^1-\mathbf{v}_t^2\circ \Pi ^1 -\theta \gamma ^1 {{\varvec{\sigma }}}_t^1(\mathbf{v}^1))\mathrm{d}\Gamma \nonumber \\&\qquad \displaystyle +\frac{1}{2}\int _{\Gamma ^2_C}\frac{1}{\gamma ^2}[\llbracket \mathbf{u}\rrbracket ^2_t - \gamma ^2 {{\varvec{\sigma }}}^2_t(\mathbf{u}^2)]_{\gamma ^2 s^2}\cdot \big (\mathbf{v}_t^2-\mathbf{v}_t^1\circ \Pi ^2 -\theta \gamma ^2 {{\varvec{\sigma }}}_t^2(\mathbf{v}^2)\big )\mathrm{d}\Gamma =0.\nonumber \\ \end{aligned}$$

(30)

We need to discuss two cases: $\theta \ne 0$ and $\theta = 0$.

Case 1 $\theta \ne 0$: In (30), let us consider $\mathbf{v}=(\mathbf{v}^1,\mathbf{v}^2)$ such that:

$$\begin{aligned} {\left\{ \begin{array}{ll} \mathbf{v}^1=\mathbf{{0}} \ \text {and} \ {{\varvec{\sigma }}}_t^1(\mathbf{v}^1)=\mathbf{{0}},\ {\sigma }_n^1(\mathbf{v}^1)\ne 0 &{} \text {on} \ \Gamma _C^1 \text { and}\\ \mathbf{v}^2=\mathbf{{0}} \ \text {and} \ {{\varvec{\sigma }}}^2(\mathbf{v}^2)\mathbf{n}^2=\mathbf{{0}} &{} \text {on} \ \Gamma _C^2, \end{array}\right. } \end{aligned}$$

(31)

so,

$$\begin{aligned} \displaystyle \frac{\theta }{2}\int _{\Gamma ^1_C}\Big ( [\llbracket u \rrbracket ^1_n-g^1_n - \gamma ^1 {\sigma }^1_n(\mathbf{u}^1)]_{+} + \gamma ^1{\sigma }_n^1(\mathbf{u}^1) \Big ){\sigma }_n^1(\mathbf{v}^1)\mathrm{d}\Gamma =0 \quad \forall \mathbf{v}\text { satisfying}\,(31). \end{aligned}$$

Then:

$$\begin{aligned} {\sigma }_n^1(\mathbf{u}^1)=-\frac{1}{\gamma ^1}[\llbracket u \rrbracket ^1_n-g_n^1 - \gamma ^1 {\sigma }^1_n(\mathbf{u}^1)]_{+}, \end{aligned}$$

which implies (2). Arguing in the same way we obtain (3) and the friction conditions (4).

Remark A.1

We can show that $\mathbf{v}$ satisfying (31) can be built by considering $\mathbf{s}(\mathbf{x})$ the curvilinear coordinate system on the boundary ${\Gamma }_C$ and d(x) the signed distance to ${\Gamma }_C$. Then, for $\mathbf{g}$ a given vector field of $\mathbb {R}^d$, $\mathbf{u}(\mathbf{x})= B^{-1}(\mathbf{s}(\mathbf{x}))\mathbf{g}(\mathbf{s}(\mathbf{x}))d(\mathbf{x})$ satisfies $\mathbf{u}(\mathbf{x})=0$ and $ {{\varvec{\sigma }}}(\mathbf{u})\mathbf{n}=\mathbf{g}$ on ${\Gamma }_C$, with $B_{il}= A_{ijkl}n_kn_j$, A being the elasticity tensor.

To obtain the second Newton law, we use Nitsche’s writing of (2) and (3) in (30) with: $\mathbf{v}_t=0$ and ${{\varvec{\sigma }}}_t=0$ and $v_n^2=-v_n^1\circ \Pi ^2$:

$$\begin{aligned} \displaystyle \int _{\Gamma _C^1} {\sigma }_n^1(\mathbf{u}^1)v_n^1 \mathrm{d}\Gamma -\int _{\Gamma _C^2} {\sigma }_n^2(\mathbf{u}^2)v_n^1\circ \Pi ^2 \mathrm{d}\Gamma =0 \qquad \forall v_n^1. \end{aligned}$$

Then:

$$\begin{aligned} \int _{\Gamma _C^1} [{\sigma }_n^1(\mathbf{u}^1)- J^1{\sigma }_n^2(\mathbf{u}^2\circ \Pi ^1)]v_n^1\mathrm{d}\Gamma =0 \qquad \forall v_n^1. \end{aligned}$$

For $v_n^1=v_n^2=0$ and $\mathbf{v}_t^2=\mathbf{v}_t^1\circ \Pi ^2$ and using (4) in (30), we have similary

$$\begin{aligned} \int _{\Gamma _C^1} [{{\varvec{\sigma }}}_t^1(\mathbf{u}^1)+J^1{{\varvec{\sigma }}}_t^2(\mathbf{u}^2\circ \Pi ^1)]\cdot \mathbf{v}_t^1\mathrm{d}\Gamma =0 \qquad \forall \mathbf{v}_t^1, \end{aligned}$$

and we have (5).

Case 2 $\theta =0$: Let us take $\mathbf{v}_t^1=\mathbf{v}_t^2=\mathbf{{0}}$ and $v_n^2=-v_n^1\circ \Pi ^2$, $v_n^1=-v_n^2\circ \Pi ^1$, then (30) reads:

$$\begin{aligned} \int _{\Gamma _C^1} [{\sigma }_n^1(\mathbf{u}^1)- J^1{\sigma }_n^2(\mathbf{u}^2\circ \Pi ^1)]v_n^1\mathrm{d}\Gamma =0 \qquad \forall v_n^1. \end{aligned}$$

Let us take, now $v_n^1=v_n^2=0$ and $\mathbf{v}_t^2=\mathbf{v}_t^1\circ \Pi ^2$, $\mathbf{v}_t^1=\mathbf{v}_t^2\circ \Pi ^1$, then (30) reads:

$$\begin{aligned} \int _{\Gamma _C^1} [{{\varvec{\sigma }}}_t^1(\mathbf{u}^1)+ J^1{{\varvec{\sigma }}}_t^2(\mathbf{u}^2\circ \Pi ^1)]\cdot \mathbf{v}_t^1\mathrm{d}\Gamma =0 \qquad \forall \mathbf{v}_t^1, \end{aligned}$$

and we have (5).

Let $\mathbf{v}^2=0$ on $\Gamma _C^2$. Taking $\mathbf{v}_t^1=0$, we get:

$$\begin{aligned}&\int _{\Gamma _C^1} \Big [{\sigma }_n^1(\mathbf{u}^1) + \frac{1}{2\gamma ^1} [\llbracket u \rrbracket ^1_n-g^1_n - \gamma ^1 {\sigma }^1_n(\mathbf{u}^1)]_{+} \\&\quad + J^1 \frac{1}{2\gamma ^2}[\llbracket u \rrbracket ^2_n\circ \Pi ^1-g^2_n\circ \Pi ^1- \gamma ^2 {\sigma }^2_n(\mathbf{u}^2\circ \Pi ^1)]_{+}\Big ]v_n^1\mathrm{d}\Gamma =0 \forall v_n^1. \end{aligned}$$

Then:

$$\begin{aligned} {\sigma }_n^1(\mathbf{u}^1) =- \frac{1}{2} \Big [ \frac{1}{\gamma ^1}[\llbracket u \rrbracket ^1_n-g^1_n - \gamma ^1 {\sigma }^1_n(\mathbf{u}^1)]_{+} + \frac{J^1}{\gamma ^2}[\llbracket u \rrbracket ^1_n-g^1_n - \gamma ^2 {\sigma }^2_n(\mathbf{u}^2\circ \Pi ^1)]_{+}\Big ]. \end{aligned}$$

Since $J^1>0$, ${\sigma }_n^1(\mathbf{u}^1)\le 0$ and so we obtain (2b). The second Newton law (5) yields:

$$\begin{aligned} {\sigma }_n^1(\mathbf{u}^1)=- \frac{1}{2} \Big [\frac{1}{\gamma ^1}[(\llbracket u \rrbracket ^1_n-g^1_n)- {\sigma }^1_n(\mathbf{u}^1)]_{+} + [\frac{J^1}{\gamma ^2}(\llbracket u \rrbracket ^1_n-g^1_n)- {\sigma }^1_n(\mathbf{u}^1)]_{+}\Big ]. \end{aligned}$$

(32)

We discuss both cases:

If ${\sigma }_n^1(\mathbf{u}^1) =0$:

$$\begin{aligned} \frac{1}{2}(\frac{1}{\gamma ^1}+\frac{J^1}{\gamma ^2})[\llbracket u \rrbracket ^1_n- g^1_n]_{+}=0 \text { then } \llbracket u \rrbracket ^1_n \le g^1_n. \end{aligned}$$

If ${\sigma }_n^1(\mathbf{u}^1) <0$:

$$\begin{aligned} \frac{1}{\gamma ^1}(\llbracket u \rrbracket ^1_n-g^1_n)-{\sigma }^1_n(\mathbf{u}^1)> 0 \qquad \text {or} \qquad \frac{J^1}{\gamma ^2}((\llbracket u \rrbracket ^1_n-g^1_n) - {\sigma }^1_n(\mathbf{u}^1))> 0 \qquad \text {or both}. \end{aligned}$$

1.
If we suppose first that: $\displaystyle \frac{1}{\gamma ^1}(\llbracket u \rrbracket ^1_n-g^1_n) -{\sigma }^1_n(\mathbf{u}^1)> 0 \text { and }\frac{J^1}{\gamma ^2}(\llbracket u \rrbracket ^1_n-g^1_n) - {\sigma }^1_n(\mathbf{u}^1)> 0$, the Eq. (32) holds:
$$\begin{aligned} {\sigma }_n^1(\mathbf{u}^1)= - \frac{1}{2} [ (\frac{1}{\gamma ^1}+\frac{J^1}{\gamma ^2})(\llbracket u \rrbracket ^1_n-g^1_n) - 2{\sigma }_n^1(\mathbf{u}^1)]\qquad \text { then } \qquad \llbracket \mathbf{u}\rrbracket ^1_n =g^1_n. \end{aligned}$$
2.
If now there only holds $ \displaystyle \frac{1}{\gamma ^1}(\llbracket u \rrbracket ^1_n-g^1_n) -{\sigma }^1_n(\mathbf{u}^1)> 0$ and $\displaystyle \frac{J^1}{\gamma ^2}(\llbracket u \rrbracket ^1_n-g^1_n) - {\sigma }^1_n(\mathbf{u}^1)=0$, we can write (32):
$$\begin{aligned} {\sigma }_n^1(\mathbf{u}^1)= & {} - \frac{1}{2\gamma ^1}(\llbracket u \rrbracket ^1_n-g^1_n)+\frac{1}{2}{\sigma }^1_n(\mathbf{u}^1).\\ \text {So } {\sigma }_n^1(\mathbf{u}^1)= & {} - \frac{1}{\gamma }(\llbracket u \rrbracket ^1_n -g^1_n). \end{aligned}$$
Then, since ${\sigma }_n(\mathbf{u}^1)<0$: $ \llbracket u \rrbracket ^1_n > g^1_n$. But in this case,
$$\begin{aligned} \displaystyle \frac{J^1}{\gamma ^2}(\llbracket u \rrbracket ^1_n-g^1_n) - {\sigma }^1_n(\mathbf{u}^1)>0, \end{aligned}$$
and this contradicts the assumption $\displaystyle \frac{J^1}{\gamma ^2}(\llbracket u \rrbracket ^1_n-g^1_n) - {\sigma }^1_n(\mathbf{u}^1)=0$. So, this case is absurd. In a similar way we get contradiction for the case $\displaystyle \frac{J^1}{\gamma ^2}(\llbracket u \rrbracket ^1_n-g^1_n) - {\sigma }^1_n(\mathbf{u}^1) >0$.

To conclude, we establish that: if ${\sigma }_n^1(\mathbf{u}^1) =0$, $\llbracket u \rrbracket ^1_n \le g^1_n $ and if ${\sigma }_n^1(\mathbf{u}^1) <0$, $\llbracket u \rrbracket ^1_n = g^1_n$; and this is equivalent to (2a) and (2c).

We suppose, now, that $v_n^1=0$ and $\mathbf{v}^2=\mathbf{{0}}$. We get:

$$\begin{aligned}&\int _{\Gamma _C^1} \Big [{{\varvec{\sigma }}}_t^1(\mathbf{u}^1) + \frac{1}{2\gamma ^1} [\llbracket \mathbf{u}\rrbracket ^1_t - \gamma ^1 {{\varvec{\sigma }}}^1_t(\mathbf{u}^1)]_{\gamma ^1 s^1} - \frac{J^1}{2\gamma ^2}[\llbracket \mathbf{u}\rrbracket ^2_t\circ \Pi ^1\\&\qquad - \gamma ^2 {{\varvec{\sigma }}}^2_t(\mathbf{u}^2\circ \Pi ^1)]_{\gamma ^2 s^2}\Big ]\cdot \mathbf{v}_t^1\mathrm{d}\Gamma =0 \forall \mathbf{v}_t^1. \end{aligned}$$

Then, using the property: $\forall \gamma >0$, $ \displaystyle [x]_{\gamma s}= \gamma \big [\frac{x}{\gamma }\big ]_s$, it yields:

$$\begin{aligned} {{\varvec{\sigma }}}_t^1(\mathbf{u}^1) + \frac{1}{2}\Big [\frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^1} - {{\varvec{\sigma }}}^1_t(\mathbf{u}^1)\Big ]_{s^1} - \frac{J^1}{2}\Big [\frac{\llbracket \mathbf{u}\rrbracket ^2_t\circ \Pi ^1}{\gamma ^2} - {{\varvec{\sigma }}}^2_t(\mathbf{u}^2\circ \Pi ^1)\Big ]_{ s^2}=0. \end{aligned}$$

We use the Newton law (5) and the condition (6) to obtain:

$$\begin{aligned} {{\varvec{\sigma }}}_t^1(\mathbf{u}^1) + \frac{1}{2}\Big [\frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^1} - {{\varvec{\sigma }}}^1_t(\mathbf{u}^1)\Big ]_{ s^1} + \frac{1}{2}\Big [J^1\frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^2} -{{\varvec{\sigma }}}^1_t(\mathbf{u}^1)\Big ]_{ s^1}=0. \end{aligned}$$

(33)

1.
If $\displaystyle \Vert \frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^1} - {{\varvec{\sigma }}}^1_t(\mathbf{u}^1) \Vert < s^1 $ and $\Vert J^1\frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^2} -{{\varvec{\sigma }}}^1_t(\mathbf{u}^1) \Vert < s^1$:

$\displaystyle \frac{\llbracket \mathbf{u}\rrbracket ^1_t}{2}(\frac{1}{\gamma ^1}+\frac{J^1}{\gamma ^2}) =0$; so $\llbracket \mathbf{u}\rrbracket ^1_t =0$. In this case we obtain: $ {{\varvec{\sigma }}}^1_t(\mathbf{u}^1)=\Big [{{\varvec{\sigma }}}^1_t(\mathbf{u}^1)\Big ]_{ s^1}$, and so: $\Vert {{\varvec{\sigma }}}^1_t(\mathbf{u}^1)\Vert < s^1$.
2.
If $\displaystyle \Vert \frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^1} - {{\varvec{\sigma }}}^1_t(\mathbf{u}^1) \Vert \ge s^1 $ and $\displaystyle \Vert J^1\frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^2} -{{\varvec{\sigma }}}^1_t(\mathbf{u}^1) \Vert \ge s^1$:
$$\begin{aligned} {{\varvec{\sigma }}}_t^1(\mathbf{u}^1) + \frac{s^1}{2} \frac{ \frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^1} - {{\varvec{\sigma }}}^1_t(\mathbf{u}^1) }{\Vert \frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^1} - {{\varvec{\sigma }}}^1_t(\mathbf{u}^1)\Vert } + \frac{s^1}{2}\frac{J^1\frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^2} -{{\varvec{\sigma }}}^1_t(\mathbf{u}^1)}{\Vert J^1\frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^2} -{{\varvec{\sigma }}}^1_t(\mathbf{u}^1)\Vert }=0. \end{aligned}$$
(34)
The Eq. (34) shows that ${{\varvec{\sigma }}}^1_t(\mathbf{u}^1)$ and $\llbracket \mathbf{u}\rrbracket ^1_t$ are collinear.

So: ${\left\{ \begin{array}{ll} &{}\displaystyle \frac{ \frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^1} - {{\varvec{\sigma }}}^1_t(\mathbf{u}^1) }{\Vert \frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^1} - {{\varvec{\sigma }}}^1_t(\mathbf{u}^1)\Vert } = \frac{J^1\frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^2} -{{\varvec{\sigma }}}^1_t(\mathbf{u}^1)}{\Vert J^1\frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^2} -{{\varvec{\sigma }}}^1_t(\mathbf{u}^1)\Vert },\\ \text {or}\\ &{}\displaystyle \frac{ \frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^1} - {{\varvec{\sigma }}}^1_t(\mathbf{u}^1) }{\Vert \frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^1} - {{\varvec{\sigma }}}^1_t(\mathbf{u}^1)\Vert } = - \frac{J^1\frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^2} -{{\varvec{\sigma }}}^1_t(\mathbf{u}^1)}{\Vert J^1\frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^2} -{{\varvec{\sigma }}}^1_t(\mathbf{u}^1)\Vert }(*), \end{array}\right. }$

and we obtain, from (34): ${\left\{ \begin{array}{ll} &{}{{\varvec{\sigma }}}^1_t(\mathbf{u}^1) =- s^1\frac{ \frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^1} - {{\varvec{\sigma }}}^1_t(\mathbf{u}^1) }{\Vert \frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^1} - {{\varvec{\sigma }}}^1_t(\mathbf{u}^1)\Vert } = -\frac{1}{\gamma ^1}[\llbracket \mathbf{u}\rrbracket ^1_t - \gamma ^1 {{\varvec{\sigma }}}^1_t(\mathbf{u}^1)]_{\gamma ^1 s^1},\\ &{} \text { and this is equivalent to}\, (4).\\ \text {or}\\ &{}{{\varvec{\sigma }}}^1_t(\mathbf{u}^1) = 0 \text { which is impossible in } (*). \end{array}\right. }$
3.
If now $\displaystyle \Vert \frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^1} - {{\varvec{\sigma }}}^1_t(\mathbf{u}^1) \Vert < s^1 $ and $\displaystyle \Vert J^1\frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^2} -{{\varvec{\sigma }}}^1_t(\mathbf{u}^1) \Vert \ge s^1$:
$$\begin{aligned} {{\varvec{\sigma }}}_t^1(\mathbf{u}^1) + \frac{1}{2} \big (\frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^1} - {{\varvec{\sigma }}}^1_t(\mathbf{u}^1) \big ) + \frac{s^1}{2}\frac{J^1\frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^2} -{{\varvec{\sigma }}}^1_t(\mathbf{u}^1)}{\Vert J^1\frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^2} -{{\varvec{\sigma }}}^1_t(\mathbf{u}^1)\Vert }=0. \end{aligned}$$
Projecting on $\frac{{{\varvec{\sigma }}}_t^1(\mathbf{u}^1)}{\Vert {{\varvec{\sigma }}}_t^1(\mathbf{u}^1)\Vert }$ and setting $a=\Vert {{\varvec{\sigma }}}_t^1(\mathbf{u}^1)\Vert $; $b=\displaystyle \frac{{{\varvec{\sigma }}}_t^1(\mathbf{u}^1)\cdot \llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^1\Vert {{\varvec{\sigma }}}_t^1(\mathbf{u}^1)\Vert }$,

we get:         ${\left\{ \begin{array}{ll} &{}|b-a|<s^1 \text { and }|bJ^1\frac{\gamma ^1}{\gamma ^2}-a|\ge s^1\\ \text {and}\\ &{}(b+a)+\epsilon s^1=0;\text { where } \epsilon =\mathrm {sign}( bJ^1\frac{\gamma ^1}{\gamma ^2}-a) = \pm 1. \end{array}\right. }$

Let $\epsilon =+1$; so, $ a= -b-s^1$ and we obtain: ${\left\{ \begin{array}{ll} &{} b-a= 2b+ s^1 \text { and } |b-a|<s^1 \\ \text {and}\\ &{}bJ^1\frac{\gamma ^1}{\gamma ^2}-a=(J^1\frac{\gamma ^1}{\gamma ^2}+1)b+s^1 \text { and }bJ^1\frac{\gamma ^1}{\gamma ^2}-a \ge s^1. \end{array}\right. }$

So:             ${\left\{ \begin{array}{ll} &{} -s^1< b<0\\ \text {and}\\ &{}(J^1\frac{\gamma ^1}{\gamma ^2}+1)b\ge 0 \end{array}\right. }$ which is absurd.

Let $\epsilon =-1$; so $a=-b+s^1$ and we obtain: ${\left\{ \begin{array}{ll} &{} b-a= 2b- s^1 \text { and } |b-a|<s^1 \\ \text {and}\\ &{}bJ^1\frac{\gamma ^1}{\gamma ^2}-a=(J^1\frac{\gamma ^1}{\gamma ^2}+1)b-s^1 \text { and }bJ^1\frac{\gamma ^1}{\gamma ^2}-a \le -s^1, \end{array}\right. }$

so:             ${\left\{ \begin{array}{ll} &{} 0< b< s^1\\ \text {and}\\ &{}(J^1\frac{\gamma ^1}{\gamma ^2}+1)b \le 0 \end{array}\right. }$,      which is absurd.
4.
If $\Vert \frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^1} - {{\varvec{\sigma }}}^1_t(\mathbf{u}^1) \Vert \ge s^1 $ and $\Vert J^1\frac{\llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^2} -{{\varvec{\sigma }}}^1_t(\mathbf{u}^1) \Vert < s^1$:

We argue in the same way putting $a=\Vert {{\varvec{\sigma }}}_t^1(\mathbf{u}^1)\Vert $; $b=J^1\displaystyle \frac{{{\varvec{\sigma }}}_t^1(\mathbf{u}^1)\cdot \llbracket \mathbf{u}\rrbracket ^1_t}{\gamma ^2\Vert {{\varvec{\sigma }}}_t^1(\mathbf{u}^1)\Vert }$.

Thus, we establish the friction condition (4) for $\hbox {i}=1$. In the same way, when supposing $\mathbf{v}^1=0$, we get (2a)–(2b)–(2c) and (4) for i $=$ 2.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chouly, F., Mlika, R. & Renard, Y. An unbiased Nitsche’s approximation of the frictional contact between two elastic structures. Numer. Math. 139, 593–631 (2018). https://doi.org/10.1007/s00211-018-0950-x

Download citation

Received: 30 November 2015
Revised: 21 April 2017
Published: 17 February 2018
Issue Date: July 2018
DOI: https://doi.org/10.1007/s00211-018-0950-x

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

An unbiased Nitsche’s approximation of the frictional contact between two elastic structures

Abstract

Similar content being viewed by others

An Overview of Recent Results on Nitsche’s Method for Contact Problems

Isogeometric frictionless contact analysis with the third medium method

Frictionless Contact Problems

1 Introduction

2 Setting of the problem

2.1 Formal statement of the two bodies contact problem

Remark 1.1

Remark 1.2

Remark 1.3

2.2 Variational formulation using Nitsche’s method

Remark 1.4

2.3 Derivation of the method from a potential

2.4 Strong–weak formulation equivalence

Theorem 1.5

Proof

2.5 Discretization of the variational formulation

Remark 1.6

3 Mathematical analysis of the method

3.1 Consistency

Lemma 2.1

Proof

Remark 2.2

3.2 Well-posedness

Lemma 2.3

Proof

Theorem 2.4

Proof

3.3 A priori error analysis

Theorem 2.5

Proof

Theorem 2.6

Proof

4 Numerical experiments

4.1 Convergence in the two dimensional frictionless case

4.2 Convergence in 2D frictional contact case

4.3 Convergence in the three dimensional case

4.4 Comparison with other methods

4.5 Influence of the Nitsche parameter

5 Conclusion

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix

Strong–weak formulation equivalence

Remark A.1

Rights and permissions

About this article

Cite this article

Share this article

Mathematics Subject Classification

Search

Navigation