Abstract
We consider problems of static equilibrium in which the primary unknown is the stress field and the solutions maximize a complementary energy subject to equilibrium constraints. A necessary and sufficient condition for the sequential lower-semicontinuity of such functionals is symmetric \(\mathrm{div}\)-quasiconvexity; a special case of Fonseca and Müller’s \(\mathcal {A}\)-quasiconvexity with \(\mathcal {A}= \mathrm{div}\) acting on \(\mathbb {R}^{n\times n}_\mathrm {sym}\). We specifically consider the example of the static problem of plastic limit analysis and seek to characterize its relaxation in the non-standard case of a non-convex elastic domain. We show that the symmetric \(\mathrm{div}\)-quasiconvex envelope of the elastic domain can be characterized explicitly for isotropic materials whose elastic domain depends on pressure p and Mises effective shear stress q. The envelope then follows from a rank-2 hull construction in the (p, q)-plane. Remarkably, owing to the equilibrium constraint, the relaxed elastic domain can still be strongly non-convex, which shows that convexity of the elastic domain is not a requirement for existence in plasticity.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
We consider problems of static equilibrium in which the primary unknown is the stress field and the solutions minimize a complementary energy subject to equilibrium constraints. Such problems arise, for example, in the limit analysis of solids at collapse, which is characterized by continuing deformations, or yielding, at constant applied loads [10]. In a geometrically linear framework, the elastic strains and the stress remain constant during collapse. Therefore, the plastic strain rate coincides with the total strain rate and is compatible. In addition, the stress is constrained to be in equilibrium and take values in the elastic domain K, which, for ideal plasticity and in the absence of hardening, is a fixed subset of \(\mathbb {R}^{n\times n}_\mathrm {sym}\). Static theory then aims to minimize over all possible velocities \(v:\Omega \rightarrow \mathbb {R}^n\) compatible with the boundary data \(g:\partial \Omega \rightarrow \mathbb {R}^n\), and maximize over all possible stress fields \(\sigma :\Omega \rightarrow K\) in equilibrium, the plastic dissipation
Natural spaces of functions are \(\sigma \in L^\infty (\Omega ;\mathbb {R}^{n\times n}_\mathrm {sym})\) with \(\sigma \in K\) almost everywhere and \(v\in W^{1,p}(\Omega ;\mathbb {R}^n)\) with \(v=g_D\) on \(\partial \Omega \) in the sense of traces. If the elastic domain K is convex, then the mathematical analysis of the problem is straightforward. Thus, the supremum of (1.1) with respect to \(\sigma \) can be taken locally, and the resulting dissipation functional
can then be minimized over all admissible v. In (1.2), \(\psi (\xi ):=\sup _{\sigma \in K} \sigma \cdot \xi \) is the dissipation potential. Thus, for convex K the classical kinematic problem of limit analysis is recovered. The functional (1.2) is itself convex and, for compact K, coercive, whence existence of minimizers follows by the direct method of the calculus of variations.
However, the elastic domain K of some notable materials is not convex. An illustrative example is silica glass. Indeed, Meade and Jeanloz [11] made measurements of the shear strength of amorphous silica at pressures up to 81 GPa at room temperature and showed that the strength initially decreases sharply as the material is compressed to denser structures of higher coordination and then rises again (Fig. 1a) resulting in a strongly non-convex elastic domain in the pressure-shear stress plane. Several authors [13, 17] have performed molecular dynamics calculations of amorphous solids deforming in pressure-shear and have found that the resulting deformation field forms distinctive patterns to accommodate permanent macroscopic deformations; see Fig. 1b. Remarkably, whereas convex limit analysis is standard [10], the case of non-convex elastic domains does not appear to have been studied.
More generally, we may consider static problems where the material response is expressed as
in terms of a complementary energy function \(\chi \). The functional of interest is then the complementary energy
to be minimized subject to the equilibrium constraints
where \(\sigma : \Omega \rightarrow \mathbb {R}^{{n}\times {n}}\) is a local stress field, \(b : \Omega \rightarrow \mathbb {R}^{n}\) are body forces and \(h : \Gamma _N \rightarrow \mathbb {R}^{n}\) applied tractions over the Neumann boundary \(\Gamma _N\subseteq \partial \Omega \). If \(\chi \) is non-convex, the question of relaxation again becomes non-standard and it may be expected to result in the development of microstructure in the form of rapidly oscillatory stress fields.
A powerful mathematical tool for elucidating such questions is furnished by \(\mathcal {A}\)-quasiconvexity, introduced by Fonseca and Müller [5] as a necessary and sufficient condition for the sequential lower-semicontinuity of functionals of the form
where \(f : \Omega \times \mathbb {R}^m \times \mathbb {R}^d \rightarrow [0,+\infty )\) is a normal integrand, \(\Omega \subseteq \mathbb {R}^n\) open and bounded, and v must satisfy the differential constraint
Here,
and \(A^{(i)} \in \mathrm{Lin}(\mathbb {R}^l; \mathbb {R}^d)\) is a constant rank partial differential operator. Specifically, \(f(x, u, \cdot )\) is \(\mathcal {A}\)-quasiconvex if
for all \(v \in \mathbb {R}^d\) and all \(w \in C^\infty (Q; \mathbb {R}^d)\) such that \(\mathcal {A}w = 0\) and w is Q-periodic, with \(Q = (0,1)^n\). In particular, with \(\mathcal {A}= \mathrm{curl}\), \(\mathcal {A}\)-quasiconvexity reduces to Morrey’s notion of quasiconvexity. In the context of the static problem (1.4) and (1.5), we may identify the state field v with \(\sigma \) and the operative differential operator \(\mathcal {A}\) with \(\mathrm{div}\). The pertinent notion of quasiconvexity is, therefore, \(\mathrm{div}\)-quasiconvexity, acting on fields of symmetric \(n\times n\) matrices. Whereas for kinematic problems of the energy-minimization type there is a well-developed theory of relaxation relating to \(\mathrm{curl}\)-quasiconvexity, the relaxation of static problems of the form (1.4) and (1.5), relating instead to \(\mathrm{div}\)-quasiconvexity, has been less extensively studied.
In this paper, we develop a theory of symmetric \(\mathrm{div}\)-quasiconvex relaxation for static problems. For definiteness, we confine attention to the static problem of limit analysis [10]
Here, \(K \subseteq \mathbb {R}^{n\times n}_{\mathrm{sym}}\) is the elastic domain, which we assume to be compact, and
where \(g_D\in L^1(\partial \Omega ;\mathbb {R}^n)\) gives the boundary data. The domain \(\Omega \) is assumed to be a bounded Lipschitz domain. The stress field \(\sigma \) is a divergence-free field, which takes values in symmetric matrices. This symmetry sets the present setting apart from previous applications of \(\mathrm {div\,}\)-quasiconvexity, also denoted \(\mathcal {S}\)-quasiconvexity or soleinoidal–quasiconvexity, which have focused on the characterization of the \(\mathrm {div\,}\)-quasiconvex hull of a 3-point set in relation with the three-well problem in linear elasticity [7, 15, 16] and on the Born-Infeld equations [12]. We call the present setting symmetric \(\mathrm {div\,}\)-quasiconvexity.
In Section 2, we show how the concept of symmetric \(\mathrm {div}\)-quasiconvexity fits within the framework of \(\mathcal {A}\)-quasiconvexity and discuss the relevant properties of symmetric \(\mathrm {div}\)-quasiconvex functions, which mainly follow directly from [5]. We also present in Lemma 2.7 an important example of a nonconvex symmetric \(\mathrm {div}\)-quasiconvex function. Section 3 deals with \(\mathrm {div}\)-quasiconvexity for sets and their hulls, in the context of relaxation theory. An important result, announced in [17, Th. 1 and Th. 2], is Theorem 3.3, which shows that the variational problem (1.10) has a solution if K is symmetric \(\mathrm {div}\)-quasiconvex. We then discuss, in particular, the definition of the symmetric \(\mathrm {div}\)-quasiconvex hull of a set K, which in principle depends on the growth of the class of test functions employed. However, we show that all \(p\in (1,\infty )\) give equivalent definitions, Theorem 3.6. Finally, Section 4 deals with the important case of sets K that can be characterized in terms of the first two stress invariants alone and show how their symmetric \(\mathrm {div}\)-quasiconvex hulls can be explicitly characterized. We recall that this elastic domain representation is the basis for a broad range of pressure-dependent plasticity models, including the Mohr–Coulomb model of sands ([10] and references therein), the Cam-Clay model of soils ( [19] and references therein), the Drucker-Prager model of pressure-dependent metal plasticity ([10] and references therein) and Gurson’s model of porous metal plasticity [8].
2 Symmetric div-quasiconvex Functions
We start by giving the basic definitions and recalling the main results from [5], specializing them to the case of interest here.
Definition 2.1
A Borel-measurable, locally bounded function \(f:\mathbb {R}^{n\times n}_\mathrm {sym}\rightarrow \mathbb {R}\) is symmetric \(\mathrm {div\,}\)-quasiconvex if, for all \(\varphi \in C^\infty _\mathrm {per}((0,1)^n;\mathbb {R}^{n\times n}_\mathrm {sym})\) which obey \(\mathrm {div\,}\varphi =0\) everywhere,
For \(\xi \in \mathbb {R}^{n\times n}_\mathrm {sym}\), the symmetric \(\mathrm {div}\)-quasiconvex envelope of \(f:\mathbb {R}^{n\times n}_\mathrm {sym}\rightarrow \mathbb {R}\) is defined as
We recall that \(C^\infty _\mathrm {per}((0,1)^n)\) is the set of \(\varphi \in C^\infty (\mathbb {R}^n)\) such that \(\varphi (x+e_i) = \varphi (x)\) for \(i=1,\dots ,n\).
Remark 2.2
From the definition it follows that, if f, g are symmetric \(\mathrm {div}\)-quasiconvex, then so are \(\max \{f,g\}\) and \(f+\lambda g\), for any \(\lambda \in [0,\infty )\). Furthermore, all convex functions are symmetric \(\mathrm {div}\)-quasiconvex.
For a generic first-order differential operator of the form given in (1.8) and a wavevector \(w\in \mathbb {R}^n\setminus \{0\}\), the linear operator \({\mathbb {A}}(w)\in \mathrm {Lin}(\mathbb {R}^m;\mathbb {R}^n)\) is defined as
The general theory of \(\mathcal {A}\)-quasiconvexity requires that \({\mathbb {A}}\) be constant rank, in the sense that \(\mathop {\mathrm {rank}}{\mathbb {A}}\) does not depend on w (as long as \(w\ne 0\)). We first show that this condition holds in the present case and compute the characteristic cone. We recall that the characteristic cone is the union of the sets where \({\mathbb {A}}(w)\) vanishes, for \(w\ne 0\), and that symmetric \(\mathrm {div}\)-quasiconvex functions are convex in the directions of the characteristic cone.
Lemma 2.3
The condition of being divergence-free is constant rank on symmetric \(n\times n\) matrices. The characteristic cone consists of all non-invertible matrices and spans \(\mathbb {R}^{n\times n}_\mathrm {sym}\).
Proof
Let \(J:\mathbb {R}^{n(n+1)/2}\rightarrow \mathbb {R}^{n\times n}_\mathrm {sym}\) be a linear bijection which maps \(\{e_1\dots e_{n(n+1)/2}\}\) to \(\{e_i\odot e_j\}_{1\leqq i\leqq j\leqq n}\). We recall that \((a\odot b)_{ij}:=\frac{1}{2} (a_ib_j+a_jb_i)\). We define the differential operator \(\mathcal {A}^\mathrm {s-div\,}\) on \(C^\infty (\Omega ;\mathbb {R}^{n(n+1)/2})\) as \(\mathcal {A}^\mathrm {s-div\,}\varphi := \mathrm {div\,}(J\varphi )\). The corresponding linear operator \({\mathbb {A}}^\mathrm {s-div\,}(w)\in \mathrm {Lin}(\mathbb {R}^{n(n+1)/2};\mathbb {R}^n)\), for \(w\in \mathbb {R}^n\), is defined by its action on a vector \(\xi \in \mathbb {R}^{n(n+1)/2}\),
which can be written as \({\mathbb {A}}^\mathrm {s-div\,}(w)\xi = (J\xi )w\).
For example, for \(n=2\),
and
We now show that the operator \({\mathbb {A}}^\mathrm {s-div\,}(w)\) is surjective for every \(w\in S^{n-1}\). Indeed, fix any vector \(v\in \mathbb {R}^n\) and let \(F^{v,w}\in \mathbb {R}^{n\times n}_\mathrm {sym}\) be such that \(F^{v,w}w=v\) (for example, let \(F^{v,w}=v\otimes w + w\otimes v - (v\cdot w) w\otimes w\)). Then, choose \(\xi :=J^{-1}(F^{v,w})\) to obtain \({\mathbb {A}}^\mathrm {s-div\,}(w)J^{-1}(F^{v,w}) = F^{v,w}w=v\). Therefore, \({\mathbb {A}}^\mathrm {s-div\,}(w)\) has rank n for all \(w\ne 0\), and the constant-rank condition holds.
The characteristic cone, first introduced by Murat and Tartar [14, 21], is defined as
In the present context, the cone \(\Lambda \) may be identified (via the mapping J) with the set of non-invertible matrices,
\(\square \)
The next three results are essentially special cases of more general assertions that hold within the framework of \(\mathcal {A}\)-quasiconvexity in [5]. For convenience, we restate here the statements that are needed in the following:
Lemma 2.4
Let f be symmetric \(\mathrm {div}\)-quasiconvex. Then, it is convex along all non-invertible directions, in the sense that \(f(\lambda A + (1-\lambda )B)\leqq \lambda f(A)+(1-\lambda ) f(B)\) whenever \(\lambda \in [0,1]\), \(A,B\in \mathbb {R}^{n\times n}_\mathrm {sym}\), \(\det (A-B)=0\). Furthermore, all such f are locally Lipschitz continuous.
Proof
If f is upper semicontinuous, then the assertion follows directly from [5, Prop. 3.4] using Lemma 2.3. Here, we give a direct proof without assuming upper semicontinuity.
We first assume that there is a vector \(\nu \in \mathbb {Q}^{n}\setminus \{0\}\) such that \((A-B)\nu =0\). We let \(h:\mathbb {R}\rightarrow \{0,1\}\) be one-periodic, with \(h(t)=0\) for \(t\in (0,\lambda )\) and \(h(t)=1\) for \(t\in (\lambda ,1)\). We choose \(M\in \mathbb {N}\) such that \(M\nu \in \mathbb {Z}^n\) and define \(u(x):=A+(B-A) h(Mx\cdot \nu )\). From \(Me_i\cdot \nu =M\nu _i\in \mathbb {Z}\), we deduce that \(u(x+e_i)=u(x)\) for all i. Furthermore, \(\mathrm {div\,}u=0\) in the sense of distributions, \(|\{u=A\}\cap (0,1)^n|=\lambda \), and \(|\{u=B\}\cap (0,1)^n|=1-\lambda \), which implies \(\int _{(0,1)^n} u \, \mathrm{d}x = \lambda A + (1-\lambda ) B\).
Let \(\theta _\varepsilon \in C^\infty _c(B_\varepsilon )\) be a mollifier. Then, \(u*\theta _\varepsilon \in C^\infty _\mathrm {per}((0,1)^n;\mathbb {R}^{n\times n}_\mathrm {sym})\) and, therefore, by (2.1), we obtain
Since f is locally bounded, u is bounded and \(|\{u*\theta _\varepsilon \ne u\}\cap (0,1)^n|\rightarrow 0\). Taking the limit \(\varepsilon \rightarrow 0\), we deduce that
whenever A and B are such that \((A-B)\nu =0\) for some \(\nu \in \mathbb {Q}^n\). In particular, f is separately convex and finite-valued, hence locally Lipschitz continuous.
Consider now any two matrices A, B and a vector \(w\in S^{n-1}\) such that \((A-B)w=0\). We choose \(\nu _j\in \mathbb {Q}^n\) such that \(\nu _j\rightarrow w\), which implies \((A-B)\nu _j\rightarrow 0\). Let now \(B_j:=B+(A-B)\nu _j\otimes \nu _j/|\nu _j|^2\). Then, \((A-B_j)\nu _j=0\), hence \(f(\lambda A+(1-\lambda )B_j)\leqq \lambda f(A)+(1-\lambda ) f(B_j)\). Taking \(j\rightarrow \infty \), by continuity of f we conclude the proof. \(\quad \square \)
Lemma 2.5
-
(i)
Let f be symmetric \(\mathrm {div}\)-quasiconvex, \(u_j\overset{*}{\rightharpoonup }u\) weakly in \(L^\infty (\Omega ;\mathbb {R}^{n\times n}_\mathrm {sym})\), \(\mathrm {div\,}u_j=0\) in the sense of distributions. Then,
$$\begin{aligned} \int _\Omega f(u(x))\mathrm{d}x\leqq \liminf _{j\rightarrow \infty } \int _\Omega f(u_j(x))\mathrm{d}x. \end{aligned}$$(2.11) -
(ii)
Let f be symmetric \(\mathrm {div}\)-quasiconvex, \(f(\xi )\leqq c(|\xi |^p+1)\) for some \(p\in [1,\infty )\), \(u_j{\rightharpoonup }u\) weakly in \(L^p(\Omega ;\mathbb {R}^{n\times n}_\mathrm {sym})\), \(\mathrm {div\,}u_j=0\) in the sense of distributions. Then,
$$\begin{aligned} \int _\Omega f(u(x))\mathrm{d}x\leqq \liminf _{j\rightarrow \infty } \int _\Omega f(u_j(x))\mathrm{d}x. \end{aligned}$$(2.12)
Proof
Lemma 2.4 shows that f is continuous. The result follows then immediately from [5, Th. 3.7] using Lemma 2.3. \(\quad \square \)
Lemma 2.6
Let \(f\in C^0(\mathbb {R}^{n\times n}_\mathrm {sym};[0,\infty ))\). Then, \(\mathcal {Q}_{\mathrm {sdqc}}f\) is symmetric \(\mathrm {div}\)-quasiconvex.
Proof
Follows from [5, Prop. 3.4]. \(\quad \square \)
We now recall an important example of a nontrivial symmetric \(\mathrm {div}\)-quasiconvex function, due to Luc Tartar.
Lemma 2.7
(From [23]) The function \(f_\mathrm {T}:\mathbb {R}^{n\times n}_\mathrm {sym}\rightarrow \mathbb {R}\), \(f_\mathrm {T}(\sigma ):=(n-1)|\sigma |^2-(\mathop {\mathrm {Tr}}\sigma )^2\), is symmetric \(\mathrm {div\,}\)-quasiconvex.
For completeness, we provide a short proof of this result, which plays an important role in the explicit examples discussed in Section 4.
Proof
We first observe that, for any matrix \(A\in \mathbb {C}^{n\times n}\), we have
To verify this inequality, it suffices to write A in a basis in which only the first \(\mathop {\mathrm {rank}}A\) diagonal entries are nonzero and to use then on this set the basic inquality \(|\sum _i A_{ii}|^2\leqq (\mathop {\mathrm {rank}}A) \sum _i |A_{ii}|^2\). We now show that for any \(\varphi \in C^1_\mathrm {per}((0,1)^n;\mathbb {R}^{n\times n})\) with \(\mathrm {div\,}\varphi =0\) the functional \(I(\varphi ):=\int _{(0,1)^n} f_\mathrm {T}(\varphi (x))\mathrm{d}x\) is nonnegative. Indeed, letting \({\hat{\varphi }}_\lambda \) be the Fourier coefficients of \(\varphi \), by Plancharel’s theorem we have
where we have used (2.13) and the fact that \(\mathrm {div\,}\varphi =0\) implies \({\hat{\varphi }}_\lambda \lambda = 0\) and therefore \(\mathop {\mathrm {rank}}{\hat{\varphi }}_\lambda \leqq n-1\). Let now \(\varphi \) be as in the definition of \(\mathrm {div\,}\)-quasiconvexity, \(\xi :=\int _{(0,1)^n} \varphi \, \mathrm{d}x\). Since \(f_\mathrm {T}\) is quadratic and \(\varphi -\xi \) has average zero, expanding, we obtain
\(\square \)
We close this section with a brief discussion of the relation to \(\mathrm {div\,}\)-quasiconvexity. In particular, we show that symmetric \(\mathrm {div\,}\)-quasiconvexity is not equivalent to \(\mathrm {div\,}\)-quasiconvexity composed with projection to symmetric matrices. We recall that a Borel-measurable, locally bounded function \(f:\mathbb {R}^{m\times n}\rightarrow \mathbb {R}\) is \(\mathrm {div\,}\)-quasiconvex if, for every \(\varphi \in C^\infty _\mathrm {per}((0,1)^n;\mathbb {R}^{m\times n})\) such that \(\mathrm {div\,}\varphi =0\) everywhere,
Lemma 2.8
For a given function \(f:\mathbb {R}^{n\times n}_\mathrm {sym}\rightarrow \mathbb {R}\), we define \(\mathcal {S}f:\mathbb {R}^{n\times n}\rightarrow \mathbb {R}\) as \(\mathcal {S}f(\xi ) := f((\xi +\xi ^T)/2)\). If \(\mathcal {S}f\) is \(\mathrm {div\,}\)-quasiconvex, then f is symmetric \(\mathrm {div\,}\)-quasiconvex. However, there are symmetric \(\mathrm {div\,}\)-quasiconvex functions f such that the corresponding \(\mathcal {S}f\) is not \(\mathrm {div\,}\)-quasiconvex.
Proof
In order to prove that f is symmetric \(\mathrm {div\,}\)-quasiconvex, we pick \(\varphi \in C^\infty _\mathrm {per}((0,1)^n; \mathbb {R}^{n\times n}_\mathrm {sym})\) with \(\mathrm {div\,}\varphi =0\) and observe that
For the converse implication, we consider \(n=2\) and \(f(F)=\det (F)\), so that
We first check that f is symmetric \(\mathrm {div\,}\)-quasiconvex. Let \(\xi \in \mathbb {R}^{2\times 2}_\mathrm {sym}\), \(\varphi \in C^\infty _\mathrm {per}([0,1]^2;\mathbb {R}^{2\times 2}_\mathrm {sym})\) with \(\mathrm {div\,}\varphi =0\) and \(\int _{(0,1)^n} \varphi \mathrm{d}x=0\). Then, there is \(v\in C^\infty (\mathbb {R}^2;\mathbb {R}^2)\) with \(Dv={}^\perp \varphi ^{\perp }\), where by this compact notation we mean \(Dv=R\varphi R\), with \(R=e_1\otimes e_2-e_2\otimes e_1\). Since \(\varphi \) has average 0 and is periodic, we can choose \(v\in C^\infty _\mathrm {per}([0,1]^2;\mathbb {R}^2)\). In particular,
At the same time, the function \(\varphi (x):=e_1\otimes e_2 \sin (2\pi x_1)\) is \([0,1]^2\)-periodic, divergence-free, has average 0, and gives
\(\square \)
3 Symmetric \(\mathrm {div}\)-quasiconvex Sets and Hulls
3.1 Symmetric \(\mathrm {div}\)-quasiconvex Sets
In this section, we discuss symmetric \(\mathrm {div\,}\)-quasiconvexity of sets and their hulls. As in the case of quasiconvexity, there are different possible definitions of the hulls, depending on the growth that is assumed. For quasiconvexity, it has been shown that the p-quasiconvex hull of a compact set does not depend on the assumed growth p. The key technical ingredient is Zhang’s truncation Lemma, see [26]. In the present setting, we can only prove the corresponding result for \(1<p<\infty \), since the bounds on the potentials of the oscillatory fields are based on singular-integral estimates which only hold in that range, see Lemma 3.13 below. For clarity we give separate definitions for \(p\in [1,\infty ]\).
Definition 3.1
A compact set \(K \subseteq \mathbb {R}^{n\times n}_\mathrm {sym}\) is symmetric \(\mathrm {div}\)-quasiconvex if, for any \(\xi \in \mathbb {R}^{n\times n}_\mathrm {sym}\setminus K\), there is a symmetric \(\mathrm{div}\)-quasiconvex function \(g\in C^0(\mathbb {R}^{n\times n}_\mathrm {sym};[0,\infty ))\) such that \(g(\xi )>\max g(K)\).
A compact set \(K \subseteq \mathbb {R}^{n\times n}_\mathrm {sym}\) is p-symmetric \(\mathrm {div}\)-quasiconvex, with \(p\in [1,\infty )\), if the function g can be chosen to have p-growth, in the sense that \(g(\sigma )\leqq c(|\sigma |^p+1)\) for some \(c\in \mathbb {R}\) and all \(\sigma \in \mathbb {R}^{n\times n}_\mathrm {sym}\).
We remark that the function g can be chosen so that it vanishes on K by replacing it with \({\hat{g}}:=\max \{g-\max g(K),0)\}\).
It is clear that if K is p-symmetric \(\mathrm {div}\)-quasiconvex for some p then it is symmetric \(\mathrm {div}\)-quasiconvex. As in the case of quasiconvexity, the definition for non compact sets depends crucially on growth and many variants are possible. We do not discuss this case here.
Lemma 3.2
Let \(K\subseteq \mathbb {R}^{n\times n}_\mathrm {sym}\) be compact and symmetric \(\mathrm {div\,}\)-quasiconvex, \(E:=\{\sigma \in L^\infty (\Omega ;K): \mathrm {div\,}\sigma =0\}\). Then, E is closed with respect to weak-\(*\) convergence in \(L^\infty (\Omega ;\mathbb {R}^{n\times n}_\mathrm {sym})\).
Proof
Let \(\sigma _j\in E\) be such that \(\sigma _j\overset{*}{\rightharpoonup }\sigma \) in \(L^\infty (\Omega ;\mathbb {R}^{n\times n}_\mathrm {sym})\).
For any \(\xi \in \mathbb {R}^{n\times n}_\mathrm {sym}\setminus K\), there is a symmetric \(\mathrm {div}\)-quasiconvex function \(g_\xi \in C^0(\mathbb {R}^{n\times n}_\mathrm {sym};[0,\infty ))\) which vanishes on K and with \(g_\xi (\xi )>0\). By continuity, \(g_\xi >0\) on \(B_{r_\xi }(\xi )\), for some \(r_\xi >0\). The set \(\mathbb {R}^{n\times n}_\mathrm {sym}\setminus K\) can be covered by countably many such balls \(B_i\). Let \(g_i\) be the corresponding functions. It suffices to show that \(\{x: \sigma (x)\in B_i\}\) is a null set for any i.
By Lemma 2.5(i), recalling that \(\sigma _j\in K\) almost everywhere for all j, we obtain \(\int _\Omega g_i(\sigma )\mathrm{d}x\leqq \liminf _{j\rightarrow \infty } \int _\Omega g_i(\sigma _j)\mathrm{d}x=0\). This implies that \(g_i(\sigma (x))= 0\) almost everywhere. Since \(g_i>0\) on \(B_i\) we obtain that \(\{x: \sigma (x)\in B_i\}\) is a null set, which concludes the proof. \(\quad \square \)
We are now ready to prove our first main result, namely, an existence statement for static problems with symmetric \(\mathrm {div}\)-quasiconvex yield sets. We refer to the introduction for the formulation and the main definitions and recall in particular that \(g_D\in L^1(\partial \Omega ;\mathbb {R}^n)\) denotes the boundary data.
Theorem 3.3
If K is nonempty and symmetric \(\mathrm {div}\)-quasiconvex, then F is weakly upper semicontinuous and the problem defined in (1.10) and (1.11) has a solution \(\sigma _*\in L^\infty (\Omega ;K)\), which obeys \(\mathrm {div\,}\sigma _*=0\) in the sense of distributions.
Proof
We first prove that \(\sup F\in \mathbb {R}\).
Let \(\xi _0\in K\). Using the constant function \(\sigma =\xi _0\) gives
hence \(\sup F\ne -\infty \).
By the trace theorem for \(W^{1,1}\) (see for example [1, p. 168]), we can extend \(g_D\) to a function \(W^{1,1}(\Omega ;\mathbb {R}^n)\), which we shall also denote \(g_D\). For any \(\sigma \in L^\infty (\Omega ;K)\) we have
hence \(\sup F\ne +\infty \).
Next, we show that only fields \(\sigma \) that are divergence-free need be considered. If we assume additional regularity, then an integration by parts gives
which does not contain any derivative of v. In particular, the \(\inf \) is \(-\infty \) unless \(\mathrm {div\,}\sigma =0\) almost everywhere.
Consider now a generic \(\sigma \in L^\infty (\Omega ;\mathbb {R}^{n\times n}_\mathrm {sym})\). If \(\mathrm {div\,}\sigma \ne 0\) in the sense of distributions, then there is \(\theta \in C^\infty _c(\Omega ;\mathbb {R}^n)\) such that \(\int _\Omega \sigma \cdot D\theta \, \mathrm{d}x\ne 0\). We consider the one-parameter family of test functions \(v_t:=g_D+t\theta \) and obtain
which shows that \(F(\sigma )=-\infty \). Therefore, we can restrict attention to fields \(\sigma \) that are divergence-free in the sense of distributions.
Let \(\sigma _k\in L^\infty (\Omega ;K)\) be a maximizing sequence. By the preceding argument, \(\mathrm {div\,}\sigma _k=0\) in the sense of distributions. Since the sequence is bounded in \(L^\infty \), after extracting a subsequence it converges weak-\(*\) to some \(\sigma _*\), by the properties of distributions \(\mathrm {div\,}\sigma _*=0\). Lemma 3.2 implies that \(\sigma _*\in K\) almost everywhere. Hence, we only need to show that it is a maximizer. For any \(v\in W^{1,1}(\Omega ;\mathbb {R}^n)\) with \(v=g_D\) on the boundary we have
hence,
\(\square \)
3.2 Symmetric \(\mathrm {div\,}\)-quasiconvex Hulls
We now deal with the case that K is not symmetric \(\mathrm {div}\)-quasiconvex. Within the framework of relaxation theory, we begin by defining the symmetric \(\mathrm {div}\)-quasiconvex hull.
Definition 3.4
Let \(K\subseteq \mathbb {R}^{n\times n}_\mathrm {sym}\) be compact, \(p\in [1,\infty )\), \(f_p(\xi ):={\text {dist}}^p(\xi ,K)\). We define
and
Lemma 3.5
\(K^{(\infty )}\) is the smallest symmetric \(\mathrm {div}\)-quasiconvex compact set that contains K. \(K^{(p)}\) is the smallest p-symmetric \(\mathrm {div}\)-quasiconvex compact set that contains K.
As usual, the first assertion means that any symmetric \(\mathrm {div}\)-quasiconvex compact set that contains K also contains \(K^{(\infty )}\), and analogously for the second.
Proof
We start by \(K^{(p)}\). By Lemma 2.6 the function \(\mathcal {Q}_{\mathrm {sdqc}}f_p\) is symmetric \(\mathrm {div}\)-quasiconvex. From \(\mathcal {Q}_{\mathrm {sdqc}}f_p\leqq f_p\) it follows that \(\mathcal {Q}_{\mathrm {sdqc}}f_p\) has p-growth and that \(K\subseteq K^{(p)}\). If \(\xi \in \mathbb {R}^{n\times n}_\mathrm {sym}\setminus K^{(p)}\), then \(\mathcal {Q}_{\mathrm {sdqc}}f_p(\xi )>0=\max \mathcal {Q}_{\mathrm {sdqc}}f_p(K^{(p)})\). Therefore, \(K^{(p)}\) is p-symmetric \(\mathrm {div}\)-quasiconvex.
To show minimality, we consider a p-symmetric \(\mathrm {div}\)-quasiconvex compact set \({\tilde{K}}\) with \(K\subseteq {\tilde{K}}\) and show that \(K^{(p)}\subseteq {\tilde{K}}\). To this end, we fix a \(\xi \in K^{(p)}\) and a symmetric \(\mathrm {div}\)-quasiconvex function g with p growth and show that \(g(\xi )\leqq \max g(K)\leqq \max g({\tilde{K}})\). If this holds for any such function g, then necessarily \(\xi \in {\tilde{K}}\), which implies that \(K^{(p)}\subseteq {\tilde{K}}\) and concludes the proof.
It remains to show that \(g(\xi )\leqq \max g(K)\). Let \(\varepsilon >0\). Since g is continuous and \(f_p>0\) outside K, there is \(\delta >0\) such that \(g(\sigma )\leqq \max g(K)+\varepsilon \) for all \(\sigma \) with \(f_p(\sigma )\leqq \delta \). Using the fact that g has p-growth, we then obtain \(g\leqq \max g(K)+\varepsilon + C_\varepsilon f_p\) pointwise. By monotonicity of the symmetric \(\mathrm {div}\)-quasiconvex envelope, this gives \(g=\mathcal {Q}_{\mathrm {sdqc}}g \leqq \max g(K)+\varepsilon +C_\varepsilon \mathcal {Q}_{\mathrm {sdqc}}f_p\) pointwise and, therefore, \(g(\xi )\leqq \max g(K)+\varepsilon \). Since \(\varepsilon \) is arbitrary, this concludes the proof.
We now treat the \(p=\infty \) case. The fact that \(K\subseteq K^{(\infty )}\) is obvious. To show that \(K^{(\infty )}\) is symmetric \(\mathrm {div}\)-quasiconvex, we pick \(\xi \not \in K^{(\infty )}\). By the definition of \(K^{(\infty )}\), there is a symmetric \(\mathrm {div}\)-quasiconvex function g with \(g(\xi )>\max g(K)\). At the same time, for any \(\sigma \in K^{(\infty )}\) it follows that \(g(\sigma )\leqq \max g(K)\), which implies \(\max g(K^{(\infty )})=\max g(K)\). We conclude that \(g(\xi )>\max g(K^{(\infty )})\), which shows that \(K^{(\infty )}\) is symmetric \(\mathrm {div}\)-quasiconvex.
To show minimality, we assume that \({\tilde{K}}\) is symmetric \(\mathrm {div}\)-quasiconvex and \(K\subseteq {\tilde{K}}\). We wish to show that \(K^{(\infty )}\subseteq {\tilde{K}}\). To this end, we fix a \(\xi \in \mathbb {R}^{n\times n}_\mathrm {sym}\setminus {\tilde{K}}\) and choose a symmetric \(\mathrm {div}\)-quasiconvex function g with \(g(\xi )>\max g({\tilde{K}})\). From \(K\subseteq {\tilde{K}}\), we obtain \(\max g({\tilde{K}})\geqq \max g(K)\). Therefore, \(\xi \not \in K^{(\infty )}\). This implies \(K^{(\infty )}\subseteq {\tilde{K}}\) and concludes the proof. \(\quad \square \)
We proceed to show that \(K^{(p)}\) does not depend on p, as long as \(p\ne \infty \). One inclusion can easily be obtained from the definition. The other will be discussed in Section 3.3 below.
Theorem 3.6
Let \(K\subseteq \mathbb {R}^{n\times n}_\mathrm {sym}\) be compact, \(1<p<q<\infty \). Then, \(K^{(p)}=K^{(q)}\).
Proof
Follows from Lemmas 3.8 and 3.15 below. \(\quad \square \)
Definition 3.7
Let \(K\subseteq \mathbb {R}^{n\times n}_\mathrm {sym}\) be compact. For every \(p\in (1,\infty )\), we set \(K^{\mathrm {sdqc}}=K^{(p)}\). This is admissible by Theorem 3.6.
Lemma 3.8
Let \(K\subseteq \mathbb {R}^{n\times n}_\mathrm {sym}\) be compact. Then, \(K^{(q)}\subseteq K^{(p)}\) for any p, q with \(1\leqq p < q \leqq \infty \).
Proof
Assume first that \(q<\infty \). We write \(f_p(\xi ):={\text {dist}}^p(\xi ,K)\) and, analogously, \(f_q\). For all \(\delta >0\), we have
and, therefore,
Let now \(\xi \in K^{(q)}\), so that \(\mathcal {Q}_{\mathrm {sdqc}}f_q(\xi )=0\). The above inequality implies that \(\mathcal {Q}_{\mathrm {sdqc}}f_p(\xi )\leqq \delta ^p\) for any \(\delta >0\). We conclude that \(\mathcal {Q}_{\mathrm {sdqc}}f_p(\xi )=0\) and \(K^{(q)}\subseteq K^{(p)}\).
If, instead, \(q=\infty \), it suffices to observe that the function \(\mathcal {Q}_{\mathrm {sdqc}}f_p\) is symmetric \(\mathrm {div}\)-quasiconvex (Lemma 2.6). Therefore, it is one of the candidates in the definition of \(K^{(\infty )}\). Since \(\mathcal {Q}_{\mathrm {sdqc}}f_p=0\) on K, we obtain that, necessarily, \(\mathcal {Q}_{\mathrm {sdqc}}f_p=0\) on \(K^{(\infty )}\). Hence, \(K^{(\infty )}\subseteq K^{(p)}\). \(\quad \square \)
Remark 3.9
By analogy with the case of quasiconvexity, one might expect that \(K^{(p)}=K^{(\infty )}\) for every \(p\in [1,\infty )\) and every compact set K. This property holds in dimension \(n=2\), since \(\mathrm {div\,}\)-quasiconvexity is equivalent to quasiconvexity composed with a 90-degree rotation. We do not know if the statement is true in higher dimensions.
Lemma 3.10
Let \(K\subseteq \mathbb {R}^{n\times n}_\mathrm {sym}\) be compact, \(A\in \mathbb {R}^{n\times n}\) invertible, \(B\in \mathbb {R}^{n\times n}_\mathrm {sym}\). Then,
and
Proof
We shall prove below that
In order to derive the other inclusion, we then consider the set \({\tilde{K}}:=AKA^{T}+B\), so that \(K=A^{-1}({\tilde{K}}-B)A^{-T}\). Application of (3.13) to \({\tilde{K}}\) gives
Multiplying on the left by A and on the right by \(A^T\) yields
which, recalling the definition of \({\tilde{K}}\), is the desired second inclusion.
It remains to prove (3.13). We consider the set \(H:=AK^{\mathrm {sdqc}}A^T+B\). It is obvious that \(AKA^T+B\subseteq H\). If we can prove that H is p-symmetric \(\mathrm {div}\)-quasiconvex, then Lemma 3.5 implies \((AKA^T+B)^{\mathrm {sdqc}}\subseteq H\) and concludes the proof.
In order to show that H is p-symmetric \(\mathrm {div}\)-quasiconvex, we fix a symmetric matrix \({\hat{\sigma }}\not \in H\) and show that there is a symmetric \(\mathrm {div}\)-quasiconvex function f with p-growth such that \(f({\hat{\sigma }})>\max f(H)\). Theorem 3.6 shows that \(p\in (1,\infty )\) can be chosen arbitrarily. In the case of \(K^{(\infty )}\), the requirement of p-growth does not apply.
We define \(\sigma :=A^{-1}({\hat{\sigma }}-B)A^{-T}\), so that \({\hat{\sigma }}=A\sigma A^T+B\). The definitions of H and \({\hat{\sigma }}\) show that \(\sigma \not \in K^{\mathrm {sdqc}}\). Since \(K^{\mathrm {sdqc}}\) is p-symmetric \(\mathrm {div}\)-quasiconvex, there is a symmetric \(\mathrm {div}\)-quasiconvex function g with p-growth such that \(g(\sigma )>\max g(K^{\mathrm {sdqc}})\). We define \(f(\xi ):=g(A^{-1}(\xi -B) A^{-T})\), so that \(f({\hat{\sigma }})>\max f(H)\). Growth and continuity are automatically inherited from g.
To conclude the proof it remains to show that f is symmetric \(\mathrm {div}\)-quasiconvex. To this end, pick some \(\varphi \in C^\infty _\mathrm {per}((0,1)^n;\mathbb {R}^{n\times n}_\mathrm {sym})\) with \(\mathrm {div\,}\varphi =0\) and let \(\xi :=\int _{(0,1)^n} \varphi \,\mathrm{d}x\).
For some \(F\in \mathbb {R}^{n\times n}\) chosen below, we define \(\psi (x):=A^{-1}(\varphi (Fx)-B)A^{-T}\) and compute
and
Therefore,
We choose \(F:=A\), so that \(\sum _j A^{-1}_{j\beta } F_{\gamma j}={\text {Id}}_{\beta \gamma }\) and
Recalling the definitions of f and \(\psi \), we compute
The function \(\psi \) is \(A^{-1}(0,1)^n\)-periodic and has average \(A^{-1}(\xi -B)A^{-T}\). The maps \(u_j(x):=\psi (jx)\) are divergence-free and converge weakly in \(L^{\infty }(\mathbb {R}^n;\mathbb {R}^{n\times n}_\mathrm {sym})\) to their average, which is \(A^{-1}(\xi -B)A^{-T}\). The functions \(x\mapsto g(u_j(x))=g(\psi (jx))\) are equally periodic and converge weakly to their average, which is the last expression in the previous equation. Since g is symmetric \(\mathrm {div}\)-quasiconvex, recalling the lower semicontinuity (Lemma 2.5) we conclude
and recalling the definition of g and the previous computation this gives
Therefore, f is symmetric \(\mathrm {div}\)-quasiconvex. This concludes the proof. \(\quad \square \)
Lemma 3.11
Let \(K\subseteq \mathbb {R}^{n\times n}_\mathrm {sym}\) be compact. If \(A,B\in K^{\mathrm {sdqc}}\) and \(\mathop {\mathrm {rank}}(A-B)<n\) then \(\lambda A+(1-\lambda )B\in K^{\mathrm {sdqc}}\) for all \(\lambda \in [0,1]\). The corresponding assertion holds for \(K^{(\infty )}\).
Proof
The proof follows immediately from the definition and Lemma 2.4. Indeed, the assumption gives \(\mathcal {Q}_{\mathrm {sdqc}}f_p(A)=\mathcal {Q}_{\mathrm {sdqc}}f_p(B)=0\). Since \(\mathcal {Q}_{\mathrm {sdqc}}f_p\) is symmetric \(\mathrm {div}\)-quasiconvex, it is convex in the direction of \(B-A\), and \(\mathcal {Q}_{\mathrm {sdqc}}f_p(\lambda A+(1-\lambda )B)=0\).
In the case of \(K^{(\infty )}\), we consider any symmetric \(\mathrm {div}\)-quasiconvex function \(f\in C^0(\mathbb {R}^{n\times n}_\mathrm {sym};[0,\infty ))\), and deduce as above \(f(\lambda A+(1-\lambda )B)\leqq \lambda f(A)+(1-\lambda ) f(B)\leqq \max f(K^{(\infty )})\). By the definition of \(K^{(\infty )}\), we obtain \(\max f(K^{(\infty )})=\max f(K)\) and, therefore, \(f(\lambda A+(1-\lambda )B)\leqq \max f(K)\). \(\quad \square \)
In closing this section, we present an explicit example in which K consists of two matrices.
Lemma 3.12
Let \(K:=\{A,B\}\subseteq \mathbb {R}^{n\times n}_\mathrm {sym}\). If \(\mathop {\mathrm {rank}}(A-B)=n\), then \(K^{\mathrm {sdqc}}=K^{(\infty )}=K\). Otherwise, \(K^{\mathrm {sdqc}}=K^{(\infty )}=[A,B]\), where [A, B] is the segment with endpoints A and B.
Proof
The function \(f(\xi ):={\text {dist}}(\xi ,[A,B])\) is convex, hence symmetric \(\mathrm {div}\)-quasiconvex, therefore \(K^{\mathrm {sdqc}}\subseteq [A,B]\).
If \(\mathop {\mathrm {rank}}(B-A)<n\), Lemma 3.11 shows that \([A,B]\subseteq K^{(\infty )}\subseteq K^{\mathrm {sdqc}}\) and concludes the proof.
Assume now that \(\mathop {\mathrm {rank}}(B-A)=n\). By Lemma 3.10, it suffices to consider the case \(A={\text {Id}}\), \(B=-{\text {Id}}\) and we need only show that no matrix of the form \(t{\text {Id}}\), \(t\in (-1,1)\), belongs to \(K^{\mathrm {sdqc}}\). Let \(f(\xi ):=((n-1)|\xi |^2-(\mathop {\mathrm {Tr}}\xi )^2+n)_+\). Lemma 2.7 implies that f is symmetric \(\mathrm {div}\)-quasiconvex, and we verify that \(f({\text {Id}})=f(-{\text {Id}})=0\). However, \(f(t{\text {Id}})=n(1-t^2)>0\) for all \(t\in (-1,1)\), hence \(t{\text {Id}}\not \in K^{\mathrm {sdqc}}\). \(\square \)
3.3 Truncation of Symmetric Divergence-Free Fields
In the remainder of this Section, we prove that \(K^{(p)}\) does not depend on p, for \(p\in (1,\infty )\). This proof requires truncation and approximation of vector fields that satisfy differential constraints, which is made much easier by working with the corresponding potentials. Following [2], we introduce a stress potential \(\Theta \), which is related to the field \(\sigma \) by \(\sigma =\mathrm {div\,}\mathrm {div\,}\Theta \), in a sense we now make precise. Let \(\mathbb {R}^{n^4}_*\) be the set of \(\zeta \in \mathbb {R}^{n\times n\times n\times n}\) such that
For \(\Theta \in L^1_\mathrm {loc}(\mathbb {R}^n;\mathbb {R}^{n^4}_*)\) we define the distribution
We observe that, by (3.23), \(\mathrm {div\,}(\mathrm {div\,}\mathrm {div\,}\Theta )=0\) and \(\mathrm {div\,}\mathrm {div\,}\Theta =(\mathrm {div\,}\mathrm {div\,}\Theta )^T\). Therefore, every potential generates a divergence-free symmetric matrix field.
In order to construct potentials, we start from a fixed matrix \(M\in \mathbb {R}^{n\times n}_\mathrm {sym}\) and define \(\Theta ^M:\mathbb {R}^n\rightarrow \mathbb {R}^{n^4}_*\) as
A straightforward computation shows that \(\mathrm {div\,}\mathrm {div\,}\Theta ^M=M\), with \(|\Theta ^M|(x)\leqq 2|x|^2|M|\), \(|D\Theta ^M|(x)\leqq 4|x|\, |M|\), \(|D^2\Theta ^M|(x)\leqq 4|M|\) for all \(x\in \mathbb {R}^n\), \(n\geqq 2\). Working in Fourier space, this procedure can be generalized to any divergence-free symmetric matrix field.
Lemma 3.13
-
(i)
Let \(w\in C^\infty _\mathrm {per}((0,1)^n;\mathbb {R}^{n\times n}_\mathrm {sym})\) with \(\mathrm {div\,}w=0\) and \(\int _{(0,1)^n} w\, \mathrm{d}x=0\). Then, there is \(\Theta \in C^\infty _\mathrm {per}((0,1)^n;\mathbb {R}^{n^4}_*)\) such that \(\mathrm {div\,}\mathrm {div\,}\Theta =w\). The map \(w\mapsto \Theta \) is linear.
-
(ii)
Let \(w\in L^p((0,1)^n;\mathbb {R}^{n\times n}_\mathrm {sym})\) for some \(p\in (1,\infty )\), \(\mathrm {div\,}w=0\), \(\int _{(0,1)^n} w\, \mathrm{d}x=0\). Then, there is \(\Theta \in W^{2,p}_\mathrm {per}((0,1)^n;\mathbb {R}^{n^4}_*)\), with \(\Vert D^2\Theta \Vert _{p}\leqq c \Vert w\Vert _p\) and \(\mathrm {div\,}\mathrm {div\,}\Theta =w\). The map \(w\mapsto \Theta \) is linear and extends the map in (i).
-
(iii)
Let \(w=w_p+w_q\), with \(w_p\in L^p((0,1)^n;\mathbb {R}^{n\times n}_\mathrm {sym})\), \(w_q\in L^q((0,1)^n;\mathbb {R}^{n\times n}_\mathrm {sym})\) for some \(p,q\in (1,\infty )\), \(\mathrm {div\,}w=0\), \(\int _T w_p\, \mathrm{d}x=\int _T w_q\, \mathrm{d}x=0\). Then, there are \(\Theta _p\in W^{2,p}_\mathrm {per}((0,1)^n;\mathbb {R}^{n^4}_*)\), with \(\Vert D^2\Theta _p\Vert _{p}\leqq c \Vert w_p\Vert _p\), and \(\Theta _q\in W^{2,q}_\mathrm {per}((0,1)^n;\mathbb {R}^{n^4}_*)\), with \(\Vert D^2\Theta _q\Vert _{q}\leqq c \Vert w_q\Vert _q\), such that \(\mathrm {div\,}\mathrm {div\,}(\Theta _p+\Theta _q)=w\).
We stress that (iii) does not assert \(\mathrm {div\,}\mathrm {div\,}\Theta _p=w_p\).
Proof
(i): Let \({\hat{w}}:2\pi \mathbb {Z}^n\rightarrow \mathbb {R}^{n\times n}_\mathrm {sym}\) be the Fourier coefficients of w, so that
The assumptions on w imply \({\hat{w}}(0)=0\), \({\hat{w}}_{ij}={\hat{w}}_{ji}\) and \(\sum _j{\hat{w}}_{ij}\lambda _j=0\). We define, in analogy to (3.25), \({\hat{\Theta }}(0)=0\) and, for \(\lambda \in 2\pi \mathbb {Z}^n\setminus \{0\}\),
We easily verify that \({\hat{\Theta }}(\lambda )\in \mathbb {R}^{n^4}_*\) and \(\sum _{hk}\lambda _h\lambda _k {\hat{\Theta }}_{ijhk}(\lambda )={\hat{w}}_{ij}(\lambda )\) for all \(\lambda \). Since the decay of the coefficients \({\hat{\Theta }}\) is faster than the decay of the coefficients \({\hat{w}}\), the Fourier series
defines a smooth periodic function \(\Theta \in C^\infty _\mathrm {per}(T;\mathbb {R}^{n^4}_*)\) such that \(\mathrm {div\,}\mathrm {div\,}\Theta =w\).
(ii): Let \(T:C^\infty _\mathrm {per}(T;\mathbb {R}^{n\times n}_\mathrm {sym})\rightarrow C^\infty _\mathrm {per}(T;\mathbb {R}^{n^4}_*)\), \(w\mapsto Tw:=\Theta _w\), be the linear operator defined above. We consider the operator \(D^2T : C^\infty _\mathrm {per}(T;\mathbb {R}^{n\times n}_\mathrm {sym})\rightarrow C^\infty _\mathrm {per}(T;\mathbb {R}^{n^6})\), defined by \(w\mapsto D^2Tw:=D^2\Theta _w\). Its Fourier symbol is smooth on \(S^{n-1}\) and homogeneous of degree zero. By [5, Proposition 2.13] (which is based on [18, Ex. (iii), page 94] and [20, Cor. 3.16, p. 263]) the operator \(D^2T\) can be extended to a continuous operator from \(L^p\) to \(L^p\) for any \(p\in (1,\infty )\). By Poincaré, and using the fact that Tw and DTw have average zero, the estimate in \(W^{2,p}\) follows.
(iii): We define \(\Theta _p:=Tw_p\), \(\Theta _q:=Tw_q\). The estimates on the norm follow as for (ii). By linearity of the operator T, the differential condition holds as well. We remark that the \(L^p\) extension and the \(L^q\) extension of the operator defined on smooth functions coincide on \(L^p\cap L^q\). Therefore, we can use the symbol T for the operator defined on \(L^p\cup L^q\). \(\quad \square \)
A crucial element in subsequent steps is the following truncation result, which is a minor variant of those given in Section 6.6.2 of [3] and Prop. A.1 of [4] and is based on Zhang’s Lemma [26].
Lemma 3.14
Let \(u\in W^{2,p}_\mathrm {per}({(0,1)^n};V)\), \(M>0\), V a finite-dimensional vector space. Then, there is \(v\in W^{2,\infty }_\mathrm {per}({(0,1)^n};V)\) such that
- (i)
\(\displaystyle \Vert D^2v\Vert _{2,\infty }\leqq c M\);
- (ii)
\(\displaystyle |\{v\ne u\}| \leqq \frac{c}{M^p} \int _{|u|+|Du|+|D^2u|>M} |u|^p+|Du|^p+|D^2u|^p \mathrm{d}x\).
The constant depends only on n and V.
The above estimates immediately imply
Proof
After choosing a basis and working componentwise, we can assume \(V=\mathbb {R}\). We define \(h:=(u,Du,D^2u)\) and
Here and subsequently, . If \(E_M\) is a null set, then it suffices to take \(v=u\) and the proof is concluded. Otherwise, using the Vitali or the Besicovitch covering theorem it follows that the volume of \(E_M\) obeys (ii). We can further enlarge \(E_M\) by a null set and assume that all points of \({(0,1)^n}\setminus E_M\) are Lebesgue points of h.
For \(x\in {(0,1)^n}\setminus E_M\) and \(r\in (0,\sqrt{n})\), we define
From the definition of \(E_M\) we obtain \(0\leqq \eta _r \leqq 4M\) for all r and x and \(\eta _r\rightarrow 0\) pointwise on \((0,1)^n\setminus E_M\). Therefore, there is a set \({\tilde{E}}_M\) with \(|{\tilde{E}}_M|\leqq |E_M|\) such that \(\eta _{r}\rightarrow 0\) uniformly in \({(0,1)^n}\setminus E_M\setminus {\tilde{E}}_M\). We define \(S_M:=(0,1)^n\setminus E_M\setminus {\tilde{E}}_M\).
We have shown that there is \(\omega :(0,\infty )\rightarrow (0,4M]\) nondecreasing with \(\omega _r\rightarrow 0\) such that
Fix now \(x\in S_M\). By Poincaré’s inequality, for any \(r\in (0,\sqrt{n})\) there is \(A_r=A_r(x)\in \mathbb {R}^n\) such that
With x being a Lebesgue point of \(Du\), we have \(\lim _{r\rightarrow 0} A_r=Du(x)\). Comparing the above equation on the balls B(x, r) and B(x, r / 2) we obtain \(|A_r-A_{r/2}|\leqq cr\omega _r\), which (summing the geometric series \(A_{2^{-k}r}-A_{2^{k+1}r}\)) implies \(|A_r-Du(x)|\leqq c r\omega _r\) and
A second application of Poincaré’s inequality yields
for some \(b_r=b_r(x)\in \mathbb {R}\), and the same argument as above leads to
where \(P_x\) is the second-order Taylor polynomial of \(u\) centered at x.
For \(x,x'\in S_M\) and \(r=|x-x'|\), we have
Since the space of polynomials of degree two is finite dimensional, this is an estimate on the difference of the coefficients and also a uniform estimate on the difference of the two polynomials. The conclusion then follows from Whitney’s extension theorem. We remark that the standard construction in Whitney’s extension theorem, if given periodic inputs, produces periodic outputs, and that, if \(E_M\) is not a null set, this procedure actually produces a \(C^2\) function. \(\quad \square \)
We are finally in a position to prove the other inequality in Theorem 3.6. Specifically, we show the following:
Lemma 3.15
Let \(K\subseteq \mathbb {R}^{n\times n}_\mathrm {sym}\) be compact. Then, \(K^{(p)}\subseteq K^{(q)}\) for any p, q with \(1< p \leqq q < \infty \).
Proof
As usual, we define \(f_p(\sigma ):={\text {dist}}^p(\sigma ,K)\) and, analogously, \(f_q\). For brevity, we write \(T=(0,1)^n\). Pick \(\xi \in K^{(p)}\). Since \(\mathcal {Q}_{\mathrm {sdqc}}f_p(\xi )=0\), by the definition (2.2) there is a sequence of functions \(w_k\in C^\infty _\mathrm {per}(T;\mathbb {R}^{n\times n}_\mathrm {sym})\) with \(\mathrm {div\,}w_k=0\), \(\int _T w_k\, \mathrm{d}x=\xi \) and \(\int _T f_p(w_k(x),K)\, \mathrm{d}x\rightarrow 0\). We choose \(M>0\) such that \(K\subseteq B_{M-1}\) and \(|\xi |\leqq M-1\) and define
where \(\chi _{|w_k|< M}(x)=1\) if \(|w_k|(x)< M\) and 0 otherwise. Then, \(\Vert w_k^M\Vert _{L^{2q}}\leqq \Vert w_k^M\Vert _{L^\infty }\leqq M\). Since \(|\sigma |\geqq M\) implies \({\text {dist}}(\sigma ,K)\geqq 1\), we obtain
and, therefore, \(\Vert w_k^L\Vert _{L^p}\rightarrow 0\). Let \(\Theta _k^{M}\in W^{2,2q}_\mathrm {per}(T;\mathbb {R}^{n^4}_*)\) and \(\Theta _k^{L}\in W^{2,p}_\mathrm {per}(T;\mathbb {R}^{n^4}_*)\) be corresponding potentials obtained from \(w_k^M-\int _T w_k^M\, \mathrm{d}x\in L^{2q}\) and \(w_k^L-\int _T w_k^L\, \mathrm{d}x\in L^p\) using Lemma 3.13(iii) with the exponents 2q and p. In particular, this implies that \(w_k=\xi +\mathrm {div\,}\mathrm {div\,}(\Theta _k^M+\Theta _k^L)\), with
Let \(\Theta _k^T\in C^2(T;\mathbb {R}^{n^4}_*)\) be the truncation of \(\Theta _k^{L}\) obtained from Lemma 3.14, \(\Vert \Theta _k^T\Vert _{2,\infty }\leqq cM\). The above estimates show that \(\Vert \Theta _k^T\Vert _{2,p}\rightarrow 0\) and, therefore, \(\Vert \Theta _k^T\Vert _{2,2q}\rightarrow 0\). We define \(w_k^*:=\xi +\mathrm {div\,}\mathrm {div\,}(\Theta _k^M+\Theta _k^T)\in L^{2q}\). Then, \(w_k-w_k^*=\mathrm {div\,}\mathrm {div\,}(\Theta _k^L-\Theta _k^T)\rightarrow 0\) in \(L^p\).
We now proceed to prove that \(\int _T f_q(w_k^*)\mathrm{d}x\rightarrow 0\) as \(k\rightarrow \infty \). For every \(N>M\), we write
and treat the two terms separately. The second can be estimated as
It remains to estimate the first term. For fixed N, the function \(f_p\) is uniformly continuous on \(B_N\), so there is \(\delta _N>0\) such that \(|\sigma |<N\), \(|\sigma -\eta |<\delta _N\) imply \(f_p(\sigma )\leqq f_p(\eta )+1/N^q\). Therefore, for all \(\sigma ,\eta \in \mathbb {R}^{n\times n}_\mathrm {sym}\), we have
Setting \(\sigma =w_k^*(x)\), \(\eta =w_k(x)\), integrating, and recalling that \(w_k-w_k^*\rightarrow 0\) in \(L^p\) yields
From (3.41) to (3.44), we conclude that
for all \(N>M\) and, therefore, \(\int f_q(w_k^*) \mathrm{d}x\rightarrow 0\). Finally, by continuity and density we can replace \(w_k^*\) by a sequence of smooth functions with the same properties (using mollification preserves the differential constraint, periodicity and the average), and therefore \(\mathcal {Q}_{\mathrm {sdqc}}f_q(\xi )=0\). \(\quad \square \)
4 Explicit Relaxation for Yield Surfaces Depending on the First Two Invariants
4.1 General Setting and Main Results
In this section, we focus on the case of rotationally symmetric sets of strains in three dimensions. Lemma 3.10 implies that if \(K\subseteq \mathbb {R}^{3\times 3}_\mathrm {sym}\) is rotationally invariant, in the sense that \(Q^TKQ=K\) for any \(Q\in \mathrm {SO}(3)\), then also its symmetric \(\mathrm {div}\)-quasiconvex hull is rotationally invariant, in the sense that \(Q^TK^{\mathrm {sdqc}}Q=K^{\mathrm {sdqc}}\) for any \(Q\in \mathrm {SO}(3)\), and the same for \(K^{(\infty )}\). We consider here the situation where K is described by only two invariants, one corresponding to the pressure (the isotropic stress) and another to the deviatoric stress (a measure of the distance to diagonal matrices). We leave the case of generic rotationally invariant elastic domains for future work.
For \(\sigma \in \mathbb {R}^{3\times 3}_\mathrm {sym}\), we define the two variables
and denote \(\Phi :\mathbb {R}^{3\times 3}_\mathrm {sym}\rightarrow \mathbb {R}\times [0,\infty )\) at the mapping \(\Phi :=(p,q)\), so that
We remark that \(2q^2(\sigma )=|\sigma _D|^2\) where \(\sigma _D:=\sigma -p{\text {Id}}\) is the deviatoric part of \(\sigma \). For example, for any \((p_*,q_*)\in \mathbb {R}\times [0,\infty )\), the matrices
obey \(\Phi (\xi _0)=\Phi (\xi _1)=(p_*,q_*)\).
Here, we consider sets K that can be characterized by the values of these two invariants, in the sense that
We seek a characterization of \(K^{\mathrm {sdqc}}\) in the (p, q) plane, that is, we aim at characterizing the set
and the same for \(K^{(\infty )}\). An explicit expression is given in Theorem 4.1 below.
In some cases, we shall additionally show that \(K^{\mathrm {sdqc}}\) is fully characterized by the values of p and q, in the sense that \(\sigma \in K^{\mathrm {sdqc}}\) if and only if \((p(\sigma ),q(\sigma ))\in {\tilde{H}}\) for some \({\tilde{H}}\in \mathbb {R}\times [0,\infty )\), see Theorem 4.2 below. This is however not always true; see Lemma 4.12 for an example of where this representation fails.
Our results are restricted to the case in which the relevant set \({\tilde{H}}\) is connected. Connectedness of hulls is, in general, a very subtle issue related to the locality of the various convexity conditions. In the case of quasiconvexity, it relates to the compactness of sequences taking values in sets without rank-one connections, a question known as Tartar’s conjecture [22]. We recall that nonlocality of quasiconvexity was proven, in dimension 3 and above, by Kristensen [9] based on Šverák’s counterexample to the equivalence of rank-one convexity and quasiconvexity [24]. However, in dimension two the situation is different and positive results have been obtained by Šverák [25] and Faraco and Székelyhidi [6].
We begin by explaining the construction qualitatively and then present a proof of its correctness. In order to get started, we fix \(p_0\in \mathbb {R}\) and consider the rank-two line
Clearly, \(p(\xi _t)=p_0\) and \(q(\xi _t)=|t|\). In particular, if \((p_0, q_0)\in H\) then both \(\xi _{q_0}\) and \(\xi _{-q_0}\) belong to K and, with Lemma 3.11, we obtain \(\xi _t\in K\) for all \(t\in [-q_0, q_0]\). Based on this argument, we define the set
The set \(\Phi (K^{\mathrm {sdqc}})\) mentioned in (4.5) will then be characterized in Theorem 4.1 as a set \(H^\mathrm {rel}\) that we now show how to construct explicitly. Specifically, \(H^\mathrm {rel}\) is obtained from \({\hat{H}}\) by first taking the convex hull and then eliminating all points that can be separated from \(H^\mathrm {rel}\) by means of a translation of Tartar’s function, \(f(\sigma ) := 4q^2(\sigma )-3p^2(\sigma )\), which is symmetric \(\mathrm {div\,}\)-quasiconvex; see Lemma 4.3 below. We say that a point \(y_*=(p_*,q_*)\) can be separated from \({\hat{H}}\) if there is \(y_0=(p_0, q_0)\in \mathbb {R}\times [0,\infty )\) such that the function \(f_{y_0}(p,q):=4(q^2-q_0^2)-3 (p-p_0)^2\) obeys \(\max f_{y_0}(H)<f_{y_0}(y_*)\). Then, the set \(H^\mathrm {rel}\) is
We refer to Fig. 2 for an illustration.
Our main result is the following:
Theorem 4.1
Let \(H\subseteq \mathbb {R}\times [0,\infty )\) be a compact set, \(K:=\{\sigma \in \mathbb {R}^{3\times 3}_\mathrm {sym}: (p(\sigma ), q(\sigma ))\in H\}\). If the set \(H^\mathrm {rel}\) defined in (4.7–4.8) is connected, then \(\Phi (K^{\mathrm {sdqc}})= \Phi (K^{(\infty )})= H^\mathrm {rel}\).
Proof
The result follows from Lemmas 4.4 and 4.10 below, using the inclusion \(K^{(\infty )}\subseteq K^{\mathrm {sdqc}}\) that was proven in Lemma 3.8. \(\quad \square \)
With an additional condition on the tangent to the boundary of \(H^\mathrm {rel}\), we obtain a full characterization of the hull. The necessity of the condition on the tangent is proven in Lemma 4.12 below.
Theorem 4.2
Under the assumptions of Theorem 4.1, if additionally the tangent to \(\partial H^\mathrm {rel}\) belongs to \(\{e\in S^1: |e_2|\leqq \frac{\sqrt{3}}{4}|e_1|\}\) for any \(y_*\in \partial H^\mathrm {rel}\setminus {\hat{H}}\), then \(K^{\mathrm {sdqc}}=K^{(\infty )}=\{\sigma :\Phi (\sigma )\in H^\mathrm {rel}\}\).
Proof
The result follows from Lemma 4.4 and 4.11 below, using the inclusion \(K^{(\infty )}\subseteq K^{\mathrm {sdqc}}\) that is proven in Lemma 3.8. \(\quad \square \)
4.2 Outer Bound
The next two Lemmas contain the proof of the outer bound, i. e., the inclusion \(\Phi (K^{\mathrm {sdqc}})\subseteq H^\mathrm {rel}\).
Lemma 4.3
Let \(g:\mathbb {R}^{3\times 3}_\mathrm {sym}\rightarrow \mathbb {R}\) be defined by \(g(\xi ):=f_{y_0}(p(\xi ),q(\xi ))\), where \(f_{y_0}(p,q):=4(q^2-q_0^2)-3 (p-p_0)^2\) and \(y_0=(p_0, q_0)\in \mathbb {R}\times [0,\infty )\). Then, g is symmetric \(\mathrm {div\,}\)-quasiconvex.
Proof
By Lemma 2.7, we know that for the function \(f_\mathrm {T}:\mathbb {R}^{3\times 3}_\mathrm {sym}\rightarrow \mathbb {R}\),
is symmetric \(\mathrm {div\,}\)-quasiconvex. From
we obtain
Therefore, \(g(\xi )=f_\mathrm {T}(\xi -p_0{\text {Id}})-4q_0^2\) is symmetric \(\mathrm {div\,}\)-quasiconvex. \(\quad \square \)
Lemma 4.4
Under the assumptions of Theorem 4.1, \(\Phi (K^{\mathrm {sdqc}})\subseteq H^\mathrm {rel}\).
Proof
We pick a \(\sigma \in K^{\mathrm {sdqc}}\) and define \(y:=(p(\sigma ),q(\sigma ))\). We need to show that \(y\in H^\mathrm {rel}\).
If \(y\not \in {\hat{H}}^\mathrm {conv}\), then there is an affine function \(a:\mathbb {R}^2\rightarrow \mathbb {R}\) of the form \((p,q)\mapsto a(p,q)=bp+cq+d\) such that \(a(y)>0\) and \(a\leqq 0\) on \({\hat{H}}\).
We first show that we can assume \(c\geqq 0\). Indeed, if this were not the case, we could consider the new affine function \(a'(p,q):=bp+d\), which obeys \(a'(y)\geqq a(y)>0\). Let now \((p',q')\in {\hat{H}}\). By the definition of \({\hat{H}}\) we have \((p',0)\in {\hat{H}}\). By the definition of \(a'\) and the properties of a we obtain \(a'(p',q')=a(p',0)\leqq 0\). Therefore, we can assume \(c\geqq 0\), or, equivalently, that a is nondecreasing in its second argument.
The function \(g:\mathbb {R}^{3\times 3}_\mathrm {sym}\rightarrow \mathbb {R}\), \(g(\xi ):=a(p(\xi ),q(\xi ))\) is the composition of convex functions, with p linear, and a nondecreasing in the second argument. Therefore, g is convex, as can be easily verified:
In particular, \(g\leqq 0\) on K, \(g(\sigma )>0\) and g is convex. Hence, \(\sigma \) does not belong to the convex hull of K and neither does it belong to the symmetric \(\mathrm {div\,}\)-quasiconvex hull.
Assume now that \(y\in {\hat{H}}^\mathrm {conv}\setminus H^\mathrm {rel}\). Then, it is separated from \({\hat{H}}\) in the sense of (4.8). Let \(y_0=(p_0,q_0)\) be as in the definition of separation. By Lemma 4.3 the function \(\xi \mapsto f_{y_0}(p(\xi ), q(\xi ))=4(q(\xi )-q_0)^2-3(p(\xi )-p_0)^2\) is symmetric \(\mathrm {div\,}\)-quasiconvex and this implies \(\sigma \not \in K^{\mathrm {sdqc}}\). Therefore, \(\Phi (K^{\mathrm {sdqc}})\subseteq H^\mathrm {rel}\). \(\quad \square \)
4.3 Inner Bound
We now prove the inner bound. Specifically, we first show that for any \(y_*\in H^\mathrm {rel}\) there is a matrix \(\sigma \in K^{(\infty )}\) with \(\Phi (\sigma )=y_*\) (Lemma 4.10) and then that, if an additional condition on the slope of the boundary of \(H^\mathrm {rel}\) is fulfilled, any matrix \(\sigma \) with \(\Phi (\sigma )=y_*\) belongs to \(K^{(\infty )}\) (Lemma 4.11).
Our key result is a characterization of a family of rank-two curves in the (p, q) plane. We say that \(t\mapsto \gamma (t)\) is a rank-two curve if it is a reparametrization of \(s\mapsto \Phi (A+s(B-A))\) for some A, \(B\in \mathbb {R}^{3\times 3}_\mathrm {sym}\) with \(\mathop {\mathrm {rank}}(A-B)\leqq 2\). The curves we construct are at the same time level sets of symmetric \(\mathrm {div}\)-quasiconvex functions, either of the type used to separate points in the definition of \(H^\mathrm {rel}\) or (piecewise) affine. This allows us (see proof of Lemma 4.10) to show below that any point in \({\hat{H}}^\mathrm {conv}\) that cannot be separated from \({\hat{H}}\) can be constructed. This strategy is illustrated in Fig. 3.
Lemma 4.5
Let K, H and \({\hat{H}}\) be as above. Then, any \(\sigma _*\in \mathbb {R}^{3\times 3}_\mathrm {sym}\) with \((p(\sigma _*),q(\sigma _*))\in {\hat{H}}\) belongs to \(K^{(\infty )}\).
Proof
Let \(\sigma _*\in \mathbb {R}^{3\times 3}_\mathrm {sym}\) be such that \(p_*:=p(\sigma _*)\), \(q_*:=q(\sigma _*)\) obey \((p_*,q_*+a)\in H\) for some \(a>0\). We consider the rank-two line
This obeys \(\xi _0=\sigma _*\) and \(p(\xi _t)=p_*\) for all t. The map \(t\mapsto q(\xi _t)\) is continuous, equals \(q_*\) at \(t=0\) and diverges for \(t\mapsto \pm \infty \). Hence, there are \(t_-<0<t_+\) such that \(q(\xi _{t_\pm })=q_*+a\). In particular, \(\xi _{t_\pm }\in K\) and, therefore, (Lemma 3.11) \(\sigma _*=\xi _0\in K^{(\infty )}\). \(\quad \square \)
Lemma 4.6
Let \(y=(p_*,q_*)\in \mathbb {R}\times (0,\infty )\). Then, there is a continuous function \(\Gamma _y:S^1\times \mathbb {R}\rightarrow \mathbb {R}\times [0,\infty )\) such that for any \(e\in S^1\) the map \(t\mapsto \Gamma _y(e,t)\) is a rank-two curve parametrized by arc-length, with \(\Gamma _y(e,0)=y\), \(\partial _t \Gamma _y(e,0)=e\), and \(\Gamma _y(e,t)=\Gamma _y(-e,-t)\). The curves \(\Gamma _y(e,\cdot )\) are either of the form (4.14) or of the form (4.19).
Proof
For reasons that will become clear subsequently, we treat separately the two sets
We observe that both are closed, that their union is \(S^1\) and their intersection consists of the four points \((\pm \frac{2}{\sqrt{7}},\pm \frac{\sqrt{3}}{\sqrt{7}})\).
We start from \(S^1_+\). For \(p_0,a\in \mathbb {R}\), we consider the rank-two line
(see Fig. 4, left panel). We compute
Solving for t the first equation and inserting into the second, we obtain that the graph of \(t\mapsto (p(\xi _t), q(\xi _t))\) is the set
which we can be rewritten (recalling that \(q\geqq 0\)) as
Therefore, any line of the form \(\{q=\alpha |p-p_0|\}\) with \(|\alpha |\geqq \sqrt{3}/2\) is a rank-two line of the type given in (4.14). In turn, this means that we can define
where \(\Pi (p,q):=(p,|q|)\) denotes reflection onto the upper half-plane.
We now turn to \(S^1_-\). Let \((p_0,q_0)\in \mathbb {R}\times [0,\infty )\) and consider the rank-two line
As above, a simple computation shows that
We now consider the equation \((p(\xi _{t_*}), q(\xi _{t_*}))=(p_*, q_*)\). For every \(t_*\in [-\sqrt{3}q_*, \sqrt{3}q_*]\) there is a unique solution \((p_0, q_0)\in \mathbb {R}\times [0,\infty )\), namely,
We compute
Since we can choose \(t_*\) freely in \([-\sqrt{3}q_*, \sqrt{3}q_*]\), we conclude that for every \(e\in S^1_-\) there is a unique triplet \((p_0,q_0,t_*)\) such that the curve \(t\mapsto (p(\xi _t),q(\xi _t))\) passes through \(y=(p_*,q_*)\) at \(t=t_*\) with the tangent parallel to e. Indeed, this solution can be explicitly written as
It is clear that this solution and, hence, \(\xi _t\), depends continuously on e. We finally define \(\Gamma _y(e,t)\) for \(e\in S^1_-\) as the arc-length reparametrization of \(t\mapsto \xi _{t_*+t}\) or \(t\mapsto \xi _{t_*-t}\) depending on the sign of \(e_1\) (see Fig. 4, right panel).
It remains to check that this definition agrees with the previous one for the four points in \(S^1_-\cap S^1_+\). For these points, the formulas above give \(q_0=0\) and \(a=0\), so that the two definitions of \(\xi _t\) also coincide (with the same \(p_0\)). This concludes the proof. \(\quad \square \)
Lemma 4.7
Let \(y_*=(p_*,q_*)\) with \(q_*>0\), and assume that there are \(e\in S^1\) and \(t_-<0<t_+\) such that \(\Gamma _{y_*}(e,t_\pm )\in {\hat{H}}\), where \(\Gamma _{y_*}\) is the map constructed in Lemma 4.6. Then, \(y_*\in \Phi ( K^{(\infty )})\). If, additionally, \(|e_2|\leqq \frac{\sqrt{3} }{4} e_1\) then any matrix \(\sigma _*\in \mathbb {R}^{3\times 3}_\mathrm {sym}\) with \(\Phi (\sigma _*)=y_*\) belongs to \(K^{(\infty )}\).
Proof
In order to prove the first assertion we observe that, by Lemma 4.6, there is a rank-two line \(t\mapsto \xi _t\) such that \(\Phi (\xi _0)=y_*\) and \(\Gamma _{y_*}(e,\mathbb {R})\) is the graph of \(t\mapsto \Phi (\xi _t)\). In particular, there is \(s_-<0\) such that \((p,q)(\xi _{s_-})=\Gamma _{y_*}(e,t_-)\in {\hat{H}}\), which by Lemma 4.5 implies that \(\xi _{s_-}\in K^{(\infty )}\). Analogously for \(s_+\). By Lemma 3.11, we obtain \(\xi _0\in K^{(\infty )}\) and, therefore, \(y_*=\Phi (\xi _0)\in \Phi (K^{(\infty )})\).
We now turn to the second assertion. By Lemma 4.8 below, there is a rank-two line \(t\mapsto \xi _t\) with the same properties and, additionally, with \(\xi _0=\sigma _*\). The same argument then implies \(\sigma _*\in K^{(\infty )}\). \(\quad \square \)
Lemma 4.8
Let \(\sigma _*\in \mathbb {R}^{3\times 3}_\mathrm {sym}\). Let \(e\in S^1\) be such that \(|e_2|\leqq \frac{\sqrt{3}}{4} |e_1|\). Then, there is a rank-two line \(t\mapsto \xi _t\) through \(\xi _0=\sigma _*\) such that the curve \(t\mapsto (p(\xi _t),q(\xi _t))\) is an hyperbola of the type (4.20) which is parallel to e at \(t=0\).
Proof
Any rank-two line through \(\sigma _*\) has the form \(t\mapsto \xi _t:=\sigma _*+t B\), for some \(B\in \mathbb {R}^{3\times 3}_\mathrm {sym}\) with \(\det B=0\). Let a, b be the eigenvalues of B, and let e, f be a pair of orthonormal vectors such that \(B=ae\otimes e + bf\otimes f\). We let \(p_*:=p(\sigma _*)\), \(q_*:=q(\sigma _*)\) and compute
and
From (4.24), we obtain \(t=3(p(\xi _t)-p_*)/(a+b)\). Inserting in the previous expression leads to
(the case \(a+b=0\) is not relevant, since in this case \(t\mapsto p(\xi _t)\) is constant). The expression
can take any value in \([1/4,\infty )\) and the value 1 / 4 is taken if and only if \(a=b\). Therefore, the coefficient of the quadratic term \((p(\xi _t)-p_*)^2 \) can be the required value of 3 / 2 [see (4.20)] if and only if \(a=b\). We can scale to \(a=b=1\) and obtain
We are left with the task of choosing e and f. Let \(g:=e\wedge f\), so that (e, f, g) is an orthonormal basis of \(\mathbb {R}^3\). Then,
so that, after some rearrangement, the linear term takes the form
We conclude that the graph of \(t\mapsto (p(\xi _t),q(\xi _t))\) is the graph of the curve defined by
and its derivative at \(p_*\) is given by
It remains to show that we can choose B such that this quantity equals \(e_2/e_1\), which is a number in \([-\sqrt{3}/4,\sqrt{3}/4]\). To this end, we first show that the ordered eigenvalues \(\lambda _1\leqq \lambda _2\leqq \lambda _3\) of the matrix \(\sigma _D:=\sigma _*-p_*{\text {Id}}\) obey \(\lambda _1\leqq -q_*/\sqrt{3}\), \(\lambda _3\geqq q_*/\sqrt{3}\). Indeed, assume the former was not the case. If \(\lambda _2\leqq 0\), then \(\lambda _3<2 q_*/\sqrt{3}\) and \(\lambda _1^2+\lambda _2^2+\lambda _3^2<(1/3+1/3+4/3 )q_*^2=2q_*^2\), which is a contradiction. If, instead, \(\lambda _2\geqq 0\), then \(\lambda _2,\lambda _3\leqq q_*/\sqrt{3}\), with the same conclusion. The argument for \(\lambda _3\) is similar.
Therefore, the set \(\{g\cdot \sigma _D g : g\in S^2\}\) contains the interval \([-q_*/\sqrt{3},q_*/\sqrt{3}]\), and we can choose g (and hence e, f) such that \(p_*-g\cdot \sigma _* g=-g\cdot \sigma _D g=4q_*e_2/(3e_1)\in [-q_*/\sqrt{3},q_*/\sqrt{3}]\). \(\quad \square \)
Lemma 4.9
Let \(p_{\min }:=\min \{p: \exists q, (p,q)\in H\}\), \(p_{\max }:=\max \{p: \exists q, (p,q)\in H\}\) and
Assume \(H^\mathrm {rel}\) is connected. Then, \(A=B=C\).
We remark that the definition of A immediately implies \({\hat{H}}^\mathrm {conv}\subseteq A\times [0,\infty )\).
Proof
By convexity, we easily obtain \(B\subseteq A\) and \(C\subseteq A\). By the construction of \({\hat{H}}\), we have \(p_{\min }\in C\), \(p_{\max }\in C\). From the construction of \(H^\mathrm {rel}\), we see that \((p,q)\in H^\mathrm {rel}\) implies that the segment joining (p, q) with (p, 0) also belongs to \(H^\mathrm {rel}\). This proves that \(H^\mathrm {rel}\) is connected if and only if C is connected and that C is the orthogonal projection of \(H^\mathrm {rel}\) onto the \(q=0\) axis. In particular, we have \(A=C\).
It remains to show that \(A\subseteq B\). By Lemma 4.5, we have that \(p_{\min }\in B\) and \(p_{\max }\in B\). We define
and
We first show that \(D_+\cap D_-\subseteq B\). Indeed, let \(p_*\in D_+\cap D_-\) and let \(\sigma _*:=p_*{\text {Id}}\). By assumption, there are \((p_-,q_-), (p_+,q_+)\in H\) such that \(p_-\leqq p_*\leqq p_+\), \(q_-\geqq \gamma (p_*-p_-)\), \(q_+\geqq \gamma (p_+-p_*)\), where \(\gamma :=\frac{\sqrt{3}}{2}\). In particular, \((p_-, \gamma (p_*-p_-))\in {\hat{H}}\) and \((p_+, \gamma (p_+-p_*))\in {\hat{H}}\). We consider the rank-two line
and observe that there are \(t_-\leqq 0\leqq t_+\) such that \(p(\xi _{t_\pm })=p_\pm \), \(q(\xi _{t_\pm })=\gamma |p_\pm -p_*|\). Lemma 4.5 implies \(\xi _{t_\pm }\in K^{(\infty )}\) and, with Lemma 3.11, one then deduces \(\sigma _*=\xi _0\in K^{(\infty )}\).
We next show that \(A\subseteq D_+\cup D_-\) Indeed, if \(p_*\not \in D_+\cup D_-\) then \(q(\sigma )< \frac{\sqrt{3}}{2} |p(\sigma )-p_*|\) for any \(\sigma \in K\). Consider the function \(f(p,q):=4q^2-3(p-p_*)^2\). Then, \(f(p,q)<0=f(p_*,0)\) for all \((p,q)\in {\hat{H}}\), therefore \((p_*,0)\) is separated from \({\hat{H}}\) and does not belong to \(H^\mathrm {rel}\). This implies that \(p_*\not \in C=A\).
Up until now we have shown that
Assume that there is \(p_*\in A{\setminus }B\). Without loss of generality, assume \(p_*\in D_+\). Let \({\bar{p}} :=\min \{p\in B: p>p_*\}\). Since \(p_{\max }\in B\), the set is nonempty. Since B is closed, \(p_*<{\bar{p}}\). The sets \(D_+\) and \(D_-\) are compact, cover the interval \([p_*,{\bar{p}}]\) and are disjoint in \([p_*,{\bar{p}})\). Therefore, \([p_*,{\bar{p}}]\subseteq D_+\).
Let \(p'\in (p_*,{\bar{p}})\subseteq D_+\). If there was \(q'\geqq 0\) such that \((p',q')\in H\), then we would have \((p',0)\in {\hat{H}}\) and \(p'\in B\). Therefore, \([p_*,{\bar{p}})\times [0,\infty )\cap H=\emptyset \). For any \(p'\in (p_*,{\bar{p}})\), there is a point \(y=(p_-,q_-)\in H\) with \(p_-<p_*\), \(q_-\geqq \gamma (p'-p_*)\). Consider a sequence of such points, \(p'_j\rightarrow {\bar{p}}\). By compactness of H, the corresponding points \(y_j=(p^-_j, q^-_j)\) converge (after extracting a subsequence) to some \(y_0=(p_0,q_0)\in H\). Since \(p^-_j<p_*\) for all j and H is closed, we have \(p_0< p_*\).
We finally consider the rank-two line
Let \(t_0\) be such that \({\bar{p}} + \frac{2}{3} t_0=p_0\). The condition \({\bar{p}}\in B\) corresponds to \(\xi _0={\bar{p}} {\text {Id}}\in K^{(\infty )}\), the definition of \(y_0\) shows that \(\Phi (\xi _{t_0})\in {\hat{H}}\) and, with Lemma 4.5, we obtain \(\xi _{t_0}\in K^{(\infty )}\). Therefore, \(\xi _t\in K^{(\infty )}\) for all \(t\in [t_0,0]\).
Let now \(t_1\in (t_0,0)\) be such that \({\bar{p}} + \frac{2}{3} t_1=p_*\). After swapping coordinates, we see that the two matrices
belong to \(K^{(\infty )}\). Since \(\mathop {\mathrm {rank}}(\xi _A-\xi _B)=2\), so do all matrices in the segment joining them and, in particular,
Again, swapping coordinates, the same is true for
Since \(\mathop {\mathrm {rank}}(\xi _D-\xi _C)=2\) and \(p_*{\text {Id}}=\frac{1}{2}\xi _D+\frac{1}{2}\xi _C\), we obtain \(p_*{\text {Id}}\in K^{(\infty )}\). This implies \(p_*\in B\), a contradiction. Therefore, we conclude that \(A\subseteq B\). \(\quad \square \)
Lemma 4.10
Under the assumptions of Theorem 4.1, \(H^\mathrm {rel}\subseteq \Phi (K^{(\infty )})\).
Proof
We fix \(y_*=(p_*, q_*)\in H^\mathrm {rel}\). If \(q_*=0\), then, in the notation of Lemma 4.9, we have \(p_*\in C=B\) and therefore \(p_*{\text {Id}}\in K^{(\infty )}\). If \(y_*\in {\hat{H}}\), then the result follows from Lemma 4.5.
It remains to consider the case \(y_*\in H^\mathrm {rel}\setminus {\hat{H}}\) and \(q_*>0\). We consider the set of directions such that the rank-two line constructed in Lemma 4.6 intersects \({\hat{H}}_A:={\hat{H}}\cup A\times \{0\}\), where A is the set constructed in Lemma 4.9 and define
(this is illustrated in Fig. 3). By continuity of \(\Gamma _{y_*}\) and compactness of \({\hat{H}}_A\), it follows that \(D(y_*)\) is a closed subset of \(S^1\).
We now distinguish two cases. If there is \(e\in D(y_*)\cap -D(y_*)\), then there are \(t_-<0<t_+\) such that \(\Gamma _{y_*}(e,t_\pm )\in {\hat{H}}_A\) and Lemma 4.7 implies that \(y_*\in \Phi (K^{(\infty )})\).
If instead there is no such e, then \(D(y_*)\) and \(-D(y_*)\) are disjoint. Since they are both closed, and \(S^1\) is connected, they cannot cover \(S^1\). In particular, there is \(e\in S^1\) such that \(e,-e\not \in D(y_*)\).
In the notation of Lemma 4.6, if \(e\in S^1_+\) then the curve \(\Gamma _{y_*}(e,\mathbb {R})\) is the graph of \(q=b|p-p_0|\) for some \(b\geqq \sqrt{3}/2\), \(p_0\in \mathbb {R}\) such that \(q_*=b|p_*-p_0|\). Assume, for definiteness, that \(p_0>p_*\). The remaining case is identical up to a few signs.
This curve does not intersect \({\hat{H}}_A\) and, by the form of \({\hat{H}}_A\), this implies that \(q<b|p-p_0|\) for all \((p,q)\in {\hat{H}}_A\). In particular, \(p_0\not \in A\). Since A is an interval and \(p_*\in A\), we have that \(A\subseteq (-\infty ,p_0)\) and \(H^\mathrm {conv}\subseteq (-\infty ,p_0)\times [0,\infty )\). Hence, \(q<b(p-p_0)\) for all \((p,q)\in {\hat{H}}\) and, by convexity, \(q<b(p-p_0)\) for all \((p,q)\in {\hat{H}}^\mathrm {conv}\), but this contradicts the assumption \((p_*,q_*)\in H^\mathrm {rel}\).
The case \(e\in S^1_-\) is similar. The curve \(\Gamma _{y_*}(e,\mathbb {R})\) is of the type \(\{f_{y_1}(\cdot )=0\}\), for some \(y_1\). Then, \(f_{y_1}(y_*)=0\) but \(f_{y_1}<0\) on \({\hat{H}}\), so that \(y_*\) is separated from \({\hat{H}}^\mathrm {conv}\), contradicting the assumption that \(y_*\in H^\mathrm {rel}\). \(\quad \square \)
Lemma 4.11
Under the assumptions of Theorem 4.1, if additionally the tangent to \(\partial H^\mathrm {rel}\) belongs to \(\{e\in S^1: |e_2|\leqq \frac{\sqrt{3}}{4}|e_1|\}\) for any \(y_*\in \partial H^\mathrm {rel}\setminus {\hat{H}}\), then any \(\sigma \) with \(\Phi (\sigma )\in H^\mathrm {rel}\) belongs to \(K^{(\infty )}\).
In particular, the assumption implies that \(\partial H^\mathrm {rel}\) is differentiable (as a graph) at any point not belonging to \({\hat{H}}\), but does not require differentiability on \({\hat{H}}\).
Proof
The argument is similar to the proof of the previous Lemma. By construction of \(H^\mathrm {rel}\), there is a map \(\psi : A\rightarrow [0,\infty )\) such that
We first show that any \(\sigma _*\) such that \((p_*,q_*):=\Phi (\sigma _*)\in \partial H^\mathrm {rel}\) belongs to \(K^{(\infty )}\). We distinguish several cases. If \(q_*=0\), then \(p_*\in A\) and the claim follows from the equality \(A=B\) in Lemma 4.9. If \((p_*,q_*)\in {\hat{H}}\), then the claim follows from Lemma 4.5. It remains the case that \((p_*,q_*)\in {\hat{H}}^\mathrm {conv}\setminus {\hat{H}}\) and cannot be separated from \({\hat{H}}\).
At this point, we repeat the argument in Lemma 4.10. In particular, since \(y_*\in H^\mathrm {rel}\) we know that there is \(e\in S^1\) such that \(e\in D(y_*)\cap -D(y_*)\). This means that there are \(t_-<0<t_+\) such that \(\Gamma _{y_*}(e,t_\pm )\in {\hat{H}}\) and that \(\Gamma _{y_*}(e,t)\in H^\mathrm {rel}\) for all \(t\in [t_-,t_+]\). This implies that \(\Gamma _{y_*}(e,\cdot )\) is tangential to \(\partial H^\mathrm {rel}\) at \(t=0\) and, in particular, that e is tangential to \(\partial H^\mathrm {rel}\). We remark that e cannot be \((0,\pm 1)\), since in that case we would have \(y_*\in {\hat{H}}\), a case we have already dealt with.
Therefore, \(|e_2|\leqq \frac{\sqrt{3}}{4}|e_1|\), so that by Lemma 4.8 we obtain that \(\Phi (\xi _{s_\pm })\in {\hat{H}}\), which by Lemma 4.5 implies \(\xi _{s_\pm }\in K^{(\infty )}\). Therefore, \(\sigma _*=\xi _0\in K^{(\infty )}\).
This shows that for any \(p\in A\) and matrix \(\sigma \) with \(\Phi (\sigma )=(p,\psi (p))\) belongs to \(K^{(\infty )}\). The argument of Lemma 4.5 then concludes the proof. \(\quad \square \)
We finally show that \(H^\mathrm {rel}=\Phi (K^{(\infty )})\) does not imply \(K^{(\infty )}=\Phi ^{-1}(H^\mathrm {rel})\). We refer to Fig. 5 for an illustration.
Lemma 4.12
Let \(H:=\{(0,0),(1,\sqrt{3}/2)\}\), and define K as in (4.4). Then, \(H^\mathrm {rel}=\{(p,q): 0\leqq p\leqq 1, 0\leqq q \leqq \sqrt{3} p/2\}\), the matrix \(\sigma _*:=\mathrm {diag}(1,1/4,1/4)\) obeys \((p(\sigma _*), q(\sigma _*)) = (1/2,\sqrt{3}/4)\in H^\mathrm {rel}\), but \(\sigma _*\not \in K^{(\infty )}\subseteq K^{\mathrm {sdqc}}\).
Proof
The formula for \(H^\mathrm {rel}\) follows immediately from the definition in (4.7–4.8); the fact that \(\sigma _*\in H^\mathrm {rel}\) from the definition of p and q in (4.1). Lemma 3.8 shows that \(K^{(\infty )}\subseteq K^{\mathrm {sdqc}}\).
It remains to prove that \(\sigma \not \in K^{(\infty )}\). Since \(\mathop {\mathrm {rank}}\sigma _*=3\), Lemma 3.12 implies \(\{0, 2\sigma _*\}^{(\infty )}=\{0, 2\sigma _*\}\). Therefore, it suffices to show that \(\sigma _*\in K^{(\infty )}\) would imply \(\sigma _*\in \{0, 2\sigma _*\}^{(\infty )}\).
We first define \(h:\mathbb {R}^{3\times 3}_\mathrm {sym}\rightarrow \mathbb {R}\), \(h(\xi ):=2p(\xi )-\xi _{11}\) and observe that \(h(0) = h(\sigma _*) = h(2\sigma _*) = 0\). We fix any \(\xi \in K\setminus \{0\}\). Then, necessarily \(p(\xi )=1\) and \(q(\xi )=\sqrt{3}/2\). Recalling that \( 2q^2(\xi )=|\xi -p(\xi ){\text {Id}}|^2\) and \(\xi _{33}=3p(\xi )-\xi _{11}-\xi _{22}\), we compute that
and we conclude that \(\xi _{11}\leqq 2\), so that \(h(\xi )\geqq 0\). Furthermore, if \(h(\xi )=0\) then necessarily \(\xi _{11}=2\), so that equality holds throughout in (4.43). This, in turn, implies that \(\xi =2\sigma _*\). We have therefore proven that \(h\geqq 0\) on K, with \(\{h=0\}\cap K=\{0,2\sigma _*\}\).
We now assume \(\sigma _*\in K^{(\infty )}\), so that, for any \(g\in C^0(\mathbb {R}^{3\times 3}_\mathrm {sym};[0,\infty ))\) which is symmetric \(\mathrm {div}\)-quasiconvex, \(g(\sigma _*)\leqq \max g(K)\). In order to show that \(\sigma _*\in \{0, 2\sigma _*\}^{(\infty )}\), we fix a function \(f\in C^0(\mathbb {R}^{3\times 3}_\mathrm {sym};[0,\infty )\) which is symmetric \(\mathrm {div}\)-quasiconvex, and let \(\alpha :=\max \{f(0),f(2\sigma _*)\}\). We need to show that \(f(\sigma _*)\leqq \alpha \).
Fix \(\varepsilon >0\). By continuity there is \(\delta >0\) such that \(f\leqq \alpha +\varepsilon \) on \(B_\delta (2\sigma _*)\). Let \(M:=\max f(K)\geqq \alpha \), \(m:=\min h(K\setminus \{0\}\setminus B_\delta (2\sigma _*))>0\). We define
Then, \(g(0)=f(0)\leqq \alpha \), \(g\leqq \alpha +\varepsilon \) on \(K\cap B_\delta (2\sigma _*)\), \(g\leqq M-(M-\alpha )=\alpha \) on the rest of K, and g is continuous and symmetric \(\mathrm {div}\)-quasiconvex. The function \(g_+=\max \{g,0\}\in C^0(\mathbb {R}^{3\times 3}_\mathrm {sym};[0,\infty ))\) obeys \(\max g_+(K)\leqq \alpha +\varepsilon \). Since \(\sigma _*\in K^{(\infty )}\), we have \(f(\sigma _*)=g_+(\sigma _*)\leqq \alpha +\varepsilon \). However, \(\varepsilon \) was arbitrary, hence we conclude that \(f(\sigma _*)\leqq \max f(\{0,2\sigma _*\})\). Therefore, \(\sigma _*\in \{0,2\sigma _*\}^{\mathrm {sdqc}}\), as claimed, and the proof is concluded. \(\quad \square \)
4.4 Examples
We close by presenting two specific examples for which the symmetric \(\mathrm {div}\)-quasiconvex hull can be explicitly characterized.
Lemma 4.13
Let \(p_1,q_1>0\), with \(0<p_1<2q_1/\sqrt{3}\), and let \(H:=\{(-p_1,q_1),(p_1,q_1)\}\). Then,
and \(\Phi (K^{\mathrm {sdqc}})=H^\mathrm {rel}\). If, additionally, \(p_1\leqq q_1/\sqrt{3}\), then
We refer to Fig. 2 for an illustration.
Proof
We observe that \({\hat{H}}=\{-p_1,p_1\}\times [0,q_1]\) and \({\hat{H}}^\mathrm {conv}=[-p_1,p_1]\times [0, q_1]\).
Let \(W:=\{(p,q): -p_1\leqq p\leqq p_1, q^2-\frac{3}{4} p^2\leqq q_1^2-\frac{3}{4}p_1^2, q\geqq 0\}\) be the set in (4.45). We first show that \(H^\mathrm {rel}\subseteq W\). We define \(q_0:=\sqrt{q_1^2-\frac{3}{4} p_1^2}\) and consider the corresponding function \(f_{(0,q_0)}(p,q)=4(q^2-q_0^2)-3p^2 = 4(q^2-q_1^2)-3(p^2-p_1^2)\). Then, \(f_{(0,q_0)}\leqq 0\) on \({\hat{H}}\), and \(f_{(0,q_0)}>0\) on \({\hat{H}}^\mathrm {conv}\setminus W\). Recalling (4.8), we obtain \(H^\mathrm {rel}\subseteq W\).
To obtain the remaining inclusion, it suffices to show that we cannot separate any point of W from \({\hat{H}}\). We fix a point \((p,q)\in W\) and consider a generic pair \(y_0=(p_0,q_0)\in \mathbb {R}\times [0,\infty )\). The function \(f_{y_0}\) separates (p, q) from \({\hat{H}}\) if
which, expanding all squares, is the same as
From \((p,q)\in W\) we obtain \(|p|\leqq p_1\), which implies \(6pp_0\leqq 6 |p_1p_0|\), and \( 4q^2 - 3 p^2 \leqq 4q_1^2-3 p_1^2 \). Summing the two gives
which means that we cannot separate (p, q) from \({\hat{H}}\). Therefore, \(W\subseteq H^\mathrm {rel}\).
From the definition and the condition \(p_1<2q_1/\sqrt{3}\), we see that \(H^\mathrm {rel}\) is connected, so that the first assertion directly follows from Theorem 4.1.
To prove the second assertion we need only control the slope of the boundary. The vertical sides of \(H^\mathrm {rel}\) belong to \({\hat{H}}\). The slope of the hyperbola is maximal at the two extreme points, i. e., at \((\pm p_1, q_1)\). Differentiating \(q^2-\frac{3}{4}p^2=c\), we obtain \(q'q=\frac{3}{4} p'p\), which implies that \(|q'|/|p'|=\frac{3}{4} p_1/q_1\). If \(p_1\leqq q_1/\sqrt{3}\), this implies that the slope is not larger than \(\frac{\sqrt{3}}{4}\). The conclusion then follows from Lemma 4.2.
\(\square \)
Next, we consider a second example in which H consists of a half-circle of radius r centered in \(C:=(p_C,0)\) and a single point \(D:=(p_D,q_D)\):
There are several different cases, depending on the existence of one or two hyperbolas in the family considered above which contain the point D and are tangent to the circle. The boundaries between the different phases are vertical lines (corresponding to the construction of \({\hat{H}}\) from H) and lines with slope \(\pm \sqrt{3}/2\) (corresponding to the maximal slope of the hyperbolas, which is also the boundary between \(S^1_+\) and \(S^1_-\)). The phase diagram is sketched in Fig. 6. The critical points are \(X=(p_C-\frac{\sqrt{7}}{\sqrt{3}} r,0)\), \(Y=(p_C+\frac{\sqrt{7}}{\sqrt{3}} r,0)\) and \(Z=(p_C,\frac{\sqrt{7}}{\sqrt{4}} r)\). For definiteness, we focus on two representative regions.
Lemma 4.14
Let H be as in (4.50) with D in region I, defined as
Then, there is a unique \(y_0=(p_0,q_0)\in \mathbb {R}\times [0,\infty )\) such that the hyperbola \(\{q^2-q_0^2=\frac{3}{4}(p-p_0)^2\}\) contains \(D=(p_D,q_D)\) and is tangent to the circle with radius r centered in \(C=(p_C,0)\) in a point T. Furthermore,
If, instead, D is in region II, defined by
then \(H^\mathrm {rel}={\hat{H}}^\mathrm {conv}\).
Proof
The second case is straightforward. The boundary of \({\hat{H}}^\mathrm {conv}\) has slope at least \(\sqrt{3}/2\), hence there is no possibility to separate any point of it using the given hyperbolas. A sketch is shown in Fig. 8.
The first case, corresponding to region I in Fig. 6, requires a more detailed argument. We first have to show that there is a unique hyperbola of the type \(q^2-q_0^2=\frac{3}{4}(p-p_0)^2\) which contains D and is tangent to the half-circle. We refer to Fig. 7 for an illustration.
The condition that \(y_D\) belongs to the hyperbola translates into
The condition of being tangential means that the system
has a double solution. Note that these equations are both quadratic in p and linear in \(q^2\), hence the system is overall of second order in these two variables. Substituting \(q^2\) into the second equation leads to the condition that
has a double solution \(p_T\), which should satisfy \(p_T\in [p_C-r,p_C+r]\). This solution can be computed explicitly, but for proving the assertion existence suffices. To this end, we consider the family of curves \(\Gamma _D(e,\mathbb {R})\) constructed in Lemma 4.6 for \(|e_2|\leqq \frac{\sqrt{3}}{2}e_1\). The assumption (4.51) implies that \(\Gamma _D((2/\sqrt{7}, -\sqrt{3}/\sqrt{7}), [0,\infty ))\) intersects \(B_C(r)\), but \(\Gamma _D((2/\sqrt{7}, +\sqrt{3}/\sqrt{7}), [0,\infty ))\) does not (notice that both these curves are piecewise affine). By continuity there is \(e_*\) in the given interval such that \(\Gamma _D(e_*,\mathbb {R})\) is tangent to \(B_C(r)\). We denote by T the intersection of the two, and define \((q_0, p_0)\) so that \(\Gamma _D(e_*,\mathbb {R})\) is the set \(q^2-q_0^2=\frac{3}{4}(p-p_0)^2\) (see Fig. 7).
To conclude the proof, it suffices to show that no point of the given set can be separated by another hyperbola. To this end, it suffices to show that no other hyperbola of the given family can have two points in common with the given one. This follows from the fact that any solution to the system
obeys \(q_0^2-q_1^2=\frac{3}{4} (p_1^2-p_0^2-2pp_1-2pp_0)\), which is a linear equation in p and, therefore, has at most one solution. If p is unique, since \(q\geqq 0\), then obviously q is also unique. This concludes the proof. \(\quad \square \)
References
Ambrosio, L., Fusco, N., Pallara, D.: Functions of Bounded Variation and Free Discontinuity Problems, Mathematical Monographs. Oxford University Press, Oxford 2000
Conti, S., Müller, S., Ortiz, S.: Data-driven problems in elasticity. Arch. Rational Mech. Anal. 229(1), 79–123, 2018
Evans, L.C., Gariepy, R.F.: Measure Theory and Fine Properties of Functions. CRC Press, Boca Raton 1992
Friesecke, G., James, R., Müller, S.: A theorem on geometric rigidity and the derivation of nonlinear plate theory from three dimensional elasticity. Commun. Pure Appl. Math55, 1461–1506, 2002
Fonseca, I., Müller, S.: \({\cal{A}}\)-quasiconvexity, lower semicontinuity, and Young measures. SIAM J. Math. Anal. 30(6), 1355–1390, 1999
Faraco, D., Székelyhidi, L.: Tartar’s conjecture and localization of the quasiconvex hull in \({\mathbb{R}}^{2\times 2}\). Acta Math. 200(2), 279–305, 2008
Garroni, A., Nesi, V.: Rigidity and lack of rigidity for solenoidal matrix fields. Proc. R. Soc. Lond. Ser. A Math. Phys. Eng. Sci. 460(2046), 1789–1806, 2004
Gurson, A.L.: Continuum theory of ductile rupture by void nucleation and growth: Part i–yield criteria and flow rules for porous ductile materials. J. Eng. Mater. Technol. 99, 2–15, 1977
Kristensen, J.: On the non-locality of quasiconvexity. Ann. Inst. H. Poincaré Anal. Non Linéaire16(1), 1–13, 1999
Lubliner, J.: Plasticity Theory. Macmillan, New York, London 1990
Meade, C., Jeanloz, R.: Effect of a coordination change on the strength of amorphous \(\text{ SiO }_2\). Science241(4869), 1072–1074, 1988
Müller, S., Palombaro, M.: On a differential inclusion related to the Born-Infeld equations. SIAM J. Math. Anal. 46(4), 2385–2403, 2014
Maloney, C.E., Robbins, M.O.: Evolution of displacements and strains in sheared amorphous solids. J. Phys. Conden. Matter20(24), 244128, 2008
Murat, F.: Compacité par compensation: condition necessaire et suffisante de continuite faible sous une hypothèse de rang constant. Ann. Sc. Norm. Super. Pisa, Cl. Sci., IV. Ser. 8, 69–102, 1981
Palombaro, M., Ponsiglione, M.: The three divergence free matrix fields problem. Asymptot. Anal. 40(1), 37–49, 2004
Palombaro, M., Smyshlyaev, V.P.: Relaxation of three solenoidal wells and characterization of extremal three-phase \(H\)-measures. Arch. Ration. Mech. Anal. 194(3), 775–722, 2009
Schill, W., Heyden, S., Conti, S., Ortiz, M.: The anomalous yield behavior of fused silica glass. J. Mech. Phys. Solids113, 105–125, 2018
Stein, E.M.: Singular Integrals and Differentiability Properties of Functions. Princeton University Press, Princeton 1970
Schofield, A.N., Wroth, C.P.: Critical State Soil Mechanics. McGraw-Hill, New York City 1968
Stein, E.M., Weiss, G.: Introduction to Fourier analysis on Euclidean spaces. Princeton University Press, Princeton, N.J., Princeton Mathematical Series, No. 32. 1971
Tartar, L.: Compensated compactness and applications to partial differential equations. Nonlinear analysis and mechanics: Heriot–Watt Symp., Vol. 4, Edinburgh 1979, Res. Notes Math. 39, 136–212, 1979
Tartar, L.: The compensated compactness method applied to systems of conservation laws. In Systems of nonlinear partial differential equations (Oxford, 1982), volume 111 of NATO Adv. Sci. Inst. Ser. C Math. Phys. Sci., pages 263–285. Reidel, Dordrecht, 1983
Tartar, L.: Estimations fines des coefficients homogénéisés. In Ennio De Giorgi colloquium (Paris, 1983), volume 125 of Res. Notes in Math., pp. 168–187. Pitman, Boston, MA, 1985
Šverák, V.: Rank-one convexity does not imply quasiconvexity. Proc. Roy. Soc. Edinburgh Sect. A120(1–2), 185–189, 1992
Šverák, V.: On Tartar’s conjecture. Ann. Inst. H. Poincaré Anal. Non Linéaire10(4), 405–412, 1993
Zhang, K.: A construction of quasiconvex functions with linear growth at infinity. Ann. Scuola. Norm. Sup. Pisa Cl. Sci. (4)19(3), 313–326, 1992
Acknowledgements
This work was partially supported by the Deutsche Forschungsgemeinschaft through the Sonderforschungsbereich 1060 “The mathematics of emergent effects”, project A5, and through the Hausdorff Center for Mathematics, GZ 2047/1, project-ID 390685813.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by The Editors
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Conti, S., Müller, S. & Ortiz, M. Symmetric Div-Quasiconvexity and the Relaxation of Static Problems. Arch Rational Mech Anal 235, 841–880 (2020). https://doi.org/10.1007/s00205-019-01433-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00205-019-01433-1