Abstract
The aim of this paper is the study of first-order stationary systems of PDEs of the form \(\sum _k A_k\partial _k U + KU=0\) with \(K\ngtr 0\) on \(\Omega = {\mathbb {R}}^d\) and \(\Omega \subset {\mathbb {R}}^d\) bounded. We prove that the classical assumption \(K>0\) is not necessary for the well-posedness of the system and is violated in the particular case of the first-order Poisson problem. In the case \(\Omega = {\mathbb {R}}^d,\) we use Fourier analysis for the existence and uniqueness of solutions. For \(\Omega \subset {\mathbb {R}}^d\) bounded, we use a complex analog of the Banach–Nečas–Babuška theorem to obtain the existence and uniqueness of a solution in a setting that encompasses both Friedrichs’ systems and the first order reduction of the Poisson problem. The techniques used to prove the classical inf-sup conditions are inspired by harmonic analysis arguments that are consistent with the case \(\Omega ={\mathbb {R}}^d.\) In order to illustrate our approach, we study in detail the reduction of the Poisson equation to a first-order system.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Let \(d,m\in {\mathbb {N}}^*.\) We consider systems of first-order PDEs
with unknown \(U\in \left( L^2({\mathbb {R}}^d) \right) ^m,\) where \(A_k, k=1,\ldots d,\) and K are \(m\times m\) real matrices and \(F\in \left( L^2({\mathbb {R}}^d) \right) ^m.\) The system (1) takes the conservative form \(\nabla \cdot {\mathbb {F}}(U)+KU=F\) with linear flux \({\mathbb {F}}(U)\vec {\xi }=A(\vec {\xi })U\) and jacobian matrix \(A(\vec {\xi })=\sum _{k=1}^d \xi _k A_k\) where \(\vec {\xi }=(\xi _1,\ldots ,\xi _d).\)
The classical theory of Friedrichs systems ([1, 2, section 5.2]) covers the case of symmetric systems with \(K>0,\) using a variational formulation and Lax-Milgram theorem. The coercivity required for the Lax-Milgram theorem is a consequence of the assumption \(K>0.\)
However, in several important cases, one has to consider systems (1) with \(K\ngtr 0.\) The first class of examples is conservation laws with source terms in the stationary regime. In the particular case of gas dynamics, taking into account a friction force [3], a Coriolis force [4] or a chemical reaction [5, chapter 2 section 5] yields \(K\ngtr 0.\)
The second class consists of the mixed formulation of stationary diffusion problems \(\nabla \cdot (D\vec {\nabla } u)=f.\) The prototypical example of a diffusion equation is the Poisson problem whose first-order reduction is given in Sect. 4. Finite volume schemes for stationary diffusion on unstructured meshes can have a very complex design (see for instance [6]). The authors however believe that the discretization of the mixed formulation of stationary diffusion on unstructured meshes will yield simpler schemes with smaller stencils and linear systems that are larger but with a better condition number \(({\mathcal {O}}(\frac{1}{h})\) instead of \({\mathcal {O}}(\frac{1}{h^2})).\)
The main objective of this paper is to lay the ground for the numerical analysis of finite volume methods for stationary first-order systems with \(K\ngtr 0.\) In this first account of our research, we emphasize the relations between the symbol A, the friction matrix K and the boundary condition operator \(P_{\partial \Omega }\) that yield a well-posed problem.
We quickly review in Sect. 2 the well-posedness of the problem (1) on \({\mathbb {R}}^d.\) We prove that \(K>0\) is not a necessary condition but only a particular case. There is even no need for K to be invertible as shown in Theorem 1 and Corollary 3. Then in Sect. 3 we investigate the case \(\Omega \subset {\mathbb {R}}^d\) bounded. We use a complex analog of the BNB theorem [2], instead of the Lax-Milgram theorem to obtain a general existence result (see Theorem 5 and Corollary 2). In Sect. 4 we investigate the first order reduction of the Poisson problem with Dirichlet boundary condition. An existence result is given in Theorem 7 as an application of the approach laid in Sect. 3.
2 The case \(\Omega ={\mathbb {R}}^d\)
In this section, the solutions are sought for in \(\left( L^2({\mathbb {R}}^d) \right) ^m\) and we give three theorems (Theorems 1, 2 and 6) that are straightforward applications of the Fourier transform on first order systems (1) as done in [7, section 3.1] for hyperbolic systems.
Existence of solutions to (1) is guaranteed provided \(i A(\vec {\xi })+K\) is an invertible matrix for almost every \(\xi \in {\mathbb {R}}^d\) and \((i A(\vec {\xi })+K)^{-1}\hat{F}\in \left( L^2({\mathbb {R}}^d) \right) ^m.\)
Theorem 1
(Existence for first order systems) Let \(m,d\in {\mathbb {N}}^*,\) \(A_1,\ldots , A_d,K\) be \(m\times m\) real matrices and \(F\in \left( L^2({\mathbb {R}}^d) \right) ^m\) such that \(i A(\vec {\xi })+K\) is an invertible matrix for almost every \(\xi \in {\mathbb {R}}^d\) and
Then (1) admits a unique solution \(U\in \left( L^2({\mathbb {R}}^d) \right) ^m.\)
Proof
Taking the Fourier transform, (1) is equivalent to \(\left( i A(\vec {\xi })+K\right) \hat{U}(\vec {\xi })=\hat{F}(\vec {\xi }),\) hence \({\hat{U}(\vec {\xi })}=\left( i A(\vec {\xi })+K\right) ^{-1}\hat{F} (\vec {\xi })\) should be in \(\left( L^2({\mathbb {R}}^d) \right) ^m\) for a unique solution to exist. \(\square \)
We propose to define elliptic systems as systems such that \(\det \left( i A(\vec {\xi })+K\right) \ne 0\) almost everywhere. This definition is different from both the Petrovski [8,9,10,11] and ADN-ellipticity definitions [8, 9, 12], since they do not involve zero order terms, i.e. the matrix K in our case. In the case of the first order Poisson system (Sect. 4), the quantity \(\det \left( i A(\vec {\xi })+K\right) \ne 0\) matches exactly the symbol of the scalar Poisson problem (see Eq. 46).
Let \(\sigma _{min}(M)\) denote the smallest singular value of a matrix M. The following theorem states that solutions to (1) exist provided the ratio between \(\hat{F}\) and \(\sigma _{min}(iA(\vec {\xi }) + K)^{-1}\) is bounded in \(L^2.\) This condition is similar to the condition (40) found when studying the Poisson problem on \({\mathbb {R}}^d.\)
Theorem 2
Let \(m,d\in {\mathbb {N}}^*.\)
Let \(A_1,\ldots , A_d,K\) be \(m\times m\) real matrices and \(F\in \left( L^2({\mathbb {R}}^d) \right) ^m\) such that
Then the system (1) admits a unique solution in \(\left( L^2({\mathbb {R}}^d) \right) ^m.\)
Proof
Since \(\left\| \left( i A(\vec {\xi })+K\right) ^{-1}\hat{F}\right\| \le \frac{\Vert \hat{F}\Vert }{\sigma _{\min }\left( iA(\vec {\xi }) + K \right) },\) (3) implies \(\left( i A(\vec {\xi })+K\right) ^{-1}\hat{F}\in \left( L^2({\mathbb {R}}^d) \right) ^m.\) Hence Theorem 1 yields the existence of a unique solution to (1) in \(\left( L^2({\mathbb {R}}^d) \right) ^m.\) \(\square \)
The following corollary shows that Theorem 2 generalizes the well-known results about Friedrichs’ systems (see [2, Section 5.2]) to the cases where \(K \ge 0.\)
Corollary 1
(Case of Friedrichs’ systems) Assume \(A_k, k=1,\ldots , d\) and K are symmetric matrices, and that \(K > 0.\) Then for any \(F\in \left( L^2({\mathbb {R}}^d) \right) ^m,\) the system (1) admits a unique solution U in \(\left( L^2({\mathbb {R}}^d) \right) ^m.\)
Proof
We prove that \(\sigma _{min}( iA(\vec {\xi })+K)^{-1}\in L^\infty ({\mathbb {R}}^d)\) and then apply Theorem 2.
Since \(A_k, k=1,\ldots , d\) and K are symmetric matrices, they are diagonalizable with real eigenvalues in an orthonormal basis of \({\mathbb {R}}^d\) and therefore
from which we deduce
Since
we deduce
and finally since \(K> 0\) implies \(\lambda _{min}(K)>0\) we deduce
Hence \(\sigma _{min}( iA(\vec {\xi })+K)^{-1}\in L^\infty ({\mathbb {R}}^d).\) As a consequence for any \(F\in \left( L^2({\mathbb {R}}^d) \right) ^m,\) \(\Vert ( iA(\vec {\xi })+K)^{-1}\hat{F}\Vert _2\le \sigma _{min}( iA(\vec {\xi })+K)^{-1}\Vert \hat{F}\Vert _2\in L^2({\mathbb {R}}^d)\) and Theorem 1 yields the existence of a unique solution U in \(\left( L^2({\mathbb {R}}^d) \right) ^m\) to the system (1). \(\square \)
Corollary 1 cannot be used for first-order systems with \(K\ngtr 0.\) An example is given with the first order reduction of the Poisson equation in Sect. 4.1, where the more general Theorem 1 is used instead of Corollary 1.
3 The case of a bounded \(\Omega \)
In this section, \(\Omega \) is assumed bounded with a Lipschitz boundary. As \(\Omega \ne {\mathbb {R}}^d,\) we can no longer make straightforward use of the Fourier transform to obtain a linear algebraic system. Also in this case of bounded \(\Omega ,\) boundary conditions are fundamental for solutions’ existence and uniqueness.
A particular feature of bounded domains \(\Omega \) is the existence of a Poincaré inequality, which allows for the coercivity of bilinear forms arising from weak formulations. In some simple cases such as the Poisson equation, Poincaré inequality enables the use of the Lax-Milgram theorem to derive the existence of a unique weak solution. However, in the case of first-order elliptic systems, existence and uniqueness are more tricky to prove. The classical theory [1, 13,14,15] of Friedrichs’ systems makes some assumptions such as \(K> 0\) that enable the use of the Lax–Milgram theorem to derive existence results (see for instance theorem 5.4 in [2]).
However, the assumption \(K> 0\) is a serious obstacle to the analysis of linear symmetric hyperbolic systems in stationary regimes since these do not necessarily include a non-zero friction operator K (see for instance conservation laws in [7]). Another interesting example is the reduction of the Poisson equation to a first-order system with \(K\ngtr 0\) (see Sect. 4). Giving up the assumption \(K>0\) yields a loss of coercivity and we can no longer use the Lax-Milgram theorem. Therefore, Theorem 5 and Corollary 2 use a complex analog of the more general Banach–Nečas–Babuška theorem (Sect. 3.5) to obtain the existence and uniqueness of a solution in a setting that encompasses both Friedrichs’ systems and the first order reduction of the Poisson problem. The techniques used to prove the classical inf-sup conditions are inspired by harmonic analysis arguments that are consistent with the case \(\Omega = {\mathbb {R}}^d.\)
We consider a first order system of \(m \in {\mathbb {N}}^{*}\) partial differential equations on \(\Omega \subset {\mathbb {R}}^d\) taking the form
where \(U \in \left( L^{2}(\Omega )\right) ^m \) is the unknown and \(F \in \left( L^{2}(\Omega )\right) ^m \) the source term. The function \(P_{\partial \Omega }\) is a symmetric projection matrix depending on \(\vec {\xi }_x,\) the outward normal vector to \(\partial \Omega \) at \(x \in \partial \Omega .\)
3.1 Trace operator
The assumption that \(\Omega \) is an open bounded domain with Lipschitz boundary \(\partial \Omega \) allows for the existence of a trace operator \( u \rightarrow u_{\mid \partial \Omega }\) for functions \(u \in H^1(\Omega ).\) The trace operator enables us to retrieve the boundary values of any \(u \in H^1(\Omega )\) (see comment 7 of chapter 9 in [16]) and to set boundary conditions.
Thanks to this operator, it is also possible to state the Green–Ostrogradski formula for \(f \in H^1(\Omega ,{\mathbb {C}}), \vec {g} \in \left( H^1(\Omega ,{\mathbb {C}})\right) ^d:\)
3.2 Boundary conditions
Unlike scalar elliptic equations, we cannot impose a global boundary condition \(U = U_b\) on \(\partial \Omega .\) Instead we can impose a condition on some components of U on some parts of \(\partial \Omega ,\) as for instance: \(u_1 = 0 \) on \(\partial \Omega _1 \subset \partial \Omega .\)
In order to be able to impose such conditions on the boundary we assume the existence of a symmetric matrix-valued function \(P_{\partial \Omega } : {\mathbb {R}}^d \rightarrow \mathcal {M}_{m}({\mathbb {R}})\) such that \(P_{\partial \Omega }\) is an orthogonal projector:
The boundary condition (5) can be equivalently formulated as \(U \in \ker P_{\partial \Omega }\) or \(U \in \textrm{Im}({\mathbb {I}} - P_{\partial \Omega }).\) We are thus led to seek a solution to (5) in the space
3.3 Weak formulation
We define the \({\mathbb {C}}\)-valued bilinear form a on \(H^1_{P_{\partial \Omega }}(\Omega , {\mathbb {C}}) \times \left( {L^2}(\Omega , {\mathbb {C}}) \right) ^m\) as
U and V are taken \({\mathbb {C}}\)-valued because harmonic analysis tools will be required later in our approach.
The weak formulation of problem (4), is to find \(U \in H^1_{P_{\partial \Omega }}(\Omega ,{\mathbb {C}})\) such that
We will then prove that the solution U is indeed in \(H^1_{P_{\partial \Omega }}(\Omega ,{\mathbb {R}})\) provided F is real valued (see Remark 2 later).
3.4 The first order operator \({\mathcal {W}}_{A,K}\)
In the sequel, we study the first order operator \(U \mapsto \sum _{k=1}^d A_k\partial _kU + K U\) by means of Fourier analysis. Since \(\Omega \) is bounded, \(L^2(\Omega )\subset L^1(\Omega )\) and we can use the expression of the \(L^1\)-Fourier transform on \({\mathbb {R}}^d\) to define \(\hat{U}\) by extending U by zero outside \(\Omega .\) We therefore define the Fourier transform \({\mathcal {F}}\) of a function \(U\in \left( L^2(\Omega )\right) ^m\) as
Similarly, the inverse Fourier transform is defined for \(V\in \left( L^2({\mathbb {R}}^d,{\mathbb {C}})\right) ^m\) as the limit of the Fourier transform on \(L^2\cap L^1\):
We start by defining the following operator
In order to find another expression for \({\mathcal {W}}_{A,K},\) we can use the Green–Ostrogradski formula (6) expressed as follows: for \(U \in \left( H^1(\Omega ,{\mathbb {R}})\right) ^m\) and \(v \in H^1(\Omega ,{\mathbb {R}})\)
with the matrix \( {\tilde{\textrm{F}}} = (A_1U, \ldots ,A_dU).\) For a given \(\vec {\xi } \in {\mathbb {R}}^d,\) we consider \(v_{\xi }(\vec {x}) =e^{-i\vec {x}\cdot \vec {\xi }}\) and obtain
As was the case with \(\Omega ={\mathbb {R}}^d,\) we see with Eq. (13) for a bounded set \(\Omega \subset {\mathbb {R}}^d\) that the Fourier transform of \(\sum _{k=1}^d A_k\partial _kU + K U\) displays the symbol \((iA(\vec {\xi }) + K)\hat{U}(\vec {\xi })\) of the operator \(\sum _{k=1}^d A_k\partial _kU + K U.\) However, unlike the case with \(\Omega ={\mathbb {R}}^d,\) we see with Eq. (13) an extra contribution arising from the boundary of \(\Omega .\)
We first give in Lemma 1 an extension of the Poincaré inequality to first-order systems, that will allow us to derive the first complex BNB condition (Lemma 4).
Lemma 1
(Generalised Poincaré inequality) Assume that \({\mathcal {W}}_{A,K}\) is injective. Then
Proof
The result follows from the open mapping theorem applied to the operator \({\mathcal {W}}_{A,K}.\) Indeed
-
\({\mathcal {W}}_{A,K}\) is a linear operator (see Eq. 12)
-
\({\mathcal {W}}_{A,K}\) is a bounded operator since Parseval’s identity yields
$$\begin{aligned} \Vert {\mathcal {W}}_{A,K}(U)\Vert _{(L^2({\mathbb {R}}^d))^m}= & {} \Vert {\mathcal {F}}^{-1}({\mathcal {W}}_{A,K}(U)) \Vert _{(L^2({\mathbb {R}}^d))^m}\nonumber \\= & {} \left\| \sum _{k=1}^d A_k\partial _kU + K U\right\| _{(L^2(\Omega ))^m}\nonumber \\\le & {} \sup \left\{ \vert \vert \vert K\vert \vert \vert , \vert \vert \vert A_1\vert \vert \vert ,\ldots , \vert \vert \vert A_d\vert \vert \vert \right\} \Vert U\Vert _{(H^1(\Omega ))^m}.\nonumber \\ \end{aligned}$$(14)
Therefore assuming that \({\mathcal {W}}_{A,K}\) is injective, \({\mathcal {W}}_{A,K}^{-1}\) is surjective from \({\mathcal {W}}_{A,K}((H^1(\Omega ))^m)\subset (L^2({\mathbb {R}}^d,{\mathbb {C}}))^m\) to \((H^1(\Omega ))^m.\) The open mapping theorem implies that \({\mathcal {W}}_{A,K}^{-1}\) is a continuous linear map, hence the result. \(\square \)
Remark 1
(Connection with Poincaré inequality) We called Lemma 1 “Generalised Poincaré inequality” because using Parseval equality it yields
We recover Poincaré inequality if \(m=d, K =0,\;\; A_k = \vec {e}_k \otimes \vec {e}_k\) where \(\vec {e}_k, k = 1,\ldots ,d\) are the vectors forming the canonical basis of \({\mathbb {R}}^d.\)
Note that the boundary conditions required for the classical Poincaré inequality are hidden behind the assumption that \({\mathcal {W}}_{A,K}\) be injective
3.5 The complex BNB theorem
Considering two Banach spaces V, W and a bilinear form \(a \in {\mathcal {L}}(V\times W',{\mathbb {R}}),\) the classical BNB conditions [2, Theorem 2.6]
are deduced from the characterisation of bijective Banach operators (see [2, Theorem A.43]):
Theorem 3
(Characterisation of bijective operators) Let V, W be two Banach spaces and \(A \in {\mathcal {L}}(V; W)\) a continuous linear operator.
A is bijective if and only if
-
1.
\(A^T : W'\rightarrow V'\) is injective and
-
2.
there exists \(\alpha > 0\) such that
$$\begin{aligned} \forall v\in V,\quad \Vert Av\Vert _W\ge \alpha \Vert v\Vert _V. \end{aligned}$$(17)
Then the classical BNB conditions are obtained under the assumption that W is reflexive and that \(A \in {\mathcal {L}}(V; W)\) is associated with a real-valued bilinear form \(a \in {\mathcal {L}}(V\times W',{\mathbb {R}})\) such that \(<Av,w>_{W,W'}=a(v,w)\) (see [2, Corollary A.46]).
The first BNB condition (15) is therefore a direct consequence of the condition (17) in the real case where \(a \in {\mathcal {L}}(V\times W',{\mathbb {R}}).\)
In the complex case, we must assume instead that \(A \in {\mathcal {L}}(V; W)\) is associated with a complex-valued bilinear form \(a \in {\mathcal {L}}(V\times W^\natural ,{\mathbb {C}})\) such that \(<Av,w>_{W,W^\natural }=a(v,w),\) where \(W^\natural \) is the space of complex-valued linear applications on W.
The analog of [2, Corollary A.46] in the complex case now follows.
Theorem 4
(The complex BNB conditions) We assume that W is reflexive and that \(A \in {\mathcal {L}}(V; W)\) is associated with a bilinear form \(a \in {\mathcal {L}}(V\times W^\natural ,{\mathbb {C}})\) such that \(\langle Av,w\rangle _{W,W^\natural }=a(v,w).\)
The following statements are equivalent :
-
1.
For all \(f\in W,\) there is a unique \(u \in V\) such that \(a(u, w)=\langle f,w\rangle _{W,W^\natural }\) for all \(w\in W^\natural .\)
-
2.
There is \(\alpha >0\) such that
$$\begin{aligned}&\inf _{v \in V} \sup _{ w \in W^\natural } \frac{\mid a(v,w) \mid }{\Vert v\Vert _V \Vert w\Vert _{W^\natural }}\ge \alpha , \end{aligned}$$(18)$$\begin{aligned}&\forall w \in W^\natural ,\quad (\forall v \in V,\; a(v,w) = 0) \Rightarrow w = 0. \end{aligned}$$(19)
The proof of Theorem 4 will require the following lemma.
Lemma 2
Let V be a normed vector space.
Lemma 2 is a complex analog of [2, Corollary A.17], which is a consequence of the Hahn-Banach theorem. Lemma 2 therefore relies on a complex version of the Hahn-Banach theorem that can be found in [17, Theorem 3.3]. A proof of Lemma 2 can indeed be found in the corollary following [17, Theorem 3.3].
Proof of Theorem 4
The first statement means that A is bijective, which thanks to Theorem 3 is equivalent to the two conditions
-
1.
$$\begin{aligned} \forall w \in W^\natural ,\quad (\forall v \in V,\; a(v,w) = 0) \Rightarrow w = 0, \end{aligned}$$(21)
which is identical to (19).
-
2.
$$\begin{aligned} \exists \alpha > 0,\forall v\in V,\quad \Vert Av\Vert _W \ge \alpha \Vert v\Vert _V, \end{aligned}$$(22)
which is equivalent thanks to Lemma 2 to
$$\begin{aligned} \exists \alpha > 0,\forall v\in V,\quad \sup _{w\in W^\natural } \frac{|\langle Av,w\rangle |_{W,W^\natural }}{\Vert w\Vert _{W^\natural }} \ge \alpha \Vert v\Vert _V. \end{aligned}$$(23)Taking the infimum over v in (23) yields the equivalence with (18).
\(\square \)
3.6 The main result
In the classical study of first-order Friedrichs systems, the assumption \(K>0\) yields the coercivity and the Lax–Milgram theorem [2, lemma 2.2] yields well-posedness (see assumption F2 in [2, theorem 5.7]). Unfortunately, when \(K\ngtr 0\) we do not have coercivity. We can prove the well-posedness thanks to a complex analog of the BNB theorem (Theorem 4 in Sect. 3.5), which has weaker assumptions than the Lax–Milgram theorem.
In order to use the complex BNB theorem (Theorem 4 in Sect. 3.5) and obtain the existence of a solution to the weak formulation (9), we need to prove the following two conditions:
Since we do not have the coercivity property \(a(U,U)\ge \gamma \Vert U\Vert {_{H_1}^2},\) we are going to find an injective function \({\mathcal {W}}(U)\) such that \(a(U,{\mathcal {W}}(U))\ge \gamma \Vert U\Vert {_{H_1}^2}.\) The operator \({\mathcal {W}}(U) ={\mathcal {F}}^{-1}({\mathcal {W}}_{A,K}(U))=\sum _{k=1}^d A_k\partial _kU + K U\) with \({\mathcal {W}}_{A,K}(U)\) defined in (13) is our candidate. The following lemma connects the bilinear form a to \({\mathcal {W}},\) and will play an important role in proving the first BNB condition (24).
Lemma 3
(Connecting a and \({\mathcal {W}}).\) For any \(U \in H^1_{P_{\partial \Omega }}(\Omega ),\) we define
Then \({\mathcal {W}}(U)\in \left( L^2(\Omega ,{\mathbb {C}})\right) ^m\) and we have
Proof
\({\mathcal {W}}(U)\in \left( L^2(\Omega ,{\mathbb {C}})\right) ^m\) is a consequence of \(U\in \left( H^1(\Omega ,{\mathbb {C}})\right) ^m.\)
From the definition of a (Eq. 8) we have
and the result follows from Parseval’s identity:
\(\square \)
Thanks to Lemmas 1 and 3 we have the following lemma on the first complex BNB condition.
Lemma 4
(First complex BNB condition) Assume that \({\mathcal {W}}_{A,K}\) is injective on \(H^1_{P_{\partial \Omega }}(\Omega ).\) Then the first complex BNB condition (24) is true :
Proof
Starting from
Lemma 3 and the Parseval’s identity yield
Now the Generalised Poincaré identity (Lemma 1) yields the result. \(\square \)
The following theorem proves that the invertibility of \(iA(\vec {\xi }) + K\) yields the second complex BNB condition (25).
Lemma 5
(Second complex BNB condition) Assume that \(iA(\vec {\xi }) + K\) is invertible for almost every \(\vec {\xi }.\) Then the second complex BNB condition (25) is true :
Proof
Let \(V \in \left( {L^2}(\Omega , {\mathbb {C}})\right) ^m\) such that
We define
Since \({e^{i\vec {x}\cdot \vec {\xi }}W_V(\vec {\xi })} \notin H^1_{P_{\partial \Omega }},\) we cannot write “\(a({e^{i\vec {x}\cdot \vec {\xi }}W_V(\vec {\xi })}, V) = 0 = \Vert W_V\Vert ^{2}\)” to deduce \(V=0.\) Hence we are first going to approximate \({e^{i\vec {x}\cdot \vec {\xi }}W_V(\vec {\xi })}\) with functions in \(\left( {\mathcal {C}}^{\infty }_K(\Omega )\right) ^m \subset H^1_{P_{\partial \Omega }},\) (where \({\mathcal {C}}_K^{\infty }\) is the space of smooth functions with compact support in \(\Omega )\), and then pass to the limit.
\({\mathcal {C}}^{\infty }_K(\Omega )\) is dense in \(H^1(\Omega )\) (see [16, Corollary 9.8]). For any \(\vec {\xi } \in {\mathbb {R}}^m,\) let a sequence \(\Phi _n \in \left( {\mathcal {C}}^{\infty }_K(\Omega )\right) ^m, n\in {\mathbb {N}}\) such that \(\Phi _n(\cdot ,\vec {\xi }) \rightarrow e^{i\vec {x}\cdot \vec {\xi }}\) in \(H^1_{P_{\partial \Omega }}.\) The function \(\Phi _n(\cdot ,\vec {\xi })W_V(\vec {\xi })\) is in \(H^1_{P_{\partial \Omega }}\) so we can compute
Passing to the limit, since \(\Phi _n(\cdot ,\vec {\xi }) \rightarrow e^{i\vec {x}\cdot \vec {\xi }}\) in \(H^1_{P_{\partial \Omega }}\) we obtain
Since \(a(U,V) = 0,\) we deduce
Hence if \(iA(\vec {\xi }) + K \) is invertible for almost every \(\vec {\xi },\) so is \({}^t(-iA(\vec {\xi }) + K) ={}^t\overline{(iA(\vec {\xi }) + K)},\) and we can deduce that \(V = 0\) almost everywhere. \(\square \)
We can now prove our main result Theorem 5.
Theorem 5
(Well-posedness of first-order systems) Let \(d,m\in {\mathbb {N}}^*.\)
Let \(A_k, k=1,\ldots d,\) \(P_{\partial \Omega },\) and K be \(m\times m\) real matrices. Let \(\Omega \) be a bounded open Lipschitz subset of \({\mathbb {R}}^d.\) Assume that
-
\({\mathcal {W}}_{A,K}\) (Eqs. (12) and (13)) is injective on \(H^1_{P_{\partial \Omega }}(\Omega ).\)
-
\(iA(\vec {\xi }) + K\) is invertible for almost every \(\vec {\xi }.\)
Then there exists a unique solution \(U \in H^1_{P_{\partial \Omega }}(\Omega ,{\mathbb {C}})\) to the variational problem (9) and
Proof
The theorem is a direct consequence of the complex BNB theorem (Theorem 4 in Sect. 3.5) with \(W=H^1_{P_{\partial \Omega }}(\Omega ,{\mathbb {C}}), V= \left( H^1(\Omega , {\mathbb {C}})\right) ^m.\) The constant \(\gamma \) is the inverse of the inf-sup constant from the first complex BNB condition. In our case, it follows from the open mapping theorem (see Lemma 1). \(\square \)
Remark 2
(On the real character of the solutions) Theorems 5 and 7 give the existence of a complex-valued function \(U\in \left( H^1(\Omega , {\mathbb {C}}) \right) ^m.\) Their proofs do not require real matrices \(A_k,k=1,\ldots ,d,\) \(P_{\partial \Omega },\) K nor a real vector F and could be extended to complex coefficient matrices. However the assumption of real matrices \(A_k,k=1,\ldots ,d,\) \(P_{\partial \Omega },\) K and a real vector F implies \(U\in \left( H^1(\Omega ,{\mathbb {R}})\right) ^m\) since otherwise \(\bar{U}\) would be another distinct solution.
In the sequel, we investigate the connections between the matrices A, K and \(P_{\partial \Omega }\) in order to satisfy the first complex BNB condition (24), and deduce the well-posedness of the system. For simplicity, we consider only symmetric systems. Since we can always multiply Eq. (4) by sign(K), we assume in the following corollaries that \(K \ge 0.\)
Corollary 2
(Boundary compatibility condition, symmetric case) Let \(\Omega \) be a bounded set with a Lipschitz boundary, and \(A_k, k = 1,\ldots , d,\) K, \(P_{\partial \Omega }\) be symmetric matrices with \(K \ge 0.\)
Assume that
-
\(iA(\vec {\xi }) + K\) is invertible for almost every \(\vec {\xi } \in {\mathbb {R}}^d\)
-
$$\begin{aligned} \forall \vec {\xi } \in {\mathbb {R}}^d, \;\left( {\mathbb {I}}_m - P_{\partial \Omega }(\vec {\xi }) \right) A(\vec {\xi }) \left( {\mathbb {I}}_m - P_{\partial \Omega }(\vec {\xi })\right) =0 \end{aligned}$$(29)
-
$$\begin{aligned} \forall \vec {\xi } \in {\mathbb {R}}^d, \;\ker K \cap \ker P_{\partial \Omega }(\vec {\xi }) =\{ 0 \} . \end{aligned}$$(30)
Then there exists a unique solution \(U \in H^1_{P_{\partial \Omega }}(\Omega ,{\mathbb {R}})\) to the variational problem (9) and
Proof
Let \(U \in \ker {\mathcal {W}}_{A,K}.\) We are going to prove that \(U=0\) and deduce that \({\mathcal {W}}_{A,K}\) is injective. From Eq. (13) we have
Hence taking the inner product with \(\hat{U}(\vec {\xi })\) yields for a.e. \(\vec {\xi } \in {\mathbb {R}}^{d},\)
after integration with respect to \(\vec {\xi }\) on the ball \(B(0,n),\, n\in {\mathbb {N}}\), we obtain
and Fubini theorem yields
Taking the limit \(n\rightarrow \infty ,\) since \(U\in H^1(\Omega )\) we find
We recall that we seek U in the space \(H^1_{P_{\partial \Omega }}(\Omega ,{\mathbb {R}})\) and as such, the boundary condition (5) is equivalent to \(U_{\mid \partial \Omega } \in \textrm{Im}({\mathbb {I}} - P_{\partial \Omega })\) \((P_{\partial \Omega }\) is a projector). As a consequence, the assumption (29) yields \(\;^t\bar{U}(\vec {x})A(\vec {\xi }_{\vec {x}})U(\vec {x})=0\) on \(\partial \Omega .\)
Hence
Now since we assumed that \(A(\vec {\xi })\) and K are symmetric matrices, \(\;^t\bar{\hat{U}}A\hat{U} \in {\mathbb {R}}\) and \({}^t\bar{\hat{U}}K\hat{U}\in {\mathbb {R}}.\) (34) thus implies
Hence since \(K \ge 0\) we have
which in turn implies that the \(L^2\)-Fourier transform of \(\hat{U}\) is in the kernel of K:
We recall that \(\hat{U}\) does not necessarily belong to \(L^1,\) hence we cannot use the \(L^1\) formula for the Fourier transform. Instead, the \(L^2\) Fourier transform is obtained as a limit using the density of \(L^1\cap L^2\) in \(L^2.\)
Now the boundary condition is \(U(x) \in \ker P_{\partial \Omega }(\vec {\xi }_x)\) for x on the boundary so
Using the boundary assumption (30), U(x) is therefore 0 on the boundary. Hence the right-hand side of Eq. (32) vanishes and
Now from the first assumption, \(iA(\vec {\xi }) + K\) is invertible and we conclude that \(U = 0.\)
Hence \({\mathcal {W}}_{A,K}\) is injective and from Theorem 5 we deduce the existence of a unique solution \(U \in H^1_{P_{\partial \Omega }}(\Omega ,{\mathbb {C}}).\) Now since the matrices \(A_k, k = 1,\ldots , d,\) K, \(P_{\partial \Omega }\) and the vector F are real, \(\bar{U}\) is also solution of (9). Uniqueness yields \(\bar{U}=U\) and therefore U is real valued. \(\square \)
4 The first order reduction of the Poisson equation
In this section, we illustrate the reduction of the second-order Poisson equation to a first-order system. This first-order reduction yields a symmetric first-order system with \(K\ge 0\) but \(K\ngtr 0,\) and therefore classical results on Friedrichs systems do not apply. We start with \(\Omega ={\mathbb {R}}^d\) in Sect. 4.1 and then on a bounded \(\Omega \) endowed with a boundary condition in Sect. 4.2.
Let \(\Omega \) be an open subset of \({\mathbb {R}}^d.\) The first-order reduction of the Poisson problem
amounts to defining \(\vec {v} =\vec {\nabla } u\) and solving the first-order symmetric system that takes the form (1)
We introduce the following symmetric matrices, unknown and right-hand side vectors
4.1 The case \(\Omega = {\mathbb {R}}^d\)
In this case the solution is sought for in \(\left( L^2({\mathbb {R}}^d)\right) ^{d + 1}\) using Fourier transform. As \(\partial {\mathbb {R}}^d=\emptyset ,\) there is no need to impose boundary conditions. We cannot use Corollary 1 for the well-posedness of the system (36) since \(K\ngtr 0.\) Theorem 1 should be used instead.
Corollary 3
(Existence for first order Poisson system—\(\Omega ={\mathbb {R}}^d\)) Let \(d\in {\mathbb {N}}^*,\) \(f\in L^2({\mathbb {R}}^d)\) such that
Let the matrices \(A_{poisson}(\vec {\xi }), K_{poisson}\) and \(F_{poisson}(\vec {\xi })\) defined in (37). Then the system (36) admits a unique solution \((u,\vec {v})\in \left( L^2({\mathbb {R}}^d)\right) ^{d+1}.\)
Proof
Since \(\left( i A_{Poisson}(\vec {\xi })+K_{Poisson}\right) ^{-1} \hat{F}_{Poisson}(\vec {\xi }) = \frac{\hat{f}(\vec {\xi })}{\Vert \vec {\xi }\Vert ^2} \left( \begin{array}{c} 1 \\ -i\vec {\xi } \end{array}\right) ,\) according to Theorem 1, the assumptions (39) yield the existence of a unique solution in \(L^2({\mathbb {R}}^d)\times \left( L^2({\mathbb {R}}^d)\right) ^d\) (actually \(H^1({\mathbb {R}}^d)\times \left( L^2({\mathbb {R}}^d)\right) ^d).\) \(\square \)
The assumption (39) on f in Corollary 3 is to be compared with assumption (40) when the original second order problem is solved in \(L^2.\)
Theorem 6
(Existence for the scalar Poisson equation) Let \(d\in {\mathbb {N}}^*,\) \(f\in L^2({\mathbb {R}}^d)\) such that
Then there exists a unique \(u\in L^2({\mathbb {R}}^d)\) such that \(-\Delta u =f\) in the weak (distributional) sense.
Theorem 3 has more stringent assumptions than Theorem 6. Indeed the first order reduction require that \(v=\vec {\nabla } u\in \left( L^2({\mathbb {R}}^d)\right) ^d,\) hence the solution u given in Corollary 3 is actually in \(H^1({\mathbb {R}}^d).\) The first order reduction is not able to represent very weak solutions \(u\notin H^1({\mathbb {R}}^d).\) However, this should not be a serious issue since \(H^1({\mathbb {R}}^d)\) is the usual solution space in classical variational formulations.
4.2 The case of bounded domain \(\Omega \subset {\mathbb {R}}^d\)
We consider the Poisson equation (35) with the Dirichlet boundary condition
The Dirichlet boundary condition \(u=0\) can be enforced on the first order system (36) as \(P_{\partial \Omega }U=0\) using a constant Dirichlet boundary operator defined as follows
The following theorem gives an existence result in \(H^1_{P_{\partial \Omega }}\) for the first order reduction of the Poisson equation (36–37) with the Dirichlet boundary condition (42). It is based on the variational formulation (9) and the use of the complex BNB theorem (Theorem 4 in Sect. 3.5) through Corollary 2.
Theorem 7
(Existence for the first order Poisson—\(\Omega \) bounded) Let \(d,m\in {\mathbb {N}}^*,\) let \(A_{Poisson},K_{Poisson}\) be \(m\times m\) real matrices defined in (37) and boundary operator \(P^{Dirichlet}_{\partial \Omega }\) defined in (42). Let \(\Omega \) be a bounded open Lipschitz subset of \({\mathbb {R}}^d.\)
Then there exists a unique weak solution \(U \in H^1_{P^{Dirichlet}_{\partial \Omega }}(\Omega ,{\mathbb {R}})\) to the variational problem (9) and
Proof
We check that the assumptions of Corollary 2 are satisfied to deduce the theorem.
For the first condition, we need to prove that \(iA_{Poisson}(\vec {\xi }) + K_{Poisson}\) is invertible. We compute the determinant using the cofactor expansion along the second column:
By an induction argument we obtain
Hence we obtain for a.e. \(\vec {\xi }\in {\mathbb {R}}^d,\, \det (iA_{Poisson}(\vec {\xi }) + K_{Poisson}) \ne 0,\) thus the first condition of Corollary 2 is met.
We now check the second condition (Eq. 29) using the definition of the Dirichlet boundary operator (42)
Hence the second compatibility condition in Corollary 2 is true.
We now check the third condition (Eq. 30): \(\forall \vec {\xi } \in {\mathbb {R}}^d,\)
hence \(\ker K \cap \ker P_{\partial \Omega }^{Dirichlet}(\vec {\xi }) = \{0\}\) and the third condition in Corollary 2 is satisfied. \(\square \)
5 Conclusion and perspectives
First-order systems with \(K\ngtr 0\) are of particular interest in many applications. We studied the stationary first-order systems for \(\Omega = {\mathbb {R}}^d\) and \(\Omega \subset {\mathbb {R}}^d\) bounded using tools derived from harmonic and functional analysis. In the case \(\Omega = {\mathbb {R}}^d,\) Fourier analysis (Sect. 2) showed that the assumption \(K > 0\) is not necessary but only sufficient for the existence and uniqueness of solutions. On bounded domains (Sect. 3), we therefore, looked for an alternative approach to the classical Friedrichs’ theory that assumes \(K>0.\) We used a complex analog of the more general Banach–Nečas–Babuška theorem (Sect. 3.5) to obtain the existence and uniqueness of a solution in a setting that encompasses both Friedrichs’ systems and the first order reduction of the Poisson problem on \(\Omega \subset {\mathbb {R}}^d\) (Theorem 5). Indeed as shown in Sect. 4, the assumption \(K>0\) is violated by first-order reduction of the Poisson problem, where the kernel of K is not trivial. We however obtained an existence result (Theorem 7) thanks to our new approach. This paper lays the ground for the numerical analysis of stationary hyperbolic problems where we expect to use discrete analogs of our new approach to design convergent numerical approximations.
Data availability
Data availability is not applicable to this article as no new data were created or analysed in this study.
References
Friedrichs, K.O.: Symmetric positive linear differential equations. Commun. Pure Appl. Math. 11, 333–418 (1958)
Ern, A., Guermond, J.-L.: Theory and Practice of Finite Elements, vol. 159. Springer, New York (2004)
Bouchut, F., Ounaissa, H., Perthame, B.: Upwinding of the source term at interfaces for Euler equations with high friction. Comput. Math. Appl. 53(3–4), 361–375 (2007)
Audusse, E., Minh, H.D., Omnes, P., Penel, Y.: Analysis of a modified Godunov scheme for the linear wave equation with Coriolis source term on cartesian meshes. J. Comput. Phys. 373, 91–129 (2018)
Godlewski, E., Raviart, P.A.: Numerical Approximation of Hyperbolic Systems of Conservation Laws. Applied Mathematical Sciences, vol. 118. Springer, New York (1996)
Eymard, R., Gallouët, T., Herbin, R.: Discretization of heterogeneous and anisotropic diffusion problems on general nonconforming meshes sushi: a scheme using stabilization and hybrid interfaces. IMA J. Numer. Anal. 30(4), 1009–1043 (2010)
Serre, D.: Systems of Conservation Laws 1: Hyperbolicity, Entropies, Shock Waves. Cambridge University Press, Cambridge (1999)
Bochev, P.B., Gunzburger, M.D.: Least-Squares Finite Element Methods. Springer, Berlin (2006)
Cosner, C.: On the definition of ellipticity for systems of partial differential equations. J. Math. Anal. Appl. 158(1), 80–93 (1991)
Rus, I.A.: Maximum principles for elliptic systems and the problem of the minimum matrix norm of a characteristic matrix, revisited. Stud. Univ. Babes-Bolyai Math. 58(2), 199–211 (2013)
Krupchyk, K., Seiler, W.M., Tuomela, J.: Overdetermined elliptic systems. Found. Comput. Math. 6(3), 309–351 (2006)
Agmon, S., Douglis, A., Nirenberg, L.: Estimates near the boundary for solutions of elliptic partial differential equations satisfying general boundary conditions II. Commun. Pure Appl. Math. 17(1), 35–92 (1964)
Ern, A., Guermond, J.-L., Caplain, G.: An intrinsic criterion for the bijectivity of Hilbert operators related to Friedrich’ systems. Commun. Partial Differ. Equ. 32(2), 317–341 (2007)
Ern, A., Guermond, J.-L.: Discontinuous Galerkin methods for Friedrichs’ systems. I. General theory. SIAM J. Numer. Anal. 44(2), 753–778 (2006)
Antonić, N., Burazin, K., Crnjac, I., Erceg, M.: Complex Friedrichs systems and applications. J. Math. Phys. 58(10), 101508 (2017)
Brezis, H.: Functional Analysis, Sobolev Spaces and PDEs. Universitext. Springer, Berlin (2010)
Rudin, W.: Functional Analysis. International Series in Pure and Applied Mathematics, vol. 8. McGraw-Hill Science/Engineering/Math., New York (1991)
Acknowledgements
All authors whose names appear on the submission: made substantial contributions to the conception of the work; drafted the work and revised it critically for important intellectual content; approved the version to be published; and agrees to be accountable for all aspects of the work in ensuring that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. We also declare that this paper is not published and is not under consideration for publication elsewhere.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Ndjinga, M., Ngwamou, S.K. On first order elliptic systems. J Elliptic Parabol Equ 10, 711–730 (2024). https://doi.org/10.1007/s41808-024-00274-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41808-024-00274-8