Abstract
Whereas many different models exist in mathematics and physics for the ground states of non-relativistic crystals, the relativistic case has been much less studied, and we are not aware of any mathematical result on a fully relativistic treatment of crystals. In this paper, we introduce a mean-field relativistic energy for crystals in terms of periodic density matrices. This model is inspired both from a recent definition of the Dirac–Fock ground state for atoms and molecules, due to one of us, and from the non-relativistic Hartree–Fock model for crystals. We prove the existence of a ground state when the number of electrons per cell is not too large.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
For solids with heavy atoms, relativistic shifts may affect the bonding properties and the optical properties. It is shown in [37] that the yellow color of gold is a result of relativistic effects. Furthermore, by studying the relativistic band structure in solids, it is shown in [11, 12] that the relativistic shifts of the 5d bands relative to the \(s-p\) bands in gold change the main interband edge by more than 1 eV.
A natural way to build quantum models for the crystal phase is to consider the so-called thermodynamic limit of quantum molecular models. Roughly speaking it consists in considering a finite but large piece of an (infinite and neutral) crystal. The thermodynamic law predicts that the ground state energy of the obtained large neutral molecule is proportional to the volume of this finite piece (which turns out to be also proportional to the total number of particles composing the molecule). The energy for the whole crystal is then identified with the limit—if it exists—of the energy per unit volume (or equivalently per particle) of the large molecule when the size of the considered piece goes to infinity. This method was applied successfully by different authors for several well-known models from quantum chemistry [6, 8, 9, 32]—see also [7] for a review—but always for non-relativistic crystals.
The Dirac–Fock model (DF) was introduced in atomic physics by Swirles [41] in 1935. It is widely used in relativistic quantum chemistry, and gives numerical results on atoms and molecules in excellent agreement with experimental data [13, 20, 30]. Its relation with QED was investigated by Mittleman [35]. Mittleman’s approach was studied mathematically in [2,3,4, 18, 34]. To our knowledge the Dirac-Fock model has not been extended to crystals: there exist fully relativistic treatments of crystals in the physics literature, but they use the Kohn–Sham approach (see [15, 28] and the references therein).
The first rigorous existence results for the atomic and molecular Dirac–Fock equations were obtained in [16, 36]. Compared to the non-relativistic models, the situation is different: the existence of bound states has only been proved when the total number of protons does not exceed 124, for the physical value \(\alpha \approx 1/137\) of the fine structure constant. Moreover, the Dirac–Fock energy functional is strongly indefinite and the notion of ground state has to be handled very carefully [16]. These difficulties exclude a thermodynamic limit approach to derive the Dirac–Fock model for crystals.
In [17] it was shown that certain solutions of the (relativistic) Dirac–Fock equations converge towards the energy-minimizing solutions of the (non-relativistic) Hartree–Fock equations when the speed of light tends to infinity. This validates a posteriori the notions of ground state solutions and ground state energy for the Dirac–Fock equations. In the approach of [17], the multi-electronic state is modeled by a Slater determinant of mono-electronic wavefunctions. On the other hand, Huber and Siedentop use a density matrix formulation and a fixed-point iteration to define and construct ground states of the Dirac–Fock model [27]. Unfortunately their assumptions do not cover the physical value of \(\alpha \). Recently, in [39] one of us gave a new definition and an existence proof for the ground state of the Dirac–Fock model in atoms and molecules, under assumptions covering the physical value of \(\alpha \), thanks to a density matrix formulation and a retraction technique combined with a minimization principle. Inspired by this work and by the analysis of the periodic Hartree–Fock model due to Le Bris, Lions, and one of us [8], we propose a definition for the ground state of the Dirac–Fock model for crystals which is a relativistic analogue of Lieb’s variational principle for the Hartree–Fock model [1, 31], and we prove the existence of minimizers. In addition, we show that these minimizers solve a self-consistent equation. Our method can be used to calculate the ground state of neutral crystals with at most 17 electrons per cell. However, some estimates used in this paper are not optimal, and we strongly believe that this limiting bound can be improved.
The minimization problem under consideration in this paper combines several difficulties related to compactness issues. The Dirac operator, hence the Dirac–Fock energy functional, is not bounded from below and the kinetic energy term is of the same order as the Coulomb-type potential energy terms, a standard feature of Coulomb–Dirac–Fock type models. Our proof of existence of minimizers for crystals is neither a straightforward adaptation of the one for atoms and molecules in [39] nor of the one for crystals in Hartree–Fock theory in [8]: a major issue arises from the compactness of the density matrices and of the self-consistent operators in the momentum variable \(\xi \), resulting from the Bloch decomposition of the space. Compactness in the momentum variable is crucial to deal with the (non-linear) exchange term in the DF periodic functional and with the nonlinear constraint ensuring that the electrons lie in the positive spectral subspace of the self-consistent periodic Dirac–Fock operator. Our results rely on a careful analysis of the periodic exchange potential. In passing, we have corrected some wrong estimates on the exchange term in [8] and improved the regularity results therein (see “Appendix B”). Furthermore, we provide an asymptotically optimal constant for the Hardy inequality associated with the periodic Coulomb potential that is new in the literature, as far as we know.
In addition, compared with existing results for crystals’ ground state energy, such as the Hartree–Fock model [8], we provide a new method to prove the existence of minimizers for crystals: based on the spectral analysis of the self-consistent periodic DF operator, we build minimizing sequences that feature both a uniform dependence with respect to the momentum \(\xi \) and a better regularity in the space variables, and we rely on it to improve the relative compactness of subsequences in the periodic energy space.
Before ending this section, let us mention the Bogoliubov–Dirac–Fock (BDF) model proposed by Chaix and Iracane in [10] as an alternative to Dirac–Fock for heavy atoms and molecules, and later studied in a series of mathematical works (see the review paper [25] and references therein, see also the more recent works [21, 22, 24]). Compared with DF, the BDF model has several advantages: the corresponding energy is bounded from below, so the notion of ground state becomes straightforward; vacuum polarization effects are taken into account; the derivation of the model as a mean-field approximation of no-photon QED is more convincing. However the mathematical definition of the BDF energy involves a rather complex functional framework as well as an ultraviolet regularization, and a renormalization procedure is needed to interpret the equations. Thus, in the present work we restrict ourselves to the conceptually simpler DF model, and the study of relativistic crystals in the BDF approximation is left for future research.
2 General Setting of the Model and Main Result
2.1 Preliminaries and Functional Framework
Throughout the paper, we choose units for which \(m=c=\hbar =1\), where m is the mass of the electron, c the speed of light and \(\hbar \) the Planck constant. For the sake of simplicity, we only consider the case of a cubic crystal with a single point-like nucleus per unit cell, which is located at the center of the cell. The reader should however keep in mind that the general case could be handled as well. Let \(\ell >0\) denote the length of the elementary cell \(Q_\ell =(-\frac{\ell }{2},\frac{\ell }{2}]^3\). The nuclei with positive charge z are treated as classical particles with infinite mass that are located at each point of the lattice \(\ell \,{\mathbb {Z}}^3\). The electrons are treated quantum mechanically through a periodic density matrix. The electronic density is modeled by a \(Q_\ell \)-periodic function whose \(L^1\) norm over the elementary cell equals the “number of electrons” q (the electrons’ charge per cell is equal to \(-q\)). When \(q=z\), electrical neutrality per cell is ensured.
In this periodic setting, the \(Q_\ell \)-periodic Coulomb potential \(G_\ell \) resulting from a distribution of point particles of charge 1 that are periodically located at the centers of the cubic cells of the lattice is defined, up to a constant, by
By convention, we choose \(G_\ell \) such that
The function \(G_\ell \) is actually the Green function of the periodic Laplace operator on \(Q_\ell \). The Fourier series of \(G_\ell \) can be written as
Under the convention (2.2), the periodic Coulomb potential changes sign, but is bounded from below (see Lemma A.1 in "Appendix A").
Remark 2.1
The size \(\ell \) of the unit cell does not play a specific role here. It is however involved in the study of the Hardy-type inequalities for the periodic Coulomb potential (see Sect. 4.1). When \(\ell \) goes to infinity, one expects to recover the Dirac–Fock model for atoms.
The free Dirac operator is defined by \(D^0=-i\sum _{{\textrm{r}}=1}^3\alpha _{\textrm{r}}\partial _{\textrm{r}}+\beta \), with \(4\times 4\) complex matrices \(\alpha _1,\alpha _2,\alpha _3\) and \(\beta \), whose standard forms are \( \beta =\begin{pmatrix} \mathbb {1}_2 &{} 0 \\ 0 &{} -\mathbb {1}_2 \end{pmatrix}\), \(\alpha _{\textrm{r}}=\begin{pmatrix} 0&{}\sigma _{\textrm{r}}\\ \sigma _{\textrm{r}}&{}0 \end{pmatrix}\) where \(\mathbb {1}_2\) is the \(2\times 2\) identity matrix and the \(\sigma _{\textrm{r}}\)’s, for \({\textrm{r}}\in \{1,2,3\}\), are the well-known \(2\times 2\) Pauli matrices \( \sigma _1=\begin{pmatrix} 0&{}1\\ 1&{}0 \end{pmatrix},\, \sigma _2=\begin{pmatrix} 0&{}-i\\ i&{}0 \end{pmatrix}\), \(\sigma _3=\begin{pmatrix} 1&{}0\\ 0&{}-1 \end{pmatrix}.\)
The operator \(D^0\) acts on \(4-\)spinors; that is, on functions from \({\mathbb {R}}^3\) to \({\mathbb {C}}^4\). It is self-adjoint in \(L^2({\mathbb {R}}^3;{\mathbb {C}}^4)\), with domain \(H^1({\mathbb {R}}^3; {\mathbb {C}}^4)\) and form domain \(H^{1/2}({\mathbb {R}}^3;{\mathbb {C}}^4)\) (denoted by \(L^2\), \(H^1\) and \(H^{1/2}\) in the following, when there is no ambiguity). Its spectrum is \(\sigma (D^0)=(-\infty ,-1]\cup [+1,+\infty )\). Following the notation in [16, 36], we denote by \(\Lambda ^+\) and \(\Lambda ^-=\mathbb {1}_{L^2}-\Lambda ^+\) respectively the two orthogonal projectors on \(L^2({\mathbb {R}}^3;{\mathbb {C}}^4)\) corresponding to the positive and negative eigenspaces of \(D^0\); that is
According to the Floquet theory [38], the underlying Hilbert space \(L^2({\mathbb {R}}^3;{\mathbb {C}}^4)\) is unitarily equivalent to \(L^2(Q_\ell ^*)\bigotimes L^2(Q_\ell ;{\mathbb {C}}^4)\), where \(Q_\ell ^*=[-\frac{\pi }{\ell },\frac{\pi }{\ell })^3\) is the so-called reciprocal cell of the lattice, with volume \(|Q_\ell ^*|=(2\pi )^3/\ell ^3\) (in the physics literature \(Q_\ell ^*\) is known as the first Brillouin zone). The Floquet unitary transform \(U:L^2({\mathbb {R}}^3;{\mathbb {C}}^4)\rightarrow L^2(Q_\ell ^*)\bigotimes L^2(Q_\ell ;{\mathbb {C}}^4) \) is given by
for every \(\xi \in Q^*_\ell \) and \(\phi \) in \(L^2({\mathbb {R}}^3;{\mathbb {C}}^4)\). For every \(\xi \in Q^*_\ell \), the function \((U \phi )_\xi \) belongs to the space
which will be denoted by \(L^2_\xi \) in the sequel. Functions \(\psi \) of this form are called Bloch waves or \(Q_\ell \)-quasi-periodic functions with quasi-momentum \(\xi \in Q^*_\ell \). They satisfy
For any function \(\phi _\xi \in L^2_\xi \), using the definition of Fourier series expansion for \(Q_\ell \)-periodic functions, we write
with coefficients
The Hilbert space \(L^2_\xi \) is endowed with the norm
Here, and in the whole paper, we use the same notation \(|\cdot |\) for the canonical Euclidean norm in \({\mathbb {R}}^n\), \({\mathbb {C}}^n\) or \( {\mathcal {M}}_n({\mathbb {C}})\). When applied to self-adjoint operators, |T| means the absolute value of T.
For every real number s, we also define
endowed with the norm
To simplify the notation, we simply write here and below \(H_\xi ^s\) when there is no ambiguity.
Operators \({\mathcal {L}}\) on \(L^2({\mathbb {R}}^3;{\mathbb {C}}^4)\) that commute with the translations of \(\ell \,{\mathbb {Z}}^3\) can be decomposed accordingly into a direct integral of operators \({\mathcal {L}}_\xi \) acting on \(L^2_\xi \) and defined by
(see [38] for more details). We use the notation , with the shorthand , to refer to this decomposition. In particular, for the free Dirac operator \(D^0\) we have
where the \(D_\xi \)’s are self-adjoint operators on \(L^2_\xi \) with domains \(H_\xi ^1\) and form-domains \(H_\xi ^{1/2}\). Note that \(D_\xi ^{\,2}=1-\Delta _\xi \), where . For every function \(\phi _\xi \in H^{1}_\xi \), the operator \(D_\xi \) is also defined by
In particular,
For every \(\xi \in Q_\ell ^*\), the positive spectrum of \(D_\xi \) is composed of a non-decreasing sequence of real eigenvalues \((d^+_j(\xi ))_{j\ge 1}\) counted with multiplicity. Each function \(\xi \mapsto d_j^+(\xi )\) is continuous and \(Q_\ell ^*\)-periodic, and one has \(d_j^+(Q_\ell ^*)\in [c_*(j),c^*(j)]\) with
Note that
In the same manner, the negative spectrum of \(D_\xi \) is composed of the non-increasing sequence of real eigenvalues \(d^-_j(\xi )=-d_j^+(\xi )\). Finally, one has
As in the Hartree–Fock model for crystals [8], the electrons will be modeled by an operator on \(L^2({\mathbb {R}}^3;{\mathbb {C}}^4)\), called the one-particle density matrix, that reflects their periodic distribution in the nuclei lattice.
We now introduce various functional spaces for linear operators on \(L^2(Q_\ell ;{\mathbb {C}}^4)\) and for operators on \(L^2({\mathbb {R}}^3;{\mathbb {C}}^4)\) that commute with translations. Let \({\mathcal {B}}\left( E \right) \) be the set of bounded operators from a Banach space E to itself. We use the shorthand \({\mathcal {B}}(L^2_\xi )\) for \({\mathcal {B}}(L^2_\xi (Q_\ell ;{\mathbb {C}}^4))\). The space of bounded operators on which commute with the translations of \(\ell {\mathbb {Z}}^3\) is denoted by Y. It is isomorphic to \(L^\infty (Q_\ell ^*;{\mathcal {B}}(L^2_\xi ))\). Moreover, for every ,
(see [38, Theorem XIII.83]). For \(s\in [1,\infty )\) and \(\xi \in Q_\ell ^*\), we define
endowed with the norm
We denote by \({\mathfrak {S}}_\infty (\xi )\) the space of compact operators on \(L^2_\xi \), endowed with the norm inherited from \(\Vert \cdot \Vert _{{\mathcal {B}}(L^2_\xi )}\). Similarly, for \(t\in [1,+\infty ]\), we define
endowed with the norm
and
In particular \({\mathfrak {S}}_{\infty ,\infty }=L^\infty (Q_\ell ^*;{\mathfrak {S}}_\infty (L^2_\xi ))\subset Y\) is endowed with the norm of Y.
In the sequel of this paper, we work with periodic one-particle density matrices belonging to subspaces \({\mathfrak {S}}_{1,p}\) of \({\mathfrak {S}}_{1,1}\), for \(1\le p\le +\infty \). On such spaces, we can define the trace per unit cell as
Here, \({\textrm{Tr}}_{L^2_\xi }\) means the usual trace of operators on the Hilbert space \(L^2_\xi (Q_\ell ;{\mathbb {C}}^4)\). It coincides with the trace of the operator with kernel \({\textrm{Tr}}_4(h_\xi (\cdot ,\cdot ))\) on \(L^2_\xi (Q_\ell ;{\mathbb {C}})\) with \({\textrm{Tr}}_4\) standing for the trace of a \(4\times 4\) matrix. The \(\;{{\widetilde{\quad }}}\) reminds us that \(\gamma \) is not trace-class on \(L^2({\mathbb {R}}^3)\).
The trace per unit cell allows to define duality pairings between spaces \({\mathfrak {S}}_{s,t}\) using the classical duality properties in Schatten’s spaces [40]. More precisely, if \((s,s')\) and \((t,t')\) are in \([1,+\infty ]^2\) with \(1/s+1/s'=1\) and \(1/t+1/t'=1\), then one can define a duality pairing \(\langle \cdot ,\cdot \rangle \) between \({\mathfrak {S}}_{s,t}\) and \({\mathfrak {S}}_{s',t'}\) as follows. For \(h\in {\mathfrak {S}}_{s,t}\) and \(h'\in {\mathfrak {S}}_{s',t'}\), the product \(hh'\) is in \({\mathfrak {S}}_{1,1}\) and one sets
One has
We also define
endowed with the norm
and
endowed with the norm
For any two functional spaces A and B the norm of the intersected space is defined by
For future convenience, we use the notation \(X(\xi )\) for \(X^1(\xi )\). We also set \(X:= X_1^1\) and
We endow Z with the norm inherited from \(X\cap Y\), that is, we take
With this norm, Z is a Banach space. The functional spaces \({\mathfrak {S}}_{1,1}\), X, Y and Z will play an essential role in the whole paper, while the functional space \({\mathfrak {S}}_{1,\infty }\) and its subspace \(X^2_\infty \) are mainly used in Sect. 6. In addition, we will also use the functional space \({\mathfrak {S}}_{\infty ,1}\) in Sect. 6 since \({\mathfrak {S}}_{1,\infty }\) is its dual space.
Definition 2.2
(Periodic one-particle density matrices) We denote by \(\Gamma \) the following set of \(Q_\ell \)-periodic one-particle density matrices:
Remark 2.3
For \(\gamma \in \Gamma \) and for almost every \(\xi \) in \(Q_\ell ^*\), the operator \(\gamma _\xi \) is compact on \(L^2_\xi \) and admits a complete set of eigenfunctions \((u_n(\xi ,\cdot ))_{n\ge 1}\) in \(L^2_\xi \) (actually lying in \(H^{1/2}_\xi \)), corresponding to a non-decreasing sequence of eigenvalues \(0\le \mu _n(\xi )\le 1\) (counted with their multiplicity). This is expressed as
where \(|u \rangle \,\langle u|\) denotes the projector onto the vector space spanned by the function u. Equivalently, for almost every \(\xi \) in \(Q_\ell ^*\) and for any \((x,y)\in {\mathbb {R}}^3\times {\mathbb {R}}^3\), the Hilbert–Schmidt kernel writes
In the above equation, \(u_n(\xi ,\cdot )\) is a column vector with four coefficients and the superscript \(^*\) refers to transposition composed with complex conjugation of the coefficients. Thus, \(\gamma _\xi (x,y)\) is a \(4\times 4\) complex matrix, and for every function \(\varphi \in L^2_\xi \),
By definition of the trace of an operator,
Definition 2.4
(Integral kernel and electronic density) Let \(\gamma \) belong to \(\Gamma \). Then we can define in a unique way an integral kernel \(\gamma (\cdot ,\cdot )\in L^2(Q_\ell \times {\mathbb {R}}^3) \cap L^2({\mathbb {R}}^3\times Q_\ell )\) with \(\gamma (\cdot +k,\cdot +k)=\gamma (\cdot ,\cdot )\) for any \(k\in {\mathbb {Z}}^3\) and a \(Q_\ell \)-periodic density \(\rho _\gamma \) associated to \(\gamma \) by
and
The function \(\rho _\gamma \) is non-negative and belongs to \(L^1(Q_\ell ;{\mathbb {R}})\). Indeed, using the decomposition (2.15), we have
and
In the physical setting we are interested in, the value of the above integral is the number of electrons per cell.
By the Cauchy–Schwarz inequality, it is easily checked that
Note that, when h is a \(Q_\ell \)-periodic trace-class operator but is not necessarily a positive operator, we still may define \(\rho _h\) with the help of (2.17), but (2.19) becomes \(|h(x,y)|^2\le \rho _{|h|}(x)\rho _{|h|}(y)\) where \(|h|=\sqrt{h^*h}\).
We can now introduce the periodic Dirac–Fock functional.
2.2 The Periodic Dirac–Fock Model
For \(\gamma \in Z\), we define the periodic Dirac–Fock functional
This functional is well-defined on Z (see Remark 4.9 below). In the above definition of the energy functional, the so-called fine structure constant \(\alpha \) is a dimensionless positive constant (the physical value is approximately 1/137). Note that \(D_\xi \gamma _\xi \) is not a trace-class operator, so \({\textrm{Tr}}_{L^2_\xi }[D_\xi \gamma _\xi ]\) is not really a trace, it is just a notation for the rigorous mathematical object \({\textrm{Tr}}_{L^2_\xi }[\vert D_\xi \vert ^{1/2}\gamma _\xi \vert D_\xi \vert ^{1/2}\textrm{sign}(D_\xi )]\). We will make this abuse of notation throughout the paper.
The last term in (2.20) is called the “exchange term ”. The potential \(W_\ell ^\infty \) that enters its definition is given by
(see [8] for a formal derivation of the exchange term from its analogue for molecules). It is \(Q_\ell ^*\)-periodic with respect to \(\eta \) and quasi-periodic with quasi-momentum \(\eta \) with respect to x. For every \(\gamma \in Z\), we now define the mean-field periodic Dirac operator
where
Here,
and
(In (2.22) we keep the notation \(\cdot *\cdot \) for the convolution of periodic functions on \(Q_\ell \).)
Let us explain the relation between \({\mathcal {E}}^{DF}\) and \(D_{\gamma }\). The periodic DF energy may be rewritten
It is smooth on Z, and its differential at \(\gamma \in Z\) is the linear form
We introduce the following set of periodic density matrices:
and
Here q is a positive real number. The elements of \( \Gamma _{q}\) (resp. \(\Gamma _{\le q}\)) are Dirac–Fock density matrices with particle number per unit cell equal to q (resp. at most q).
Our goal is to define the ground state despite the fact that the energy functional \({\mathcal {E}}^{DF}\) is strongly indefinite on \(\Gamma _{\le q}\), due to the unboundedness of the Dirac operator \(D^0\).
2.3 Ground State Energy and Main Result
We follow Dirac’s interpretation of the negative energy states of Dirac–Fock models: Such states are supposed to be occupied by virtual electrons that form the Dirac sea. Therefore, by the Pauli exclusion principle, the states of physical electrons are orthogonal to all the negative energy states. The ground- energy and state should thus be defined on the positive spectral subspaces of the corresponding Dirac–Fock operator. Let
Note that by definition \(P_{0,\xi }^{\pm }=\mathbb {1}_{{\mathbb {R}}_{\pm }}(D_\xi -\alpha zG_\ell )\). We define the set
and the ground state energy
We need the following assumption.
Assumption 2.5
Let \(q^+:=\max \{q;1\}\) and \(\kappa :=\alpha \,\big (C_G z+C_{EE}'q^+\big )\). We also introduce the positive constants \(e_0:=(1-\kappa )^{-1}c^*(\lceil q\rceil )\) and \(a:=\frac{\alpha }{2}C_{EE}\,(1-\kappa )^{-1/2}\lambda _0^{-1/2}\) (well-defined if \(\kappa <1\)). Here we have used the standard notation \(\lceil q\rceil :=\min \{m\in {\mathbb {N}}\;\vert \;m\ge q\}\) and \(c^*(\cdot )\) is given by formula (2.9).
We demand that
-
1.
\(\kappa <1-\frac{\alpha }{2} C_{EE}q^+\);
-
2.
\(2a\,\sqrt{\max \{(1-\kappa -\frac{\alpha }{2} C_{EE}q^+)^{-1}e_0 q;1\}q^+}<1\).
The positive constants \(C_G\), \(C_{EE}\), \(C_{EE}'\) and \(\lambda _0\) are defined respectively in Lemmas 4.1, 4.7 and 4.11 below.
Our main result is
Theorem 2.6
(Existence of a ground state) When \(\alpha \), q, z and \(\ell \) satisfy Assumption 2.5, there exists \(\gamma _*\in \Gamma _{q}^+\) such that
In addition, \(\gamma _*\) solves the following nonlinear self-consistent equation
where \(0\le \delta \le \mathbb {1}_{\{\nu \}}(D_\gamma )\) and \(\lambda _0\le \nu \le e_0\), with \(e_0\) being defined in Assumption 2.5, and \(\lambda _0\ge 1-\kappa >0\) in Lemma 4.11.
Remark 2.7
(Projectors) According to [1, 19, 31] any ground state of the Hartree–Fock model (both for the molecules and crystals) is a projector. However we do not know whether the ground states of Dirac–Fock model are projectors in general.
Remark 2.8
In Solid State Physics, the length of the unit cell is about a few Ångströms. In our system of units, \(\hbar =m=c=1\), thus \(\alpha \approx \frac{1}{137}\) and \(\ell \approx 1000\). Under the condition \(q=z\) for electrical neutrality, Assumption 2.5 is satisfied for \(q\le 17\). The proof is detailed in “Appendix D”. Our estimates are far from optimal: The ideas of this paper are expected to apply to higher values of q.
3 Sketch of Proof of Theorem 2.6
We are convinced that the constraint set \(\Gamma ^+_q\) is not convex, and we are not able to prove that it is closed for the weak-\(^*\) topology of Z. This is the source of considerable difficulties. Mimicking [39], we shall use a retraction technique as for the Dirac–Fock model for atoms and molecules. This imposes to search the ground state in the set \(\Gamma _{\le q}^+\) defined by
However, under the above constraint, the minimizers may not be situated in \(\Gamma _{q}^+\). To overcome this problem, we next subtract a penalization term \(\epsilon _{P} \,{\widetilde{{\textrm{Tr}}}}_{L^2}(\gamma )\) from the DF energy functional, for some parameter \(\epsilon _{P}>0\) to be chosen later, and first study a minimization problem for the penalized functional with relaxed constraint. We introduce the infimum
If this infimum is attained at some \(\gamma _*\in \Gamma _{\le q}^+\), \(\gamma _*\) will be called a minimizer for \(J_{\le q}\). We are going to see that for a suitably chosen value of \(\epsilon _P\), \(J_{\le q}\) is attained and that every minimizer for \(J_{\le q}\) lies in \(\Gamma _{q}^+\), thus is a minimizer for \(I_q\).
For the study of the penalized problem \(J_{\le q}\), we need an analogue of Assumption 2.5.
Assumption 3.1
Let \(q^+=\max \{q;1\}\), \(\kappa :=\alpha \,(C_Gz+C'_{EE} q^+)\) and \(a:=\frac{\alpha }{2}C_{EE}(1-\kappa )^{-1/2}\lambda _0^{-1/2}\) (well-defined if \(\kappa <1\)). We assume that
-
1.
\(\kappa <1-\frac{\alpha }{2} C_{EE}q^+\) ;
-
2.
\(2a\,\sqrt{\max \{(1-\kappa -\frac{\alpha }{2} C_{EE}q^+)^{-1}\epsilon _{P}\,q;1\}q^+}<1\).
The relation between assumptions 2.5 and 3.1 is given by the following lemma:
Lemma 3.2
(Choice of \(\epsilon _P\)) Assume that Assumption 2.5 on q and z holds. Then, there is a constant \(\epsilon _{P}>e_0\) such that Assumption 3.1 is satisfied.
Proof
One just needs to take \(\epsilon _{P}=e_0+\varepsilon \) with \(\varepsilon \) positive and small enough. \(\square \)
We now state an existence result.
Theorem 3.3
(Existence of a minimizer for the penalized problem) We suppose that Assumption 3.1 on \(q,z,\epsilon _P\) holds and recall the notation \(e_0:=(1-\kappa )^{-1} c^*(\lceil q \rceil )\). If \(\epsilon _{P}>e_0\), then there exists \(\gamma _*\in \Gamma _{\le q}^+\) such that
In addition, and \(\gamma _*\) solves the following nonlinear self-consistent equation
where \(0\le \delta \le \mathbb {1}_{\{\nu \}}(D_\gamma )\) and \(\nu \in [\lambda _0,e_0]\) is the Lagrange multiplier due to the charge constraint \( {\textrm{Tr}}_{L^2 }(\gamma )\le q\).
Theorem 2.6 is a direct consequence of Lemma 3.2 and Theorem 3.3. Indeed, if Assumption 2.5 on q, z holds, Lemma 3.2 guarantees the existence of \(\epsilon _{P}\) such that the assumptions of Theorem 3.3 are satisfied. Then this theorem provides a minimizer \(\gamma _*\) for \(J_{\le q}\) which lies in \(\Gamma _q^+\), hence \(J_{\le q}+\epsilon _P q={\mathcal E}^{DF}(\gamma _*)\ge I_q\). On the other hand, for each \(\gamma \in \Gamma _q^+\) one has the inequality \(\mathcal {\mathcal E}^{DF}(\gamma )\ge J_{\le q}+\epsilon _P q\), so \(I_q\ge J_{\le q}+\epsilon _P q\). As a consequence, we get \({\mathcal E}^{DF}(\gamma _*)=J_{\le q}+\epsilon _P q=I_q\) which is the same as (2.26). Moreover \(\gamma _*\) satisfies (3.3) which is the same as (2.27).
Therefore, in the sequel of this paper we focus on the proof of Theorem 3.3. Before going further, we explain the difficulties we face and the strategy we adopt to solve them, by comparing with the Hartree–Fock case [8]. The method used in [8] is based on some properties of the Schrödinger operator \(-\Delta \):
-
1.
This operator is non-negative. Hence the Hartree–Fock model for crystals is well-defined and the kinetic energy is weakly lower semi-continuous w.r.t. the density matrix;
-
2.
The exchange potential \(W_{\ell }^\infty \) is rather easily controlled by the Schrödinger operator \(-\Delta \).
In [8], these properties allow to deduce bounds on the minimizing sequence of density matrices w.r.t. the \(\xi \), x and y variables, and to pass to the limit in the different terms of the energy functional, in particular in the exchange term which is the most intricate one. In the proof, the strong convergence of the density matrix kernels plays an important role. In addition, the charge constraint in the periodic Hartree–Fock model is linear with respect to the density, and there is no possible loss of charge in passing to the limit.
In the Dirac–Fock model for crystals, two additional difficulties occur. First of all, the Dirac operator does not control the potential energy terms, which are of the same order. Secondly, the convergence of the nonlinear constraint requires stronger compactness properties of the sequence of density matrices with respect to the \(\xi \) variable (of course, this second difficulty does not exist for the Dirac–Fock model of atoms and molecules, since in that case the \(\xi \) variable is absent). Therefore the proof of existence of minimizers in the periodic Hartree–Fock setting cannot be applied mutatis mutandis. The functional space Z is natural to give a sense to the energy functional and to the constraints, but the weak convergence of minimizing sequences in Z is not sufficient to deal with the exchange term and the non-linear constraints. The whole paper (except Sect. 5 about the retraction) is devoted to solving the difficulties arising from the \(\xi \) variable.
Strategy for the proof of Theorem 3.3. Our strategy rather relies on the spectral analysis of the periodic Dirac–Fock operator, which is new, to our knowledge, for the proof of existence of minimizers in the periodic case. Thanks to this spectral analysis, in Lemma 4.15 together with Lemma 5.1 (see also Remark 4.16), we can prove that every minimizer for \(J_{\le q}\) actually lies in \({\mathfrak {S}}_{1,\infty }\), and is situated in \( {\overline{{\textbf{B}}}}\), where
and \(j_1\) is an integer defined in Sect. 4.2.
The key point in the proof of the existence of minimizers for \(J_{\le q}\) is that for any minimizing sequence \((\gamma _n)\) for \(J_{\le q}\), we are able to construct another minimizing sequence \(({\widetilde{\gamma }}_n)\) satisfying the same regularity estimate as the minimizers, that is, \( {\widetilde{\gamma }}_n\in {\overline{{\textbf{B}}}}\). This is the content of Lemma 6.2. This estimate helps considerably to solve the problem of passing to the limit in the constraint .
Organization of the paper. The next sections are devoted to the proof of Theorem 3.3. Our paper is organized as follows.
In Sect. 4, we collect some fundamental estimates on the potentials \(G_\ell \) and \(W_\ell ^\infty \), that ensure in particular that the DF periodic energy functional is well-defined and smooth on Z. In Sect. 4.2, we study the spectral properties of the Dirac–Fock operators \(D_{\gamma ,\xi }\) for every \(\xi \in Q_\ell ^*\). Relying on them, we study in Sect. 4.3 the properties of minimizers for a linearized Dirac–Fock energy. Finally, we collect the first estimates on minimizing sequences for \(J_{\le q}\).
In Sect. 5, we study the Euler-Lagrange equation associated to \(J_{\le q}\). We conclude that each minimizer for \(J_{\le q}\) is in \(\Gamma _{q}^+\) and solves a self-consistent equation (it is a ground state of its own mean-field Hamiltonian) and that minimizing sequences are approximate solutions. In Hartree–Fock type models for molecules [33] or crystals [19], it is a standard fact that the approximate minimizers are also approximate ground states of their mean-field Hamiltonian, and the proof relies on the convexity of the constraint set. However, in the Dirac–Fock model (both for molecules and crystals), the constraint set \( \Gamma ^+_{\le q}\) does not seem to be convex. By using a retraction technique, a similar difficulty was recently overcome by one of us for the Dirac–Fock model of molecules [39]. Adapting the method of [39], we define a set \({\mathcal {V}}\) which is relatively open in \(\Gamma _{\le q}\) for the norm of Z, and we build a regular map \(\theta :\overline{{\mathcal {V}}}\rightarrow \overline{{\mathcal {V}}}\cap \Gamma _{\le q}^+\) such that \(\theta (\gamma )=\gamma \,,\;\forall \gamma \in \overline{{\mathcal {V}}}\cap \Gamma _{\le q}^+\). Here, \(\overline{{\mathcal {V}}}\) is the closure of \({\mathcal {V}}\) in Z. Under our assumptions, there exist minimizing sequences for \(J_{\le q}\) lying in \({\mathcal {V}}\cap \Gamma _{\le q}^+\), hence the equality
which allows us to prove that the terms of minimizing sequences are approximate ground states of their mean-field Hamiltonian.
Then, in Sect. 6, we build modified minimizing sequences lying in \({\overline{{\textbf{B}}}}\). Finally, we prove the convergence of such sequences to a minimizer for \(J_{\le q}\), and this ends the proof of Theorem 3.3.
Assumption 2.5 involves optimal constants in Hardy-type inequalities introduced in Sect. 4.1. Therefore, in “Appendices A–C”, we prove Lemma 4.1, Lemma 4.5 and Lemma 4.7 respectively. Finally, in “Appendix D”, we calculate the maximum number of electrons per cell allowed by the model, relying on approximate values of the constants obtained in “Appendices A–C”.
4 Fundamental Estimates
In this section, we give Hardy-type inequalities for the periodic Coulomb potential and provide estimates on the interaction potential between electrons in crystals. Then we study the spectrum of the periodic self-consistent Dirac–Fock operators. Finally, using this spectral analysis, we derive properties of the minimizers for a linearized problem, and a priori bounds on minimizing sequences for \(J_{\le q}\).
4.1 Hardy-type Estimates on the Periodic Coulomb Potential
First of all, and this is a major difference with the usual Coulomb potential \(\frac{1}{|x|}\) in \({\mathbb {R}}^3\), the periodic Coulomb potential \(G_\ell \) may not be positive, since it is defined up to constant, but it is bounded from below (see Lemma A.1 in "Appendix A"). Nevertheless, the operator of convolution with \(G_\ell \) is positive on \(L^2(Q_\ell )\) in virtue of (2.3). Moreover, we have the following Hardy-type estimates concerning the operator of multiplication by the periodic potential \(G_\ell \).
Lemma 4.1
(Hardy-type inequalities for the periodic Coulomb potential) There exist positive constants \(C_H=C_H(\ell )>0\) that only depends on \(\ell \) and such that
in the sense of operators on \(L^2(Q_\ell ^*)\bigotimes L^2(Q_\ell ;{\mathbb {C}}^4)\).
Moreover, there exists a positive constant \(C_G=C_G(\ell )\) with \(C_G\ge C_H\) that only depends on \(\ell \) and such that
Remark 4.2
In (4.1), the inequality \(A\le B\) is equivalent to: For almost every \(\xi \in Q_\ell ^*\), \(A_\xi \le B_\xi \) in the sense of operators on \(L^2_\xi \).
Remark 4.3
The constant \(C_G(\ell )\) is estimated in (A.4) in “Appendix A” below. While it is far from optimal when \(\ell \) is small, it converges to 2 when \(\ell \) goes to infinity; that is, to the value of the optimal constant for the Coulomb potential on the whole space. By interpolation,
Therefore, (4.1) holds with \(C_H\) being replaced by \(C_G\). However, \(C_H\) is expected to converge to \(\pi /2\) as \(\ell \) goes to infinity; that is, to the best constant in the Kato–Herbst inequality on the whole space [26, 29].
A by-product of Lemma 4.1 is the following.
Corollary 4.4
(Estimates on the direct term) For any \(\gamma \in X\), we have
and
Proof
For every \(x\in {\mathbb {R}}^3\) and \(\gamma \in X\), we have
Indeed, the bound (4.1) in Lemma 4.1 yields
uniformly in x.
We now turn to the proof of (4.5). For every \(\xi \in Q_\ell ^*\) and \(\varphi _\xi \) in \(L^2_\xi \), we have
In (4.6), we have used the bound (4.2) of Lemma 4.1 and the obvious fact that it remains true for \(G_\ell (\cdot -x)\) for any \(x\in {\mathbb {R}}^3\). \(\square \)
Now, we consider the operator which enters the definition of the exchange term. The operators \(W_{\gamma ,\xi }\) have been defined in Formula (2.23), which involves the integral kernel \(W_\ell ^\infty \) given in (2.20). We can separate the singularities of \(W_\ell ^\infty \) with respect to \(\eta \in 2Q_\ell ^*\) and \(x\in 2Q_\ell \) as follows
with
and
where \(|k|_\infty :=\max \{|k_1|,|k_2|,|k_3|\}\). It is easy to see that the singularity of \(W_{<m,\ell }^{\infty }\) behaves like \(\frac{1}{|\eta |^2}\). We will show in “Appendix B” that the singularity of \(W_{\ge m,\ell }^{\infty }\) behaves like \(\frac{1}{|x|}\) or equivalently \(G_\ell (x)\), and we will obtain the following estimates on the operator \(W_\gamma \).
Lemma 4.5
(Estimates on \(W_{\gamma }\)) If \(\gamma \in Z\), then \(W_\gamma \in Y\) and there exist positive constants \(C_W=C_W(\ell )\), \(C_W'=C_W'(\ell )\), \(C_W''=C_W''(\ell )\) that only depend on \(\ell \), such that
Remark 4.6
The constants \(C_W\), \(C_W'\) and \(C_{W}''\) are estimated in (B.24) in "Appendix B".
Gathering together Lemmas 4.1, 4.5 and Corollary 4.4 we can get some rough estimates on the self-consistent potential \(V_{\gamma ,\xi }\) defined in (2.21). In “Appendix C” we obtain much better estimates by a careful study of the structure of the operator
Lemma 4.7
(Estimates on \(V_{\gamma }\)) There exist positive constants \(C_{EE}=C_{EE}(\ell )>0\) and \(C_{EE}'=C_{EE}'(\ell )>0\) that only depend on \(\ell \) and such that, for every \(\gamma \in Z\),
and
For every \(\xi \in Q_\ell ^*\) and any \(\psi _\xi \in H^{1/2}_\xi \),
Furthermore, if \(\gamma \ge 0\), for any \(\psi \in L^2_\xi \),
Remark 4.8
The constants \(C_{EE}\), \(C_{EE}'\) and \(C_{EE}''\) are estimated in (C.7), (C.5) and (C.8) of “Appendix C” respectively.
Remark 4.9
Using Lemmas 4.1 and 4.7, it is easily checked that \(Z \ni \gamma \mapsto {\mathcal {E}}(\gamma )\) is well-defined.
4.2 Spectral Properties of the Mean-Field Dirac–Fock Operator
Recall that \(\kappa :=\alpha \,\big (C_G z+C_{EE}'q^+\big )\). We start with the following.
Lemma 4.10
Let \(\gamma \in Z\). We assume that \(C_G z+C_{EE}'\Vert \gamma \Vert _{{\mathfrak {S}}_{1,1}\cap Y}<1/\alpha \), then \(D_{\gamma ,\xi }\) is a self-adjoint operator on \(L^2_\xi \) with domain \(H^1_\xi \) and form-domain \(H^{1/2}_\xi \). In addition, it holds that
and
In particular, if \(\gamma \in \Gamma _{\le q}\) and \(\kappa <1\), we have
Proof
Recall \(q^+=\max \{1,q\}\). By Lemmas 4.1 and 4.7, we obtain
In particular, \(D_{\gamma }\) is self-adjoint on by the Rellich–Kato theorem if \(C_G z+C_{EE}'\Vert \gamma \Vert _{{\mathfrak {S}}_{1,1}\cap Y} < 1/\alpha \) (see [38, Theorem XIII-85]). Let now \(\xi \in Q_\ell ^*\) and \(u_\xi \in H^1_\xi (Q_\ell )\). We have
which implies (4.15). On the other hand,
Hence,
which implies (4.16). Since \(\gamma \in \Gamma _{\le q}\), \(\Vert \gamma \Vert _{{\mathfrak {S}}_{1,1}\cap Y}\le q^+\). Thus (4.19) and (4.20) together give (4.17). This concludes the proof. \(\square \)
As a consequence of (4.20), we deduce that the spectrum of \(D_{\gamma }\) (and of any \(D_{\gamma ,\xi }\)) is included in \({\mathbb {R}}\setminus [-1+\kappa ;1-\kappa ]\). In order to allow for as many electrons as possible per cell, we need a more accurate estimate on the bottom of \(\sigma (|D_{\gamma }|)\).
Lemma 4.11
(Further properties of the bottom of the spectrum of \(|D_\gamma |\)) Let \(\gamma \in \Gamma _{\le q}\). Then
with \(\lambda _0:=1-\alpha \max \{C_H z+C_{EE}''q^+; \frac{C_0}{\ell }z+C_{EE}q^+\}\), the constant \(C_0\) being defined in Lemma A.1 in “Appendix A” below.
Proof
Let \(\psi ^+_\xi =\Lambda ^+_{\xi }\psi _\xi \) and \(\psi ^-_\xi =\Lambda ^-_{\xi }\psi _\xi \). Notice that \(D_{\gamma ,\xi }=D_{\xi }-\alpha zG_\ell +\alpha V_{\gamma ,\xi }\) and \(V_{\gamma ,\xi }\) satisfies (4.13) and (4.14). Now, combining with (A.1) we have
and
We get
\(\square \)
Further spectral properties of the self-consistent operator \(D_\gamma \) are collected in the following.
Lemma 4.12
(Properties of positive eigenvalues of \(D_{\gamma ,\xi }\)) Assume that \(\kappa <1\) and let \(\gamma \in \Gamma _{\le q}\). We denote by \(\lambda _j(\xi )\), for \(j\ge 1\), the j-th positive eigenvalue (counted with multiplicity) of the mean-field operator \(D_{\gamma ,\xi }\). Then \(\lambda _j(\xi )\) is situated in the interval \([c_*(j)(1-\kappa ),c^*(j)(1-\kappa )^{-1}]\) where \(c^*(j)\) and \(c_*(j)\) are the constants of Formula (2.9).
In addition, every eigenfunction \(u_{j,\xi }\) associated to \(\lambda _j(\xi )\) lies in \(H^1_\xi \) and satisfies
Proof
We rely on a variational characterization of eigenvalues in spectral gaps (see [14] and references therein). Let
and
One has \(\Lambda _\xi ^\pm H^1_\xi \subset H^1_\xi \), that is, the domain \(H^1_\xi \) of the self-adjoint operator \(D_{\gamma ,\xi }\) satisfies Condition (H1) of [14]. To each integer \(j\ge 0\) we associate the min-max level
Let \(u_\xi \in (V\bigoplus \Lambda _\xi ^-H_{\xi }^{1})\setminus \{0\}\). We write \(u_\xi =u_\xi ^++u_\xi ^-\) with
By definition of \(\Lambda _\xi ^\pm \),
Therefore,
We observe that, for \(j\ge 1\),
By (4.23), (4.2) in Lemma 4.1 and (4.12) in Lemma 4.7, for any \(u^+_\xi \in \Lambda _\xi ^+ H_\xi ^{1}\),
Thus,
In particular, \({\hat{\lambda }}_1(\xi )\ge (1-\kappa )c_*(1)>0\). On the other hand, \( (D_{\gamma ,\xi }\,u^-_\xi ,u^-_\xi )\le -(1-\kappa )(\vert D_{\xi }\vert \,u^-_\xi ,u^-_\xi )\le 0\) for every \(u^-_\xi \in \Lambda _\xi ^- H^{1}_\xi \), whenever \(\kappa <1\). This implies that \({\hat{\lambda }}_0(\xi )\le 0\), so Conditions (H2) and (H3) of [14] are satisfied, and we can conclude that
hence the lower bound \(\lambda _j(\xi )\ge (1-\kappa )c_*(j)\).
For the upper bound, we proceed as follows. Equations (4.18) and (4.23) together yield
by Young’s inequality. Moreover, \(\Vert u^+_\xi \Vert _{L^2_\xi }\le \Vert u_\xi \Vert _{L^2_\xi }\), so, recalling (4.22) and (4.24) we see that
Finally, using (4.20) in Lemma 4.10, we obtain
hence (4.21). The lemma is thus proved. \(\square \)
Recall that where \(j_1\) is an integer that has not been defined yet. In the rest of this paper, assuming that \(\kappa <1\) and recalling our notation \(e_0:=(1-\kappa )^{-1}c^*(\lceil q \rceil )\,\), we take
This integer is well-defined, since \(\lim _{j\rightarrow \infty }c_*(j)=+\infty \). We also introduce the energy level
By construction, one has \(0<e_0<e_1\) an \(j_1\ge \lceil q \rceil \). Moreover, Lemma 4.12 has an immediate consequence, which will be very useful in the sequel.
Corollary 4.13
Assuming that \(\kappa <1\), with the above notation, for every \(\gamma \) in \(\Gamma _{\le q}\):
-
The projector \(\mathbb {1}_{[0,e_0]}(D_{\gamma ,\xi })\) has rank at least \(\lceil q \rceil \) for a.e. \(\xi \in Q_\ell ^*\).
-
The projector \(\mathbb {1}_{[0,e_1)}(D_{\gamma ,\xi })\) has rank at most \(j_1\) for a.e. \(\xi \in Q_\ell ^*\).
-
If \(\gamma '\in Z\) and \(0\le \gamma '\le \mathbb {1}_{[0,e_1)}(D_{\gamma })\), then \(\gamma '\in {\overline{{\textbf{B}}}}\).
We end this subsection with the following proposition.
Proposition 4.14
Assume that \(\kappa <1\). Let \(\gamma ,\gamma '\in \Gamma _{\le q}\) such that
with \(\nu >0\). Then,
Proof
By Lemma 4.10, we have
Since \(\gamma '\in \Gamma _{\le q}\), we obtain
Then \(\Vert \gamma '\Vert _X\le (1-\kappa )^{-1}q\,\nu \). We deduce the desired bound since \(\Vert \gamma '\Vert _Y\le 1\). \(\square \)
4.3 Properties of the Minimizers for a Linearized Problem
The following lemma will be used in the next sections.
Lemma 4.15
Let \(g\in \Gamma _{\le q}\) be given, and assume \(\kappa <1\). Then for each \(\epsilon _P>0\), the minimization problem
admits a minimizer. Every minimizer \({\hat{g}}\) is of the form , with for some \(\nu \in (0,\min (\epsilon _{P},e_0)]\), and one has \({\hat{g}}\in {\overline{{\textbf{B}}}}\).
If \(\epsilon _{P}> e_0\), the set of minimizers is independent of \(\epsilon _{P}\), and every minimizer satisfies \({\widetilde{{\textrm{Tr}}}}_{L^2}({\hat{g}})=q\) and \(\nu \ge \lambda _0\).
Proof
The proof is inspired of [5]. For any \(\xi \in Q_\ell ^*\) we can choose an orthonormal eigenbasis \(\{\psi _j(\xi ,\cdot )\}_{j\ge 1}\) of \(D_{g,\xi }P^+_{g,\xi }\), such that
with \(\lambda _j(\xi )\ge 0\). According to Lemma 4.12, for every \(\xi \in Q_\ell ^*\) and for every \(j\ge 1\)
Let us introduce as in [5, 19] the non-decreasing function
By Lemma 4.12, for \(0\le s<(1-\kappa )c_*(\lceil q\rceil )\) one has \(C(t)\le \lceil q\rceil -1 < q\). On the other hand, \(C(e_0)\ge \lceil q \rceil \ge q\). Thus, there exists \(\nu _1\) with
such that
Equivalently,
and
Therefore, there exists , such that the density matrix
satisfies
We first consider the case \(\nu _1< \epsilon _{P}\). For any \(\gamma \in \Gamma _{\le q}\) with \(\gamma =P_g^+\gamma P_g^+\), we write
On the other hand, we have
since \(0\le \gamma \le \mathbb {1}_{L^2({\mathbb {R}}^3)}\). Thus \({\widetilde{g}}\) is a minimizer. Conversely, if \({\hat{g}}\) is a minimizer, then it must be of the form with , \(\,\nu =\nu _1\in [\lambda _0,\min \{\epsilon _P,e_0\}]\) and \({\widetilde{{\textrm{Tr}}}}_{L^2}({\hat{g}})=q\) since all inequalities in (4.29) and (4.30) above have to be equalities for \(g={\hat{g}}\).
For the case \(\epsilon _{P}\le \nu _1\), we prove that is a minimizer with \({\widetilde{{\textrm{Tr}}}}_{L^2}(g')\le q\), thanks to a modified version of (4.30) with \(\nu _1\) (resp. \({\widetilde{g}}\)) being replaced by \(\epsilon _P\) (resp. \(g'\)). As in the previous case, every minimizer \({\hat{g}}\) satisfies , with . Note that in the case \(\epsilon _p\le \nu _1\), the inequality \({\widetilde{{\textrm{Tr}}}}_{L^2}({\hat{g}})\le q\) automatically holds for any such \({\hat{g}}\).
In both cases, thanks to (4.27) we have \(\nu =\min \{\nu _1,\epsilon _P\}\le e_0<e_1\), hence \(0\le {\hat{g}}\le \mathbb {1}_{[0,e_1)}(D_{g})\,\). Thus, Corollary 4.13 implies that \({\hat{g}}\in {\overline{{\textbf{B}}}}\). \(\square \)
Remark 4.16
Actually, it follows from Corollary 4.13 that for every minimizer \({\hat{g}}\) and a.e. \(\xi \in Q_\ell ^*\), one has \( \textrm{Rank}({\hat{g}}_{\xi })\le j_1\).
4.4 First Properties of Minimizing Sequences for \(J_{\le q}\)
We introduce the sublevel set
Note that the operator 0 belongs to \(\Gamma _{\le q}^+\) and satisfies \({\mathcal {E}}^{DF}(0)-\epsilon _{P}{\widetilde{{\textrm{Tr}}}}_{L^2}(0)=0\). Thus,
and from now on we will only consider minimizing sequences for \(J_{\le q}\) lying in the sublevel set \({\mathcal {S}}\). These sequences satisfy a priori estimates gathered in the following lemma.
Lemma 4.17
(Boundedness of \({\mathcal {S}}\)) Assume that \(\kappa <1\). If \(\kappa <1-\frac{\alpha }{2} C_{EE}q^+\), then, for every \(\gamma \in {\mathcal {S}}\),
and
Proof
As \(D_{\gamma }\gamma =|D_{\gamma }|\gamma \) for any \(\gamma \in \Gamma _{\le q}^+\), we get, by (4.13) and (4.17),
Hence, for any \(\gamma \in {\mathcal {S}}\),
Whenever \(1-\kappa -\frac{\alpha }{2} C_{EE}q^+>0\), (4.33) holds since \(\Vert \gamma \Vert _Y\le 1\).
The estimate (4.34) follows from Hölder’s inequality and the fact that \(\gamma \ge 0\); namely,
\(\square \)
5 Approximation by a Linearized Problem
The aim of this section is to show the link between a minimizing sequence \((\gamma _n)_{n\ge 1}\) in \({\mathcal {S}}\) and the linearized Dirac–Fock problem introduced in Lemma 4.15.
Proposition 5.1
(Link with the linearized problem) Under Assumption 3.1, let \((\gamma _{n})_{n\ge 1}\in {\mathcal {S}}^{{\mathbb {N}}^*}\) be a minimizing sequence for \(J_{\le q}\). Then, as n goes to infinity,
This property is used in Lemma 6.2 below to build a new minimizing sequence with further regularity, and it is also used at the end of Sect. 6 to show some properties of the minimizers for \(J_{\le q}\).
As mentioned at the end of Sect. 3, the main difficulty in the proof of Proposition 5.1 is to deal with the nonconvex constraint set \(\Gamma _{\le q}^+\). To do so, we adapt to our setting a retraction technique introduced in [39]. We are going to build an open subset \({\mathcal {U}}\) of Z stable under the continuous map \(T:\,\gamma \mapsto P^+_{\gamma }\gamma P^+_{\gamma }\) and such that the sequence \(\big (T^p\big )_{p\ge 1}\) converges uniformly on \(\overline{{\mathcal {U}}}\) to a surjective map \(\theta :\,\overline{{\mathcal {U}}}\rightarrow \overline{{\mathcal {U}}}\cap \textrm{Fix}(T)\). Here, \(\overline{{\mathcal {U}}}\) is the closure of \({\mathcal U}\) in Z and \(\textrm{Fix}(T)\) is the set of fixed points of T. The map \(\theta \) will be uniformly continuous and such that \(\theta \circ \theta =\theta \). Following a classical terminology we call it a retraction of \(\overline{{\mathcal {U}}}\) onto \(\overline{{\mathcal {U}}}\cap \textrm{Fix}(T)\). The restriction of \(\theta \) to \({{\mathcal {U}}}\) will be of class \(C^1\), the differential map \(d\theta \) being itself uniformly continuous and bounded from \({{\mathcal {U}}}\) to the space \({{\mathcal {B}}}(Z)\) of bounded linear operators on Z.
Then we will consider the subset \({{\mathcal {V}}}:={{\mathcal {U}}}\cap \Gamma ^+_{\le q}\), which is relatively open in \(\Gamma ^+_{\le q}\) for the topology of Z. We will see that \(\theta (\overline{\mathcal V})\subset \overline{{\mathcal {V}}}\cap \Gamma _{\le q}^+\) and \(\overline{{\mathcal {V}}}\cap \textrm{Fix}(\theta )=\overline{\mathcal V}\cap \Gamma _{\le q}^+\), so \(\theta \) may be considered as a retraction of \(\overline{{\mathcal {V}}}\) onto \(\overline{\mathcal V}\cap \Gamma ^+_{\le q}\). Under our assumptions, we will prove the inclusion \({{\mathcal {S}}}\subset {{\mathcal {V}}}\) which implies, in combination with (4.32), the equality
It will even turn out that \({{\mathcal {U}}}\) is a uniform neighborhood of \({{\mathcal {S}}}\) in Z, that is,
for some positive constant \(\rho \). This property, combined with a formula for the differential of \(\theta \), will allow us to prove Proposition 5.1.
Before giving the definition of \({\mathcal {U}}\), we take \(r>0\) very small and we introduce the following set, as in [39]:
Then analogously to Lemmas 4.10 and 4.11, we have for any \(\gamma \in \Gamma _{\le q,r}\),
and
where \(\kappa _r:=\alpha \,\big (C_Gz+C'_{EE}(q^++2r)\big )\) and
Definition 5.2
(Admissible set for the retraction) Assume that \(\kappa _r<1\). Let \(a_r:=\frac{\alpha }{2}C_{EE}\,(1-\kappa _r)^{-1/2}\lambda _{0,r}^{-1/2}\). Given \(0<\tau <\frac{1}{2a_r}\), let \(M:=\max \left( \frac{2+a_r(q^++r)}{2};\frac{1}{1-2a_r \tau }\right) \). We then define
We have the following result:
Proposition 5.3
(Existence and differentiability of the retraction) Take \(\kappa _r, a_r, \tau , {\mathcal {U}}\) as in Definition 5.2. Let \(k:=2a_r\tau \) and \({\mathcal {V}}:={\mathcal {U}}\cap \Gamma _{\le q}^+ \). Then the sequence of iterated maps \((T^p)_{p\ge 1}\) converges uniformly on \(\overline{{\mathcal {U}}}\) to a limit \(\theta \) with \(\theta (\overline{{\mathcal {U}}})= \text {Fix}(T)\cap \overline{{\mathcal {U}}}\), \(\theta (\overline{{\mathcal {V}}})= \Gamma _{\le q}^+\cap \overline{{\mathcal {V}}}\) and \(\theta \circ \theta =\theta \). We have the estimate
Moreover \(\theta \in C^{1,\text {unif}}({\mathcal {U}},Z)\) and \(d\theta (T^p)\) converges uniformly to \(d\theta \) on \({\mathcal {U}}\).
In this way we obtain a continuous retraction \(\theta \) of \(\overline{{\mathcal {U}}}\) onto \(\Gamma _{\le q}^+\cap \overline{{\mathcal {U}}}\) whose restriction to \({\mathcal {U}}\) is of class \(C^{1,\text {unif}}\). This map and its differential are bounded and uniformly continuous on \({\mathcal {U}}\).
For any \(\gamma \in \text {Fix}(T)\cap {\mathcal {U}}\) and \(\xi \in Q_\ell ^*\), the linear operator \(h\mapsto d\theta _{\xi }(\gamma ) h\) satisfies
where , according to the Floquet-Bloch decomposition. In other words, the splitting \(L^2_{\xi }=P_{\gamma ,\xi }^+L^2_{\xi }\oplus P_{\gamma ,\xi }^-L^2_{\xi }\) gives a block decomposition of \(d\theta _{\xi }(\gamma ) h\) of the form
The proof of Proposition 5.3 is postponed to the end of this section.
To apply Proposition 5.3 to the proof of Proposition 5.1, we need to find \(\tau \in \left( 0,\frac{1}{2a_r}\right) \) such that \({\mathcal {U}}\) is a uniform neighborhood of \({\mathcal {S}}\). From Lemma 4.17 and the definition of \({{\mathcal {U}}}\), we can observe that if
then there is \(\rho >0\) such that for every \(\gamma \in {\mathcal {S}}\), one has the inclusion \(B_{Z}(\gamma ,\rho )\subset {\mathcal {U}}\). Thus, we have the following.
Lemma 5.4
Assume that \(\kappa <1-\frac{\alpha }{2} C_{EE}q^+\), and let \(a_r\) be as above. Assume in addition that
Then there exist \(\tau \in (0,\frac{1}{2a_r})\) and \(\rho >0\) such that \({\mathcal {S}}+B_{Z}(0,\rho )\subset {\mathcal {U}}\).
We are now in a position to prove the main result of this section.
Proof of Proposition 5.1 (as a consequence of Proposition 5.3)
Under Assumption 3.1, we may choose \(r>0\) so small that the assumptions of Lemma 5.4 hold true. Then we may take \(\tau \in (0,\frac{1}{2a_r})\) and \(\rho >0\) satisfying the conclusion of this lemma. To prove (5.1), we argue by contradiction. Otherwise, there would be an \(\epsilon _0 >0\) such that, for n large enough,
By Lemma 4.15, there exists an operator \({\widehat{\gamma }}_n\in \Gamma _{\le q}\) which solves the following minimization problem:
From Lemma 4.15 and Proposition 4.14, \(\Vert {\widehat{\gamma }}_n\Vert _{Z}\) is uniformly bounded. So according to Corollary 5.4, there is \(\sigma >0\) such that for any n large enough and any \(s\in [0,\sigma ]\), \((1-s)\gamma _{n}+s{\widehat{\gamma }}_n\in \Gamma _{\le q}\cap B_{Z}(\gamma _n,\rho ) \subset {\mathcal {V}}\). Then, from Proposition 5.3, the function \(f_n: [0,\sigma ]\ni s \mapsto ({\mathcal {E}}^{DF}-\epsilon _P{\widetilde{{\textrm{Tr}}}}_{L^2})(\theta [(1-s)\gamma _n+s{\widehat{\gamma }}_n])\) is of class \(C^1\), and the sequence of derivatives \((f'_n)\) is equicontinuous on \([0,\sigma ]\). From (5.5), we infer
So there is \(0<s_0<\sigma \) independent of n such that for any \(s\in [0,s_0]\) we have \(f_n'(s)\le -\frac{\epsilon _0}{4}\). Hence,
Recalling that \(({\mathcal {E}}^{DF}-\epsilon _{P}{\widetilde{{\textrm{Tr}}}}_{L^2})(\gamma _{n})\) converges to \( J_{\le q}\), we conclude that \(({\mathcal {E}}^{DF}-\epsilon _P{\widetilde{{\textrm{Tr}}}}_{L^2})(\theta [(1-s_0)\gamma _n+s_0{\widehat{\gamma }}_n])<J_{\le q}\) when n is large enough. This contradicts the definition of \(J_{\le q}\), since \(\theta [(1-s)\gamma _{n}+s{\widehat{\gamma }}_n]\in \Gamma _{\le q}^+\). Hence the proposition. \(\square \)
It remains to prove Proposition 5.3, but before that, we need some preliminary results. Recall that \(P_{0}^+=\mathbb {1}_{{\mathbb {R}}^+}(D^0-\alpha zG_\ell )\). We have the following lemma.
Lemma 5.5
Take \(\kappa _r, a_r\) as in Definition 5.2 and consider the map
that is, with \( Q_\xi (\gamma ):=P^+_{\gamma ,\xi }-P^+_{0,\xi }\).
Then Q is in \(C^{1,\text {lip}}\left( \Gamma _{\le q,r}\,;\,|D^0|^{-1/2}\,Y\right) \) and we have the estimates
and
where K is a positive constant depending only on \(\kappa _r\) which remains bounded when \(\kappa _r\) stays away from 1.
Remark 5.6
In (5.6), if \(\gamma \in \Gamma _{\le q}\) one can replace \(a_r\) by the constant a introduced in Assumption 2.5, since \(\lim _{r\rightarrow 0} a_r=a\).
Proof of Lemma 5.5
By Lemma 4.10, for every \(\xi \in Q_\ell ^*\) and for all \(\gamma \in \Gamma _{\le q,r}\), \(D_{\gamma ,\xi }\) is a self-adjoint operator, and 0 is in its resolvent set. Then by Taylor’s formula [29, Chapter VI.5, Lemma 5.6] or [23], we have
and, by the second resolvent identity,
Hence, for every \(h\in Z\), we deduce from (5.8) and the second resolvent formula again, that
In addition, for any \(u_{\xi }\in L^2_{\xi }(Q_\ell )\) and any \(\gamma \in \Gamma _{\le q,r}\), we have
We infer from (5.3) that
Thus, gathering (5.9), (5.10), (5.11) with Lemma 4.7, for any \(\phi _\xi ,\psi _\xi \in L^2_\xi \), we may write
hence we obtain (5.6); namely,
For the second inequality, we write
Proceeding as above, we get (5.7). The fact that \(Q\in C^{1,\text {lip}}\left( \Gamma _{\le q,r}\,;\,|D^0|^{-1/2}\,Y\right) \) follows from (5.6) and (5.7). \(\square \)
Lemma 5.7
Take \(\kappa _r, a_r\) as in Definition 5.2. Then the map \(T:\,\gamma \rightarrow P^+_{\gamma }\gamma P^+_\gamma \) is well-defined and of class \({\mathcal {C}}^{1,1}\) on \( \Gamma _{\le q,r}\) with values in \(\Gamma _{\le q,r}\subset Z\). Moreover, for any \(\gamma \in \Gamma _{\le q,r}\),
Moreover, there are two positive constants \(C_{\kappa ,r},\; L_{\kappa ,r}\) such that
and
Proof
If \(\gamma \in \Gamma _{\le q,r}\), there is \(\gamma _0\in \Gamma _{\le q,r}\) such that \(\Vert \gamma -\gamma _0\Vert _{{\mathfrak {S}}_{1,1}\cap Y}<r\). Then \(P^+_{\gamma }\gamma _0 P^+_{\gamma }\in \Gamma _{\le q}\), \(T(\gamma )\in Z\) and
so \(T(\gamma )\in \Gamma _{\le q,r}\).
Let \(\gamma ,\gamma '\in \Gamma _{\le q,r}\). Then \(P^+_{\gamma }-P^+_{\gamma '}\) can be written as
From (5.6),
For the estimate (5.13), we write
Then
We have
and
Notice that \(\Vert T(\gamma )\Vert _{{\mathfrak {S}}_{1,1}\cap Y}\le \Vert \gamma \Vert _{{\mathfrak {S}}_{1,1}\cap Y}\le q^++r\). Gathering together these estimates, we obtain (5.13).
We now turn to the proof of (5.14) and (5.15). From Lemma 5.5, T is in \(C^1(\Gamma _{\le q,r})\) with
Notice that for any \(\gamma \in \Gamma _{\le q,r}\),
Thus, using (5.6), one finds a constant \(C_{\kappa ,r}\) such that for any \(h\in Z\),
so (5.14) is proved. Finally, for the term \(dT(\gamma ')-dT(\gamma )\), we have
Proceeding in the same way as for (5.14), we can get (5.15). \(\square \)
We now show that \({\mathcal {U}}\) and T satisfy all the assumptions in [39, Proposition 2.2] in the Banach space Z.
Proposition 5.8
Let \(\kappa _r,\,a_r,\,\tau ,\,{{\mathcal {U}}}\) be as in Definition 5.2. Then T is in \(C^0(\overline{{\mathcal {U}}})\cap C^{1,\text {lip}}({\mathcal {U}},Z)\). Moreover \(T({\mathcal {U}})\subset {\mathcal {U}}\) and the following estimates are satisfied:
and
with \(k:=2a_r\tau <1\).
Proof
For any \(\gamma \in {\mathcal {U}}\), we have
and
As a result, as \(M\ge \frac{2+a_r(q^++r)}{2}\), (5.13) implies that
Moreover, using the inequality \(M\ge \frac{1}{1-2 a_r\tau }\,\), we get
so \(T(\gamma )\in {\mathcal {U}}\).
The fact that \(\sup _{\gamma \in {{\mathcal {U}}_r}}\Vert dT(\gamma )\Vert _{{\mathcal {B}}(Z)}<\infty \) and that dT is Lipschitz continuous on \({\mathcal {U}}\) follows from (5.14) and (5.15). Besides, for \(\gamma \in {\mathcal {U}}\), we have \(\Vert T(\gamma )-\gamma \Vert _{Z}<\frac{\tau }{M}\). This ends the proof of Proposition 5.8. \(\square \)
We can now prove Proposition 5.3, which implies Proposition 5.1, as we have already seen.
Proof of Proposition 5.3
By Proposition 5.8, we may apply [39, Proposition 2.2] to our map T and our open set \({\mathcal {U}}\) in the Banach space Z. This allows us to construct \(\theta \in C^0(\overline{{\mathcal {U}}},Z)\cap C^{1,\text {unif}}({\mathcal {U}},Z)\) with the properties \(\theta (\overline{{\mathcal {U}}})= \text {Fix}(T)\cap \overline{{\mathcal {U}}}\), \(\theta \circ \theta =\theta \) and the convergence estimate (5.4). By our definition of T, we have \(T(\Gamma _{\le q})\subset \Gamma _{\le q}\) and \(\Gamma ^+_{\le q}=\text {Fix}(T)\cap \Gamma _{\le q}\), hence the additional property \(\theta (\overline{{\mathcal {V}}})= \Gamma _{\le q}^+\cap \overline{{\mathcal {V}}}\). The proof of (5.5) is exactly the same as in [39, Theorem 2.10]. This ends the proof of Proposition 5.3. \(\square \)
6 Proof of Theorem 3.3
Throughout this section, we assume that Assumption 3.1 is satisfied and that \(\epsilon _P>e_0\). Let \((\gamma _n)_{n\ge 1}\) be a minimizing sequence for \(J_{\le q}\) lying in \({\mathcal {S}}\). According to Lemma 4.17, this sequence is uniformly bounded in Z. We split \((\gamma _n)_{n\ge 1}\) into two parts: \(({\widetilde{\gamma }}_n)_{n\ge 1}\) and \((\gamma _n-{\widetilde{\gamma }}_n)_{n\ge 1}\) where, for each n,
where \(e_1\) has been defined in Formula (4.26). Thanks to Corollary 4.13, for almost every \(\xi \in Q_\ell ^*\) the rank of \(p_{n,\xi }\), and therefore of \({\widetilde{\gamma }}_{n,\xi }\), is at most \(j_1\), so that \( {\widetilde{\gamma }}_n\in {\overline{{\textbf{B}}}}\).
Actually, we prove in Lemma 6.1 that, for each \(n\ge 1\), \({\widetilde{\gamma }}_n\in X^2_\infty \) whereas \(\gamma _n\in X\); roughly speaking, we reach a \(L^\infty (Q^*_\ell ; H^1_\xi (Q_\ell ))\) regularity instead of a \(L^2(Q^*_\ell ; H_\xi ^{1/2}(Q_\ell ))\) regularity for the associated eigenfunctions. Moreover \({\widetilde{\gamma }}_n\) is close to \(\gamma _n\) in X (Lemma 6.2), so \(({\widetilde{\gamma }}_n)_{n\ge 1}\) is a modified minimizing sequence with higher regularity than \((\gamma _n)_{n\ge 1}\).
The structure of the proof of Theorem 3.3 is as follows. In Sect. 6.1, we will show that \(\Vert \gamma _n-{\widetilde{\gamma }}_n\Vert _X\rightarrow 0\) when n goes to infinity. In Sect. 6.2, we first study the convergence of the kernels of \((W_{{\widetilde{\gamma }}_n,\xi })_{n\ge 1}\). Then we deduce the strong convergence of \((V_{{\widetilde{\gamma }}_n,\xi })_{n\ge 1}\). As a result, \(\Vert P^+_{\gamma _*}-P^+_{{\widetilde{\gamma }}_n}\Vert _Y\rightarrow 0\). On the other hand, for any \(\gamma \in \Gamma _{\le q}\), we also have \(\Vert (P^+_{\gamma _n}-P^+_{{\widetilde{\gamma }}_n})\gamma \Vert _{{\mathfrak {S}}_{1,1}}\le C\Vert {\widetilde{\gamma }}_n-\gamma _n\Vert _{X}^{1/2}\Vert \gamma \Vert _{Z}\rightarrow 0\). Hence in Sect. 6.3, we can pass to the limit in the energy and in the constraints.
6.1 Decomposition of Minimizing Sequences
We start with some regularity bounds on \({\widetilde{\gamma }}_n\) that will be needed in the sequel.
Lemma 6.1
The sequence \(({\widetilde{\gamma }}_n)_{n\ge 1}\) and the sequence of kernels \(({\widetilde{\gamma }}_{n,\xi }(\cdot ,\cdot ))_{n\ge 1}\) are uniformly bounded in \(X_\infty ^2\) and in \(L^\infty (Q_\ell ^*;H^1(Q_\ell \times Q_\ell ))\), respectively. More precisely, we have, for every \(n\ge 1\) and for almost every \(\xi \) in \(Q_\ell ^*\),
and
Proof
We first prove that \(\Vert p_{n}\Vert _{X^2_\infty }\) is bounded. Let \((u_{n,j}(\xi ))_{j\ge 1}\) be the normalized eigenfunctions of the operator \(D_{\gamma _n,\xi }\) with the corresponding eigenvalues \(\lambda _{n,j}(\xi )\) counted with multiplicity. Hence,
with \(\delta _{n,j}(\xi )=1\) if \(0\le \lambda _{n,j}(\xi )< e_1\) and \(\delta _{n,j}(\xi )=0\) otherwise.
By Corollary 4.13, we know that . By (4.21), for any eigenfunction \(u_{n,j}(\xi )\), we have \(\delta _{n,j}(\xi )\Vert u_{n,j}(\xi )\Vert ^2_{H^1_\xi (Q_\ell )} \le (1-\kappa )^{-4}c^*(j_1)^2\), for every \(\xi \in Q_\ell ^*\). Now,
Hence,
Since \(p_{n}=p_{n}^{\,2}\), \({\widetilde{\gamma }}_{n}=p_{n}{\widetilde{\gamma }}_{n}p_{n}\) and \(0\le {\widetilde{\gamma }}_n\le \mathbb {1}_{L^2({\mathbb {R}}^3)}\), we have
In terms of kernels, this implies that
the same holding for \(|D_{\xi ,y}|{\widetilde{\gamma }}_{n,\xi }(\cdot ,\cdot )\). Thus, \({\widetilde{\gamma }}_{n,\xi }(\cdot ,\cdot )\in L^\infty (Q_\ell ^*;H^1(Q_\ell \times Q_\ell ))\) and (6.3) holds. \(\square \)
We begin the proof of Theorem 3.3 by showing the following result as in the case of molecules [39, Lemma 3.4].
Lemma 6.2
Under Assumption 3.1, whenever \(\epsilon _{P}> e_0\), for any minimizing sequence \((\gamma _n)_{n\ge 1}\) of (3.1) in \(\Gamma _{\le q}^+\) we have
Proof
According to Proposition 5.1, any minimizing sequence \((\gamma _n)_{n\ge 1}\) in \(\Gamma _{\le q}^+\) satisfies (5.1); namely
By Lemma 4.15, for every n, minimizers of the above minimization problem are of the form for some \(\nu _n\in [\lambda _0,e_0]\) and some such that \({\widetilde{{\textrm{Tr}}}}_{L^2}(\gamma '_n)=q\). In particular, \(\limsup _{n\rightarrow +\infty }\nu _n<\epsilon _P\), for every n. We define
We can write \(p_n=\pi _n'+\pi _n''\) and we observe that \(\gamma _{n}'=\pi ''_n\gamma _{n}'\pi ''_n\). Proceeding as for (4.29) and (4.30), and since \(\gamma _n\in \Gamma _{\le q}^+\) we have
We observe that the four terms in the right-hand side of the above equation are non-negative whereas, from Proposition 5.1, their sum goes to 0 as n goes to infinity. Therefore,
since \(\liminf _{n\rightarrow +\infty }(\epsilon _P-\nu _n)\ge \epsilon _P-e_0>0\). But \(\pi _{n}(D_{\gamma _n}-\nu _n)\pi _{n}\ge (e_1-\nu _n)\pi _{n}\ge \big (e_1-e_0)\,\pi _n\) and \(\pi _{n}(D_{\gamma _n}-\nu _n)\pi _{n}\ge \pi _{n}(|D_{\gamma _n}|-e_0)\pi _{n}\). Thus, taking a convex combination of these two estimates leads to
Hence
and the right-hand side goes to 0 by (4.17). It remains to study the limit of \(h_{n}:=\pi _n\gamma _n p_n\) as n goes to infinity. Since \((\gamma _n)^2\le \gamma _n\), we have
Hence
In other words, \(\Vert |D_{\gamma _n}|^{1/2}h_{n}\Vert _{{\mathfrak {S}}_{2,2}}\rightarrow 0\). Taking any operator A in Y, by the Cauchy–Schwarz inequality,
We have already seen that \(p_{n,\xi }\) has rank at most \(j_1\). Therefore,
so we deduce immediately from (6.4) that
since A is arbitrary. Hence, thanks to (4.15), \(\Vert h_n\Vert _X=\left\| |D^0|^{1/2}h_n|D^0|^{1/2}\right\| _{{\mathfrak {S}}_{1,1}}\rightarrow 0\). Finally, we obtain that \(\Vert \gamma _n-{\widetilde{\gamma }}_n\Vert _X\le \Vert \pi _n \gamma _n\pi _n\Vert _X+2\,\Vert h_n\Vert _X\rightarrow 0\). \(\square \)
By Lemma 6.1, up to the extraction of a subsequence, there is \(\gamma _*\) in \(X^2_\infty \cap Y\), such that
since \(X^2_\infty \) is a subspace of \({\mathfrak {S}}_{1,\infty }\) which is the dual space of \({\mathfrak {S}}_{\infty ,1}\) and Y is the dual space of \({\mathfrak {S}}_{1,1}\). We immediately get the following.
Lemma 6.3
(Strong convergence of the density) The sequence \(\rho _{{\widetilde{\gamma }}_n}^{1/2}\) converges strongly to \(\rho _{\gamma _*}^{1/2}\) in \(H^{s}(Q_\ell )\) with \(0\le s<1\), thus in \(L^p(Q_\ell )\) for every \(1\le p <6\). In particular, whenever \(\epsilon _{P}> e_0\), we have \(\int _{Q_\ell }\rho _{\gamma _*}\,dx=q\).
Proof
The proof of the strong convergence of \(\rho _{{\widetilde{\gamma }}_n}^{1/2}\) to \(\rho _{\gamma _*}^{1/2}\) in \(L^p(Q_\ell )\) for every \(1\le p<6\) is the same as in [8, p. 730] and relies on the fact that \({\widetilde{\gamma }}_n\in X^2_\infty \) (see the proofs of [8, Eqs. (4.51) and (4.55)]). When \(\epsilon _P>e_0\), by Lemma 6.2, \({\widetilde{\gamma }}_n-\gamma _n\) converges to 0 in X, thus in \({\mathfrak {S}}_{1,1}\), whereas \({\widetilde{{\textrm{Tr}}}}_{L^2}[\gamma _n]\) converges to q. Thus, \({\widetilde{{\textrm{Tr}}}}_{L2}[\gamma _*]=q\). \(\square \)
6.2 Convergence of \(\left( P^+_{\gamma _n}\right) _{n\ge 1}\)
We now study the differences between \(P^+_{\gamma _n}\) and \(P^+_{{\widetilde{\gamma }}_n}\), and between \(P^+_{{\widetilde{\gamma }}_n}\) and \(P^+_{\gamma _*}\) separately. We do not know whether \(\gamma _n-{\widetilde{\gamma }}_n\) goes to 0 in Y and we do not even know whether \({\widetilde{\gamma }}_n-\gamma _*\) goes to 0 in \({\mathfrak {S}}_{1,1}\), so we cannot rely on the continuity of the map Q introduced in Lemma 5.5. The proof is therefore more involved than in [39].
6.2.1 Convergence of \(\left( P^+_{{\widetilde{\gamma }}_n}-P^+_{\gamma _*}\right) _{n\ge 1}\)
The main result of this section is Corollary 6.8 which states that the sequence \(\left( P^+_{{\widetilde{\gamma }}_n}-P^+_{\gamma _*}\right) _{n\ge 1}\) converges strongly in Y.
Recall that
where for \(\xi \in Q_\ell ^*\) and \(x,\, y\in Q_\ell \), the kernels of \(W_{\ge m,\gamma ,\xi }\) and \(W_{< m,\gamma ,\xi }\) are respectively
and
We first prove the following.
Lemma 6.4
The two sequences of operators \(\big (W_{<m,{\widetilde{\gamma }}_n}\big )_n\) and \(\big (W_{\ge m,{\widetilde{\gamma }}_n}\big )_n\) are bounded in \({\mathfrak {S}}_{2,\infty }\). Thus, up to the extraction of a subsequence, we may assume that
Note that saying that the operator belongs to \( {\mathfrak {S}}_{2,p}\) with \(1\le p\le +\infty \) is equivalent to saying that the family of kernels \(\xi \mapsto A_\xi (\cdot ,\cdot )\) is in \(L^p(Q_\ell ^*;L^2(Q_\ell \times Q_\ell ))\).
Proof
The first claim follows from (B.23) and (B.17) in “Appendix B”, thanks to (6.2). Let . From (B.23), we know that \(W_{<m,g}\in {\mathfrak {S}}_{2,1}\). Then, by (6.5), and since \({\mathfrak {S}}_{2,1}^*={\mathfrak {S}}_{2,\infty }\), we get
The argument is similar for \(W_{\ge m, {\widetilde{\gamma }}_n}\). Let . Notice that by (B.17), \(W_{\ge m, {\widetilde{\gamma }}_n}\in {\mathfrak {S}}_{2,1}\). Then we write
with \(|D^0|{\widetilde{\gamma }}_n|D^0|\) converging to \(|D^0|\gamma _*|D^0|\) for the \(*\)-weak topology of \({\mathfrak {S}}_{1,\infty }\) thanks to (6.5). Therefore, it remains to show that the operator \(|D^0|^{-1}W_{\ge m, g}|D^0|^{-1}\) belongs to \({\mathfrak {S}}_{\infty ,1}\) whenever \(g\in {\mathfrak {S}}_{2,1}\). Actually, we prove that \(W_{\ge m,g}|D^0|^{-1}\in Y\), which gives \(|D^0|^{-1}W_{\ge m,g}|D^0|^{-1}\in {\mathfrak {S}}_{\infty ,\infty }\) since \(\Vert |D^0|^{-1}\Vert _{{\mathfrak {S}}_{\infty ,\infty }}\le 1\). We conclude since \({\mathfrak {S}}_{\infty ,\infty }\subset {\mathfrak {S}}_{\infty ,1} \). We use Proposition B.1 on \(W_{\ge m,\ell }^\infty \) and focus on the singularity introduced by the potential \(G_\ell \), the difference being easy to deal with. For every \(\xi \in Q_\ell ^*\) and every \(\psi _\xi \) and \(\phi _\xi \) in \(L^2_\xi (Q_\ell )\) with \(\Vert \psi _\xi \Vert _{L^2_\xi }=1\), we have
thanks to (4.2). Hence the result. \(\square \)
We are now proving stronger convergence results in (6.8), by improving the bounds on the kernels of \(W_{{\widetilde{\gamma }}_n,\xi }.\)
Lemma 6.5
(Convergence of the sequence \((W_{\ge m,{\widetilde{\gamma }}_n})_{n\ge 1}\)) The sequence of kernels \(W_{\ge m,{\widetilde{\gamma }}_n,\xi }\) is bounded in \(W^{1,\infty }(Q_\ell ^*;L^2(Q_\ell \times Q_\ell ))\). Therefore, up to the extraction of a subsequence, we have
Proof
Let us start with the boundedness of the sequence. We already proved in Lemma 6.4 that \((W_{\ge m,{\widetilde{\gamma }}_n})_n\) is bounded in \({\mathfrak {S}}_{2,\infty }\) which follows from (B.17). Let us now check that the sequence of norms \(\Vert \nabla _\xi W_{\ge m,{\widetilde{\gamma }}_n,\xi }(\cdot ,\cdot )\Vert _{L^\infty (Q_\ell ^*;L^2(Q_\ell \times Q_\ell ))}\) is also bounded. Thanks to (B.5), for every \(\xi \in Q_\ell ^*\),
and we conclude with the help of (6.2). Therefore, the kernels \(|D_\xi |^{-1/2}W_{\ge m,{\widetilde{\gamma }}_n,\xi }|D_\xi |^{-1/2}(\cdot ,\cdot )\) are bounded in \(W^{1,\infty }(Q_\ell ^*;H^{1/2}(Q_\ell \times Q_\ell ))\). Thus, according to the Rellich–Kondrachov and the Arzelà–Ascoli theorems, up to the extraction of a subsequence,
which yields \( \,\Vert |D^0|^{-1/2}W_{\ge m,{\widetilde{\gamma }}_n-\gamma _*}|D^0|^{-1/2}\Vert _{{\mathfrak {S}}_{2,\infty }}\rightarrow 0, \) whence the lemma. \(\square \)
Lemma 6.6
(Convergence of the sequence \((W_{<m,{\widetilde{\gamma }}_n})_{n\ge 1}\)) The sequence of kernels \(W_{< m,{\widetilde{\gamma }}_n,\xi }\) is bounded in \(C^{0,\mu }(Q_\ell ^*;H^1_\xi (Q_\ell \times Q_\ell ))\). In particular, up to the extraction of a subsequence,
Proof
Let us first show the uniform boundedness of \(W_{< m,{\widetilde{\gamma }}_n,\xi }\) in \(C^{0,\mu }(Q_\ell ^*;H^1(Q_\ell \times Q_\ell ))\). It is based on Lemma 6.1, particularly Eq. (6.3). Recall that
Thus,
For each term on the right-hand side, we have
As \(\eta ,\xi '\in Q_\ell ^*\), according to (6.3) we get
where here and below C is a positive constant which depends only on m and \(\ell \). By the Hölder continuity of the function \(\eta \mapsto \int _{Q_\ell ^*}\frac{d\eta '}{|\eta -\eta '|^2}\), there is a \(0<\mu <1\) such that
For the last term, note that
and \(|\nabla _z (e^{-i\eta \cdot z}- e^{-i\eta '\cdot z})|\le C|\eta -\eta '|\). We get
We finally get that there is \(\mu \in (0,1)\) such that
Finally, thanks to the Arzelà–Ascoli theorem and the Rellich–Kondrachov theorem, the sequence of kernels converges strongly in \(L^\infty (Q_\ell ^*;L^2(Q_\ell \times Q_\ell ))\), up to the extraction of a subsequence, and the limit is the operator \(W_{< m,\gamma _*}\) thanks to (6.8). This concludes the proof of the lemma. \(\square \)
Then we have the following.
Lemma 6.7
(Strong convergence of the electron–electron interaction) As n goes to infinity, we have
Proof
As \(V_{\gamma }=G_\ell *\rho _{\gamma }-W_{\gamma }\), we have
Notice that, from Lemma 6.3, we infer \(\rho _{{\widetilde{\gamma }}_n}\rightarrow \rho _{\gamma _*}\) in \(L^2(Q_\ell )\). This, together with the fact that \(G_\ell \in L^2(Q_\ell )\), yield
Then, using \(|D^0|^{-1}\le 1\), we infer
We consider now the second term that we split into two parts. We already proved in Lemma 6.5 that \(\Vert |D^0|^{-1/2}W_{\ge m,{{\widetilde{\gamma }}_n}-{\gamma _*}}|D^0|^{-1/2}\Vert _{{\mathfrak {S}}_{2,\infty }}\rightarrow 0\), which implies the strong convergence in Y. The proof for the other term is even simpler, since
for \(\Vert |D^0|^{-1/2}\Vert _Y\le 1\). We conclude by Lemma 6.6. Finally, we infer
The lemma follows gathering together (6.9) and (6.10). \(\square \)
As a result, we have the following.
Corollary 6.8
(Strong convergence of the spectral projectors) As n goes to infinity, we have
Proof
For any \(\phi _\xi \) and \(\psi _\xi \in L^2_\xi \), by (5.8) and the second resolvent identity, we obtain
in virtue of (4.16). Therefore,
The right-hand side goes to 0 by Lemma 6.7. \(\square \)
6.2.2 Relationship Between \(P^+_{{\widetilde{\gamma }}_n}\) and \(P^+_{\gamma _n}\)
It remains to analyze the behaviour of \(P^+_{\gamma _n}-P^+_{{\widetilde{\gamma }}_n}\), as n goes to infinity. Our main result is the following.
Lemma 6.9
(Relationship between \(P^+_{{\widetilde{\gamma }}_n}\) and \(P^+_{\gamma _n}\)) For every \(\gamma \in Z\) and for some \(C>0\) independent of n and Z, it holds that
Proof
We first observe that \(\Vert G* (\rho _{\gamma _n}-\rho _{{\widetilde{\gamma }}_n})\Vert _{Y}\rightarrow 0\), thanks to Lemma 6.2 and (4.4) in Lemma 4.4. Next, thanks to the bound (B.10), the sequence of operators \((W_{\ge m, \gamma _n-{\widetilde{\gamma }}_n})_n\) converges strongly to 0 in Y. Therefore, proceeding as for (5.12), we have
with the right-hand side going to 0. We now focus on the term involving \(W_{<m,\gamma _n-{\widetilde{\gamma }}_n}\). For every \(\gamma \in Z\), proceeding as for (6.11), we have
thanks to (B.23) and in virtue of \(\Vert \gamma _n-{\widetilde{\gamma }}_n\Vert _Y\le 2\). \(\square \)
6.3 Existence and Properties of Minimizers for \(J_{\le q}\)
The existence of minimizers for \(J_{\le q}\) now follows by passing to the limit in the constraint and in the energy. The proof is separated into the following two lemmas.
Lemma 6.10
The limit \(\gamma _*\) lies in \(\Gamma _{q}^+\).
Proof
As
we get
and
Thus, \(\gamma _*\in \Gamma \). Since \(\epsilon _P>e_0\), \({\widetilde{{\textrm{Tr}}}}_{L^2}[\gamma _*]=q\) thanks to Lemma 6.3.
To complete the proof, it remains to show that \(P_{\gamma _*}^+\gamma _{*}=\gamma _{*}\). From Lemma 6.9 and since \(\Vert {\widetilde{\gamma }}_n-\gamma _n\Vert _X\rightarrow 0\) (Lemma 6.2), we first prove that
Indeed, since \(P_{\gamma _n}^+\gamma _n=\gamma _n\), we have
using also \(\Vert P_{\gamma _n}^+\Vert _Y\le 1\). Then the right-hand side goes to 0. Next, by Corollary 6.8,
Let \(g\in Y\). Let us show that
Notice that
The first two terms in the right-hand side goes to 0 thanks to (6.12) and (6.13). For the last two terms, we use the weak-* convergence of \({\widetilde{\gamma }}_n\) to \(\gamma _*\) in \(X^2_\infty \) and the fact that \(|D^0|^{-1}g|D^0|^{-1}\) and \(|D^0|^{-1}P_{\gamma _*}^+g|D^0|^{-1}\) both lie en \({\mathfrak {S}}_{\infty ,1}\). Hence \(\gamma _*\in \Gamma _{ q}^+\). This concludes the proof. \(\square \)
Lemma 6.11
We have
Therefore, \(\gamma _*\) is a minimizer of \(J_{\le q}\).
Proof
For the kinetic energy term, we write
Since \(\Vert \gamma _n-{\widetilde{\gamma }}_n\Vert _X\rightarrow 0\) thanks to Lemma 6.2, \({\widetilde{{\textrm{Tr}}}}_{L^2}[D^0(\gamma _{n}-{\widetilde{\gamma }}_{n})]\rightarrow 0\). On the other hand, by definition of the weak-\(^*\) convergence in \(X^2_\infty \), \(|D^0|({\widetilde{\gamma }}_{n}-\gamma _*)|D^0|\) converges to 0 for the weak-\(^*\) topology in \({\mathfrak {S}}_{1,\infty }\). As \(|D^0|^{-1}D^0|D^0|^{-1}\in {\mathfrak {S}}_{\infty ,1}\), we obtain that
hence
In addition, thanks again to Lemma 6.3,
The proof for the attractive potential is similar. We start with
The second term goes to 0 as n goes to infinity since \(\rho _{{\widetilde{\gamma }}_n}\) converges to \(\rho _{\gamma _*}\) in \(L^2(Q_\ell )\) and \(G\in L^2(Q_\ell )\) (Lemma 6.3). For the first term, we use the fact that
and we conclude by Lemma 6.2.
For the repulsive potential, using the fact that \({\widetilde{{\textrm{Tr}}}}_{L^2}[V_{{\widetilde{\gamma }}}\,\gamma ']={\widetilde{{\textrm{Tr}}}}_{L^2}[V_{\gamma '}\,{{\widetilde{\gamma }}}]\) whenever \(\gamma \) and \(\gamma '\) are in Z, we have
using the bound (4.11) in Lemma 4.7. Finally, the right-hand side in the above string of inequalities goes to 0, according to Lemmas 6.2 and 6.7. The lemma follows. \(\square \)
Since \(\gamma _*\) is a minimizer of \(J_{\le q}\), we may apply Proposition 5.1 to the constant minimizing sequence equal to \(\gamma _*\), and we obtain that
We may then apply Lemma 4.15 to conclude that with \(0\le \delta \le \mathbb {1}_{\{\nu \}}(D_{\gamma _*})\) for some \(\nu \in [\lambda _0,e_0]\) that is independent of \(\epsilon _P\). This ends the proof of Theorem 3.3. Thus, Theorem 2.6 holds true.
Data Availability
Data sharing is not applicable to this article as it has no associated data.
References
Bach, V.: Error bound for the Hartree–Fock energy of atoms and molecules. Commun. Math. Phys. 147(3), 527–548, 1992
Barbaroux, J.-M., Esteban, M.J., Séré, É.: Some connections between Dirac–Fock and electron-positron Hartree–Fock. Ann. Henri Poincaré 6(1), 85–102, 2005
Barbaroux, J.-M., Farkas, W., Helffer, B., Siedentop, H.: On the Hartree–Fock equations of the electron-positron field. Commun. Math. Phys. 255(1), 131–159, 2005
Barbaroux, J.-M., Helffer, B., Siedentop, H.: Remarks on the Mittleman max–min variational method for the electron-positron field. J. Phys. A Math. Gen. 39(1), 85–98, 2006
Cancés, É., Deleurence, A., Lewin, M.: A new approach to the modeling of local defects in crystals: the reduced Hartree–Fock case. Commun. Math. Phys. 281(1), 129–177, 2008
Catto, I., Le Bris, C., Lions, P.-L.: The Mathematical Theory of Thermodynamic Limits: Thomas–Fermi Type Models. Oxford Mathematical Monographs, p. xiv+277. The Clarendon Press, New York (1998)
Catto, I., Le Bris, C., Lions, P.-L.: Recent mathematical results on the quantum modeling of crystals. In: Mathematical Models and Methods for Ab Initio Quantum Chemistry, vol. 74. Lecture Notes in Chem, pp. 95–119. Springer, Berlin (2000)
Catto, I., Le Bris, C., Lions, P.-L.: On the thermodynamic limit for Hartree–Fock type models. Ann. Inst. Henri Poincaré C Anal. Non Linéaire 18(6), 687–760, 2001
Catto, I., Le Bris, C., Lions, P.-L.: On some periodic Hartree-type models for crystals. Ann. Inst. Henri Poincaré C Anal. Non Linéaire 19(2), 143–190, 2002
Chaix, P., Iracane, D.: From quantum electrodynamics to mean field theory: I. The Bogoliubov–Dirac–Fock formalism. J. Phys. B 22, 3791–3814, 1989
Christensen, N.E.: Relativistic band structure calculations. Int. J. Quantum Chem. 25(1), 233–261, 1984
Christensen, N.E., Seraphin, B.O.: Relativistic band calculation and the optical properties of gold. Phys. Rev. B 4, 3321–3344, 1971
Desclaux, J.-P.: Relativistic Dirac–Fock expectation values for atoms with Z = 1 to Z = 120. Atomic Data Nucl. Data Tables 12(4), 311–406, 1973
Dolbeault, J., Esteban, M.J., Séré, É.: Distinguished self-adjoint extension and eigenvalues of operators with gaps. Application to Dirac–Coulomb operators. J. Spectr. Theory 13(2), 491–524, 2023
Eschrig, H., Richter, M., Opahle, I.: Chapter 12—Relativistic solid state calculations. In: Schwerdtfeger, P. (ed.) Relativistic Electronic Structure Theory, vol. 14. Theoretical and Computational Chemistry, pp. 723–776. Elsevier (2004)
Esteban, M.J., Séré, É.: Solutions of the Dirac–Fock equations for atoms and molecules. Commun. Math. Phys. 203(3), 499–530, 1999
Esteban, M.J., Séré, É.: Nonrelativistic limit of the Dirac–Fock equations. Ann. Henri Poincaré 2(5), 941–961, 2001
Esteban, M. J., Séré, É.: A max–min principle for the ground state of the Dirac–Fock functional. In: Mathematical Results in Quantum Mechanics (Taxco, 2001). Vol. 307. Contemp. Math, pp. 135–141. American Mathematical Society, Providence (2002)
Ghimenti, M., Lewin, M.: Properties of periodic Hartree–Fock minimizers. Calc. Var. Partial Differ. Equ. 35(1), 39–56, 2009
Grant, I.P.: Relativistic calculation of atomic structures. Adv. Phys. 19(82), 747–811, 1970
Gravejat, P., Lewin, M., Séré, É.: Ground state and charge renormalization in a nonlinear model of relativistic atoms. Commun. Math. Phys. 286(1), 179–215, 2009
Gravejat, P., Lewin, M., Séré, É.: Renormalization and asymptotic expansion of Dirac’s polarized vacuum. Commun. Math. Phys. 306(1), 1–33, 2011
Griesemer, M., Lewis, R.T., Siedentop, H.: A minimax principle for eigenvalues in spectral gaps: Dirac operators with Coulomb potentials. Doc. Math. 4, 275–283, 1999
Hainzl, C., Lewin, M., Séré, É.: Existence of atoms and molecules in the mean-field approximation of no-photon quantum electrodynamics. Arch. Ration. Mech. Anal. 192(3), 453–499, 2009
Hainzl, C., Lewin, M., Séré, É., Solovej, J.P.: Minimization method for relativistic electrons in a mean-field approximation of quantum electrodynamics. Phys. Rev. A 76, 052104, 2007
Herbst, I.W.: Spectral theory of the operator \((p^2+ m^2)1/2 - Ze^2/r\). Commun. Math. Phys. 53(3), 285–294, 1977
Huber, M., Siedentop, H.: Solutions of the Dirac–Fock equations and the energy of the electron-positron field. Arch. Ration. Mech. Anal. 184(1), 1–22, 2007
Kadek, M., Repisky, M., Ruud, K.: All-electron fully relativistic Kohn–Sham theory for solids based on the Dirac–Coulomb Hamiltonian and Gaussian-type functions. Phys. Rev. B 99, 205103, 2019
Kato, T.: Perturbation Theory for Linear Operators. Classics in Mathematics. Reprint of the 1980 edition, p. xxii+619. Springer, Berlin (1995)
Kim, Y.K.: Relativistic self-consistent-field theory for closed-shell atoms. Phys. Rev. 154, 17–39, 1967
Lieb, E.H.: Variational principle for many-fermion systems. Phys. Rev. Lett. 46(7), 457–459, 1981
Lieb, E.H., Simon, B.: The Thomas–Fermi theory of atoms, molecules and solids. Adv. Math. 23(1), 22–116, 1977
Lions, P.-L.: Solutions of Hartree–Fock equations for Coulomb systems. Commun. Math. Phys. 109(1), 33–97, 1987
Meng, L.: A rigorous justification of the Mittleman’s approach to the Dirac–Fock model. Calc. Var. Partial Differ. Equ. 63(2), 39, 2024
Mittleman, M.H.: Theory of relativistic effects on atoms: configuration-space Hamiltonian. Phys. Rev. A 24, 1167–1175, 1981
Paturel, É.: Solutions of the Dirac–Fock equations without projector. Ann. Henri Poincaré 1(6), 1123–1157, 2000
Pyykko, P., Desclaux, J.-P.: Relativity and the periodic system of elements. Acc. Chem. Res. 12(8), 276–281, 1979
Reed, M., Simon, B.: Methods of Modern Mathematical Physics. IV. Analysis of Operators, p. xv+396. Academic Press, New York (1978). (Harcourt Brace Jovanovich, Publishers).
Séré, É.: A new definition of the Dirac–Fock ground state. Commun. Math. Phys. 404(3), 1275–1307, 2023
Simon, B.: Trace Ideals and their Applications, vol. 120. American Mathematical Society, Providence (2005)
Swirles, B.: The relativistic self-consistent field. Proc. R. Soc. Lond. Ser. A - Math. Phys. Sci. 152(877), 625–649, 1935
Acknowledgements
The authors are grateful to the referees for a careful reading of the manuscript and for remarks and suggestions that greatly improved the quality of this paper. L.M acknowledges support from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation program (Grant agreement No. 810367).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Communicated by C. Le Bris.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Proof of Lemma 4.1
It suffices to prove (4.2). By interpolation, we can choose \(C_H=C_G\).
To deal with (4.2), the idea is to decompose the potential \(G_\ell \) on \(Q_\ell \) into two parts, namely \(\frac{1}{|x|}\) and \(G_\ell -\frac{1}{|x|}\). The first term can be treated as the Hardy inequality on \(Q_\ell \), whereas the second is bounded. We begin with the second term and prove the following.
Lemma A.1
There is a constant \(C_0>0\) independent of \(\ell \) such that
This implies that, for any \(x\in {\mathbb {R}}^3\),
In particular, we have
Proof
As \(G_1(x)=\ell \,G_\ell (\ell x)\), it suffices to consider the case \(\ell =1\). Let \(f(x)=G_1(x)-\frac{1}{|x|}\). Equation (2.1) yields
Let B(z, R) be a ball of center z and radius R chosen such that \(\big (\bigcup _{z\in Q_1}B(z,R)\big )\cap ({\mathbb {Z}}^3{\setminus }\{0\})=\emptyset \). Obviously, we can assume \(0< R<1/2\). By the divergence theorem, for \(0\le r\le R\) and \(z\in Q_1\), we obtain
with \({\mathbb {S}}^2\) denoting the unit sphere. On the one hand, for any \(z\in Q_1\),
where the first equation holds since
Therefore, integrating (A.2) with respect to r,
Since \(\int _{B(z,R)}f(x)\,dx=\int _0^R r^2\left( \int _{{\mathbb {S}}^2}f(z+r\omega )\,d\omega \right) \,dr\), integration over [0, R] leads to
On the other hand,
Using (2.2) and the periodicity of \(G_1\), we also have
Thus,
Furthermore,
Therefore, the bound holds for \(\ell =1\) and any \(0<R<\frac{1}{2}\) with
\(\square \)
We now consider the Hardy inequality on \(Q_\ell \) for the potential \(\frac{1}{|x|}\).
Lemma A.2
Let \(u\in H^1(Q_\ell )\), then
Proof
We start with the relationship:
Thus,
By the divergence theorem for \(\int _{Q_\ell }\nabla \cdot (\frac{|u|^2 x}{|x|^2})\,dx\), we obtain
where \(\vec {n}\) is the outward pointing unit normal at each point on the boundary \(\partial Q_\ell \). To end this proof, it suffices to estimate \(\int _{\partial Q_\ell }\frac{\vec {n}\cdot x|u|^2}{|x|^2}\).
Let
As \(|\vec {n}\cdot x|=\frac{\ell }{2}\) and \(|x|\ge \frac{\ell }{2}\) for any \(x\in \partial Q_\ell \), we have
Let \(x_1^{(0)}\in (-\frac{\ell }{2},\frac{\ell }{2}]\) such that
Then we have
As
we get
Inserting this into (A.3), we can conclude that
As a result, by the Cauchy–Schwarz inequality
\(\square \)
Actually, Lemma A.2 implies that for any \(\xi \in Q_\ell ^*\),
Thus combining Lemmas A.1 and A.2, we know that
with
We now turn to the estimates on the operator \(W_{\gamma }\).
Proof of Lemma 4.5
We first study the properties of \(W_\ell ^\infty \), then we prove Lemma 4.5. In passing, we correct wrong estimates in [8]. Indeed, contrary to what was claimed, the function \(W_{\infty }(\eta , x)-e^{-i \eta \cdot x} G(x)-4 \pi \frac{e^{-i \eta \cdot x}}{|\eta |^2}\) introduced in [8, p. 745, Eq. (5.3)] is not harmonic in \((1+\epsilon )\,Q_\ell \) with respect to the x-variable. Therefore, some arguments need to be modified.
1.1 Properties of \(W_\ell ^\infty \)
Recall that \(W_\ell ^\infty =W_{\ge m,\ell }^\infty +W_{<m,\ell }^\infty \) is given by (4.7). We are going to prove the Hardy type inequalities for \(W_{\ge m,\ell }^\infty \). A natural idea is to compare it with the potential \(G_\ell \).
Proposition B.1
(Singularities for the potential \(W_{\ge m,\ell }^\infty \)) For every \(m\ge 2\), there exists a positive constant \(C_{\ge m}\) such that, for any \(\ell >0\), we have
with
Proof
The proof is similar to Lemma A.1. Notice that
We therefore take \(\ell =1\). Observe, from (2.20), that
Let \(f(\eta ,x)=W_{\ge m,1}^\infty (\eta ,x)-G_1(x)\). Then
Let B(z, R) be a ball of center z and radius R chosen such that \(\big (\bigcup _{z\in Q_1}B(z,R)\big )\cap ({\mathbb {Z}}^3{\setminus }\{0\})=\emptyset \). Obviously, we can assume \(0< R<1/2\). Analogous to (A.2), for \(0\le r\le R\) and \(z\in Q_1\) we obtain
On the one hand, for any \(z\in Q_1\),
where the first equality holds, since
Therefore, integrating (B.2) with respect to r,
Then integration over [0, R] leads to
On the other hand,
Furthermore, according to the quasi-periodicity of \(W_{\ge m,1}\) with respect to \(z\in {\mathbb {R}}^3\), for any \(\eta \in 2Q_1^*\),
Therefore, the bound (B.1) holds for \(\ell =1\) with
for any \(0<R<\frac{1}{2}\). The corresponding result for any \(\ell >0\) follows immediately by a scaling argument. \(\square \)
We can immediately conclude from Lemma 4.1 and Proposition B.1 the following.
Corollary B.2
(Hardy-type inequalities for the potential \(W_{\ge m,\ell }^\infty \)) For \(m\ge 2\), we have
and
We also have the following estimate on \(W_{\ge m,\ell }^\infty \).
Lemma B.3
Let \(m\ge 2\). There is a positive constant \(C=C(\ell ,m)\) such that
Proof
Take \(\ell =1\) for simplicity. Note that
from which we obtain
for any \(\eta \in 2Q_\ell ^*\) and \(x\in Q_\ell \). Following the proof of Lemma B.1, we know
The corresponding result for any \(\ell >0\) follows immediately by a scaling argument as for Lemma B.1. \(\square \)
1.1.1 Estimates for the Exchange Term
We consider now the exchange term. Let \(\psi _\xi \in H^{1/2}_\xi \). As
we only need to study the inner product \((W_{\gamma ,\xi }\psi _\xi ,\phi _\xi )\). For \(m\ge 2\), \(\eta \in {\mathbb {R}}^3\) and \(z\in {\mathbb {R}}^3\),
For the term that carries all singularities in the x variable (i.e., \(W_{\ge m,\ell }^\infty \)), we use the decomposition (2.14) and Corollary B.2. Let \(\gamma \in X\) with \(\gamma ^*=\gamma \). For any \(\xi \in Q_\ell ^*\) we have
with \(\left( v_{n}(\xi ,\cdot ),v_{m}(\xi ,\cdot )\right) _{L^2_\xi }=\delta _{m,n}\) and . Hence
with \(u_n(\xi ,\cdot )=|D_\xi |^{-1/2}v_n(\xi ,\cdot )\). Now, we have
with the help of (B.3). Thus, for every \(\gamma \in X\) with \(\gamma =\gamma ^*\), we have
Using the Cauchy–Schwarz inequality and (B.4), we can also argue as follows whenever \(\gamma \in {\mathfrak {S}}_{1,1}\) with \(\gamma ^*=\gamma \):
Thus, for every \(\gamma \in {\mathfrak {S}}_{1,1}\) with \(\gamma ^*=\gamma \), since \(|D^0|^{-1}\in {\mathfrak {S}}_{\infty ,\infty }\), we have
Additionally, when \(\gamma \in X^2_1\) with \(\gamma =\gamma ^*\) (which includes the case \(\gamma \in X^2_\infty )\), one can prove that \(W_{\ge m,\gamma }\) belongs to \({\mathfrak {S}}_{2,\infty }\). Recall that \( \Vert W_{\ge m,\gamma }\Vert _{{\mathfrak {S}}_{2,p}}= \Vert W_{\ge m,\gamma }(\cdot ,\cdot )\Vert _{L^p(Q_\ell ^*;L^2(Q_\ell \times Q_\ell ))}\). The main ingredient is that \(\sqrt{\rho _{|\gamma |}}\in H^1(Q_\ell )\) when \(\gamma \in X^2_1\). Indeed, analogously to (B.8), for any \(\gamma \in X^2_1\), we have
with \(\left( v_{n}(\xi ,\cdot ),v_{m}(\xi ,\cdot )\right) _{L^2_\xi }=\delta _{m,n}\) and . Hence
with \(u_n(\xi ,\cdot )=|D_\xi |^{-1}v_n(\xi ,\cdot )\in H^1_\xi (Q_\ell )\), from which we deduce, adapting the proof in [8, Eq. (4.42)], that
since . For every \(\xi \in Q_\ell ^*\), the kernel of \(W_{\ge m, \gamma , \xi }\) writes
We now use the bound (B.1) on \(W_{\ge m,\ell }^\infty \) and split the kernel \(W_{\ge m, \gamma ,\xi }\) in two terms :
with \(f\in L^\infty (2Q_\ell ^*\times 2Q_\ell )\). The second term in (B.16) lies in \(L^\infty (Q_\ell ^*;L^2(Q_\ell \times Q_\ell ))\) as soon as \(\gamma \in {\mathfrak {S}}_{2,1}\). Therefore, the delicate contribution comes from the term involving the potential \(G_\ell \) that we bound from above by using (2.19):
Finally, using (B.14), we have
where C is a positive constant that depends only on m and \(\ell \).
We now study the contribution of the term involving \(W_{<m, \ell }^{\infty }\), that carries the singularities in the \(\eta \) variable. For this part of the proof, we only need that \(\gamma \in Y\). Then, we may write
with
Thus, for every \(\gamma \in Y\), \(W_{<m,\gamma }\in Y\) and
We now make a further assumption on \(\gamma \); namely, \(\gamma \in {\mathfrak {S}}_{1,\infty }\). (Actually, we need \(\gamma \in {\mathfrak {S}}_{1,4}\).) We first observe that
In particular, \(\rho _{\gamma _{\xi '+\frac{2k\pi }{\ell }}}=\rho _{\gamma _{\xi '}}\quad \text {for every }\xi '\in Q_\ell ^* \text { and } k\in {\mathbb {Z}}^3\), and the function of \(\xi \mapsto {\textrm{Tr}}_{L^2_{\xi }}(\gamma _{\xi })\) is \(Q_\ell ^*\)-periodic. Next, we write
where the last estimate follows from the Cauchy–Schwarz inequality. Here and below we use the fact that
since \((Q_\ell ^*+2\pi k/\ell )\cap (Q_\ell ^*+2\pi k'/\ell )=\emptyset \) whenever \(k,k'\in {\mathbb {Z}}^3\) with \(k\ne k'\). We focus on the quantity inside the brackets in the last inequality. By Hölder’s inequality, for \(\gamma \in {\mathfrak {S}}_{1,\infty }\), and for some constant \(C_{\le m,\ell }'\), we obtain
Following the lines of the proof of (B.21) and (B.22), we obtain, for every \(\xi \in Q_\ell ^*\),
and, by Young’s convolution inequality, the right-hand side belongs to \(L^p(Q_\ell ^*)\) as soon as \(\gamma \in {\mathfrak {S}}_{2,p}\) with \(1\le p\le +\infty \) (which is guaranteed whenever \(\gamma \in {\mathfrak {S}}_{1,p}\)), where C is a positive constant which depends only on m and \(\ell \). In particular, for every \(1\le p\le +\infty \),
Since \(\Vert \gamma \Vert _{{\mathfrak {S}}_{1,1}}\le \Vert \gamma \Vert _X\) and \(|D^{0}|^{-1/2}\le 1\), the statement of the lemma follows : From (B.9) and (B.18), we obtain (4.8); from (B.9) and (B.22), we obtain (4.9); from (B.11) and (B.18), we obtain (4.10). More precisely,
with
Proof of Lemma 4.7
Analogously to (B.6), we have
We can rewrite as \(W_{\ell }^\infty =W_{< m,\ell }^\infty +G_\ell + (W_{\ge m,\ell }^\infty -G_\ell )\). According to Proposition B.1 and (B.18), the terms associated to \(W_{< m,\ell }^\infty \) and \((W_{\ge m,\ell }^\infty -G_\ell )\) are easily bounded. The aim of this section is to get a better estimate on the following term:
From now on, for any function \(f\in L^2(Q_\ell ,{\mathbb {C}}^4)\), we denote \(f:=(f^\alpha )_{1\le \alpha \le 4}\). We use the decomposition (2.14) for \(\gamma \in {\mathfrak {S}}_{1,1}\cap Y\) such that \(\gamma =\gamma ^*\). Then as \(G(x)=G(-x)\), for almost every \(\xi \in Q_\ell ^*\), we may write
Estimate for (4.12). By Lemma A.1, we have
Thus according to the Cauchy–Schwarz inequality, we have
Substituting this inequality into (C.2) and using the decomposition (2.15), we get
from which we get
Combining (C.4) with Proposition B.1 and (B.18), we get for any \(\phi _\xi \in L^2_{\xi }\) and \(\psi _\xi \in H^1_{\xi }\),
hence (4.12) with
with \(C_\ell \) given in (B.25).
Estimate for (4.11). As \(\gamma \in Z\), we use the decomposition (B.8) for \(\gamma _{\xi }\). Analogously to (C.3), we also have
from which by the decomposition (B.8) we get
where the last inequality holds by using Lemma 4.1.
Combining (C.6) with Proposition B.1 and estimate (B.18), we get for any \(\phi _\xi \in L^2_{\xi }\) and \(\psi _\xi \in H^1_{\xi }\),
hence (4.12) and
Estimate for (4.13). Combining with Proposition B.1 and estimate (B.18), analogously to (C.6) it can be derived directly from:
using the decomposition (2.14) for \(\gamma _\xi \). Hence (4.13) and \(C_{EE}\).
Estimate for (4.14). Notice that \(|\gamma _{\xi '}(x,y)|\le \rho _{\gamma _{\xi '}}(x)^{1/2}\rho _{\gamma _{\xi '}}(y)^{1/2}\) since \(\gamma \ge 0\). Thus, according to Lemma A.1 and the Cauchy–Schwarz inequality,
Combining with Proposition B.1 and (B.18), we get
hence (4.13) and
Numerical Results About Constants
In this section, we will show the numerical results about the constants used in Remark 2.8 under the condition \(\ell =1000\). Next, we show that Assumption 2.5 is satisfied for \(q\le 17\) for the neutral systems.
We compute numerically the value of the bound of the potential \(G_\ell -\frac{1}{|x|}\). First of all, we calculate
Thus, \(C_0\approx 5.019\) and we can choose \(C_H= C_G\approx 2.011\). Concerning the estimates involving the potential \(W_\ell \), we set \(m=2\). When \(R\approx \frac{1}{2}\),
Thus, we get \(C_W\approx 2.042\), and \(C'_W\approx 2.042\). Then, \(C_{EE}\approx 2.052\), \(C_{EE}'\approx 2.052\) and \(C_{EE}''\approx 0.041\).
Finally, we estimate \(c^*(\lceil q\rceil )\) which is given by (2.9). Let \(u_{p,\xi }(x)=e^{i\left( \frac{2\pi p}{\ell } +\xi \right) \cdot x}\) with \(p\in {\mathbb {Z}}^3\). Then \((u_{p,\xi })_{p\in {\mathbb {Z}}^3}\) is an orthogonal basis on \(L^2_\xi (Q_\ell )\). Obviously, \((\Lambda ^+ u_{p,\xi })_p\) is also an orthogonal basis on \(L^2_\xi (Q_\ell )\). Let
Then
Now we can check Assumption 2.5 for \(z=q=17\). The calculation leads to \(a\approx 0.010\) and \(c^*(17)\le 1.006\). Thus, we have
-
\(\kappa +\frac{\alpha }{2} C_{EE}q^+\approx 0.631<1\),
-
\(2a\,\sqrt{\max \{(1-\kappa -\frac{\alpha }{2} C_{EE}q^+)^{-1}(1-\kappa )^{-1} c^*(\lceil q \rceil )q,1\}q^+}\le 0.973<1\).
Consequently, Assumption 2.5 is satisfied for \(z=q\le 17\) whenever \(\ell =1000\).
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Catto, I., Meng, L., Paturel, É. et al. Existence of Minimizers for the Dirac–Fock Model of Crystals. Arch Rational Mech Anal 248, 63 (2024). https://doi.org/10.1007/s00205-024-01988-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00205-024-01988-8