Piecewise Linearity and Spectroscopic Properties from Koopmans-Compliant Functionals

Dabo, Ismaila; Ferretti, Andrea; Marzari, Nicola

doi:10.1007/128_2013_504

Ismaila Dabo^18,19,
Andrea Ferretti²⁰ &
Nicola Marzari²¹

Part of the book series: Topics in Current Chemistry ((TOPCURRCHEM,volume 347))

Abstract

Density-functional theory is an extremely powerful and widely used tool for quantum simulations. It reformulates the electronic-structure problem into a functional minimization with respect to the charge density of interacting electrons in an external potential. While exact in principle, it is approximate in practice, and even in its exact form it is meant to reproduce correctly only the total energy and its derivatives, such as forces, phonons, or dielectric properties. Quasiparticle levels are outside the scope of the theory, with the exception of the highest occupied state, since this is given by the derivative of the energy with respect to the number of electrons. A fundamental property of the exact energy functional is that of piecewise linearity at fractional occupations in between integer fillings, but common approximations do not follow such piecewise behavior, leading to a discrepancy between total and partial electron removal energies. Since the former are typically well described, and the latter provide, via Janak’s theorem, orbital energies, this discrepancy leads to a poor comparison between predicted and measured spectroscopic properties. We illustrate here the powerful consequences that arise from imposing the constraint of piecewise linearity to the total energy functional, leading to the emergence of orbital-density-dependent functionals that (1) closely satisfy a generalized Koopmans condition and (2) are able to describe with great accuracy spectroscopic properties.

Graphical Abstract

Access provided by Autonomous University of Puebla. Download chapter PDF

The bifunctional formalism: an alternative treatment of density functionals

Article Open access 10 January 2022

Nonlocal pseudopotential energy density functional for orbital-free density functional theory

Article Open access 16 March 2022

Extending the Scale with Real-Space Methods for the Electronic Structure Problem

Keywords

1 Introduction

Optimizing the performance of materials involves understanding their properties as a function of structure and composition [1]. At the experimental level, some of the most powerful approaches are provided by spectroscopic techniques of increasing time and space resolution. However, as spectroscopy experiments become more detailed, the data they provide become more difficult to interpret. Therefore, computational methods that deliver insight into complex spectroscopic data become critical to characterize complex or novel materials. A number of electronic-structure methods [2] have been developed to address spectroscopic properties. These methods rely on solving the equations of quantum mechanics to capture the interactions of electrons with electromagnetic fields, but, due to the complexity of the many-electron Schrödinger problem, these equations must first be simplified before they can be solved computationally.

To break down this complexity, one general approach aims to map the total energy, in principle an expectation value over the very cumbersome N-electron wave function Ψ(r ₁,r ₂, …,r _N), onto simpler reduced variables, which encode the properties that are relevant to the physical phenomenon at hand. For instance, if one’s goal is to capture the energy of an electronic system, one can choose the reduced variable to be the ground-state electron density ρ(r). Then there exists a functional whose minimization with respect to ρ(r) yields the exact ground-state density and total energy of the system as a function of the atomic positions. This approach is referred to as density-functional theory (DFT); its proof was first established by Hohenberg and Kohn [3] and then extended to degenerate ground states and open systems using Legendre transform analysis [4, 5]. In addition to the energy, variations of the DFT energy functional with respect to any external variable are also reproduced correctly. As an example, the first derivatives of the DFT energy with respect to atomic coordinates provide atomic forces from which one can extract equilibrium geometries, and its second derivatives provide interatomic force constants, from which one can derive dynamical properties and vibrational spectra. In quantitative terms, existing local and semilocal approximations to density-functional theory enable one to predict vibrational spectra with a typical accuracy of 1–2%, for systems containing hundreds of atoms. Density-functional calculations and related perturbation methods are reviewed in [6, 7].

DFT can also describe changes in energy with respect to the number of particles, and thus provides orbital levels either exactly [8] or accurately for the frontier valence shells [9]. In particular, exact Kohn–Sham (KS) DFT calculations yield the exact highest occupied orbital energies of many-electron systems and provide reasonable approximations to single-electron energies for the other valence states (for a discussion of the subtleties connected to the interpretation of Kohn–Sham eigenvalues see, e.g., [9, 10]). Approximate DFT calculations usually make matters worse, and in general are only poor predictors of electronic spectra, notwithstanding their very good performance in describing the thermodynamic and kinetic properties of molecular systems. For instance, local and semilocal KS-DFT overestimate occupied electron levels and underestimate unoccupied levels, causing band gaps^{Footnote 1} to be systematically underestimated, and thus providing a poor description of charged excitations, where an electron is removed or added to the system, as happens in photoemission experiments. Conversely, time-dependent extensions to DFT (TDDFT) [11, 12] have the power to describe correctly the optical response of materials (i.e., neutral excitations). However, TDDFT calculations based upon adiabatic local and semilocal approximations exhibit severe limitations in describing the optical response of extended systems [13] and in capturing charge-transfer excitations, whereby the absorption of a photon is accompanied by a significant displacement of the excited electrons [14–17].

To overcome these limitations, one approach consists of selecting reduced variables that encode more spectral information. In this vein, methods that rely on the Green’s function G(r,r′,ω) as the central variable (as the quasiparticle GW approximation) have been very successful in predicting electronic spectra [18–23], and their extensions (such as the Bethe–Salpeter equation) have provided reliable optical spectra [13, 24–27]. Likewise, electronic-structure approaches that rely on the one-body density matrix γ(r,r′) (the reduced density matrix functional theory) have shown great promise [28–31]. Nevertheless, due to the simplicity of DFT and the extensive experience gained over decades in building more predictive density functionals, the DFT approach remains conceptually and computationally appealing [32]. Hence, it is of interest to develop better approximations beyond conventional local and semilocal methods. To this end, successful hybrid DFT functionals, which include a fraction of Hartree–Fock exchange in a simple linear-admixture or more sophisticated range-separated fashion, have been developed [33, 34]. The state-of-the-art of these methods is reviewed in [35] and other extensions have recently been proposed [36, 37].

In this work we review another route towards more accurate and efficient DFT or beyond-DFT methods. These functionals are obtained by imposing the condition of piecewise linearity for the energy into existing local and semilocal functionals, generalizing earlier suggestions for determining the strength of Hubbard corrections to DFT [38, 39] and correcting for self-interaction in DFT [40] beyond the case of localized d and f manifolds. The resulting functionals become orbital-density-dependent, but retain the conceptual simplicity of conventional DFT approximations while restoring important conditions connected to the description of electronic spectra, namely, the Koopmans compliance of orbital energies. The review is organized as follows. We first recall the essential features of DFT. We then explain the construction of Koopmans-compliant orbital-density-dependent (ODD) functionals and discuss their practical minimization. Finally, we present spectral predictions for a range of molecular systems to establish the predictive potential of Koopmans-compliant methods.

2 Methods

2.1 Functionals of the Total Density

Before presenting ODD Koopmans-compliant functionals, we outline in this section the main features of conventional DFT. In particular, we place emphasis on the analytical interpretation of calculated electronic spectra. To this end, we work within the independent-electron mapping of Kohn and Sham [41] generalized to fractional orbital occupations.

It is important to note that fractional orbital occupations are beyond the scope of the original Kohn–Sham framework. The introduction of fractional orbital occupations is generally attributed to Janak who first interpreted orbital energies as derivatives of the total energy with respect to these new electronic variables [42]. In the literature, the generalization of the Kohn–Sham functional to fractional occupations is termed the extended Kohn–Sham model [43]. Beyond its central importance to interpret orbital energies, the extended Kohn–Sham model is a powerful framework to construct robust energy minimization schemes. Examples of such algorithms are provided by the ensemble-DFT algorithm [44] and relaxed-constraint algorithm [45] that both rely on exploring fractionally occupied states to reduce the nonconvexity of the Kohn–Sham electronic-structure problem. Paradoxically, fractional orbital occupations appeared in the DFT literature even before fractional electron numbers were discussed physically by Perdew et al. [8] in terms of grand-canonical mixtures of pure states (and then by Yang et al. [46] in terms of pure states), and formalized mathematically by Lieb [5] using convex-envelope analysis. The theory of fractional orbital occupations and that of fractional electron numbers are nonetheless closely related (the Aufbau principle), and are both critical to understanding the failure of conventional Kohn–Sham DFT approximations in predicting electronic spectra. Therefore, both of these theories are central to the subsequent discussion.

To begin our discussion, let us recall that, within the Kohn–Sham theory, the ground-state energy E(N) of the N-electron system can be obtained by minimizing the energy functional [41]

$$ {E}_{\mathrm{KS}}\left[{f}_1,{f}_2,\dots, {\varphi}_1,{\varphi}_2,\dots \right]={\displaystyle \sum_{i=1}^{+\infty }{f}_i{\displaystyle \int {d}^3\mathbf{r}{\varphi}_i^{\ast}\left(\mathbf{r}\right)\cdot {\widehat{h}}_0{\varphi}_i\left(\mathbf{r}\right)+{E}_{\mathrm{Hxc}}\left[\rho \right]}}, $$

(1)

which includes the nonlinear electron-interaction term E _Hxc[ρ] that depends on the total electron density

$$ \rho \left(\mathbf{r}\right)={\displaystyle \sum_{i=1}^{+\infty }{f}_i\left|{\varphi}_i\right|{}^2\left(\mathbf{r}\right)}, $$

(2)

where the f _i's and φ _i's denote the fractional occupations and wave functions of the fictitious independent-electron system, respectively (for simplicity, the spin index is omitted throughout). The linear part in (1) involves the Hamiltonian operator

$$ {\widehat{h}}_0=-\frac{1}{2}{\nabla}_{\mathbf{r}}^2+v\left(\mathbf{r}\right) $$

that is the sum of the one-electron kinetic operator and potential v(r) generated by the atomic nuclei and external contributions.

The total energy E(N) of the system in its ground state is obtained by performing the minimization

$$ E(N)=\underset{{\displaystyle \overset{\int {\varphi}_i^{\ast }{\varphi}_j={\delta}_{ij}}{{\displaystyle {\sum}_{i=1}^{+\infty }}{f}_i=N, 0\le {f}_i\le 1}}}{min}{\displaystyle \sum_{i=1}^{+\infty} {f}_i{\displaystyle \int {d}^3\mathbf{r}{\varphi}_i^{\ast}\left(\mathbf{r}\right)\cdot {\widehat{h}}_0{\varphi}_i\left(\mathbf{r}\right)+{E}_{\mathrm{Hxc}}\left[{\displaystyle \sum_{i=1}^{+\infty} {f}_i\left|{\varphi}_i\right|{}^2}\right]}}, $$

(3)

where the occupations f _i of the orthonormal Kohn–Sham orbitals φ _i must add up to N and must obey the constraints 0 ≤ f _i ≤ 1.

To perform this minimization, we first focus on the orbital degrees of freedom (the minimization with respect the occupations will be examined in a second step). We thus introduce the Lagrange functional

$$ \begin{array}{lll}{\mathcal{L}}_{\mathrm{KS}}\left[{f}_1,{f}_2,\dots, {\varphi}_1,{\varphi}_2,\dots \right]\hfill & =\hfill & {\displaystyle \sum_{i=1}^{+\infty }{f}_i{\displaystyle \int {d}^3\mathbf{r}{\varphi}_i^{\ast}\left(\mathbf{r}\right)\cdot {\widehat{h}}_0{\varphi}_i\left(\mathbf{r}\right)+{E}_{\mathrm{Hxc}}\left[{\displaystyle \sum_{i=1}^{+\infty }{f}_i\left|{\varphi}_i\right|{}^2}\right]}}\hfill \\ {}\hfill & \hfill & -{\displaystyle \sum_{i,j=1}^{+\infty }{\Lambda}_{ij}\left({\displaystyle \int {d}^3\mathbf{r}{\varphi}_i^{\ast}\left(\mathbf{r}\right){\varphi}_j\left(\mathbf{r}\right)-{\delta}_{ij}}\right)}\hfill \end{array}. $$

(4)

Variations of ℒ_KS with respect to the orbitals φ _i and their complex conjugates φ ^∗_i yield a set of coupled one-electron equations:

$$ {f}_i\left({\displaystyle {\widehat{h}}_0}{\varphi}_i\left(\mathbf{r}\right)+{v}_{\mathrm{Hxc}}\left(\mathbf{r}\right){\varphi}_i\left(\mathbf{r}\right)\right)={\displaystyle \sum_{j=1}^{+\infty }{\Lambda}_{ij}{\varphi}_j\left(\mathbf{r}\right)}, $$

(5)

$$ {f}_i\left({\displaystyle {\widehat{h}}_0}{\varphi}_i\left(\mathbf{r}\right)+{v}_{\mathrm{Hxc}}\left(\mathbf{r}\right){\varphi}_i\left(\mathbf{r}\right)\right)={\displaystyle \sum_{j=1}^{+\infty }{\Lambda}_{ji}^{\ast }{\varphi}_j\left(\mathbf{r}\right)}, $$

(6)

where $ {v}_{\mathrm{Hxc}}\left(\mathbf{r}\right)=\frac{\updelta {E}_{\mathrm{Hxc}}\left[\rho \right]}{\updelta \rho \left(\mathbf{r}\right)} $ stands for the effective single-electron potential, which includes a classical electrostatic contribution (the Hartree potential) and quantum exchange-correlation interactions. Note that these equations must be solved self-consistently as v _Hxc(r) is a functional of ρ(r) that itself depends on the solution of the Kohn–Sham problem. Using orthonormality relations, it can be shown that the matrix of Lagrange multipliers fulfills the conditions

$$ {\Lambda}_{ij}={\Lambda}_{ji}=0, $$

(7)

whenever the state φ _i is not occupied (f _i = 0). Additionally, the Lagrange matrix is Hermitian:

$$ {\Lambda}_{ij}={\Lambda}_{ji}^{\ast }. $$

(8)

It should also be noted that the Λ_ij's can only couple orbitals φ _i and φ _j that have the same occupations (f _i = f _j). This condition can be derived from the relation

$$ \left({f}_i-{f}_j\right){\Lambda}_{ij}=0, $$

(9)

which implies that Λ_ij vanishes whenever f _i differs from f _j. As a result, the Λ_ij's form a block-diagonal matrix in which each block corresponds to orbitals that have the same occupations.

Now, bearing in mind that v _Hxc(r) is a functional of the density ρ(r) and invoking the invariance of ρ(r) with respect to block-diagonal unitary transformations, we can recast the self-consistent equations into

$$ {f}_i\left({\displaystyle {\widehat{h}}_0}{\psi}_i\left(\mathbf{r}\right)+{v}_{\mathrm{Hxc}}\left(\mathbf{r}\right){\psi}_i\left(\mathbf{r}\right)\right)={\lambda}_i{\psi}_i\left(\mathbf{r}\right), $$

(10)

where the coefficients λ _i are the eigenvalues of the Lagrange matrix and the orbitals ψ _i are related to the initial orbitals φ _i through the block-diagonal rotation U that diagonalizes the Lagrange matrix Λ of the same block-diagonal form:

$$ {\psi}_i\left(\mathbf{r}\right)={\displaystyle \sum_{j=1}^{N_{\mathrm{occ}}}{U}_{ji}^{\ast }{\varphi}_j\left(\mathbf{r}\right)}, $$

(11)

$$ {\Lambda}_{ij}={\displaystyle \sum_{k=1}^{N_{\mathrm{occ}}}{U}_{ik}{\lambda}_k{U}_{jk}^{\ast }}, $$

(12)

with $ {\displaystyle {\sum}_{k=1}^{N_{\mathrm{occ}}}}{U}_{ki}{U}_{kj}^{\ast }={\displaystyle {\sum}_{k=1}^{N_{\mathrm{occ}}}}{U}_{ik}^{\ast }{U}_{jk}={\delta}_{ij} $ and (f _i − f _j)U _ij = 0. For the moment, all the summations are restricted to the N _occ occupied states (the extension to unoccupied states is described below). We can then rewrite (10) in the canonical form

$$ {\displaystyle {\widehat{h}}_0}{\psi}_i\left(\mathbf{r}\right)+{v}_{\mathrm{Hxc}}\left(\mathbf{r}\right){\psi}_i\left(\mathbf{r}\right)={\varepsilon}_i{\psi}_i\left(\mathbf{r}\right), $$

(13)

where the eigenvalues of the self-consistent Kohn–Sham Hamiltonian $ {\displaystyle {\widehat{h}}_{\mathrm{KS}}}={\displaystyle {\widehat{h}}_0}+{v}_{\mathrm{Hxc}}\left(\mathbf{r}\right) $ and of the Lagrange matrix Λ are related through ε _i = λ _i/f _i.

We are now in a position to define the occupation-dependent energy

$$ E\left({f}_1,{f}_2,\dots \right)={E}_{\mathrm{KS}}\left[{f}_1,{f}_2,\dots, {\psi}_1,{\psi}_2,\dots \right], $$

(14)

where the ψ _i's stand for the canonical Kohn–Sham orbitals at self-consistency (13) ordered in ascending order of their eigenenergies, that is, ε ₁ ≤ ε ₂ ≤ …. The definition E(f ₁, f ₂, …) allows us to rewrite the ground-state energy in terms of a constrained minimization over the occupation numbers:

$$ E(N)=\underset{\begin{array}{l}{\displaystyle {\sum}_{i=1}^{+\infty }}{f}_i=N\\ {}0\le {f}_i\le 1\end{array}}{min}E\left({f}_1,{f}_2,\dots \right). $$

(15)

From this definition, one can interpret the Kohn–Sham eigenvalues as the derivatives of the occupation-dependent energy including self-consistent orbital relaxation:

$$ \frac{\partial E\left({f}_1,{f}_2,\dots \right)}{\partial {f}_i}={\varepsilon}_i, $$

(16)

where use has been made of the relation

$$ {\displaystyle \int {d}^3\mathbf{r}\left(\updelta {\psi}_i^{\ast}\left(\mathbf{r}\right)\cdot {\displaystyle {\widehat{h}}_{\mathrm{KS}}}{\psi}_j\left(\mathbf{r}\right)+{\psi}_i^{\ast}\left(\mathbf{r}\right)\cdot {\displaystyle {\widehat{h}}_{\mathrm{KS}}}\updelta {\psi}_j\left(\mathbf{r}\right)\right)=0}, $$

(17)

that results from Kohn–Sham stationarity and orthonormality conditions. In the literature, (16) is referred to as Janak’s theorem [42]. The theorem stands true for unoccupied states upon extending the diagonalization of $ {\displaystyle {\widehat{h}}_{\mathrm{KS}}} $ to empty orbitals instead of only considering the N _occ occupied states. This straightforward extension does not affect the occupation-dependent energy while enabling us to define energy derivatives at f _i = 0⁺ and offering an analytical interpretation for the eigenenergies of the empty states.

A central consequence of Janak’s theorem is the Aufbau principle, which has been alluded to at the beginning of this section. In fact, from Janak’s theorem, one can infer that the system remains unstable as long as a state that has an energy ε _i strictly lower than the energy ε _ℋ of the highest occupied orbital is not entirely filled.^{Footnote 2} The Aufbau principle can also be extended to the case where the highest occupied level is degenerate, as expressed by the relation

$$ E(N)=E\left(1,1,\dots, 1,{f}_1^{\mathrm{\mathcal{H}}},\dots, {f}_d^{\mathrm{\mathcal{H}}},0,0,\dots \right). $$

(18)

Equation (18) indicates that the ground state of the N-electron system can be constructed by simply filling the Kohn–Sham levels in ascending order until reaching the highest occupied levels of degeneracy d and of fractional occupations f ^ℋ₁ , …, f ^ℋ_d . Finally, using the Aufbau principle, it can be shown that

$$ \frac{\mathrm{d}E(N)}{\mathrm{d}N}=\frac{\partial E(N)}{\partial {f}_i^{\mathrm{\mathcal{H}}}}={\varepsilon}_{\mathrm{\mathcal{H}}}, $$

(19)

which reflects the fact that changes in the total electron number N can only occur through changes in the occupation numbers at the highest occupied level ε _ℋ. These important relations provide an analytical interpretation for the Kohn–Sham eigenenergies. They are exploited in the next section to construct ODD functionals beyond conventional DFT approximations.

2.2 Functionals of the Orbital Densities

2.2.1 Charged Excitations

Spectroscopy experiments in the X-ray and ultraviolet wavelength ranges involve excitations whose energies are sufficiently high to modify the charge of the sample through the removal (or addition) of an electron. The description of charged excitations requires us to predict correctly the energy of the system as a function of a reaction coordinate that parametrizes the excitation process, e.g., the occupation of the ionized state. In particular, if one is interested in capturing the onset of ultraviolet photoemission, one must correctly describe the dependence of the energy on the occupations of the highest occupied orbitals f ^ℋ_i or, equivalently, as a function of the electron number N (see (19) and the related discussion). Beyond charged excitations, correctly predicting the analytical behavior of the energy is important to describe lower-energy neutral excitations. Indeed, the accuracy of adiabatic TDDFT approximations in predicting neutral excitations and related optical resonances depends in particular on the ability of the underlying DFT approximations to describe orbital energies, that is, the derivatives of the energy with respect to the occupation numbers [17].

However, conventional Kohn–Sham DFT approximations do not correctly describe charged excitations; typically, the energy E(N) exhibits a strong nonlinear dependence within local, semilocal, and hybrid approximations, whereas the exact behavior of E(N) is known to be a connection of straight line segments between integer electron numbers. In fact, at fractional electron number, the ground state can be expressed as a statistical mixture of at most two pure states and its total energy verifies the linearity relation

$$ E(N)=\left(1-\omega \right)E(M)+\omega E\left(M+1\right), $$

(20)

where M and ω are the integer and fractional parts of N, respectively.

The piecewise linearity of the total energy was first established by Perdew et al. [8] and is critical to describe a range of orbital properties. It was then suggested by Cococcioni and de Gironcoli that it could be used to determine the strength U of Hubbard corrections to DFT [38, 39]. The connection between self-interaction and the lack of piecewise linearity was first made by Kulik et al. [40], arguing that Hubbard corrections reduce the hybridization and delocalization of d or f orbitals, and thus improve self-interaction errors rather than correlations, and by Mori-Sánchez et al. [47], who introduced the related concept of many-electron self-interaction. In addition to the inaccurate prediction of orbital energy levels, the lack of piecewise linearity of conventional DFT approximations results in an incorrect description of orbital densities [48]; functionals for which the dependence of E(N) on N is convex tend to delocalize the orbital densities, whereas functionals for which E(N) is concave lead to overlocalization [49, 50].^{Footnote 3} Equivalently, imposing the piecewise linearity condition amounts, by definition, to cancelling many-electron self-interaction errors [51, 52] taking into account self-consistent electronic relaxation. In other words, the energy of the highest occupied orbital should not change as a function of its fractional occupations, that is, the orbital should not interact with itself:

$$ \frac{\partial {\varepsilon}_{\mathrm{\mathcal{H}}}}{\partial {f}_i^{\mathrm{\mathcal{H}}}}=0. $$

(21)

In the language of quantum chemistry, this condition is equivalent to the (generalized) Koopmans theorem,^{Footnote 4} whereby the energy of the highest occupied state equals the energy of the ionization from the (M + 1)-electron to M-electron ground state, including full orbital relaxation:

$$ {\varepsilon}_{\mathrm{\mathcal{H}}}(N)=E\left(M+1\right)-E(M). $$

(22)

At this stage, it is important to mention that different definitions of self-interaction correction exist in the literature (Fig. 1). The term self-interaction may refer to one-electron self-interaction or many-electron self-interaction, and the latter may correspond either to the frozen picture where orbitals are kept unchanged upon varying f ^ℋ_i (the frozen-orbital many-electron self-interaction) or to the opposite situation where electrons are allowed to relax self-consistently (the relaxed-orbital many-electron self-interaction). We note in passing that there is no distinction between frozen-orbital self-interaction and relaxed-orbital self-interaction for one-electron systems as the two concepts are equivalent in that case. Identical hierarchies exist for piecewise linearity and Koopmans compliance (see the correspondences summarized in Fig. 1).

To make the different definitions clear, let us first recall that DFT functionals are said to be one-electron self-interaction-free when the nonlinear electron–electron contributions satisfy

$$ {E}_{\mathrm{Hxc}}\left[\rho \right]=0 $$

(23)

for any one-electron density ρ(r) = f ₁|φ ₁|²(r), whether the orbital is allowed to relax or not. Equation (23) is not fulfilled by conventional DFT approximations. For instance, the local (spin) density approximation (LDA) exhibits a strong nonlinear behavior with a well-known singularity in $ \frac{\mathrm{d}{\varepsilon}_1}{\mathrm{d}{f}_1} $ at f ₁ = 0 that is due to the Slater exchange contribution to E _Hxc[ρ] [53]. A simple correction to one-electron self-interaction errors in approximate DFT functionals was first proposed by Fermi and Amaldi in the context of the Thomas–Fermi–Dirac theory [53]; for KS-DFT, the Fermi–Amaldi one-electron self-interaction correction reads

$$ {E}_{\mathrm{FA}}\left[{f}_1,{f}_2,\dots, {\varphi}_1,{\varphi}_2,\dots \right]={E}_{\mathrm{KS}}\left[{f}_1,{f}_2,\dots, {\varphi}_1,{\varphi}_2,\dots \right]-N{E}_{\mathrm{Hxc}}\left[\frac{\rho }{N}\right]. $$

(24)

This functional satisfies (23) for one-electron systems (N = 1) but it exhibits important errors when more electrons are present. In particular, it does not preserve the size-consistency of the underlying DFT functional and lessens the precision of total energy predictions in general [54, 55]. The one-electron self-interaction correction of Perdew and Zunger improves upon the Fermi–Amaldi correction by subtracting individual electron-interaction contributions to the total energy functional:

$$ \begin{array}{ll}{E}_{\mathrm{PZ}}\left[{f}_1,{f}_2,\dots, {\varphi}_1,{\varphi}_2,\dots \right]\hfill & ={E}_{\mathrm{KS}}\left[{f}_1,{f}_2,\dots, {\varphi}_1,{\varphi}_2,\dots \right]\hfill \\ {}\hfill & -{\displaystyle \sum_{i=1}^{+\infty }{E}_{\mathrm{Hxc}}\left[{f}_i\left|{\varphi}_i\right|{}^2\right].}\hfill \end{array} $$

(25)

The Perdew–Zunger self-interaction-corrected functional fulfills (23) by construction while preserving size-consistency. However, in its simplest form, the predictive accuracy and practical usefulness of the Perdew–Zunger method is restricted to one-electron systems and isolated atoms; its precision deteriorates rapidly with the number of atoms in the system and it exhibits important many-electron self-interaction errors in both the frozen-orbital and relaxed-orbital approximations [51].

A more balanced correction of one-electron and frozen-orbital many-electron self-interaction errors is instead achieved by Hartree–Fock (HF) theory. In fact, it is well known that in the expression for the Hartree–Fock energy functional

$$ \begin{array}{ll}{E}_{\mathrm{HF}}\left[{f}_1,{f}_2,\dots, {\varphi}_1,{\varphi}_2,\dots \right]\hfill & ={\displaystyle \sum_{i=1}^{+\infty }{f}_i{\displaystyle \int {d}^3\mathbf{r}{\varphi}_i^{\ast}\left(\mathbf{r}\right)\cdot {\widehat{h}}_0{\varphi}_i\left(\mathbf{r}\right)}}\hfill \\ {}\hfill & +\frac{1}{2}{\displaystyle \sum_{i=1}^{+\infty }{\displaystyle \sum_{j=1}^{+\infty }{f}_i{f}_j{\displaystyle \int {d}^3\mathbf{r}{d}^3{\mathbf{r}}^{\mathbf{\prime}}\frac{\left|{\varphi}_i\left|{}^2\left(\mathbf{r}\right)\right|{\varphi}_j\left|{}^2\right({\mathbf{r}}^{\mathbf{\prime}}\right)}{\left|\mathbf{r}-{\mathbf{r}}^{\mathbf{\prime}}\right|}}}}\hfill \\ {}\hfill & -\frac{1}{2}{\displaystyle \sum_{i=1}^{+\infty }{\displaystyle \sum_{j=1}^{+\infty }{f}_i{f}_j{\displaystyle \int {d}^3\mathbf{r}{d}^3{\mathbf{r}}^{\mathbf{\prime}}\frac{\varphi_i^{\ast}\left(\mathbf{r}\right){\varphi}_j\left(\mathbf{r}\right){\varphi}_j^{\ast}\left({\mathbf{r}}^{\mathbf{\prime}}\right){\varphi}_i\left({\mathbf{r}}^{\mathbf{\prime}}\right)}{\left|\mathbf{r}-{\mathbf{r}}^{\mathbf{\prime}}\right|}}}}{\delta}_{\sigma_i{\sigma}_j,}\hfill \end{array} $$

(26)

the self-Hartree and self-exchange terms (that is, the terms corresponding to i = j in the double sums) cancel out. (In (26), σ _i denotes the spin of φ _i.) Consequently, the HF functional is one-electron self-interaction-free. Furthermore, due to the cancellation between Hartree and Fock contributions, it is quite straightforward to show that HF verifies (21) within the frozen-orbital approximation for many-electron systems. In other words, the HF method fulfills the restricted (original) Koopmans theorem (Fig. 2) in addition to one-electron Koopmans compliance (Fig. 3). The situation is completely different for relaxed orbitals. In fact, as illustrated in Fig. 4, all the approximations mentioned above, namely, the LDA, HF, and PZ formulations, predict the energy of the highest occupied state ε _ℋ to vary as a function of the occupation number f ^ℋ, meaning that the energy of the ground state E(N) is not linear. In specific terms, the LDA ground-state energy is strongly convex since $ {\varepsilon}_{\mathrm{\mathcal{H}}}=\frac{\mathrm{d}E}{\mathrm{d}N} $ increases rapidly upon raising f ^ℋ or N. The HF energy exhibits the opposite trend, whereas the PZ energy is seen to be mostly convex.

The lack of generalized Koopmans compliance of conventional quantum approximations reverberates negatively on the electronic-structure description of physical systems and on the accuracy of spectroscopic predictions. The importance of the generalized Koopmans theorem lies in the fact that, if one could impose generalized Koopmans compliance, that is, self-consistent piecewise linearity, while preserving the precision of DFT energy predictions, one would automatically obtain accurate highest occupied levels (see (22)). Furthermore, in practice, imposing the generalized Koopmans theorem to the full electronic spectra would enable one to inherit from the established accuracy of finite-difference DFT energy predictions (the ΔSCF method) [56] in describing low- and high-energy charged excitations without requiring repeated calculations for the non-Aufbau ionized states.

Therefore, imposing generalized Koopmans compliance (that is, restoring self-consistent piecewise linearity and correcting relaxed-orbital many-electron self-interaction) is fundamental to the accuracy of calculated charged-excitation spectra. The importance of the generalized Koopmans theorem has been highlighted in a number of theoretical and computational studies [8, 35, 36, 49, 51, 52, 57–61]. In the next section, we present the Koopmans-compliant method [62–64] specifically devised to correct relaxed-orbital many-electron self-interaction errors and to restore the generalized Koopmans theorem for DFT approximations (Fig. 4).

2.2.2 Generalized Koopmans Compliance

To impose generalized Koopmans compliance in DFT calculations, the first important step is to provide a precise definition of the lack of piecewise linearity. A quantitative definition of deviations from Koopmans compliance is provided by the non-Koopmans energy first introduced by Perdew and Zunger. It is simply obtained by comparing the correct linear behavior imposed by Koopmans’ theorem and the incorrect nonlinear behavior of the approximate ground-state energy. Explicitly, the non-Koopmans energy can be expressed as

$$ {\Pi}_{\mathrm{\mathcal{H}}}\left(N;{\omega}_{\mathrm{ref}}\right)={\displaystyle {\int}_0^{\omega}\mathrm{d}{\omega}^{\prime}\left(\frac{\mathrm{d}E}{\mathrm{d}N}\left(M+{\omega}_{\mathrm{ref}}\right)-\frac{\mathrm{d}E}{\mathrm{d}N}\left(M+{\omega}^{\prime}\right)\right)} $$

(27)

$$ =E(M)-E\left(M+\omega \right)+\omega \frac{\mathrm{d}E}{\mathrm{d}N}\left(M+{\omega}_{\mathrm{ref}}\right), $$

(28)

where the reference fractional number ω _ref denotes the value of ω at which

$$ \frac{\mathrm{d}E}{\mathrm{d}N}\left(M+{\omega}_{\mathrm{ref}}\right)=E\left(M+1\right)-E(M). $$

(29)

In other words, Π_ℋ(N;ω _ref) can be written as

$$ {\Pi}_{\mathrm{\mathcal{H}}}\left(N;{\omega}_{\mathrm{ref}}\right)=E(M)-E\left(M+\omega \right)+\omega \left(E\left(M+1\right)-E(M)\right). $$

(30)

Then, making use of the fact that E(N) can be accurately approximated by a parabola between M and M + 1 in most practical situations, we can obtain a very close approximation to the non-Koopmans energy by setting $ {\omega}_{\mathrm{ref}}=\frac{1}{2} $ (the Slater one-half approximation) [48]:

$$ {\tilde{\Pi}}_{\mathrm{\mathcal{H}}}(N)={\Pi}_{\mathrm{\mathcal{H}}}\left(N;\frac{1}{2}\right)=E(M)-E\left(M+\omega \right)+\omega \frac{\mathrm{d}E}{\mathrm{d}N}\left(M+\frac{1}{2}\right). $$

(31)

Now we can rely on (31) to construct a correction to the lack of Koopmans compliance.^{Footnote 5} To this end, we must express the non-Koopmans energy as a functional of the φ _i's and f _i's. As a first attempt to write $ {\tilde{\Pi}}_{\mathrm{\mathcal{H}}}(N) $ explicitly, let us perform a Taylor series expansion. We assume for simplicity that the highest occupied state ψ _M + 1 is not degenerate, that is, ω ≡ f _M + 1 and ℋ ≡ M + 1. Hence, to second order around ω = 0, the expansion^{Footnote 6} reads

$$ \begin{array}{ll}{\tilde{\Pi}}_{M+1}(N)\hfill & =\frac{1}{2}{f}_{M+1}\left(1-{f}_{M+1}\right)\hfill \\ {}\hfill & \times {\displaystyle \int {d}^3\mathbf{r}{d}^3{\mathbf{r}}^{\mathbf{\prime}}{d}^3{\mathbf{r}}^{\mathbf{{\prime\prime}}}\left|{\psi}_{M+1}\right|{}^2\left(\mathbf{r}\right){\tilde{\varepsilon}}^{-1}\left(\mathbf{r},{\mathbf{r}}^{\mathbf{\prime}}\right){f}_{\mathrm{Hxc}}\left({\mathbf{r}}^{\mathbf{\prime}},{\mathbf{r}}^{\mathbf{{\prime\prime}}}\right)\left|{\psi}_{M+1}\right|{}^2\left({\mathbf{r}}^{\mathbf{{\prime\prime}}}\right)+\cdots,}\hfill \end{array} $$

(32)

where $ {f}_{\mathrm{Hxc}\left(\mathbf{r},{\mathbf{r}}^{\mathbf{\prime}}\right)}=\frac{\updelta {v}_{\mathrm{Hxc}}\left(\mathbf{r}\right)}{\updelta \rho \left({\mathbf{r}}^{\mathbf{\prime}}\right)} $ stands for the exchange-correlation kernel and $ {\tilde{\varepsilon}}^{-1}\left(\mathbf{r},{\mathbf{r}}^{\mathbf{\prime}}\right)=\delta \left(\mathbf{r}-{\mathbf{r}}^{\mathbf{\prime}}\right)+\frac{\updelta {v}_{\mathrm{Hxc}}\left(\mathbf{r}\right)}{\updelta v\left({\mathbf{r}}^{\mathbf{\prime}}\right)} $ denotes a screening function of the Kohn–Sham system. Equation (32) underscores the main difficulties that arise in expressing the non-Koopmans energy explicitly; the computational challenge here is to evaluate the Kohn–Sham nonlocal dielectric function $ {\tilde{\varepsilon}}^{-1}\left(\mathbf{r},{\mathbf{r}}^{\mathbf{\prime}}\right) $, which captures the complex self-consistent response of the electrons to an external perturbation.

Nonetheless, (32) suggests us that the expression of the non-Koopmans energy would be greatly simplified if self-consistent electronic relaxation were not present. This observation leads us to consider first the simpler case in which orbitals are frozen. Explicitly, we work within the approximation

$$ \updelta {\psi}_i=0,\kern2em \updelta {v}_{\mathrm{Hxc}}\left(\mathbf{r}\right)=0,\kern2em {\tilde{\varepsilon}}^{-1}\left(\mathbf{r},{\mathbf{r}}^{\mathbf{\prime}}\right)=\delta \left(\mathbf{r}-{\mathbf{r}}^{\mathbf{\prime}}\right). $$

(33)

In this frozen orbital picture, expressing the non-Koopmans energy becomes straightforward; by evaluating each term in (31) for the fixed ψ _i's, we obtain

$$ \begin{array}{ll}{\tilde{\Pi}}_{M+1}^{\mathrm{u}}\left[{f}_1,{f}_2,\dots, {\psi}_1,{\psi}_2,\dots \right]\hfill & ={E}_{\mathrm{Hxc}}\left[{\displaystyle \sum_{i=1}^M{f}_i\left|{\psi}_i\right|{}^2}\right]-{E}_{\mathrm{Hxc}}\left[{\displaystyle \sum_{i=1}^{M+1}{f}_i\left|{\psi}_i\right|{}^2}\right]\hfill \\ {}\hfill & +{f}_{M+1}{\displaystyle \int {d}^3\mathbf{r}\;{v}_{\mathrm{Hxc}}\left(\mathbf{r};\left[{\displaystyle \sum_{i=1}^M{f}_i\left|{\psi}_i\right|{}^2+{\scriptscriptstyle \frac{1}{2}}\left|{\psi}_{M+1}\right|{}^2}\right]\right){\left|{\psi}_{M+1}\right|}^2\left(\mathbf{r}\right),}\hfill \end{array} $$

(34)

where the superscript {⋅}^u indicates that orbitals are kept unrelaxed during the fictitious ionization process. We note in passing that all the linear contributions related to $ {\widehat{h}}_0 $ vanish in the frozen orbital picture.

With the explicit expression of the non-Koopmans contributions in hand, it is now possible to impose Koopmans compliance for frozen orbitals by defining

$$ \begin{array}{ll}{E}_{\mathrm{K}}^{\mathrm{u}}\left[{f}_1,{f}_2,\dots, {\varphi}_1,{\varphi}_2,\dots \right]\hfill & ={E}_{\mathrm{K}\mathrm{S}}\left[{f}_1,{f}_2,\dots, {\varphi}_1,{\varphi}_2,\dots \right]\hfill \\ {}\hfill & +{\tilde{\Pi}}_{M+1}^{\mathrm{u}}\left[{f}_1,{f}_2,\dots, {\varphi}_1,{\varphi}_2,\dots \right].\hfill \end{array} $$

(35)

Indeed, one can verify that the new functional $ {E}_{{}_{\mathrm{K}}}^{\mathrm{u}} $ is exactly linear in the absence of relaxation due to the fact that

$$ \frac{\partial^2{\tilde{\Pi}}_{M+1}^{\mathrm{u}}}{\partial {f}_{M+1}^2}=-\frac{\partial^2{E}_{\mathrm{KS}}}{\partial {f}_{M+1}^2}, $$

(36)

while not altering the energy E(N) at integer occupations since the unrelaxed non-Koopmans energy

$$ \begin{array}{ll}{\tilde{\Pi}}_{M+1}^{\mathrm{u}}\hfill & =\frac{1}{2}{f}_{M+1}\left(1-{f}_{M+1}\right)\hfill \\ {}\hfill & \times {\displaystyle \int {d}^3\mathbf{r}{d}^3{\mathbf{r}}^{\mathbf{\prime}}{\left|{\varphi}_{M+1}\right|}^2}\left(\mathbf{r}\right){f}_{\mathrm{Hxc}}\left(\mathbf{r},{\mathbf{r}}^{\mathbf{\prime}}\right){\left|{\varphi}_{M+1}\right|}^2\left({\mathbf{r}}^{\mathbf{\prime}}\right)+\cdots, \hfill \end{array} $$

(37)

is nearly zero when φ _M + 1 is completely empty or filled.

Having imposed Koopmans compliance for frozen orbitals, it remains to include the effect of self-consistent orbital relaxation. To address this problem, let us use the expression of E ^u_K as a first approximation and monitor the analytical behavior of the ground-state energy. From these calculations, one can observe that the energy E(N), which is exactly linear in the frozen orbital approximation, becomes downward convex when orbitals relax self-consistently. This observation is in line with the intuition that neglecting screening contributions leads to an overestimation of the correction. In fact, it can be rigorously shown that any functional that fulfills the restricted Koopmans theorem leads to a downward convex dependence of the energy E(N) when relaxation is taken into account. A perfect illustration of this result is provided by the HF theory whose ground-state energy is piecewise linear for frozen orbitals (that is, the HF theory fulfills the restricted Koopmans theorem) and becomes piecewise concave for relaxed orbitals.

Since including self-consistent relaxation through the nonlocal electronic dielectric function $ {\tilde{\varepsilon}}^{-1}\left(\mathbf{r},{\mathbf{r}}^{\mathbf{\prime}}\right) $ would be prohibitively expensive, we resort here to a much simpler correction, which consists of making the zeroth-order approximation

$$ {\tilde{\varepsilon}}^{-1}\left(\mathbf{r},{\mathbf{r}}^{\mathbf{\prime}}\right)={\alpha}_{M+1}+\cdots, $$

(38)

to capture the self-consistent relaxation effects that take place upon ionizing φ _M + 1. Substituting (38) into (32) and comparing with the expansion of $ {\tilde{\Pi}}^{\mathrm{u}} $, we infer

$$ {\tilde{\Pi}}_{M+1}\left[{f}_1,{f}_2,\dots, {\varphi}_1,{\varphi}_2,\dots \right]={\alpha}_{M+1}{\tilde{\Pi}}_{M+1}^{\mathrm{u}}\left[{f}_1,{f}_2,\dots, {\varphi}_1,{\varphi}_2,\dots \right]+\cdots . $$

(39)

Hence, we arrive at the Koopmans-compliant functional

$$ \begin{array}{ll}{E}_{\mathrm{K}}\left[{f}_1,{f}_2,\dots, {\varphi}_1,{\varphi}_2,\dots \right]\hfill & ={E}_{\mathrm{K}\mathrm{S}}\left[{f}_1,{f}_2,\dots, {\varphi}_1,{\varphi}_2,\dots \right]\hfill \\ {}\hfill & +{\alpha}_{M+1}{\tilde{\Pi}}_{M+1}^{\mathrm{u}}\left[{f}_1,{f}_2,\dots, {\varphi}_1,{\varphi}_2,\dots \right],\hfill \end{array} $$

(40)

which generalizes (35) by taking into account orbital screening through the uniform dielectric constant α _M + 1.^{Footnote 7}

In principle, the dielectric coefficient α _M + 1 should be calculated by averaging the nonlocal permittivity in some suitable system-dependent and orbital-dependent fashion.^{Footnote 8} However, it can also be obtained in a more pragmatic and efficient manner by directly imposing Koopmans compliance (22), thereby avoiding complex averaging procedures. In our calculations, we obtain α _M + 1 through the necessary condition:

$$ {\varepsilon}_{M+1}\left({M}^{+}\right)={\varepsilon}_{M+1}\left(M+{1}^{-}\right), $$

(41)

which reflects the fact that the electron affinity of the M-electron system $ {\mathcal{A}}_M=-{\varepsilon}_{M+1}\left({M}^{+}\right) $ should be equal to the ionization potential of the (M + 1)-electron system $ {\mathcal{I}}_{M+1}=-{\varepsilon}_{M+1}\left(M+{1}^{-}\right) $. Admittedly, (41) is not a sufficient condition for complete Koopmans compliance. However, it provides a very accurate correction of the lack of piecewise linearity of local and semilocal DFT approximations. In practice, the calculation of the dielectric screening coefficient can be performed using the secant-method recursion:

$$ \begin{array}{ll}{\alpha}_{M+1}^{\left(n+2\right)}\hfill & ={\alpha}_{M+1}^{(n)}\hfill \\ {}\hfill & +\frac{\left({\alpha}_{M+1}^{\left(n+1\right)}-{\alpha}_{M+1}^{(n)}\right)\left({\varepsilon}_{M+1}^{(n)}\left({M}^{+}\right)-{\varepsilon}_{M+1}^{(n)}\left(M+{1}^{-}\right)\right)}{\left({\varepsilon}_{M+1}^{(n)}\left({M}^{+}\right)-{\varepsilon}_{M+1}^{(n)}\left(M+{1}^{-}\right)\left)-\right({\varepsilon}_{M+1}^{\left(n+1\right)}\left({M}^{+}\right)-{\varepsilon}_{M+1}^{\left(n+1\right)}\left(M+{1}^{-}\right)\right)}.\hfill \end{array} $$

(42)

Note that, due to the almost linear behavior of the orbital-energy difference ε _M + 1(M ⁺) − ε _M + 1(M + 1⁻) as a function of α _M + 1, two recursions of (42) are most of the time sufficient in our experience to converge α _M + 1 and impose Koopmans’ condition [63].

Finally, we emphasize that the correction described above restores the generalized Koopmans theorem for the highest occupied orbital but leaves the energies of the other states unchanged. However, imposing Koopmans’ theorem on the other states would clearly improve the description of the electronic spectrum by equating orbital energies with the accurate total energy differences [15, 56]. Although applying the correction to both the occupied and unoccupied manifolds is guided by practical considerations,^{Footnote 9} it will be shown that this extension provides accurate ionization potentials and electron affinities, often comparable to many-body predictions.

We thus define the non-Koopmans energy $ {\tilde{\Pi}}_i^{\mathrm{u}} $ associated with the removal or addition of φ _i using a straightforward generalization of (34):

$$ \begin{array}{ll}{\tilde{\Pi}}_i^{\mathrm{u}}\left[{f}_1,{f}_2,\dots, {\varphi}_1,{\varphi}_2,\dots \right]\hfill & ={E}_{\mathrm{Hxc}}\left[{\displaystyle \sum_{\begin{array}{c}j=1\\{}j\ne {\mathrm{i}}\end{array}}^{+\infty }{f}_j\left|{\varphi}_j\right|{}^2}\right]-{E}_{\mathrm{Hxc}}\left[{\displaystyle \sum_{j=1}^{+\infty }{f}_j\left|{\varphi}_j\right|{}^2}\right]\hfill \\ {}\hfill & +{f}_i{\displaystyle \int {d}^3\mathbf{r}\;{v}_{\mathrm{Hxc}}\left(\mathbf{r};\left[{\displaystyle \sum_{\begin{array}{c}j=1\\{}j\ne {\mathrm{i}}\end{array}}^{+\infty }{f}_j\left|{\varphi}_j\right|{}^2+{\scriptscriptstyle \frac{1}{2}}\left|{\varphi}_i\right|{}^2}\right]\right)\left|{\varphi}_i\right|{}^2\left(\mathbf{r}\right).}\hfill \end{array} $$

(43)

This definition allows us to define a Koopmans-compliant functional extended to the full spectrum:

$$ \begin{array}{ll}{E}_{\mathrm{K}}\left[{f}_1,{f}_2,\dots, {\varphi}_1,{\varphi}_2,\dots \right]\hfill & ={E}_{\mathrm{K}\mathrm{S}}\left[{f}_1,{f}_2,\dots, {\varphi}_1,{\varphi}_2,\dots \right]\hfill \\ {}\hfill & +{\displaystyle \sum_{i=1}^{+\infty }{\alpha}_i{\tilde{\Pi}}_i^{\mathrm{u}}\left[{f}_1,{f}_2,\dots, {\varphi}_1,{\varphi}_2,\dots \right],}\hfill \end{array} $$

(44)

where a different dielectric screening constant α _i is introduced for each of the orbitals. However, in practical simulations, evaluating the α _i's would require a different calculation to impose Koopmans’ condition on each of the electronic states, thereby considerably increasing the computational burden. Fortunately, it is observed in practice that the α _i's vary in a narrow range of values so that approximating the α _i's to be all equal to a unique α that depends only on the system does not significantly alter the accuracy of electronic level predictions in most practical cases.^{Footnote 10} In explicit terms, the functional that we employ in our simulations reads

$$ \begin{array}{ll}{E}_{\mathrm{K}}\left[{f}_1,{f}_2,\dots, {\varphi}_1,{\varphi}_2,\dots \right]\hfill & ={E}_{\mathrm{K}\mathrm{S}}\left[{f}_1,{f}_2,\dots, {\varphi}_1,{\varphi}_2,\dots \right]\hfill \\ {}\hfill & +\alpha {\displaystyle \sum_{i=1}^{+\infty }{\tilde{\Pi}}_i^{\mathrm{u}}\left[{f}_1,{f}_2,\dots, {\varphi}_1,{\varphi}_2,\dots \right].}\hfill \end{array} $$

(45)

This completes the presentation of the Koopmans-compliant functional. In summary, imposing Koopmans compliance leads us to considering the ionization of individual Kohn–Sham orbitals, thereby defining a functional of the general ODD form

$$ \begin{array}{ll}{E}_{\mathrm{ODD}}\left[{f}_1,{f}_2,\dots, {\varphi}_1,{\varphi}_2,\dots \right]\hfill & ={\displaystyle \sum_{i=1}^{+\infty }{f}_i{\displaystyle \int {d}^3\mathbf{r}{\varphi}_i^{\ast}\left(\mathbf{r}\right)\cdot {\widehat{h}}_0{\varphi}_i\left(\mathbf{r}\right)}}\hfill \\ {}\hfill & +{E}_{\mathrm{Hxc}}\left[{f}_1{\left|{\varphi}_1\right|}^2,{f}_2{\left|{\varphi}_2\right|}^2,\dots \right].\hfill \end{array} $$

(46)

Although other definitions of the non-Koopmans error could be envisioned, the procedure outlined above would always provide functionals of this form, i.e., with an exchange-correlation term E _Hxc[f ₁|φ ₁|², f ₂|φ ₂|², …] that depends on the individual orbital densities instead of E _Hxc[∑_i = 1 ^+ ∞ f _i|φ _i|²] that depends on the total density. It is important to note that ODD functionals can still be regarded as implicit DFT functionals [65], yet defined in a non-Kohn–Sham framework, unless ad hoc optimized effective potential (OEP) techniques [66, 67] are adopted. Alternatively, a beyond-DFT perspective on ODD methods, based upon the local and frequency-dependent spectral-density potential introduced by Gatti et al. [68], is discussed elsewhere [69].

To close the presentation of orbital-dependent corrections, it should be said that orbital-independent methods have been proposed to reduce many-electron self-interaction errors in DFT approximations [64, 70]. These DFT approaches improve energy predictions for systems with fractional electron numbers. Nevertheless, they are not meant to improve the description of spectroscopic properties and charged excitations, except for the highest occupied orbital of the system. The advantage of the ODD approach in that regard will be discussed extensively in Sect. 4.

2.2.3 Energy Minimization

The minimization of ODD energy functionals deserves particular attention, and important aspects of it are presented in this section.

To minimize the ODD energy, we first write the Lagrange functional related to the orthonormality constraints on the orbitals:

$$ {\mathrm{\mathcal{L}}}_{\mathrm{ODD}}={E}_{\mathrm{ODD}}\left[{f}_1,{f}_2,\dots, {\varphi}_1,{\varphi}_2,\dots \right]-{\displaystyle \sum_{i,j=1}^{+\infty }{\Lambda}_{ij}\left({\displaystyle \int {d}^3\mathbf{r}{\varphi}_i^{\ast}\left(\mathbf{r}\right){\varphi}_j\left(\mathbf{r}\right)-{\delta}_{ij}}\right)}. $$

(47)

The corresponding stationary conditions can be written as

$$ \frac{\updelta {\mathrm{\mathcal{L}}}_{\mathrm{ODD}}}{\updelta {\varphi}_i\left(\mathbf{r}\right)}=0,\kern1em \frac{\updelta {\mathrm{\mathcal{L}}}_{\mathrm{ODD}}}{\updelta {\varphi}_i^{\ast}\left(\mathbf{r}\right)}=0. $$

(48)

This leads to a set coupled self-consistent equations

$$ {f}_i\left({\widehat{h}}_0{\varphi}_i\left(\mathbf{r}\right)+{v}_{\mathrm{Hxc},i}\left(\mathbf{r}\right){\varphi}_i\left(\mathbf{r}\right)\right)={\displaystyle \sum_{j=1}^{+\infty }{\Lambda}_{ij}{\varphi}_j\left(\mathbf{r}\right)}={\displaystyle \sum_{j=1}^{+\infty }{\Lambda}_{ji}^{\ast }{\varphi}_j\left(\mathbf{r}\right)}, $$

(49)

in which the unique Kohn–Sham potential v _Hxc(r) of conventional DFT is replaced by a collection of potentials corresponding to the different orbitals of the system:

$$ {v}_{\mathrm{Hxc},i}\left(\mathbf{r}\right)=\frac{\updelta {E}_{\mathrm{Hxc}}\left[{\rho}_1,{\rho}_2,\dots \right]}{\updelta {\rho}_i\left(\mathbf{r}\right)},{\rho}_i\left(\mathbf{r}\right)={f}_i{\left|{\varphi}_i\right|}^2\left(\mathbf{r}\right). $$

(50)

Similarly to the original Kohn–Sham, the Λ_ij's form a Hermitian matrix:

$$ \begin{array}{ll}{\Lambda}_{ij}\hfill & ={f}_i{\displaystyle \int {d}^3\mathbf{r}{\varphi}_j^{\ast}\left(\mathbf{r}\right)\left({\widehat{h}}_0{\varphi}_i\left(\mathbf{r}\right)+{v}_{\mathrm{Hxc},i}\left(\mathbf{r}\right){\varphi}_i\left(\mathbf{r}\right)\right)}\hfill \\ {}\hfill & ={f}_j{\displaystyle {\left({\displaystyle \int {d}^3\mathbf{r}{\varphi}_i^{\ast}\left(\mathbf{r}\right)\left({\displaystyle {\widehat{h}}_0}{\varphi}_j\left(\mathbf{r}\right)+{v}_{\mathrm{Hxc},j}\left(\mathbf{r}\right){\varphi}_j\left(\mathbf{r}\right)\right)}\right)}^{\ast }}={\Lambda}_{ji}^{\ast },\hfill \end{array} $$

(51)

which can be rewritten as

$$ {\displaystyle \int {d}^3\mathbf{r}{\varphi}_i^{\ast}\left(\mathbf{r}\right)\cdot \left({f}_i{\widehat{h}}_i-{f}_j{\widehat{h}}_j\right){\varphi}_j\left(\mathbf{r}\right)=0}, $$

(52)

with

$$ {\widehat{h}}_i={\widehat{h}}_0+{v}_{\mathrm{Hxc},i}. $$

(53)

Equation (52) is the ODD counterpart of (9) and is also known as the Pederson condition [71]. It highlights an important feature of ODD functionals; since those are not in general invariant under a unitary transformation U of the orbitals, the gradient of the ODD energy with respect to U is usually not zero. As shown in [72], the expression of such gradient is proportional to the left-hand side of (52) for f _i = f _j = 1. At the minimum, the Pederson condition determines the specific unitary rotation of the orbitals that makes the energy stationary. For certain ODD functionals, such as PZ, the minimizing orbitals are usually localized (and often similar to Wannier functions [73, 74]), so the Pederson condition can also be regarded as a localization condition; for Koopmans-compliant functionals, the driving force to localization changes depending on the functional chosen, but it is always present in the functionals described here.

In general, when dealing with ODD functionals it is customary to consider two set of orbitals, namely the minimizing orbitals discussed above and the so-called canonical orbitals, corresponding to the eigenvectors of the Λ matrix [69, 75]. This second set is introduced to define orbital ionization energies by identifying the Λ_ij's as being proportional to the coefficients of an effective Hamiltonian (see the following discussion for details). It is important to stress here that while the diagonalization of Λ is fully supported by Janak’s theorem in the KS-DFT framework (providing an interpretation to the eigenvalues of Λ), this is not the case for ODD, as stressed by Vydrov et al. [76]. In fact, when dealing with ODD methods, we have

$$ \frac{\partial {E}_{\mathrm{ODD}}}{\partial {f}_i}={\displaystyle \int {d}^3\mathbf{r}\;{\varphi}_i^{\ast}\left(\mathbf{r}\right)\cdot {\widehat{h}}_i{\varphi}_i\left(\mathbf{r}\right)=\frac{\Lambda_{ii}}{f_i}}. $$

(54)

Bearing in mind that the minimizing orbitals are typically localized, it is clear that the ODD Janak’s theorem (54) does not provide a physical definition of orbital energies. On the other hand, canonical orbitals are physical but not protected by a Janak-like theorem. This issue, still a very important open problem in the field, has been thoroughly discussed by Stengel and Spaldin [72]. In [72] the authors stressed the fact that the breakdown of a Janak-definition for orbital energies in ODD methods has to be found in the extension of ODD functionals to fractional occupations, suggesting that an alternative extension providing also a proper Janak’s theorem would be very desirable and a major advancement.

Because of these issues in defining ODD functionals for fractional numbers of electrons, in the following we will consider only the case where we have two subspaces, the valence and the conduction manifold, separated by a gap and with occupations 1 and η respectively, where the limit η → 0 has to be taken. This construction allows one to define an effective Hamiltonian for both occupied and empty states. Within these definitions, at the minimum we obtain

$$ {\Lambda}_{ij}={\Lambda}_{ji}^{\ast}\sim \eta \to 0\kern1em \left(1\le i\le {N}_{\mathrm{occ}}\ \mathrm{and}\ {N}_{\mathrm{occ}}<j\right). $$

(55)

This leads us to the following set of equations:

$$ {\widehat{h}}_i{\varphi}_i\left(\mathbf{r}\right)={\displaystyle \sum_{j=1}^{N_{\mathrm{occ}}}{\Lambda}_{ij}{\varphi}_j\left(\mathbf{r}\right)}\kern1em \left(1\le i\le {N}_{\mathrm{occ}}\right), $$

(56)

$$ {\widehat{h}}_i{\varphi}_i\left(\mathbf{r}\right)={\displaystyle \sum_{j={N}_{\mathrm{occ}}+1}^{+\infty }{\tilde{\Lambda}}_{ij}{\varphi}_j\left(\mathbf{r}\right)}+{\displaystyle \sum_{j=1}^{N_{\mathrm{occ}}}{\tilde{\Lambda}}_{ij}{\varphi}_j\left(\mathbf{r}\right)}\kern1em \left({N}_{\mathrm{occ}}<i\right), $$

(57)

with

$$ {\tilde{\Lambda}}_{ij}=\frac{1}{\eta }{\Lambda}_{ij}. $$

(58)

While the equation for occupied orbitals (56) does not couple them to the empty manifold, the equation for the empty states (57) involves the occupied ones because of the orbital orthogonality constraint.

It is then useful to introduce projectors onto the occupied and empty manifolds and define projected Hamiltonians as

$$ {\displaystyle {\widehat{P}}_i}{\varphi}_j\left(\mathbf{r}\right)={\varphi}_i\left(\mathbf{r}\right){\displaystyle \int {d}^3{\mathbf{r}}^{\mathbf{\prime}}{\varphi}_i^{\ast}\left({\mathbf{r}}^{\mathbf{\prime}}\right){\varphi}_j\left({\mathbf{r}}^{\mathbf{\prime}}\right)}, $$

(59)

$$ \widehat{P}={\displaystyle \sum_{i=1}^{Nocc}{\widehat{P}}_i}, $$

(60)

$$ \widehat{Q}=\widehat{I}-\widehat{P}, $$

(61)

$$ {\widehat{H}}_{\mathrm{V}}={\displaystyle \sum_{i=1}^{N_{\mathrm{occ}}}{\widehat{h}}_i{\widehat{P}}_i}, $$

(62)

$$ {\widehat{H}}_{\mathrm{C}}={\displaystyle \sum_{i={N}_{\mathrm{occ}}+1}^{+\infty }{\widehat{h}}_i{\widehat{P}}_i}. $$

(63)

By operating $ \widehat{P} $ and $ \widehat{Q} $ on both sides of (56) and (57) and using the above definitions, we obtain

$$ \widehat{P}{\widehat{H}}_{\mathrm{V}}\widehat{P}{\varphi}_i\left(\mathbf{r}\right)={\displaystyle \sum_{j=1}^{N_{\mathrm{occ}}}{\Lambda}_{ij}{\varphi}_j\left(\mathbf{r}\right)}\kern1em \left(1\le i\le {N}_{\mathrm{occ}}\right), $$

(64)

$$ \widehat{Q}{\widehat{H}}_{\mathrm{C}}\widehat{Q}{\varphi}_i\left(\mathbf{r}\right)={\displaystyle \sum_{j={N}_{\mathrm{occ}}+1}^{+\infty }{\tilde{\Lambda}}_{ij}{\varphi}_j\left(\mathbf{r}\right)}\kern1em \left({N}_{\mathrm{occ}}<i\right). $$

(65)

These expressions have the important merit of explicitly decoupling the valence and the conduction manifolds, thus suggesting us to use

$$ \widehat{H}=\widehat{P}{\widehat{H}}_{\mathrm{V}}\widehat{P}+\widehat{Q}{\widehat{H}}_{\mathrm{C}}\widehat{Q}, $$

(66)

as an effective (Hermitian) Hamiltonian for the system (which is equivalent to considering the canonical orbitals and the Λ eigenvalues to define the electronic structure of the system). The construction above is routinely used in interpreting eigenvalues of the Λ matrix provided by self-interaction corrections, and it is argued for in [69]. The practical performance of this approach is highlighted in the next section.

Before closing this section, it is worth discussing the extension of the above ODD approaches to the solid limit. While this still represents an open problem, the localization properties of the minimizing orbitals may be exploited. As an example, let us focus first on the PZ case. The behavior of E _Hxc[ρ _i] as a function of the spread of ρ _i (modeled as a Gaussian distribution) has been studied recently [77]; it has been confirmed that the PZ-ODD corrections would vanish for fully delocalized orbitals. Focusing now on the K functional, a similar problem appears, since the $ {\tilde{\Pi}}_i^{\mathrm{u}} $ terms of (45) would become identically zero for extended orbitals (for the same reason the ΔSCF method fails for solids [78]). The role of localized minimizing orbitals appears then to be pivotal for the use of ODD methods on extended systems, since it would yield a non-zero orbital-dependent correction. A detailed analysis of the localization properties of the orbitals for PZ-ODD is given in [77]. Further numerical investigation along these lines is required, especially concerning the K functional, and will be the subject of a future publication.

3 Numerical Approach

In this section we present the analytical expressions of the Hamiltonians resulting from Koopmans-compliant functionals together with the implementation of the method. Full computational details are also provided.

3.1 Koopmans-Compliant Contributions

According to (49), (56), and (57), we need to evaluate the quantities

$$ {\widehat{h}}_i=\frac{\updelta {E}_{\mathrm{ODD}}}{\updelta {\rho}_i\left(\mathbf{r}\right)}={\widehat{h}}_0+\frac{\updelta {E}_{\mathrm{Hxc}}}{\updelta {\rho}_i\left(\mathbf{r}\right)}, $$

(67)

focusing on the case of the Koopmans-compliant functional E _ODD = E _K, as defined in (45). To this end, we have to compute the derivatives $ \frac{{\updelta \tilde{\Pi}}_j^{\mathrm{u}}}{\rho_i\left(\mathbf{r}\right)} $. As a first step, we evaluate the term corresponding to i = j:

$$ \frac{{\updelta \tilde{\Pi}}_j^{\mathrm{u}}}{\updelta {\rho}_i\left(\mathbf{r}\right)}={v}_{\mathrm{Hxc}}\left(\mathbf{r};\left[{\rho}_i^{\mathrm{ref}}\right]\right)-{v}_{\mathrm{Hxc}}\left(\mathbf{r};\left[\rho \right]\right)+{w}_i^{\mathrm{ref}}\left(\mathbf{r}\right), $$

(68)

$$ \begin{array}{ll}{w}_i^{\mathrm{ref}}\left(\mathbf{r}\right)\hfill & =\frac{1}{2}{\displaystyle \int {d}^3{\mathbf{r}}^{\mathbf{\prime}}{f}_{\mathrm{Hxc}}\left(\mathbf{r},{\mathbf{r}}^{\mathbf{\prime}};\left[{\rho}_i^{\mathrm{ref}}\right]\right){n}_i\left({\mathbf{r}}^{\mathbf{\prime}}\right)}\hfill \\ {}\hfill & -\frac{1}{2}{\displaystyle \int {d}^3{\mathbf{r}}^{\mathbf{\prime}}{d}^3{\mathbf{r}}^{\mathbf{{\prime\prime}}}{f}_{\mathrm{Hxc}}\left({\mathbf{r}}^{\mathbf{\prime}},{\mathbf{r}}^{\mathbf{{\prime\prime}}};\left[{\rho}_i^{\mathrm{ref}}\right]\right){n}_i\left({\mathbf{r}}^{\mathbf{\prime}}\right){n}_i\left({\mathbf{r}}^{\mathbf{{\prime\prime}}}\right)}\hfill \end{array}, $$

(69)

where we have introduced the compact notations

$$ {n}_i\left(\mathbf{r}\right)=\left|{\varphi}_i\right|{}^2\left(\mathbf{r}\right), $$

(70)

$$ {\rho}_i\left(\mathbf{r}\right)={f}_i{n}_i\left(\mathbf{r}\right), $$

(71)

$$ {\rho}_i^{\mathrm{ref}}=\rho \left(\mathbf{r}\right)-{\rho}_i\left(\mathbf{r}\right)+\frac{1}{2}{n}_i\left(\mathbf{r}\right). $$

(72)

Next, for i ≠ j, we have the following derivative expression:

$$ \frac{{\updelta \tilde{\Pi}}_j^{\mathrm{u}}}{\updelta {\rho}_i\left(\mathbf{r}\right)}={v}_{\mathrm{xc}}\left(\mathbf{r};\left[\rho -{\rho}_j\right]\right)-{v}_{\mathrm{xc}}\left(\mathbf{r};\left[\rho \right]\right)+{\displaystyle \int {d}^3{\mathbf{r}}^{\mathbf{\prime}}{f}_{\mathrm{xc}}\left(\mathbf{r},{\mathbf{r}}^{\mathbf{\prime}};\left[{\rho}_j^{\mathrm{ref}}\right]\right){n}_j\left({\mathbf{r}}^{\mathbf{\prime}}\right)}. $$

(73)

Collecting the above contributions, the Koopmans-compliant Hamiltonian reads

$$ {\widehat{h}}_i=\widehat{h}\left[{\rho}_i^{\mathrm{ref}}\right]+{w}_i^{\mathrm{ref}}\left(\mathbf{r}\right)+{w}_i^{\mathrm{xd}}\left(\mathbf{r}\right), $$

(74)

where we have defined the cross-derivative (xd) term as

$$ {w}_i^{\mathrm{xd}}\left(\mathbf{r}\right)={\displaystyle \sum_{j\ne i}\frac{{\updelta \tilde{\Pi}}_j^{\mathrm{u}}}{\updelta {\rho}_i\left(\mathbf{r}\right)}}. $$

(75)

In a nutshell, the leading term of the Koopmans-compliant Hamiltonian in (74) is the original KS Hamiltonian evaluated at the reference density ρ ^ref_i , that is, the density where the occupation of the ith orbital has been replaced by $ \frac{1}{2} $ in the Slater transition-state spirit. Additionally, we have two variational terms, w ^ref_i and w ^xd_i . The latter comes from the interdependence of the corrective terms corresponding to different orbitals; those can be shown analytically and numerically to have little influence on ODD spectral predictions. In fact, by expanding f _xc in Taylor series, one can see that w ^xd_i does not contribute up to the second order (e.g., no Hartree term shows up) and its leading contribution comes from the third derivative of E _xc with respect to the density. Instead, the reference potential w ^ref_i comes from the dependence of ρ ^ref_i on the ith orbital density.

We note that this term preserves the expectation value of the orbital-dependent Hamiltonian $ {\widehat{h}}_i $ since ∫ d ³ r w ^ref_i (r)n _i(r) = 0. Nevertheless, w ^ref_i cancels long-range contributions (arising from the Hartree term) to the Hamiltonian and tends to reduce the localization of the orbitals. Thus, this term has to be considered as an unwanted by-product of variationality. Neglecting w ^ref_i means that the reference density is not updated during the minimization but only at the end of the calculation; the minimization is then repeated at fixed w ^ref_i until full self-consistency is reached. This approach (where the potential w ^xd_i is also omitted for simplicity) is termed the K ₀ method. Most of the results shown in the following have been produced by this version of the functional. We have checked numerically that K and K ₀ electronic-structure predictions do not differ significantly, except for quantities that are very sensitive to charge localization (such as polarizabilities).

In fact, while corrections to orbital energies of the K and K ₀ functionals are usually rather similar, the K functional has a weak tendency to localization (i.e., it does not fully exploit the corrections to orbital energies to improve on the localization of the charge density). Ultimately this results in the K charge density being very similar to that of the LDA functional (at variance with K ₀ where strong Perdew–Zunger like localization occurs), such that also the potential energy surfaces (PES) obtained by K are only slightly changed with respect to the LDA ones. This happens because the Koopmans correction is first and foremost an approach that addresses particle exchanges with an external path, hence charged excitations and photoemission. As such, it does not automatically improve on the total energies and the interplay between corrected orbital energies and charge density localization are key to current efforts to further develop the functionals. To this end, different flavors of the Koopmans-compliant functionals have been introduced (e.g., building K on top of PZ); these approximations will be discussed elsewhere.

3.2 Computational Details

Atomic calculations have been performed using a modified version of the ld1 code from the QUANTUM-ESPRESSO distribution [79]. For each angular momentum channel, the orbital occupations are averaged among the m quantum numbers, leading to a spherically symmetric contribution to the charge density. The ODD energy functionals (either PZ or K) are minimized by optimizing the radial distribution of each orbital at fixed angular momentum. Orbitals with the same l quantum numbers but corresponding to different n are then not automatically orthogonal. The validity of this approximation is carefully discussed in [65]. The MOLPRO code has been employed for the atomic HF calculations using the def2-QZVPP basis set.

With the exception of atoms, all calculations have been performed via a modified version of the CP code of QUANTUM-ESPRESSO. This implementation exploits plane-wave basis sets and pseudopotentials. Periodic boundary conditions are implicitly assumed because of the basis set, and a Coulomb cutoff technique (based upon auxiliary regularization functions of the Coulomb kernel [80]) is adopted to compute the electrostatic contributions. The energy minimization is carried out by using either a fictitious damped dynamics on the electronic degrees of freedom or conjugate gradient steps. The convergence threshold in minimizing the energy is 10⁻⁷ Ha. In the following we have used real wave functions expanded into plane waves up to a kinetic energy cutoff of 60 Ry (reduced to 40 Ry for acenes and fullerenes which contain just C and H atoms). DFT calculations have been performed within the local density approximation (LDA) [65]. Unless otherwise specified, all Koopmans-compliant calculations are carried out with the K ₀[LDA] scheme where the same dielectric screening coefficient α is used for all the orbitals (see (45)) and computed by requiring the ionization potential at N electrons to be equal to the electron affinity at N − 1 electrons (within a tolerance of 0.01 eV) (41).

4 Results

In this section, we review spectroscopic data for atoms and molecules, computed through standard KS-DFT, Hartree–Fock (HF), and orbital-density dependent (ODD) functionals, such as the Perdew–Zunger (PZ) and Koopmans-compliant methods. Theoretical estimates are compared with experimental data, when available. We mostly focus on ionization potentials (IPs), electron affinities (EAs), and energy levels (as obtained from photoemission experiments, for instance). In the case of molecules, all the energy transitions studied (including ionizations) have to be considered vertical, meaning that atomic relaxation is not allowed after the excitation.

4.1 Atoms

In Table 1 we report the ionization potentials for isolated atoms ranging from H to Kr computed at different levels of theory (LDA, HF, PZ, K[LDA], K ₀[LDA]) as (the opposite of) the topmost valence eigenvalue. In agreement with previous literature [65], the LDA HOMO (highest occupied molecular orbital) levels are not particularly accurate with an average error of 4.40 eV, and up to 9 eV for He. This reflects the intrinsic inaccuracy of LDA; in fact, exact DFT would provide exact ionization potentials for finite systems [81, 82]. It is interesting to note that the LDA functional systematically underestimates the IPs (HOMO levels are not sufficiently bound). Since the exponential decay of the total charge density [82] in the vacuum region is related to the IP I _N through

Table 1 Ionization potential of atoms

Full size table

$$ \rho \left(\mathbf{r}\right)\sim {\mathrm{e}}^{-2\kappa \left|\mathbf{r}\right|}\kern2em \kappa =\sqrt{2{I}_N}, $$

(76)

underestimating the IP also leads to an overestimated delocalization of the charge density.

In contrast, despite its simplicity, the HF method gives rather accurate estimates for atomic IPs with an average error of 0.4 eV. A similar behavior is also shown by the ODD methods (PZ, K[LDA], K ₀[LDA]) with mean absolute deviations (MADs) ranging from 0.35 to 0.50 eV. However, this accuracy is not retained by all the functionals in predicting electron affinities, as discussed below. This fact is particularly apparent for PZ, which leaves the LDA empty states unaffected up to changes arising from the orthogonalization with respect to the occupied manifold (in the specific case of atoms where orthogonality is not imposed, PZ and LDA energy levels for empty states are identical by construction).

Before turning to molecules, we also analyze the performance of the above functionals in describing the deeper valence energy levels of atoms and compare the theoretical estimates with X-ray photoemission (XPS) results (Table 2). In agreement with previous data, LDA energy levels are the least accurate with a mean relative error of about 27%. Nevertheless, it has to be stressed that, at variance with the IP case, this error is not totally due to the LDA approximation but, as is well known, it is also inherent in the Kohn–Sham scheme itself [9, 13, 82]. For a detailed discussion of the accuracy of the KS-DFT scheme to describe charged excitations, we refer the reader to [9, 10]. The HF method (that can be formally viewed as the simplest approximation to a self-energy) works appreciably better (MAD of 6.6%) than LDA. In this context, the ODD methods provide the best accuracy with an average error of about 3.5%. The lowest MAD (ca. 3.3%) is found for K ₀[LDA]. Along with HF, these ODD methods do not fit into the standard KS scheme (where exchange and correlation effects are not described by a simple local potential) but go beyond it, having a more general structure with local but orbital-specific potentials. The properties of these theoretical schemes are discussed in more detail in [69] with particular emphasis on the description of energy levels and spectroscopic information.

Table 2 LDA, HF, PZ, K[LDA], and K ₀[LDA] orbital energies of He, Be, Ne, Mg, Ar, and Ca compared with experimental photoemission energies

Full size table

4.2 Molecules

We now turn to assessing the predictive performance of ODD functionals for molecules.

4.2.1 Ionization Potentials

We have used four different sets of molecules in our benchmark. The first set includes 17 small molecules (H₂ to C₂H₄) from the G2 set. Computed and experimental ionization potentials are reported in Fig. 5a. The second set is taken from [83] and contains larger aromatic molecules (such as PTCDA, porphyrins, and phthalocyanines). Results are shown in Fig. 5b. The third and fourth sets are molecules from the acene (benzene to hexacene) and fullerene (C₂₀ to C₈₀) families (Fig. 5c). In all graphs, LDA (black), HF or GW (blue), PZ (orange), and K ₀[LDA] (red), results are compared with experimental ionization potentials (green). Vertical IPs are considered when available.

Confirming the trends observed for atoms, LDA IPs exhibit the largest error (25–35% depending on the molecular set) and systematically underestimate molecular IPs. On the other hand, unscaled PZ results show the opposite behavior, all the time resulting in IPs appreciably larger than the reference data with a relative error in the range 13–18%. As for atoms, the K approach performs extremely well (with an error of 1.7–2.5%) and its accuracy is comparable to high-level GW calculations. This level of accuracy is remarkable considering the reduced computational load of K relative to GW. Moreover, the predictive precision of K ₀[LDA] remains more or less constant through a large variety of systems, ranging from atoms to small and more extended molecules.

Concerning the GW method, we note that several practical schemes have been proposed in the literature to include partial or full self-consistency or to go beyond the random phase approximation (RPA) to screening. For each molecular set, we have chosen the most accurate GW results. This may explain the fluctuating accuracy of the GW calculations reported in Fig. 5. For instance, GW reference data [83] (Fig. 5b) have been obtained by starting from LDA eigenvalues and eigenvectors and performing a GW calculation with self-consistency on eigenvalues. On the other hand, GW data for fullerenes [84] come from a fully self-consistent GW calculation where the adiabatic TDDFT kernel based upon LDA is used in the calculation of W to improve upon the RPA polarizability. GW calculations for acenes are instead performed at the G₀W₀ (LDA) level. Benzene, naphthalene, and anthracene data are from [85], while results for tetracene and pentacene are from [83] (taken at the G₀W₀ (LDA) level for consistency).

4.2.2 Electron Affinities

Table 3 reports electron affinities (EAs) for the same molecule sets. Data computed at the LDA, PZ, and K ₀[LDA] levels of theory are compared to GW results from the literature and experiments. As expected, the LDA EAs are systematically too large (LUMO levels excessively bound). This is due to both the intrinsic inaccuracy of the LDA approximation and to the missing derivative discontinuity required by the KS scheme. As for atoms, the PZ scheme is not correcting the empty states because both the energy contributions and the potential corrections are zero for those levels. K ₀[LDA] instead shows a clear trend of systematic correction of the LDA results, with a residual error of about 0.5 eV in the most problematic cases (as compared to experimental data when available or GW otherwise). The accuracy of the K[LDA] results tends to improve when strong acceptors are considered, as is the case for fullerenes. While it is clear from the theoretical description of the K functionals that the accuracy of the method is connected to that of the ΔSCF approach [56], the ability to compute empty states using the K method highlights its important advantages over ΔSCF. In fact, it is known that the LDA functional (as well as other approximate DFT functionals) would not be able to bind the extra electron for most anions. Instead, the K formulation overcomes this difficulty and leads to a quantitatively reliable description of EAs.

Table 3 Electron affinities (EAs) for selected molecules

Full size table

4.2.3 Energy Levels

After we have carefully analyzed the accuracy of the K[LDA] method in predicting IPs and EAs, we now study full electronic spectra using ultraviolet photoemission spectroscopy (UPS) data as references. The peaks in these spectra correspond to the charged excitations of the system and are usually described in terms of main peaks (more or less sharp features with a finite width bearing most of the spectral weight) and satellites (shallow structures). For a full discussion see, e.g., [13, 86, 87]. In the present treatment, we will consider only the main peak structures, which we will refer to as orbital energies. Moreover, we will mostly compare the computed density of states (DOS) with the UPS spectra without including any transition matrix elements. Therefore, our analysis will not address UPS intensities, but only peak positions.

We report a detailed analysis for the case of four molecules, namely furan, pyrrole, anthracene, and tetracene. Data for the first two molecules are shown in Fig. 6 while data for the acenes are given in Fig. 7. In each panel we report the computed LDA, HF, PZ, and K ₀[LDA] DOS together with the UPS intensities at the top. A Gaussian broadening of 0.2 eV has been included in the DOS as a guide for the eye while the theoretical orbital energies (eigenvalues) are reported as vertical bars. We have also highlighted the most evident experimental features by dashed vertical lines. The energy scale corresponds to negative binding energies, the zero being the vacuum level. The spectral features at the highest energy (the smallest binding energy) correspond to the HOMO levels (negative ionization potentials) already described in the previous paragraphs. LUMO (lowest unoccupied molecular orbital) levels are not shown (when bound, out of the graphs at higher frequencies).

In the case of furan and pyrrole, the electronic structure spectra are rather simple and orbital energy patterns can be easily followed, moving across different theoretical methods. In both cases, the LDA DOS shows systematic errors in predicting the HOMO position (about 3–4 eV above the experimental peak). The spectrum thus appears overall shifted to higher energies. The same holds for anthracene and tetracene, though the energy error on the HOMO position is smaller. As we have already discussed, this error can be totally attributed to the quality of the functional approximation. Moreover, by examining deeper energy levels, one can also note a slight shrinking of the energy scale with respect to experiment. This feature can also be perceived in the case of acenes. Besides the quality of the functional, here the use of a KS-DFT Hamiltonian (denoted by a local potential instead of a nonlocal and dynamical self-energy) is also expected to contribute to the error. For further details, see [69].

The onsets of photoemission in the HF spectra (the LUMO levels) are definitely more accurate than their LDA counterparts, showing remarkable an agreement for furan and pyrrole (with an error of 0.3 eV), as well as acenes (with an error of 0.4 eV). Despite the accuracy of the HOMO levels, deeper valence states are strongly over-bound, the spectrum being overall stretched towards negative energies. This behavior can be ascribed to missing correlation contributions in the HF method. Comparing with the GW theory [20, 88], one can show that correlation contributions would show up at first as static and dynamical screening effects. The systematic over-binding of HF can then be considered to be related to the absence of screening contributions.

The spectra obtained through the PZ method display an even more bound position of the HOMO levels, showing appreciable errors with respect to experiment (1.5–2 eV). At variance with the HF method, where the valence band width is strongly enhanced, PZ band widths are slightly more extended than the LDA ones, as shown in Table 4. This can be understood in terms of the correction to the potential provided by PZ. Assuming that the ODD minimizing orbitals are localized (as is typically the case with closed shell covalently bound systems), the representation of the Hamiltonian on this basis can be read in a tight-binding picture. Neglecting self-consistency effects, the PZ correction on LDA acts only on on-site matrix elements of the Hamiltonian, providing no correction for the off-diagonal (hopping) elements, which govern the band widths. When including self-consistency, PZ tends to provide a decoupling force that reduces the hybridization of the orbital. Such a conclusion is also supported by noting that the patterns of the PZ energy levels for acenes are rather different from that corresponding to the other methods and experiment. This observation does not extend to HF, which tends to increase the band width of covalent systems relative to LDA because the nonlocality of the exchange potential leads to larger hopping [89].

Table 4 Highest occupied orbital energy ε _ℋ and valence band width Δε of furan, pyrrole, anthracene, and tetracene computed at the LDA, HF, PZ, and K ₀ levels of theory

Full size table

Finally, K ₀[LDA] spectra are in remarkable agreement with experimental data in terms of orbital energies over a wide energy range. We stress that no semiempirical shift or ad hoc alignment has been performed here. For furan and pyrrole, where deep valence states are very evident in the experimental data, the agreement with theory holds for states as deep as 25 eV. Even if the experimental data do not cover the full valence energy range, this comparison confirms that K ₀[LDA] band widths are very accurate and can be taken as a reference in assessing the accuracy of other theoretical methods (Table 4).

These results establish the precision of Koopmans-compliant methods in describing full electronic structures for representative families of systems, ranging from isolated atoms to large π-conjugated molecules. Extending this accuracy to the solid limit will be the subject of separate work.

5 Conclusions

In summary, we argued that the generalized Koopmans condition, i.e., the piecewise linearity of the self-consistent ground-state energy E(N) as a function of the electron number N, is critical to describe charged excitations in molecular systems and related spectroscopic properties. We have shown that imposing the generalized Koopmans theorem is equivalent to canceling relaxed-orbital many-electron self-interaction. We have also emphasized the distinction between the generalized Koopmans theorem, which is fulfilled by exact KS-DFT, and the restricted (original) Koopmans theorem, which is satisfied by the HF method (owing to the known cancellation of the self-Hartree and self-exchange terms) but not by exact DFT. To impose Koopmans compliance, we have constructed an ODD functional working first in the frozen-orbital approximation where self-consistent orbital relaxation and dielectric screening are neglected. Within this approximation, deviations from Koopmans compliance can be precisely quantified in terms of a non-Koopmans energy $ {\tilde{\Pi}}_i $. This definition has then been extended to relaxed orbitals by approximating the nonlocal dielectric function $ {\tilde{\varepsilon}}^{-1}\left(\mathbf{r},{\mathbf{r}}^{\mathbf{\prime}}\right) $ as an orbital-independent uniform coefficient α that can be determined in a fully nonempirical fashion. The accuracy of the method has been demonstrated by computing the ionization potentials, electron affinities, and the full electronic structures of a wide range of atomic and molecular systems. Quantitative results from the Koopmans-compliant method have been shown to be comparable to those of high-level many-body perturbation theory methods such as GW at a fraction of the computational cost.

In introducing the ODD Koopmans-compliant functional, we have discussed one definition, based on the Slater one-half construction [63]. We have also developed alternative definitions of the non-Koopmans energy, leading to different flavors of Koopmans-compliant functionals and subtle differences in performance that will be discussed elsewhere. In addition, the uniform screening approximation that we have employed in constructing the functional could be refined by restoring the orbital dependence or non-locality of the dielectric function. Refinements in the description of electronic spectra through the prediction of photoemission amplitudes are underway. The study of extended systems and the prediction of optical spectra represent other exciting lines of development. A systematic and critical assessment of the accuracy of thermodynamic and kinetic properties within the different Koopmans-compliant functionals is also important; essentially, Koopmans-compliant functionals restore the correct energy levels, and can provide the foundations for other strategies aimed at reducing the delocalization tendency of common functionals. In fact, one of the most notable features of Koopmans-compliant methods is the fact that they can lead to spontaneous localization of the minimizing orbitals, suggesting that they would be ideally suited to basis-set reduction and linear-scaling techniques for the study of large-scale systems. These directions represent promising opportunities to extend the scope of current quantum simulations.

Notes

1.
Note that besides functional approximations, the KS-DFT empty states need to be corrected for the derivative discontinuity of the potential upon infinitesimal electron addition. Such derivative discontinuity is usually neglected by approximate functionals which also tend to downshift further the orbital energies of empty states.
2.
Otherwise, an infinitesimal transfer of charge δf > 0 from the highest occupied orbital to a state of energy ε _i < ε _ℋ would decrease the total energy by an amount (ε _i – ε _ℋ)δf < 0.
3.
Reference [49] also highlights the limitations of conventional DFT approximations in capturing static correlation in spin-degenerate systems (the H₂ dissociation problem). Self-interaction errors arising from fractional occupations are nevertheless distinct from static correlation errors arising from fractional spins. In this work, only the self-interaction problem is addressed.
4.
Koopmans’ theorem has been originally proven for the HF method considering frozen orbitals [109]. Here we refer to this case as the restricted Koopmans theorem. The generalized version of the theorem has been introduced later [110] in order to include orbital relaxation. We note in passing that the generalized Koopmans theorem is a property of the exact many-body Green’s function G. The performance of GW approximations in this regard has been recently discussed by Bruneval [48]. In fact, when adopting the Lehmann representation, the poles of G, playing the role of (Dyson) orbital energies, are exactly given by total energy differences corresponding to many-body states with different number of particles (with one electron added or removed).
5.
One could rely on other definitions to measure the lack of Koopmans compliance. In particular, (30) has recently been exploited in [64] within the frozen orbital approximation. The comparative assessment of these closely related definitions is beyond the scope of this introductory review and will be discussed in detail elsewhere.
6.
It is very instructive to note that the linear-response DFT + U method of Cococcioni and de Gironcoli [57] is obtained from a similar expansion to evaluate the U parameters for the N _I preselected orbitals χ _Ii of the Ith atom. In fact, in its simplest form, the nonlinearity correction reads
$$ {E}_U\left[{f}_1,{f}_2,\dots, {\varphi}_1,{\varphi}_2,\dots \right]={\displaystyle \sum_{I=1}^{N_{\mathrm{atom}}}{\displaystyle \sum_{i=1}^{N_I}{\scriptscriptstyle \frac{U_{Ii}}{2}}{n}_{Ii}\left(1-{n}_{Ii}\right)}} $$
with
$$ {U}_{Ii}={\displaystyle \int {d}^3\mathbf{r}{d}^3{\mathbf{r}}^{\mathbf{\prime}}{d}^3{\mathbf{r}}^{\mathbf{{\prime\prime}}}\left|{\chi}_{Ii}\right|{}^2\left(\mathbf{r}\right){\tilde{\varepsilon}}^{-1}\left(\mathbf{r},{\mathbf{r}}^{\mathbf{\prime}}\right){f}_{\mathrm{Hxc}}\left({\mathbf{r}}^{\mathbf{\prime}},{\mathbf{r}}^{\mathbf{{\prime\prime}}}\right)\left|{\chi}_{Ii}\right|{}^2\left({\mathbf{r}}^{\mathbf{{\prime\prime}}}\right)}\kern1em \mathrm{and}\kern1em {n}_{Ii}={\displaystyle \sum_{j=1}^{+\infty }{f}_j\left|\left\langle {\chi}_{Ii}\Big|{\varphi}_j\right\rangle \right|{}^2.} $$

The spirit of the Koopmans-compliant correction is identical with the advantage of not requiring preselected atomic orbitals.
7.
We note that in Figs. 2 and 4 that we have used the Koopmans-compliant functional defined in (40), where the α screening coefficient has been included. We have adopted the same value for α in both figures. If no α were used [(35)], the K panel in Fig. 2 would show a flat curve, while that of Fig. 4 would display a negative slope as the HF method.
8.
For instance, one could compute the average dielectric screening coefficient related to the orbital ψ _i through
$$ {\alpha}_i=\frac{{\displaystyle \int {d}^3\mathbf{r}{d}^3{\mathbf{r}}^{\mathbf{\prime}}{d}^3{\mathbf{r}}^{\mathbf{{\prime\prime}}}\left|{\psi}_i\right|{}^2\left(\mathbf{r}\right){\tilde{\varepsilon}}^{-1}\left(\mathbf{r},{\mathbf{r}}^{\mathbf{\prime}}\right){f}_{\mathrm{Hxc}}\left({\mathbf{r}}^{\mathbf{\prime}},{\mathbf{r}}^{\mathbf{{\prime\prime}}}\right)\left|{\psi}_i\right|{}^2\left({\mathbf{r}}^{\mathbf{{\prime\prime}}}\right)}}{{\displaystyle \int {d}^3\mathbf{r}{d}^3{\mathbf{r}}^{\mathbf{\prime}}\left|{\psi}_i\right|{}^2\left(\mathbf{r}\right){f}_{\mathrm{Hxc}}\left(\mathbf{r},{\mathbf{r}}^{\mathbf{\prime}}\right)\left|{\psi}_i\right|{}^2\left({\mathbf{r}}^{\mathbf{\prime}}\right)}}+\cdots, $$
where it is understood that each quantity that appears in the integrals must be calculated self-consistently.
9.
The same approach is adopted when computing virtual orbital levels and band gaps within, e.g., hybrid DFT and DFT+U approximations.
10.
A detailed sensitivity analysis of this approximation is presented in [62].

References

Allen SM, Thomas EL (1999) The structure of materials, MIT series in materials science and engineering. Wiley, New York
Google Scholar
Martin RM (2008) Electronic structure: basic theory and practical methods. Cambridge University Press, Cambridge
Google Scholar
Hohenberg P, Kohn W (1964) Inhomogeneous electron gas. Phys Rev 136(3B):B864–B871. doi:10.1103/PhysRev.136.B864
Google Scholar
Eschrig H (2003) The fundamentals of density functional theory. Edition am Gutenbergplatz, Leipzig
Google Scholar
Lieb EH (1983) Density functionals for coulomb systems. Int J Quant Chem 24(3):243–277. doi:10.1002/qua.560240302
Google Scholar
Baroni S, de Gironcoli S, Dal Corso A (2001) Phonons and related crystal properties from density-functional perturbation theory. Rev Mod Phys 73:515–562. doi:10.1103/RevModPhys.73.515
Google Scholar
Payne MC, Arias TA, Joannopoulos JD (1992) Iterative minimization techniques for ab initio total-energy calculations: molecular dynamics and conjugate gradients. Rev Mod Phys 64(4):1045–1097. doi:10.1103/RevModPhys.64.1045
Google Scholar
Perdew JP, Levy M, Balduz JL (1982) Density-functional theory for fractional particle number: derivative discontinuities of the energy. Phys Rev Lett 49(23):1691–1694. doi:10.1103/PhysRevLett.49.1691
Google Scholar
Chong DP, Gritsenko OV, Baerends EJ (2002) Interpretation of the Kohn Sham orbital energies as approximate vertical ionization potentials. J Chem Phys 116(5):1760. doi:10.1063/1.1430255
Google Scholar
Casida M (1995) Generalization of the optimized-effective-potential model to include electron correlation – a variational derivation of the Sham–Schluter equation for the exact exchange-correlation potential. Phys Rev A 51(3):2005–2013
Google Scholar
Casida M, Huix-Rotllant M (2012) Progress in time-dependent density-functional theory. Annu Rev Phys Chem 63(1):287–323. doi:10.1146/annurev-physchem-032511-143803
Google Scholar
Runge E, Gross EKU (1984) Density-functional theory for time-dependent systems. Phys Rev Lett 52(12):997–1000. doi:10.1103/PhysRevLett.52.997
Google Scholar
Onida G, Reining L, Rubio A (2002) Electronic excitations: density-functional versus many-body greens-function approaches. Rev Mod Phys 74(2):601–659. doi:10.1103/RevModPhys.74.601
Google Scholar
Dreuw A, Weisman JL, Head-Gordon M (2003) Long-range charge-transfer excited states in time-dependent density functional theory require non-local exchange. J Chem Phys 119(6):2943. doi:10.1063/1.1590951
Google Scholar
Himmetoglu B, Marchenko A, Dabo I, Cococcioni M (2012) Role of electronic localization in the phosphorescence of iridium sensitizing dyes. J Chem Phys 137(15):154309. doi:10.1063/1.4757286
Google Scholar
Maitra NT (2005) Undoing static correlation: long-range charge transfer in time-dependent density-functional theory. J Chem Phys 122(23):234104. doi:10.1063/1.1924599
Google Scholar
Tozer DJ (2003) Relationship between long-range charge-transfer excitation energy error and integer discontinuity in Kohn Sham theory. J Chem Phys 119(24):12697. doi:10.1063/1.1633756
Google Scholar
Faleev S, van Schilfgaarde M, Kotani T (2004) All-electron self-consistent GW approximation: application to Si, MnO, and NiO. Phys Rev Lett 93(12):126406. doi:10.1103/PhysRevLett.93.126406
Google Scholar
Godby R, Schlüter M, Sham L (1988) Self-energy operators and exchange-correlation potentials in semiconductors. Phys Rev B 37(17):10159–10175. doi:10.1103/PhysRevB.37.10159
Google Scholar
Hedin L (1965) New method for calculating the one-particle Green’s function with application to the electron-gas problem. Phys Rev 139(3A):A796–A823. doi:10.1103/PhysRev.139.A796
Google Scholar
Hybertsen M, Louie S (1986) Electron correlation in semiconductors and insulators: band gaps and quasiparticle energies. Phys Rev B 34(8):5390–5413. doi:10.1103/PhysRevB.34.5390
Google Scholar
van Schilfgaarde M, Kotani T, Faleev S (2006) Quasiparticle self-consistent GW theory. Phys Rev Lett 96(22):226402. doi:10.1103/PhysRevLett.96.226402
Google Scholar
Zakharov O, Rubio A, Blase X, Cohen M, Louie S (1994) Quasiparticle band structures of six II-VI compounds: ZnS, ZnSe, ZnTe, CdS, CdSe, and CdTe. Phys Rev B 50(15):10780–10787. doi:10.1103/PhysRevB.50.10780
Google Scholar
Albrecht S, Onida G, Reining L (1997) Ab initio calculation of the quasiparticle spectrum and excitonic effects in Li₂O. Phys Rev B 55(16):10278–10281. doi:10.1103/PhysRevB.55.10278
Google Scholar
Albrecht S, Reining L, Del Sole R, Onida G (1998) Ab initio calculation of excitonic effects in the optical spectra of semiconductors. Phys Rev Lett 80(20):4510–4513. doi:10.1103/PhysRevLett.80.4510
Google Scholar
Rohlfing M, Louie S (1998) Excitonic effects and the optical absorption spectrum of hydrogenated Si clusters. Phys Rev Lett 80(15):3320–3323. doi:10.1103/PhysRevLett.80.3320
Google Scholar
Tiago M, Northrup J, Louie S (2003) Ab initio calculation of the electronic and optical properties of solid pentacene. Phys Rev B 67(11):115212. doi:10.1103/PhysRevB.67.115212
Google Scholar
Donnelly RA, Parr RG (1978) Elementary properties of an energy functional of the first-order reduced density matrix. J Chem Phys 69(10):4431. doi:10.1063/1.436433
Google Scholar
Gilbert T (1975) Hohenberg–Kohn theorem for nonlocal external potentials. Phys Rev B 12(6):2111–2120. doi:10.1103/PhysRevB.12.2111
Google Scholar
Lathiotakis NN, Sharma S, Helbig N, Dewhurst JK, Marques MAL, Eich F, Baldsiefen T, Zacarias A, Gross EKU (2010) Discontinuities of the chemical potential in reduced density matrix functional theory. Z Phys Chem 224(3–4):467–480. doi:10.1524/zpch.2010.6118
Google Scholar
Sharma S, Dewhurst JK, Shallcross S, Gross EKU (2013) Spectral density and metal-insulator phase transition in Mott insulators within reduced density matrix functional theory. Phys Rev Lett 110(11):116403. doi:10.1103/PhysRevLett.110.116403
Google Scholar
Burke K (2012) Perspective on density functional theory. J Chem Phys 136(15):150901. doi:10.1063/1.4704546
Google Scholar
Becke AD (1993) A new mixing of Hartree–Fock and local density-functional theories. J Chem Phys 98(2):1372. doi:10.1063/1.464304
Google Scholar
Livshits E, Baer R (2007) A well-tempered density functional theory of electrons in molecules. Phys Chem Chem Phys 9(23):2932. doi:10.1039/b617919c
Google Scholar
Kümmel S, Kronik L (2008) Orbital-dependent density functional: theory and applications. Rev Mod Phys 80(1):3–60. doi:10.1103/RevModPhys.80.3
Google Scholar
Baer R, Livshits E, Salzner U (2010) Tuned range-separated hybrids in density functional theory. Annu Rev Phys Chem 61(1):85–109. doi:10.1146/annurev.physchem.012809.103321
Google Scholar
Refaely-Abramson S, Baer R, Kronik L (2011) Fundamental and excitation gaps in molecules of relevance for organic photovoltaics from an optimally tuned range-separated hybrid functional. Phys Rev B 84(7):075144. doi:10.1103/PhysRevB.84.075144
Google Scholar
Cococcioni M (2002) A LDA + U study of selected iron compounds. PhD thesis, SISSA, Trieste http://www.sissa.it/cm
Cococcioni M, Gironcoli SD (2005) Linear response approach to the calculation of the effective interaction parameters in the lda + u method. Phys Rev B 71(3):035105. doi:10.1103/PhysRevB.71.035105
Google Scholar
Kulik H, Cococcioni M, Scherlis D, Marzari N (2006) Density functional theory in transition-metal chemistry: a self-consistent Hubbard U approach. Phys Rev Lett 97(10):103001. doi:10.1103/PhysRevLett.97.103001
Google Scholar
Kohn W, Sham LJ (1965) Self-consistent equations including exchange and correlation effects. Phys Rev 140(4A):A1133–A1138. doi:10.1103/PhysRev.140.A1133
Google Scholar
Janak J (1978) Proof that ∂E/∂n _i = ε _i in density-functional theory. Phys Rev B 18(12):7165–7168. doi:10.1103/PhysRevB.18.7165
Google Scholar
Cancès E (2001) Self-consistent field algorithms for Kohn Sham models with fractional occupation numbers. J Chem Phys 114(24):10616. doi:10.1063/1.1373430
Google Scholar
Marzari N, Vanderbilt D, Payne M (1997) Ensemble density-functional theory for ab initio molecular dynamics of metals and finite-temperature insulators. Phys Rev Lett 79(7):1337–1340. doi:10.1103/PhysRevLett.79.1337
Google Scholar
Cancès E, Le Bris C (2000) Can we outperform the DIIS approach for electronic structure calculations? Int J Quant Chem 79(2):8290
Google Scholar
Yang W, Zhang Y, Ayers PW (2000) Degenerate ground states and a fractional number of electrons in density and reduced density matrix functional theory. Phys Rev Lett 84(22):5172
Google Scholar
Mori-Sánchez P, Cohen A, Yang W (2006) Many-electron self-interaction error in approximate density functionals. J Chem Phys 125:201102
Google Scholar
Bruneval F (2009) GW approximation of the many-body problem and changes in the particle number. Phys Rev Lett 103(17):176403. doi:10.1103/PhysRevLett.103.176403
Google Scholar
Cohen AJ, Mori-Sanchez P, Yang W (2008) Insights into current limitations of density functional theory. Science 321(5890):792–794. doi:10.1126/science.1158722
Google Scholar
Mori-Sánchez P, Cohen A, Yang W (2008) Localization and delocalization errors in density functional theory and implications for band-gap prediction. Phys Rev Lett 100(14):146401. doi:10.1103/PhysRevLett.100.146401
Google Scholar
Mori-Sánchez P, Cohen AJ, Yang W (2006) Many-electron self-interaction error in approximate density functionals. J Chem Phys 125(20):201102. doi:10.1063/1.2403848
Google Scholar
Ruzsinszky A, Perdew JP, Csonka GI, Vydrov OA, Scuseria GE (2006) Spurious fractional charge on dissociated atoms: pervasive and resilient self-interaction error of common density functionals. J Chem Phys 125(19):194112. doi:10.1063/1.2387954
Google Scholar
Parr RG, Yang W (1989) Density-functional theory of atoms and molecules. Oxford University Press, New York
Google Scholar
Ayers PW, Morrison RC, Parr RG (2005) Fermi–Amaldi model for exchange-correlation: atomic excitation energies from orbital energy differences. Mol Phys 103(15–16):2061–2072. doi:10.1080/00268970500130183
Google Scholar
Perdew J (1990) Size-consistency, self-interaction correction, and derivative discontinuity in density functional theory. In: Advances in quantum chemistry, vol 21. Elsevier, San Diego, California. pp 113–134
Google Scholar
Kowalczyk T, Yost SR, Voorhis TV (2011) Assessment of the ΔSCF density functional theory approach for electronic excitations in organic dyes. J Chem Phys 134(5):054128. doi:10.1063/1.3530801
Google Scholar
Cococcioni M, de Gironcoli S (2005) Linear response approach to the calculation of the effective interaction parameters in the LDA + U method. Phys Rev B 71(3):035105. doi:10.1103/PhysRevB.71.035105
Google Scholar
Cohen AJ, Mori-Sánchez P, Yang W (2012) Challenges for density functional theory. Chem Rev 112(1):289–320. doi:10.1021/cr200107z
Google Scholar
Hachmann J, Olivares-Amaya R, Atahan-Evrenk S, Amador-Bedolla C, Sanchez-Carrera RS, Gold-Parker A, Vogt L, Brockway AM, Aspuru-Guzik A (2011) The Harvard clean energy project: large-scale computational screening and design of organic photovoltaics on the world community grid. J Phys Chem Lett 2(17):2241–2251. doi:10.1021/jz200866s
Google Scholar
Lany S, Zunger A (2010) Generalized Koopmans density functional calculations reveal the deep acceptor state of N_O in ZnO. Phys Rev B 81(20):205209. doi:10.1103/PhysRevB.81.205209
Google Scholar
Salzner U, Baer R (2009) Koopmans springs to life. J Chem Phys 131(23):231101. doi:10.1063/1.3269030
Google Scholar
Dabo I, Ferretti A, Park CH, Poilvert N, Li Y, Cococcioni M, Marzari N (2013) Donor and acceptor levels of organic photovoltaic compounds from first principles. Phys Chem Chem Phys 15:685. doi:10.1039/c2cp43491a
Google Scholar
Dabo I, Ferretti A, Poilvert N, Li Y, Marzari N, Cococcioni M (2010) Koopmans condition for density-functional theory. Phys Rev B 82(11):115121. doi:10.1103/PhysRevB.82.115121
Google Scholar
Kraisler E, Kronik L (2013) Piecewise linearity of approximate density functionals revisited: implications for frontier orbital energies. Phys Rev Lett 110(12):126403. doi:10.1103/PhysRevLett.110.126403
Google Scholar
Perdew JP, Zunger A (1981) Self-interaction correction to density-functional approximations for many-electron systems. Phys Rev B 23(10):5048–5079. doi:10.1103/PhysRevB.23.5048
Google Scholar
Körzdörfer T, Kümmel S, Mundt M (2008) Self-interaction correction and the optimized effective potential. J Chem Phys 129(1):014110. doi:10.1063/1.2944272
Google Scholar
Krieger J, Li Y, Iafrate G (1992) Construction and application of an accurate local spin-polarized Kohn–Sham potential with integer discontinuity: exchange-only theory. Phys Rev A 45(1):101–126. doi:10.1103/PhysRevA.45.101
Google Scholar
Gatti M, Olevano V, Reining L, Tokatly IV (2007) Transforming nonlocality into a frequency dependence: a shortcut to spectroscopy. Phys Rev Lett 99(5):057401
Google Scholar
Ferretti A, Cococcioni M, Marzari N (2013) Submitted
Google Scholar
Cohen AJ, Mori-Sanchez P, Yang W (2007) Development of exchange-correlation functionals with minimal many-electron self-interaction error. J Chem Phys 126(19):191109. doi:10.1063/1.2741248
Google Scholar
Pederson M, Heaton R, Lin C (1984) Local density Hartree–Fock theory of electronic states of molecules with self interaction correction. J Chem Phys 80:1972
Google Scholar
Stengel M, Spaldin N (2008) Self-interaction correction with Wannier functions. Phys Rev B 77(15):155106
Google Scholar
Marzari N, Vanderbilt D (1997) Maximally localized generalized Wannier functions for composite energy bands. Phys Rev B 56(20):12847–12865
Google Scholar
Wannier GH (1937) The structure of electronic excitation levels in insulation crystals. Phys Rev 52:191–197
Google Scholar
Messud J, Dinh P, Reinhard PG, Suraud E (2009) On the exact treatment of time-dependent self-interaction correction. Ann Phys 324:955–976
Google Scholar
Vydrov O, Scuseria G, Perdew J (2007) Tests of functionals for systems with fractional electron number. J Chem Phys 126:154109
Google Scholar
Körzdörfer T (2011) On the relation between orbital-localization and self-interaction errors in the density functional theory treatment of organic semiconductors. J Chem Phys 134:094111
Google Scholar
Chan M, Ceder G (2010) Efficient band gap prediction for solids. Phys Rev Lett 105(19):196403. doi:10.1103/PhysRevLett.105.196403
Google Scholar
Giannozzi P, Baroni S, Bonini N, Calandra M, Car R, Cavazzoni C, Ceresoli D, Chiarotti GL, Cococcioni M, Dabo I, Corso AD, de Gironcoli S, Fabris S, Fratesi G, Gebauer R, Gerstmann U, Gougoussis C, Kokalj A, Lazzeri M, Martin-Samos L, Marzari N, Mauri F, Mazzarello R, Paolini S, Pasquarello A, Paulatto L, Sbraccia C, Scandolo S, Sclauzero G, Seitsonen AP, Smogunov A, Umari P, Wentzcovitch RM (2009) Quantum espresso: a modular and open-source software project for quantum simulations of materials. J Phys Condens Mat 21(39):395502
Google Scholar
Li Y, Dabo I (2011) Electronic levels and electrical response of periodic molecular structures from plane-wave orbital-dependent calculations. Phys Rev B 84(15):155127
Google Scholar
Perdew J, Levy M (1983) Physical content of the exact Kohn–Sham orbital energies - band-gaps and derivative discontinuities. Phys Rev Lett 51(20):1884–1887
Google Scholar
Perdew J, Levy M (1997) Comment on “significance of the highest occupied Kohn–Sham eigenvalue”. Phys Rev B 56(24):16021–16028
Google Scholar
Blase X, Attaccalite C, Olevano V (2011) First-principles GW calculations for fullerenes, porphyrins, phthalocyanine, and other molecules of interest for organic photovoltaic applications. Phys Rev B 83(11):115103. doi:10.1103/PhysRevB.83.115103
Google Scholar
Tiago ML, Kent PRC, Hood RQ, Reboredo FA (2008) Neutral and charged excitations in carbon fullerenes from first-principles many-body theories. J Chem Phys 129(8):084311
Google Scholar
Foerster D, Koval P, Sánchez-Portal D (2011) An O (N3) implementation of Hedin’s GW approximation for molecules. J Chem Phys 135:074105
Google Scholar
Pines D (1963) Elementary excitations in solids. W.A. Benjamin, New York
Google Scholar
Pines D, Nozières P (1989) The theory of quantum liquids. Addison-Wesley, New York
Google Scholar
Onida G, Reining L, Rubio A (2002) Electronic excitations: density-functional versus many-body Green’s-function approaches. Rev Mod Phys 74(2):601–659
Google Scholar
Ferretti A, Mallia G, Martin-Samos L, Bussi G, Ruini A, Montanari B, Harrison N (2012) Ab initio complex band structure of conjugated polymers: effects of hybrid density functional theory and GW schemes. Phys Rev B 85(23):235105. doi:10.1103/PhysRevB.85.235105
Google Scholar
Curtiss L, Raghavachari K, Redfern P, Pople J (1997) Assessment of gaussian-2 and density functional theories for the computation of enthalpies of formation. J Chem Phys 106(3):1063–1079
Google Scholar
National Institute of Standards and Technology (NIST) (2013). Computational chemistry comparison and benchmark database, http://cccbdb.nist.gov
Kadantsev ES, Stott MJ, Rubio A (2006) Electronic structure and excitations in oligoacenes from ab initio calculations. J Chem Phys 124(13):134901
Google Scholar
Piancastelli M, Kelly M, Chang Y, McKinley J, Margaritondo G (1987) Benzene adsorption on low-temperature silicon: a synchrotron-radiation photoemission study of valence and core states. Phys Rev B 35(17):9218–9221. doi:10.1103/PhysRevB.35.9218
Google Scholar
Trofimov AB, Zaitseva IL, Moskovskaya TE, Vitkovskaya NM (2008) Theoretical investigation of photoelectron spectra of furan, pyrrole, thiophene, and selenole. Chem Heterocycl Comp 44(9):1101–1112. doi:10.1007/s10593-008-0159-5
Google Scholar
Coropceanu V, Malagoli M, da Silva D, Gruhn N, Bill T, Bredas J (2002) Hole- and electron-vibrational couplings in oligoacene crystals: intramolecular contributions. Phys Rev Lett 89(27):275503. doi:10.1103/PhysRevLett.89.275503
Google Scholar
CRC (2009) CRC handbook of chemistry and physics. CRC, Boca Raton
Google Scholar
Kramida A, Ralchenko Y, Reader J, NIST ASD Team (2013) NIST atomic spectra database
Google Scholar
Mehlhorn W, Breuckmann B, Hausamann D (1977) Electron spectra of free metal atoms. Phys Scrip 16(5–6):177
Google Scholar
Shirley D, Martin R, Kowalczyk S, McFeely F, Ley L (1977) Core-electron binding energies of the first thirty elements. Phys Rev B 15(2):544
Google Scholar
Banna M, Wallbank B, Frost D, McDowell C, Perera J (1978) Free atom core binding energies from X-ray photoelectron spectroscopy. II. Na, K, Rb, Cs, and Mg. J Chem Phys 68:5459
Google Scholar
Perera J, Frost D, McDowell C, Ewig C, Key R, Banna M (1982) Atomic and ionic core binding energies of selected levels in the alkaline earths from X-ray photoelectron spectroscopy and Dirac–Fock calculations. J Chem Phys 77:3308
Google Scholar
Chen H, Pan Y, Groh S, Hagan T, Ridge D (1991) Gas-phase charge-transfer reactions and electron affinities of macrocyclic, anionic nickel complexes: Ni (salen), Ni (tetraphenylporphyrin), and derivatives. J Am Chem Soc 113(7):2766–2767
Google Scholar
Schiedt J, Weinkauf R (1997) Photodetachment photoelectron spectroscopy of mass selected anions: anthracene and the anthracene-H₂O cluster. Chem Phys Lett 266(1):201–205
Google Scholar
Crocker L, Wang T, Kebarle P (1993) Electron affinities of some polycyclic aromatic hydrocarbons, obtained from electron-transfer equilibria. J Am Chem Soc 115(17):7818–7822. doi:10.1021/ja00070a030
Google Scholar
Prinzbach H, Weller A, Landenberger P, Wahl F, Worth J, Scott L, Gelmont M, Olevano D, von Issendorff B (2000) Gas-phase production and photoelectron spectroscopy of the smallest fullerene, C-20. Nature 407(6800):60–63
Google Scholar
Yang S, Pettiette C, Conceicao J, Cheshnovsky O, Smalley R (1987) Ups of buckminsterfullerene and other large clusters of carbon. Chem Phys Lett 139(3):233–238
Google Scholar
Wang XB, Ding CF, Wang LS (1999) High resolution photoelectron spectroscopy of C₆₀. J Chem Phys 110:8217–8220
Google Scholar
Wang XB, Woo HK, Huang X, Kappes M, Wang LS (2006) Direct experimental probe of the on-site coulomb repulsion in the doubly charged fullerene anion c702-. Phys Rev Lett 96(14):143002. doi:10.1103/PhysRevLett.96.143002
Google Scholar
Zsabo A, Ostlund NS (1996) Modern quantum chemistry: introduction to advanced electronic structure theory. Dover, New York
Google Scholar
Phillips J (1961) Generalized Koopmans theorem. Phys Rev 123(2):420
Google Scholar

Download references

Acknowledgements

The authors are indebted to M. Cococcioni, N. Poilvert, G. Borghi, N. L. Nguyen, C.-H. Park, M. Marqués, E. K. U. Gross, S. de Gironcoli, and S. Baroni for valuable discussions and relevant suggestions. ID acknowledges partial support from the French National Research Agency through Grant ANR 12-BS04-0001 PANELS (Photovoltaics from Ab-initio Novel Electronic-structure Simulations). AF acknowledges partial support from Italian MIUR through Grant FIRB-RBFR08FOAL_001.

Author information

Authors and Affiliations

Department of Materials Science and Engineering, Materials Research Institute, The Pennsylvania State University, University Park, PA, 16802, USA
Ismaila Dabo
Penn State Institutes of Energy and the Environment, The Pennsylvania State University, University Park, PA, 16802, USA
Ismaila Dabo
Centro S3, CNR–Istituto Nanoscienze, 41125, Modena, Italy
Andrea Ferretti
Theory and Simulations of Materials (THEOS), École Polytechnique Fédérale de Lausanne, 1015, Lausanne, Switzerland
Nicola Marzari

Authors

Ismaila Dabo
View author publications
You can also search for this author in PubMed Google Scholar
Andrea Ferretti
View author publications
You can also search for this author in PubMed Google Scholar
Nicola Marzari
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ismaila Dabo .

Editor information

Editors and Affiliations

Dipartimento di Scienza dei Materiali, Università di Milano-Bicocca, via Cozzi 55 20125, Milano, Italy
Cristiana Di Valentin
UMR5306 Université Lyon 1-CNRS Université Lyon, Institut Lumière Matière and ETSF, Villeurbanne, France
Silvana Botti
Éole polytechnique fédérale de Lausanne, Institute of Materials, Lausanne, Switzerland
Matteo Cococcioni

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Dabo, I., Ferretti, A., Marzari, N. (2014). Piecewise Linearity and Spectroscopic Properties from Koopmans-Compliant Functionals. In: Di Valentin, C., Botti, S., Cococcioni, M. (eds) First Principles Approaches to Spectroscopic Properties of Complex Materials. Topics in Current Chemistry, vol 347. Springer, Berlin, Heidelberg. https://doi.org/10.1007/128_2013_504

Download citation

DOI: https://doi.org/10.1007/128_2013_504
Published: 15 February 2014
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-55067-6
Online ISBN: 978-3-642-55068-3
eBook Packages: Chemistry and Materials ScienceChemistry and Material Science (R0)

Publish with us

Policies and ethics

Piecewise Linearity and Spectroscopic Properties from Koopmans-Compliant Functionals

Abstract

Graphical Abstract

Similar content being viewed by others

The bifunctional formalism: an alternative treatment of density functionals

Nonlocal pseudopotential energy density functional for orbital-free density functional theory

Extending the Scale with Real-Space Methods for the Electronic Structure Problem

Keywords

1 Introduction

2 Methods

2.1 Functionals of the Total Density

2.2 Functionals of the Orbital Densities

2.2.1 Charged Excitations

2.2.2 Generalized Koopmans Compliance

2.2.3 Energy Minimization

3 Numerical Approach

3.1 Koopmans-Compliant Contributions

3.2 Computational Details

4 Results

4.1 Atoms

4.2 Molecules

4.2.1 Ionization Potentials

4.2.2 Electron Affinities

4.2.3 Energy Levels

5 Conclusions

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Publish with us

Navigation

Piecewise Linearity and Spectroscopic Properties from Koopmans-Compliant Functionals

Abstract

Graphical Abstract

Similar content being viewed by others

The bifunctional formalism: an alternative treatment of density functionals

Nonlocal pseudopotential energy density functional for orbital-free density functional theory

Extending the Scale with Real-Space Methods for the Electronic Structure Problem

Keywords

1 Introduction

2 Methods

2.1 Functionals of the Total Density

2.2 Functionals of the Orbital Densities

2.2.1 Charged Excitations

2.2.2 Generalized Koopmans Compliance

2.2.3 Energy Minimization

3 Numerical Approach

3.1 Koopmans-Compliant Contributions

3.2 Computational Details

4 Results

4.1 Atoms

4.2 Molecules

4.2.1 Ionization Potentials

4.2.2 Electron Affinities

4.2.3 Energy Levels

5 Conclusions

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation