1 Introduction

Electron spectroscopy for chemical analysis (ESCA), more commonly referred to as X-ray photoelectron spectroscopy (XPS), is an experimental technique widely used in many materials and surface science laboratories and facilities, either in research or in applied industry. XPS is broadly used for the elemental analysis of bulk materials, especially for surfaces, given its surface sensitivity [1, 2], but it has also been applied to the detection of gas phase molecules [3]. Furthermore, XPS is currently used to in situ observe the evolution of an heterogeneously catalyzed reaction, allowing for the characterization of reactants, intermediates and products, thus serving as a powerful tool to determine reaction mechanisms [46].

In part, the XPS usefulness hangs on the elemental analysis by measurements of core-level electron binding energies (BEs). However, the BEs are not only characteristic of a given element, but, more specifically, they are characteristic of a given element in a given chemical environment and electronic state. Thus, BEs provide qualitative information of the elements present in a sample, quantitative information of their concentration, plus qualitative/quantitative information of their different chemical environments and electronic states [16]. Such a detailed information enables BEs from XPS experiments to be used as chemical fingerprints in condensed phase systems, reflecting the chemical properties and bonding between the species in it [7]. The small variations for a given element in different chemical environments are often characterized by means of core-level BEs shifts (ΔBEs) and normally allow for distinguishing different structural as well as the oxidation state of a given atom [8].

The assignment of a given XPS peak to a given chemical element is straightforward and constitutes a useful property to benchmark ab initio methods. In fact, core-level BEs and their ΔBEs have been accurately predicted from ab initio calculations at the well-known Hartree–Fock (HF) level of theory [911]. Nevertheless, electron correlation effects are missing in HF, and explicitly correlated methods usually referred to as post-HF rapidly become computationally expensive. On the other hand, electron correlation is implicitly included in density functional theory (DFT)-based methods, which indeed have a better scaling with respect to the number of atoms and electrons than post-HF ones [12]. Not surprisingly, DFT-based methods are becoming the default choice.

The performance of DFT methods in predicting structure and reactivity of molecular systems is well documented [13], but, except for the series of works of Chong et al. [1417], the information regarding their performance on predicting core-level BEs is missing and, if present, refers mainly to functionals in the lower rungs of Jacob’s ladder of accuracy for DFT functionals. In one of these works, Takahata and Chong [17] studied 35 small organic molecules containing B → F atoms and considered a total of 59 1s core electron BEs, tackling the effect of using increasingly large basis sets and a sequence of 21 different DFT exchange–correlation (xc) functionals, either within the local density approximation (LDA) or the generalized gradient approximation (GGA), but also considering some earlier versions of meta-GGA xc functionals. Among all the studied functionals, the Voorhis-Scuseria (VS98) [18], meta-GGA and the Becke–Lee–Yang–Parr (BLYP) [19, 20] GGA were found to be the most accurate for computing BEs, with mean absolute errors (MEA) of ~0.2 and ~0.3 eV, respectively.

Unfortunately, this previous study [17] did not include the type of hybrid functionals which are nowadays among the most used and successful main-group element molecules [13]. These functionals incorporate a percentage of HF exchange such as B3LYP [21], probably among the most used methods in molecular quantum chemistry. The success of B3LYP is the significant improvement in the description of the thermochemistry of main-group molecules, relative to GGA and meta-GGA functionals [13]. Another family of meta-GGA and hybrid xc functionals with excellent performance in the thermochemistry of main-group elements is the one developed by Truhlar’s group and often referred to as Minnesota functionals [22].

To assess the performance of these new functionals in predicting 1s core-level BEs of main-group element containing molecules, the present work describes a systematic study for the series of molecules included in the study of Takahata and Chong [17]. In particular, we chose the M06 [23] and M11 [24] hybrids and added the M06-L and M11-L local meta-GGA functionals [25, 26], given that such xc has proven to deliver an accurate description even in very complex systems [27]. In fact, these methods are able to solve the well-known CO adsorption on Pt(111) puzzle [28]. In addition, their local character and concomitant low computational cost are particularly well suited for periodic DFT calculations [29]. For comparison purposes, HF and B3LYP methods have also been tested.

2 Computational details

Core-level BEs can be rather accurately predicted from ab initio calculations—such as HF- or DFT-based methods—[911, 1417, 31, 32] via the difference between the total energies of the neutral state and the same system with a core–hole configuration generated by subtracting one electron in the desired core level, as shown in Eq. 1.

$${\text{BE}} = E\left( {\text{Core hole state}} \right) - E({\text{Neutral state}})$$
(1)

A possible way to obtain the energy values in Eq. 1 is to make use of separate self-consistent field (SCF) calculations. The resulting procedure is usually referred as ΔSCF [9, 11], and Eq. 1 is usually rewritten as

$${\text{BE}}_{i} = E_{i}^{N - 1} \left( {\text{SCF}} \right) - E^{N} ({\text{SCF}})$$
(2)

where the subindex i in BE i indicates the ionized core, whereas \(E^{N} \left( {\text{SCF}} \right)\) and \(E_{i}^{N - 1} \left( {\text{SCF}} \right)\) are the variationally optimized energy for the initial system with N electrons and the final systems with (N-1) electrons and the corresponding i core–hole.

The ΔSCF calculations have been performed on the molecular set proposed by Takahata and Chong [17], who assessed the performance of some gradient corrected functionals on a series of organic molecules containing B → F elements. Note, however, that here B is not considered, since only BF3 result is available, and would deliver only one point for B-based molecules, thus not allowing for a statistical analysis. Therefore, only 20 molecules have been contemplated, which involve 39 core levels—see the full list in “Appendix.” The ΔBEs are, for each element C → F level, computed with respect to a given simple reference molecule: methane (CH4), ammonia (NH3), water (H2O), and fluoromethane (CH3F) for C → F, respectively.

The calculations have been carried out using a large fully uncontracted basis set near the HF limit, ensuring an accurate and well-defined description of both neutral and ionized states. Specifically, for C → F, an uncontracted Partridge (14s,9p) set augmented by a d function taken from the polarized valence triple zeta (pVTZ) set was used. For H atoms, we used an uncontracted basis set (5s) taken from the VTZ basis set augmented with a p function [30]. All BEs and ΔBEs have been obtained using the M06, M06-L, M11, and M11-L methods, which can be taken as representative of the Minnesota series of functionals, as well as with HF, a well-defined reference method, and B3LYP, one of the most broadly used DFT-based methods. In particular, B3LYP has been recently found to be acutely suited in estimating BEs for a series of N-containing molecules in gas phase [31, 32].

For any of the explored methods, the equilibrium geometry of each molecule is optimized for the neutral molecule. Frequency analyses have been carried out on each studied system, ensuring that they represent a minimum in the corresponding potential energy hypersurface. Note also that ΔSCF calculations aimed to obtain BEs and ΔBEs values are carried out at the geometry of the neutral molecule, i.e., vertical transitions, which is a reasonable choice given the timescale of core-level ionization in the XPS experiments. For the core–hole state, the occupied orbitals are selected using an overlap criterion instead of the usual Aufbau approach. In order to properly account for spin symmetry, calculations at the Hartree–Fock level employ the restricted open-shell (ROHF) formalism. A restricted formalism has also been adopted for the DFT-based calculations, thus avoiding spin contamination and artificial spin polarization in valence electron in the state with a core–hole, the need for spin projection [33] and, at the same time, providing appropriate comparison with those obtained at the HF level. All calculations have been carried out using the GAMESS program [34, 35].

Note by passing by that relativistic effects, not included in the previously described calculations, are different for distinct core levels and increase with the atomic number of the ionized atom. In order to discuss the accuracy of the different methods in predicting BEs, it is convenient to have a reliable estimate of the contribution of the relativistic effects. To this end, results from relativistic and non-relativistic calculations for the C → F isolated atoms at the HF level of theory provided by Bagus [36] have been used. These relativistic calculations calculations were carried out with the DIRAC program [37], and the non-relativistic calculations were carried out with the CLIPS code [38]. The wave functions were based on the average of configurations and do not take into account the multiplet splitting effects for these open-shell atoms [39]. We compared fully relativistic four-component Dirac HF wave functions and energies with non-relativistic HF wave functions and energies for the C → F isolated atoms. The basis sets used for these calculations were the same as for the non-relativistic calculations. It is worth pointing out that previous works [31, 32] validated the GAMESS results for the core–hole states by comparing with results obtained with CLIPS.

3 Results and discussion

Let us first analyze the absolute BEs—BE(ΔSCF)—results, which are encompassed in Table 1. At a first glance, any functional explored and also HF are targeting well the experimental BE. Indeed, the statistical analysis, evaluated in terms of mean error (ME), MAE and the mean absolute percentage error (MAPE), consistently shows it. Hence, any functional here explored (also HF) excellently targets the experimental value with a deviation below 0.17 %. When one goes into the fine detail, HF tends to underestimate the experimental BEs by ~0.3 eV, due to the neglect of electron correlation which is larger in the ground state than in the core–hole containing cation. Note, however, that B3LYP performs in a similar way even if here correlation effects are accounted for through the xc potential.

Table 1 BE(ΔSCF) results for the 1s core orbitals analyzed, in underlined bold font, of the molecules described in the overall set

A different panorama emerges when analyzing results predicted by the Minnesota functionals. In fact, the M06 hybrid xc is found to underestimate BEs by ~0.7 eV and the related meta-GGA M06-L functional also underestimates the BEs by ~0.3 eV. Thus, the underestimation seems to be residing in the functional by construction, although relativistic effects must also be considered, as shall be commented below. On the other hand, the M11 and M11-L reformatted xc functionals tend to overestimate the BEs by ~0.3 and ~0.7 eV, respectively. In any case, it is clear that by adding the corresponding % HF to M06-L or M11-L meta-GGA xc functionals, the BEs lower by ~0.45 eV, which is not so easy to interpret. By further analyzing the data using MAE, one can remove sign cancellation errors; in this way, M11 hybrid and M06-L meta-GGA seem to be the best xc functionals to predict absolute BE values, but with a mean error for both near ~0.3 eV. Interestingly, this is closely followed by B3LYP, with a mean absolute error of ~0.4 eV.

An elemental analysis for BEs of C → F is visually presented in Fig. 1, where excellent linear relationships are drawn for calculated BEs versus the experimental ones. Several conclusions can be withdrawn from this analysis. The first one is that C1s are rather accurately described by any of the inspected methods, with just very slight deviations from ideality. Thus, the over/underestimations commented in Table 1 arise from N → F elements. Indeed, for N1s orbitals, the BEs of M11-L—the xc functional with the largest overestimation, as seen in Table 1—display a positive offset with respect to the experimental values, whereas HF and M06 display a similar negative offset, which seems to be physically more correct. These deviations seem to be progressively acute when going to O1s and F1s core levels.

Fig. 1
figure 1

Calculated, BE(ΔSCF), versus experimental, BE(Exp.), core-level BEs for C, N, O and F elements. All values are given in eV. Black lines represent the ideal experimental matching

Aside, the B3LYP displays a less markedly negative offset, only significant for O1s and F1s . Last but not least, M06-L meta-GGA and M11 hybrid display very little deviations with respect to ideality. It is remarkable that for O1s and F1s a trend of underestimation of HF > M06 > B3LYP > M06-L > M11 > M11-L is obtained, regarding that such underestimation turns to be overestimation in the M11-L case. Last but not least, the above-commented rising of BEs when adding HF exchange to M06-L and M11-L meta-GGAs, i.e., when using M06 and M11, respectively, is clearly seen in N → F panels.

Let us pay attention to ΔBEs, listed in Table 2, instead of absolute BE values. The overall statistical analysis reveals that, as expected, the above-commented over/underestimations cancel when comparing BE shifts, and so, the overall deviations for all the tested methods sensibly drop: The Minnesota functionals and B3LYP have MAE values of ~0.2 eV, whereas HF displays a larger value of ~0.4 eV. Aside the marked underestimation observed for HF, B3LYP, and M06 and M06-L vanish. It is noteworthy how by estimating BE shifts the mean error on DFT xc functionals approach to the XPS chemical accuracy of 0.1 eV. This is a clear indication of systematic errors in the functionals which could in this way be mended.

Table 2 ΔBE(ΔSCF) results for the analyzed 1s core orbitals of atoms in molecules described in the overall set and highlighted in underscored bold font

This agreement is also reflected when plotting, itemized for the different studied elements, the estimated ΔBEs with respect to the experimental ones, as shown in Fig. 2. The agreement between methods is remarkable, and barely one can only highlight a sensibly larger slope for HF, most acute for C1s and O1s cases, slightly followed by M11 and faint value dispersion for F1s cases. Actually, one can inspect the data gained from the linear fittings in this matter, as shown in Table 3. It is clear that linear fitting is excellent for ΔBEs, with R regression coefficients larger than 0.99. One can estimate, for each inspected element and method, the mean offset for the obtained range of ΔBEs. By doing so, it is clear that the larger the slope, i.e., the more deviation with the ideal value of 1.0, the larger the offset, as observed for C1s and N1s ΔBEs, with slopes of 1.12 and 1.27, and offsets of 0.67 and 0.87 eV, respectively. This is also observed for M11 but to a lower degree, e.g., for N1s , with a slope of 1.11 and an offset of 0.49 eV. Aside from these two cases, the overall offsets range 0–0.2 eV, really close to the ESCA chemical accuracy of 0.1 eV. It is worth highlighting that the lowest offset deviation is found for M06-L meta-GGA, below 0.1 eV regardless of the studied method.

Fig. 2
figure 2

Calculated, ΔBE(ΔSCF), versus experimental, ΔBE(Exp.), core-level ΔBEs for C, N, O, and F elements in the molecular data set. All values are given in eV. Black lines represent the ideal experimental matching

Table 3 Summary of the regression adjustments of the 1s core-level BEs and ΔBEs for each element and the overall set for the different explored methods, including the slope and the linear equation regression coefficient, R

The offset analysis discussed above can also be applied to BEs. Indeed, Table 3 contains the linear fittings for the BEs plots in Fig. 1. Again, the linear adjustments are excellent, with regression coefficients above 0.99. Here, however, the offsets can be larger despite a slope near one, reflecting the over/underestimation of the chosen xc functional or HF. Indeed, the deviations can be as high as 1 eV, as observed for F1s cases.

The analysis of results presented so far refers to non-relativistic calculations. We indeed considered the influence of relativistic effect on the 1s BEs on the studied elements. Relativistic (Rel.) and non-relativistic (Non-Rel.) calculations have been carried out for the C → F atoms, and the relativistic contributions and the comparison with the non-relativistic ones are given in Table 4. The relativistic change in the final state, including the relaxation of the electrons with a core–hole, Diff., is defined as:

$${\text{Diff = BE}}\left( {\Delta {\text{SCF,Rel}}} \right)-{\text{ BE}}\left( {\Delta {\text{SCF,Non-Rel}}} \right) .$$
(3)
Table 4 Relativistic contributions (Rel.) and non-relativistic (Non-Rel.) results for BE(ΔSCF) B → F calculations

It has been reported [32] that the relativistic changes on the initial and final states are very close to each other, which means that the relativistic changes are dominant, already at the initial state. Since we want to establish the relativistic contribution to the absolute core-level BEs, the changes were studied for the BE(ΔSCF) values. The Diff. values in Table 4 show that the relativistic 1s core BEs are larger than the non-relativistic values, although the relativistic effects lead to a very small increase in the core-level BEs for these light atoms as expected, from 0.13 eV for B to 0.75 eV for F, thus increasing along the C → F series, as expected.

When properly accounting for the relativistic effects on 1s core orbitals, a clear distinction in between HF and the other xc meta-GGA and hybrid functionals pops up. HF overestimates C1s and N1s levels by ~0.65 eV, whereas it underestimates O1s and F1 s by ~0.45 eV. Such trend behavior is mimicked by B3LYP, but with smaller values of around 0.2 eV, implying that this functional seems to efficiently capture the correlation effects missing in HF. When dealing with Minnesota xc hybrid functionals, M06 is consistently, for C → F elements, underestimating BEs by ~0.3 to 0.4 eV, while M11 consistently overestimates by ~0.6 to 0.8 eV. The meta-GGA M11-L xc functional only worsens the situation, with an overestimation of more than 1 eV. However, despite the overall poor description of Minnesota functionals in estimating 1s core-level BEs, the M06-L meta-GGA performs excellently. By including the relativistic contributions to properly compare with experiment, the deviation with respect to the experimental values is, in average terms, below 0.1 eV, and this is within the XPS chemical accuracy. Thus, M06-L, known to be a rather accurate xc meta-GGA functional, even suited to simulate particularly complicated systems [27, 28], is here reinforced in the sense that is also well suited in obtaining an excellent description of 1s core electron energies of main-group elements.

In light of the above-presented discussion, the obtained results for the six explored methods, with and without relativistic corrections, are overall good when computing ΔBEs of main-group organic molecules, and predict absolute core-level BEs with a small percentage deviation. However, when relativistic effects are considered, the meta-GGA M06-L xc functional stands out in between its fellows, showing chemical accuracy in both BEs and ΔBEs, with the concomitant advantage of being computationally more affordable than other tested hybrid functionals, and so poses itself as excellent for describing at the same time the thermochemistry of organic molecules and their core state energies as well as its capability to be used in periodic DFT calculations with plane wave basis sets [29].

4 Conclusions

Here we have explored the performance of M06, M06-L, M11 and M11-L Minnesota functionals in predicting 1s core-level BEs of a set of 20 molecules (39 core levels explored in total) extracted from a previous study [17] containing samples of molecules containing C → F main-group elements. Results for HF and B3LYP have been included for comparison. The obtained results using ΔSCF methodology have been compared to the reported experimental references. This has been carried out in a non-relativistic fashion, yet the relativistic effects have been explicitly considered on isolated atoms and since are known to be independent of the particular chemical environment, added a posteriori on the obtained estimates.

The analysis yields that computed absolute core-level BEs are overall, and regardless of the method, close to the reported experimental values, with small percentage deviations, but sensible when considered in an absolute deviation, due to the inherent different construction of the explored functionals and methods. The BEs values have also been analyzed in terms of shifts, ΔBEs, where such building differences are cancelled, showing in overall terms a good performance; actually B3LYP and M06 hybrids, or M06-L and M11-L meta-GGAs are recommended for ΔBEs since their deviations, of the order of 0.2–0.3 eV, are close to the XPS chemical resolution of 0.1 eV.

However, when relativistic effects are included, all methods show strong over/underestimations of the BEs, which can be as large as 1 eV. There is, however, one exception to this unexpected behavior. The M06-L meta-GGA Minnesota xc functional, which after applying the pertinent relativistic corrections, displays an overall mean deviation of the estimated BEs with respect to the experimental measurements below 0.1 eV, and this is well within the XPS chemical resolution and so, poses itself as an excellent choice when modeling core-level BEs of main-group organic molecules, plus being computationally less demanding than hybrid functionals, which in turn display a lower accuracy in BEs.