1 Introduction

Representative test sets consisting of a few small systems are very important to gauge the accuracy of computational methods. Moreover, small representative test sets are often used to adjust empirical parameters of new approximate methods. The test sets AE6 (six atomization energies) and BH6 (six barrier heights) proposed by Lynch and Truhlar [1] are supposed to be representative sets for the 109 atomization energies and 44 barrier heights of the Database/3 [2] data set. Unfortunately, they involve experimental reference values that have to be corrected for certain contributions (entropic, enthalpic, and zero-point vibrational contributions) that are usually not easily accessible in a new computational method. These corrections add to the uncertainty of the reference values. A very attractive alternative is to obtain highly accurate reference values from computational chemistry. Established protocols yielding very high accuracy are the W4 method [3] from the Wn family of methods [36] and the HEAT project [7]. Both involve highly expensive calculations and basis set extrapolations.

The aim of the present study is to propose an alternative computational protocol that comes close to the accuracy of the W4 method but which is computationally less expensive and does not involve basis set extrapolations. This new computational protocol employs explicitly correlated coupled-cluster theory including iterative single and double excitations as well as perturbative triple excitations, CCSD(T)(F12) [819], augmented with higher-order corrections. The design of our protocol is inspired by prior assessments of the performance of explicitly correlated methods for a variety of systems [2025]. We present results of our new composite method for the AE6 and BH6 test sets. We note that the highly accurate W4 method has already been applied to the BH6 test set [26] and to five of the six molecules of the AE6 test set [27]. We therefore compare our new results not only to the currently used experimental reference values but also to the values obtained from the W4 method.

Our computational approach allows us to provide relativistic as well as nonrelativistic and frozen-core reference values. We also provide error estimates to gauge the accuracy of the theoretical reference values obtained from our computational protocol. More approximate methods can be compared to these more suitable reference values.

2 Methods

The computational protocol proposed here avoids basis set extrapolations by usage of the explicit treatment of electron correlation by means of the CCSD(T)(F12) method. This yields the CCSD correlation energy very close to the basis set limit and offers a basis set incompleteness correction for the Hartree--Fock energy (CABS singles). We augment the part covered by CCSD(T)(F12) with corrections for higher excitations and core/core-valence correlation. We also propose a method/basis set combination to account for scalar-relativistic effects. If full-relativistic reference values are desired, we suggest to use the experimental fine structure to extract first-order spin-orbit coupling terms.

2.1 Explicitly correlated calculations

Frozen-core CCSD(T)(F12) and core/core-valence correlation energies were calculated with the Turbomole [28] module ricc2 [29] using reference wave functions from the module dscf [30]. Open-shell species were calculated via unrestricted formalisms. The CCSD(F12) part of the calculations used variant B, ansatz 2, where the strong orthogonality projector

$$ {\hat{Q}}_{12} = (1 - {\hat{O}}_1) (1 - {\hat{O}}_2) - {\hat{V}}_1 {\hat{V}}_2, $$
(1)
$$ \begin{aligned}\hat{O}_\mu=&\sum_i \left| \phi_i(\mu)\rangle\langle\phi_i(\mu)\right| ,\qquad \hat{V}_\mu = \sum_a \left|\phi_a(\mu)\rangle \langle \phi_a(\mu)\right| , \\\hbox{with occupied orbitals}\;\phi_i\,\hbox{and virtual orbitals}\; \phi_a\\\end{aligned} $$

is used, with sp cusp conditions [18] to predetermine the geminal amplitudes and with spin-flipped [19] geminals that are constructed from a linear combination of six Gaussian functions using recommended exponents [31]. Explicitly correlated calculations employed the cc-pVXZ-F12 series of orbital basis sets [32] with recommended auxiliary basis sets [33] and CABS basis sets [34] from the Turbomole basis set library. The perturbative CABS singles correction (E CABS) was taken into account [15].

2.2 Conventional calculations

Conventional AE-CCSD(T) calculations employing the cc-pwCVQZ and cc-pwCV5Z basis sets [35] in conjunction with recommended RI basis sets [33] were used to estimate the core/core-valence correlation energies. Conventional CCSD(T) calculations for error estimates employing the cc-pVXZ and cc-pV(X + d)Z [3639] series of basis sets were also performed using the Turbomole program package.

Higher excitation (difference between frozen-core CCSDT(Q) and CCSD(T) calculations) contributions and scalar-relativistic effects (AE-CCSD(T)) were calculated using the programs CFOUR [40] and MRCC [41, 42]. Estimation of higher excitation energies employed the cc-pVDZ and cc-pVTZ basis sets [36] in the case of H, C, and O, while the cc-pV(D + d)Z and cc-pV(T + d)Z basis sets [37] were used in the case of Si and S. The general coupled-cluster code MRCC was used for all CCSDT(Q) calculations. The calculation of scalar-relativistic effects (mass-velocity and Darwin terms) [43, 44] employed the cc-pwCVTZ and cc-pwCVQZ basis sets [35].

3 Results and discussion

3.1 Atomization energies and barrier heights of the AE6 and BH6 test sets

The AE6 test set consists of the atomization energies of the six molecules SiH4, SiO, S2, propyne (C3H4), glyoxal (C2O2H2), and cyclobutane (C4H8). The BH6 test set consists of the forward and reverse barrier heights of the three hydrogen transfer reactions \({\rm OH} + {\rm CH}_4\rightarrow {\rm CH}_3 + {\rm H}_2{\rm O}, {\rm H} + {\rm OH}\rightarrow {\rm O} + {\rm H}_2\), and \({\rm H} + {\rm H}_2{\rm S} \rightarrow {\rm H}_2 + {\rm SH}\). We use geometries at the QCISD/MG3 level of theory as supplied by Truhlar et al. [45]. We present atomization energies and barrier heights for the AE6 and BH6 test sets from conventional CCSD(T) calculations using the basis sets cc-pVDZ-F12, cc-pVTZ-F12, cc-pVQZ-F12, cc-pV(Q + d)Z, cc-pV(5 + d)Z, cc-pV(6 + d)Z, and CBS-extrapolated CCSD(T) as well as explicitly correlated CCSD(T)(F12) calculations using the basis sets cc-pVDZ-F12, cc-pVTZ-F12, and cc-pVQZ-F12. We use CBS extrapolations only to obtain error estimates for individual contributions. For the CBS extrapolation of the Hartree--Fock energy [46, 47], we use

$$ E^{\rm HF}(X) = E^{\rm HF}({\rm CBS}) + B {\rm e}^{-A X^{1/2}}, $$
(2)

fitting E HF(CBS), A, and B to energies E HF(X) from calculations with cc-pV(X + d)Z basis sets where X = Q, 5, 6. The CBS limit correlation energies were estimated by the two-point formula

$$ E^{\rm corr.}(X) = E^{\rm corr.}({\rm CBS}) + C X^{-3}, $$
(3)

fitting E corr.(CBS) and C to energies E corr.(X) using cc-pV(X  + d)Z basis sets where X = 5, 6 for CCSD and perturbative (T) correlation energies [48]. Additionally, we also show values for both test sets using the CCSD(T*)(F12) [26] method employing the basis sets cc-pVDZ-F12, cc-pVTZ-F12, and cc-pVQZ-F12. The CCSD(T*)(F12) method scales the perturbative triples of CCSD(T)(F12) with the ratio of the MP2-F12 and MP2 correlation energies,

$$ E^{\rm corr.}_{({\rm T}^*)} = E^{\rm corr.}_{\rm (T)} \frac{E^{\rm corr.}_{\mathrm{\rm{MP2}\rm{-}F12}}}{E^{\rm corr.}_{\rm MP2}}. $$
(4)

We also computed corrections for neglected core/core-valence correlation (δC/CV),

$$ \delta_{\rm C/CV} = E^{\rm corr.}_{\mathrm{\rm{AE}\rm{-}CCSD(T)/cc\rm{-}pwCVQZ}} - E^{\rm corr.}_{\mathrm{\rm{FC}\rm{-}CCSD(T)/cc\rm{-}pwCVQZ}}, $$
(5)

higher excitations (δHE),

$$ \delta_{\rm HE} = E^{\rm corr.}_{\mathrm{\rm {FC}\rm{-}CCSDT(Q)/cc\rm{-}pV(D+d)Z}} - E^{\rm corr.}_{\mathrm{\rm{FC}\rm{-}CCSD(T)/cc\rm{-}pV(D+d)Z}}, $$
(6)

and scalar-relativistic effects (δMVD, mass-velocity as well as one- and two-electron Darwin terms), at the AE-CCSD(T)/cc-pwCVTZ level of theory. The latter are used to obtain nonrelativistic experimental reference values while the corrections δC/CV and δHE are added to the calculated atomization energies and barrier heights. Table 1 compiles the atomization energies of the different quantum chemical methods and experimental reference values for the AE6 test set, while Table 2 presents the barrier heights for the BH6 test set. Here, δrel = δSO + δMVD, where δSO is a correction for spin-orbit effects taken from Ref. [49]. The W4 values were taken from Refs. [26] and [27]. Please note that slightly different geometries were used in Ref. [27]. All three corrections δHE, δC/CV, and δMVD are of rather small magnitude (about 1 kJ/mol) for all reactions of the BH6 test set and cancel out in parts, but they are much more important for the atomization energies of the AE6 test set. For C4H8, δHE, and δC/CV amount to somewhat more than 20 kJ/mol and for C3H4 and C2O2H2, they slightly exceed 17 kJ/mol. Explicitly treated electron correlation shows more benefits in the case of atomization energies than for barrier heights.

Table 1 Atomization energies for the molecules of the AE6 test set in kJ/mol
Table 2 Barrier heights for the reactions of the BH6 test set in kJ/mol

The atomization energies calculated from FC-CCSD(T)(F12)/cc-pVQZ-F12 + δHE + δC/CV and FC-CCSD(T*)(F12)/cc-pVQZ-F12 + δHE + δC/CV are in between the values obtained from FC-CCSD(T)/cc-pV(6 + d)Z + δHE + δC/CV and FC-CCSD(T)/CBS + δHE + δC/CV and very close to the highly accurate W4 values. Already, the atomization energies from FC-CCSD(T)(F12)/cc-pVDZ-F12 + δHE + δC/CV and FC-CCSD(T*)(F12)/cc-pVDZ-F12 + δHE + δC/CV are closer to the CBS estimates than the values from FC-CCSD(T)/cc-pV(Q + d)Z + δHE + δC/CV and FC-CCSD(T)/cc-pVQZ-F12 + δHE + δC/CV. The atomization energies from FC-CCSD(T*)(F12)/cc-pVXZ-F12 + δHE + δC/CV are surprisingly close to the values from FC-CCSD(T)(F12)/cc-pV(X + 1)Z-F12 + δHE + δC/CV. This suggests that FC-CCSD(T*)(F12)/cc-pVXZ-F12 can be used to approximate FC-CCSD(T)(F12)/cc-pV(X + 1)Z-F12 values when the latter method is computationally too expensive. However, FC-CCSD(T*)(F12)/cc-pVQZ-F12 + δHE + δC/CV overestimates the atomization energies on average compared to the nonrelativistic experimental values (Exp−δrel) and W4. It is easy to explain why the FC-CCSD(T*)(F12) methods seem to work well for some cases, but not for other systems. The perturbative triples contribution converges faster than the MP2 correlation energy. Hence, Eq. 4 will become less useful in larger basis sets. Any FC-CCSD(T*)(F12) method should only be used with extreme caution and only when none of the more accurate methods is feasible anymore due to system size.

Overall, the atomization energies and barrier heights obtained from FC-CCSD(T)(F12)/cc-pVQZ-F12 + δHE + δC/CV agree best with the currently adopted nonrelativistic experimental values and the highly accurate W4 values. This makes FC-CCSD(T)(F12)/cc-pVQZ-F12 + δHE + δC/CV a very attractive candidate for efficient and highly accurate computations of atomization energies and barrier heights when nondynamical correlation plays a minor role. However, one should expect a larger uncertainty of the experimental values for SiH4 and SiO as they have been obtained using an enthalpy of formation for the silicon atom with a rather large uncertainty of 7.95 kJ/mol [50, 51]. More recent estimates [54] suggest a slightly higher enthalpy of formation (by about 2 kJ/mol) for the silicon atom, which would result in a slightly higher atomization energy of silicon compounds. Also, the experimental reference value of the atomization energy of S2 has been revised to 426.3 kJ/mol (432.4 kJ/mol after subtraction of relativistic effects) [53]. The revised atomization energy of S2 is closer to the values from W4 and FC-CCSD(T)(F12)/cc-pVQZ-F12 + δHE + δC/CV than the previous experimental value.

We did not find revised experimental values for the barrier heights, but we expect them to be slightly less accurate than the values obtained from W4 and FC-CCSD(T)(F12)/cc-pVQZ-F12 + δHE + δC/CV. All experimental hydrogen transfer barrier heights are likely to be somewhat tainted by tunnelling effects. We expect this to be the case for the barriers of the reaction \({\rm H} + {\rm H}_2{\rm S} \rightarrow {\rm H}_2 + {\rm SH}\) because the experimental reference value is significantly lower than the one from W4 theory for forward and reverse directions. We expect that the disagreement of the barrier heights of the reaction \({\rm OH} + {\rm CH}_4 \rightarrow {\rm CH}_3 + {\rm H}_2{\rm O}\) is most likely due to uncertainties of the experimental values, while experimental values and W4 theory are in good agreement for the barrier heights of the reaction \({\rm H} + {\rm OH} \rightarrow {\rm O} + {\rm H}_2\). In the present study, we would like to establish a protocol with a similar accuracy as W4, which can also be used to treat larger systems (currently up to ten nonhydrogen atoms). Thus, we recommend FC-CCSD(T)(F12)/cc-pVQZ-F12 + δHE + δC/CV as a robust method to calculate atomization energies, barrier heights, and reaction energies. Relativistic reference values are easily derived by addition of δrel, and frozen-core reference values are available by subtraction of δC/CV.

3.2 Error estimates

Table 3 presents estimated errors for the FC-CCSD(T)(F12)/cc-pVQZ-F12 + δHE + δC/CV level concerning the atomization energies and barrier heights of the AE6 and BH6 test sets. We observe only a very small error in the Hartree--Fock energy, which is much improved by the CABS singles correction of F12 theory. The estimated errors are smaller for barrier heights than for atomization energies. The errors of higher excitations and perturbative triples have opposite signs for atomization energies. This error cancellation is systematic for the atomization energies of the AE6 test set. Thus, we use a slightly different formula for error propagation for atomization energies, as explained in footnote g of Table 3. The error in perturbative triples is greatly reduced by the CCSD(T*)(F12) method. However, the error in higher excitations is not affected by CCSD(T*)(F12), and therefore, the error cancellation between perturbative triples and higher excitations is diminished by the CCSD(T*)(F12) method. This explains why FC-CCSD(T*)(F12)/cc-pVQZ-F12 + δHE + δC/CV overestimates atomization energies. This error cancellation does not exist for the barrier heights of the BH6 test set. Instead, errors of core/core-valence treatment and perturbative triples have opposite signs for the barrier heights, and thus, we here observe a cancellation of errors for FC-CCSD(T)(F12) + δHE + δC/CV. Again, FC-CCSD(T*)(F12)/cc-pVQZ-F12 + δHE + δC/CV lessens this error cancellation. Therefore, we recommend to use the CCSD(T*)(F12) method only for small and medium basis set sizes (cc-pVDZ-F12 and cc-pVTZ-F12), while the CCSD(T)(F12) method should be preferred for accurate calculations using cc-pVQZ-F12 and larger basis sets. Effects of excitations higher than perturbative triples for core/core-valence correlation and scalar-relativistic effects as well as perturbative quadruples for higher excitation contributions are not covered by this error analysis but are expected to be negligible [27, 54]. This expectation is also supported by our comparison to W4 values in Tables 1 and 2.

Table 3 Error estimates (in kJ/mol) for the FC-CCSD(T)(F12)/cc-pVQZ-F12 + δHE + δC/CV level with respect to the AE6 and BH6 test sets

3.3 Recommended reference values

Table 4 summarizes our theoretical reference values obtained from CCSD(T)(F12)/cc-pVQZ-F12 calculations corrected for higher excitations (iterative triples and perturbative quadruple contributions), core/core-valence correlation, and relativistic effects (both at the CCSD(T) level of theory). They possess a similar accuracy as the values from W4 theory. Reference values suitable for comparison with frozen-core, all-electron, scalar-relativistic, and full-relativistic methods are presented separately. In Table 4, we also supply error estimates for each proposed reference value. The error estimates are derived from the CBS values.

Table 4 Proposed theoretical reference values for the test sets AE6 and BH6 in kJ/mol with error estimates

4 Conclusions

In summary, we have proposed a new protocol to predict very accurate atomization energies, barrier heights, and reaction energies without basis set extrapolations. The largest basis set employed is of quadruple-\(\zeta\) size, while other high-accuracy protocols include basis set sizes up to sextuple-\(\zeta\) and basis set extrapolations. We have also presented theoretical reference values for the AE6 and BH6 test sets. They are highly accurate as shown by error estimates that were calculated from CBS-extrapolated values and comparison with values from the highly accurate W4 theory. Our error estimates from error propagation show that the uncertainty of the reference values for atomization energies and barrier heights obtained from our new protocol is well below 1 kcal/mol (chemical accuracy). For barrier heights, our error estimates even suggest a sub-kJ/mol accuracy. Besides being consistently of very high accuracy, the new reference values have other important advantages over the experimental ones: Certain effects (core/core-valence correlation and relativistic effects) can be excluded. Thus, more appropriate reference values for comparison with more approximate methods (frozen-core and/or nonrelativistic methods) are available. We plan to apply the proposed computational protocol to larger test sets where doubt has been casted on currently used reference values.