Introduction

Iodine is an element that belongs to group 17 of the periodic table [1]. Its compounds can be widely applied [2, 3], with emphasis on the medical-pharmaceutical area: antiseptic/disinfectant (iodine tincture, polyvinylpyrrolidone); expectorant (potassium iodide solution, KI). Potassium iodate (KIO3) is prescribed to people affected by radiation to prevent or reduce the radioactive iodine absorption by the thyroid [2]. Another great application is in the analytical area, where iodine compounds are used to determine the oils and fats unsaturation level, as one of the reagents of the Karl Fischer method or in qualitative and quantitative analyses and organic syntheses [3]. In addition, iodine has also been used in the development of new materials, such as polarizing film for liquid–crystal display (LCD) and electrolytes for dye-sensitized solar panels [1].

However, theoretical studies involving high-level calculations to obtain the thermochemical properties of iodine compounds have been little explored mainly due to computational difficulties, even though there are reliable experimental data on their compounds in the gas phase.

Recently, Leal and Custodio formulated the G3(MP2)//B3-SBK theory [4], combining the CEP pseudopotential and the respective 31G basis set developed by Stevens, Basch, and Krauss to replace the P31G basis functions used in the G3(MP2)//B3-CEP method. The main differences are in the fact that the G3(MP2)//B3-CEP theory adapts the pseudopotential to the basis set used in the corresponding original method that considers all electrons (CEP-P31G, it was the notation given to the basis function adapted in reference to Pople basis functions) and the G3(MP2)//B3-SBK theory uses the basis set developed for the respective pseudopotential (SBK or CEP-31G). Furthermore, each method has its values for the reoptimized parameters of the s, p, and sp functions in the large basis set of carbon, nitrogen, oxygen, fluorine, phosphorus, and chlorine atoms, as well as for the parameters used in the calculation of the high-level empirical correction. The G3(MP2)//B3-SBK theory was initially tested, according to the respective corresponding versions G3(MP2)//B3 [5] and G3(MP2)//B3-CEP [6], for a set of 446 systems containing representative elements up to the 4th period [H-Kr] of the periodic table. The best combination of adjusting the G3(MP2)//B3-SBK theory produced a mean absolute error for all tested properties of 1.43 kcal mol−1. This deviation is essentially equal to that presented by the all-electron theory G3(MP2)//B3 (1.41 kcal mol−1) with savings of up to 45% CPU time and it has better accuracy than the corresponding theory that uses adapted pseudopotential G3(MP2)//B3-CEP (1.60 kcal mol−1) with equivalent CPU time. Unlike the G3(MP2)//B3 and G3(MP2)//B3-CEP theories, the G3(MP2)//B3-SBK theory allows the calculation of thermochemical properties of compounds for some elements of the 5th line of the periodic table, like Sn and Sb. Thus, the idea of expanding the application domain of this method for calculations of compounds containing heavier elements, such as iodine, arose due to its relevance in many chemical and biological processes.

Other papers can be used as a reference to demonstrate the efficiency of using composite methods. As an example, Silva [7, 8] recently used the composite methods EnAt1, EnAt2, G3X-CEP, G3X(CCSD)-CEP, and G4 [9] to determine the enthalpies of formation of 29 aluminum compounds. The Stuttgart/Dresden pseudopotential was used in the G3(MP2,CSSD,rel) theory for 20 compounds containing transition elements from scandium (Sc) to zinc (Zn) [10]. The G4(CEP) theory [11, 12] was used to accurately determine the pKa of 22 monocarboxylic acids. Other examples of applications can be found in the study of Diels–Alder reaction mechanisms [13], phenol nitration [14], rotation barriers [15], and among others.

The objective of this research was to expand the application domain of the G3(MP2)//B3-SBK theory [4] to calculate enthalpies of formation of iodine-containing compounds. The calculations efficiency was performed comparatively to accurate experimental data and calculated with the G2 theory [16, 17]. Other conventional methods (e.g., DFT, MPn) were combined with different basis sets (e.g., Pople, Dunning) for a more global assessment of errors produced by conventional calculations regarding experimental data available in the literature.

Computational methods

All calculations were performed using the Gaussian 09 software [18]. The enthalpies of formation were calculated, at 298.15 K and 1 atm, according to the standard procedure established in the literature [19]: (a) The first step is to calculate the enthalpies of formation \(({\Delta H}_{f}^{0} \left(M,0 K\right)\)) of the molecule \(\left(M\right)\) at 0 K from the molecular atomization energy \(\left(\sum {D}_{0}(M)\right)\) and heats of formation of the atoms \((\sum_{atoms}x{\Delta H}_{f}^{0} \left(X, 0 K\right))\) at 0 K:

$${\Delta H}_{f}^{0} \left(M,0 K\right)= \sum_{atoms}x{\Delta H}_{f}^{0} \left(X, 0 K\right)-\sum {D}_{0} \left(M\right) ,\mathrm{ where}:$$
$$\sum {D}_{0}\left(M\right)= \sum_{atoms}x{\varepsilon }_{0} \left(X\right)-{\varepsilon }_{0} \left(M\right)-{\varepsilon }_{ZPE}(M)$$

where \(\sum_{atoms}x{\varepsilon }_{0} \left(X\right)\) is the sum of the energies of the constituent atoms, \({\varepsilon }_{0} \left(M\right)\) is the total energy, and \({\varepsilon }_{ZPE}(M)\) is the zero-point energy of the molecule. \(X\) is each element which makes up M, and \(x\) is the number of atoms of \(X\) in the molecule \(M\). (b) The second step is to calculate the enthalpies of formation \(({\Delta H}_{f}^{0} \left(M, 298 K\right))\) at 298 K:

$${\Delta H}_{f}^{0} \left(M, 298 K\right)={\Delta H}_{f}^{0} \left(M,0 K\right)+{(H}_{M}^{0} \left(298 K\right)-{H}_{M}^{0} \left(0 K\right))-\sum_{atoms}x({H}_{X}^{0}\left(298 K\right)- {H}_{X}^{0} \left(0 K\right))$$

where \({(H}_{M}^{0} \left(298 K\right)-{H}_{M}^{0} \left(0 K\right))\) is the molecular enthalpy correction and \(\sum_{atoms}x({H}_{X}^{0}\left(298 K\right)- {H}_{X}^{0} \left(0 K\right))\) is the sum of the enthalpy corrections of the atomic elements. Table 1 shows values used for enthalpies of formation of the elements at 0 K and respective thermal corrections (\({\mathrm{H}}^{0} (298\mathrm{ K})-{\mathrm{H}}^{0} (0\mathrm{ K})\)), at 298.15 K (25 °C) and 0.1 MPa (1 bar), both were taken from references [8, 19,20,21].

Table 1 Enthalpies of formation at 0 K for gaseous atoms and enthalpy corrections of the atomic elements in their standard states from experiment. Values in kcal.mol–1

Knowing that the SBK or CEP–31G basis functions [22, 23] were not developed with polarization functions, similar to the procedure adopted by Leal and Custodio to include tin (Sn) and antimony (Sb) atoms [4], the adjustment of the polarization function for iodine was performed while minimizing the energies of a subset of 14 molecules through the optimization at B3LYP/CEP–31G(d) level. For this adjustment, systems available at WebNIST [24] were selected which presented experimental values of standard enthalpy of formation in the gas phase with an uncertainty of the order of ± 1 kcal mol−1: HI, CH3I, C2H5I, CIN, CH3IS, C3H5I, C2H5IS, C4H9I, C2H4ICl, C2H3IO, C4H7IO, C7H5IO, CF3I, and C2H2F3I. The obtained value of 0.270 for the polarization function exponent of the iodine atom is similar to the corresponding values of 0.266 (G2 theory, all-electron) [17], 0.279 (ECP(HW), the shape-consistent orbital-adjusted ECPs of Hay and Wadt) [17, 25], 0.267 (ECP(S), energy-adjusted ECPs of the Stuttgart group (S)) [17, 26] that are the values of energy-optimized exponents of polarization d functions for augmentation of the valence basis sets in all-electron (AE) and effective core potential (ECP) calculations iodine-containing species. The values of the polarization functions’ exponents, used together with the CEP-31G set, for the other elements were taken from the Leal and Custodio work [4].

The energy G3(MP2)//B3–SBK [4] is determined through the equation:

$${\mathrm{E}}_{\mathrm{G}3(\mathrm{MP}2)//\mathrm{B}3-\mathrm{SBK}}=\mathrm{E}\left[\mathrm{QCISD}(\mathrm{T})/\mathrm{CEP}-31\mathrm{G}\left(\mathrm{d}\right)\right]+\Delta {\mathrm{E}}_{\mathrm{G}3\mathrm{MP}2\mathrm{large}}+{\mathrm{E}}_{\mathrm{SO}}+{\mathrm{E}}_{\mathrm{HLC}}+{\mathrm{E}}_{\mathrm{therm}}$$

in which, E[QCISD(T)/CEP-31G(d)] is the reference energy; ΔEG3MP2large = E[MP2/CEP–G3MP2large] – E[MP2/CEP–31G(d)] is the energy component calculated with the basis function CEP-G3MP2large, included to correct the effects of extending the basis sets; ESO is the spin–orbit correction for atoms whose values were extracted from the literature [9, 17]; EHLC is an empirical correction represented by EHLC =  − Anβ − B(nα − nβ) for molecules (being A = 8.849 and B = 4.495, mHartree) and EHLC =  − Cnβ − D(nα − nβ) for atoms and atomic ions (being C = 9.436 and D = 1.586, mHartree), in which nα and nβ are the number of valence electrons with alpha and beta spins, respectively; Etherm is the energy component that contains the zero point correction and thermal effects related to translational, rotational, and vibrational motions. All vibrational frequencies were scaled by a factor of 0.96. The cc-pV5z-PP basis set [27] was considered the CEP-G3MP2large for the iodine atom.

The energy G2 [16, 17] is obtained through an improvement of the G1 energy [28] stated by:

$${\mathrm{E}}_{0}\left(\mathrm{G}2\right)={\mathrm{E}}_{0}\left(\mathrm{G}1\right)+\Delta +1.14\mathrm{ npair}$$

in which, E0(G1) is the energy obtained through the G1 theory; Δ = E[MP2/6–311 + G(3df,2p)] – E[MP2/6-311G(2df,p)] – E[MP2/6–311 + G(d,p) + E[MP2/6-311G(d,p)] and npair is the number of valence electron pairs. The detailed step-by-step to obtain energy E0(G1) can be directly verified in Pople et al. [28]. To obtain the G2 energy of compounds involving the iodine element, the protocol established by Glukhovtsev et al. [17] was followed.

In addition to the G3(MP2)//B3–SBK and G2 composite methods, the performance of three popular functionals B3LYP [29, 30], M06-2X [31], WB97XD [32], and the MP2 method [33] with the basis sets 6-311G(d,p) [17, 34, 35], LANL2DZ [25, 36], jorge-ADZP [37, 38], and CEP-31G(d) were also evaluated as formulated in this research, since these methodologies also allow direct application to systems containing iodine, which contributes to a global assessment.

Results and discussion

Table 2 shows the errors set (ΔH0f (exp) – ΔH0f (calc)) calculated concerning experimental values of standard enthalpy of formation (high precision in the gas phase) available in WebNIST [24] for 41 iodine compounds with the G2 and G3(MP2)//B3-SBK theories. Besides, mean absolute deviation (MAD) and standard deviation (SD) values are also reported.

Table 2 Experimental enthalpies of formation and calculated errors with G2 and G3(MP2)//B3-SBK theories concerning the experimental data. Values in kcal mol−1

The G3(MP2)//B3-SBK method presented a mean absolute deviation equal to 3.7 kcal mol−1 and a standard deviation equal to 6.0 kcal mol−1, while the G2 method presented values of 3.1 kcal mol−1 and 4.9 kcal mol−1 respectively. The deviations obtained with the G3(MP2)//B3-SBK theory are greater when compared to those obtained with the G2 theory, but achieving the G3(MP2)//B3-SBK energy involves fewer steps in addition to the smallest computational cost. There is a balance in the individual performance between these two composite methods, and there are 17 systems that present errors (regarding the experimental data) outside the range of ± 2 kcal mol−1 when obtained with the G2 theory, against 19 systems when calculated with the G3(MP2)//B3-SBK theory.

According to the literature [4, 6, 42], obtaining thermochemical or spectroscopic properties of compounds containing fluorine atoms usually exhibits unusual behavior and it is still a challenge to predict these properties with high accuracy, even with the use of composite methods. Even the G3(MP2)//B3-SBK method with the scaling of experimental atomization energies still showed high deviations for enthalpies of formation C2F4 (4.6 kcal mol−1), PF3 (− 7.0 kcal mol−1), SF6 (− 8.7 kcal mol−1), ClFO3 (− 18.7 kcal mol−1), and ionization energies B2F4 (8.7 kcal mol−1), BF3 (− 5.2 kcal mol−1), and CH3F (− 5.0 kcal mol−1) of fluorinated molecules. In this work, high deviations respectively with the G2 and G3(MP2)//B3-SBK theories were also observed for the C2H3F2I (5.6 and 4.3 kcal mol−1), C2H2F3I (6.9 and 4.8 kcal mol−1), CF3I (4.9 and 3.3 kcal mol−1), F5I (5.1 and 17.0 kcal mol−1), and F7I (20.8 and 25.9 kcal mol−1) systems. Due to the low uncertainty associated with experimental measurements, for example ΔH0f (exp): C2H2F3I =  − 155.0 ± 0.9 kcal mol−1, ΔH0f (exp): CF3I =  − 140.8 ± 0.1 kcal mol−1 [39, 40], ΔH0f (exp) F5I =  − 200.8 ± 0.4 kcal mol−1 [41], ΔH0f (exp): F7I =  − 229.7 ± 0.5 kcal mol−1 [41], higher level calculations associated with the use of more extended basis sets are welcome in an attempt to better reproduce the experimental behavior of these molecules. However, the high computational cost often limits the use of even more rigorous composite methods. In addition, more than one experimental information can often be found for the same system, making it difficult to choose a reference standard (for example: most recent measurement, measurement with lower uncertainty, measurement leading to a lower calculated error, among others). In this work, the measure associated with the lowest experimental uncertainty was chosen as the reference standard.

Recently, from a modification of the G4MP2 theory [43], E. Sookhaki and M. Namazian developed the G4MP2-ECP method [44] specifically to allow calculations involving the iodine atom. Among the various tested properties, they evaluated the performance of the G4MP2-ECP method for standard enthalpy of formation of a set of 19 iodinated organic compounds. Table 3 shows the performance of G2 and G3(MP2)//B3-SBK theories compared to experimental data and the G4MP2-ECP method. The best performance was achieved with the G2 theory (MAD = 0.8 kcal mol−1 and SD = 1.1 kcal mol−1), followed by the G3(MP2)//B3-SBK theory (MAD = 1.0 kcal mol−1 and SD = 1.4 kcal mol−1) and finally G4MP2-ECP (MAD = 1.6 kcal mol−1 and SD = 2.0 kcal mol−1).

Table 3 Standard enthalpy of formation (kcal mol−1) calculated with G4(MP2)-ECP [44], G2 and G3(MP2)//B3-SBK theories. The values in parenthesis are the deviations from experimental data

Regarding the functional/basis set combination (Appendices Tables 4 and 5), the smallest deviations were achieved by combining the different functionals to the CEP-31G(d) basis set, with M06-2X (MAD = 17.3 kcal mol−1 and SD = 23.0 kcal mol−1) as the functional with the best performance against the experimental results, followed by WB97XD (MAD = 20.1 kcal mol−1 and SD = 28.0 kcal mol−1) and B3LYP (MAD = 24.5 kcal mol−1 and SD = 29.9 kcal mol−1). Functional M06-2X stands out above all for the low deviations achieved in fluorinated systems such as C6F5I (7.8 kcal mol−1), F5I (− 6.4 kcal mol−1), F7I (− 20.8 kcal mol−1), and FI (0.7 kcal mol−1) compared to B3LYP and WB97XD functionals. It is also noteworthy that the M06-2X and WB97XD functionals combined with the CEP-31G(d) set produced the smallest deviation (only, − 5.1 kcal mol−1), among all the tested methodologies, for the C2H3IN2O4 system. This evaluation is particularly useful as it can direct and drive the development of an alternative composite method using the M06-2X functional combined with the CEP-31G(d) set.

Regarding the combination of the MP2 method (Appendix Table 5), with the four tested basis sets, the smallest deviations (MAD = 12.9 kcal mol−1 and SD = 18.7 kcal mol−1) were achieved when the 6-311G(d,p) set was used. This is mainly due to low errors, including for (INO), (CIN3O6), and (C2H3IN2O4) systems that showed a divergence of only − 6.8; 4.8 and 9.0 kcal mol−1 concerning the experimental data, respectively. However, systems like (F5I) e (F7I) still showed high deviations regarding the experimental data, − 50.5 and − 57.0 kcal mol−1 respectively. The second smallest deviation (MAD = 21.0 kcal mol−1 and SD = 25.5 kcal mol−1) was achieved by combining MP2/jorge-ADZP, followed by MP2/CEP-31G(d) (MAD = 25.7 kcal mol−1 and SD = 29.8 kcal mol−1) and, finally, MP2/LANL2DZ (MAD = 53.6 kcal mol−1 and SD = 63.8 kcal mol−1).

Figure 1 shows the mean absolute deviation (MAD) for all methodologies tested considering the set of enthalpies of formation of the forty-one iodine compounds studied. Appendix Table 6 summarizes the energy of atoms for each methodology evaluated in this research.

Fig. 1
figure 1

Mean absolute deviation (MAD) for all methodologies tested considering the set of enthalpies of formation of the forty-one iodine compounds studied: a B3LYP, b M06-2X, c WB97XD, and d MP2

Conclusion

In this paper, the domain of application of the G3(MP2)//B3-SBK theory was extended to study the enthalpies of formation of 41 iodine compounds. For the first time for this method and for the iodine element, was added a polarization function in the CEP–31G basis set, and adjusting its exponent, the value of 0.270 was obtained, which is in good agreement with other corresponding values found in the literature. The G3(MP2)//B3-SBK method showed a mean absolute deviation equal to 3.7 kcal mol−1 and a standard deviation equal to 6.0 kcal mol−1, while the G2 method presented values of 3.1 kcal mol−1 and 4.9 kcal mol−1 respectively. The deviations obtained with the G3(MP2)//B3-SBK theory are greater when compared to those obtained with the G2 theory, but achieving the G3(MP2)//B3-SBK energy involves fewer steps in addition to the smaller computational cost. There is a balance in the individual performance between these two composite methods, there are 17 systems that present errors (regarding the experimental data) outside the range of ± 2 kcal mol−1 when obtained with the G2 theory, against 19 systems when calculated with the G3(MP2)//B3-SBK theory.

Regarding the standard enthalpy of formation of the nineteen iodized organic compounds set, previously studied by E. Sookhaki and M. Namazian with the G4MP2-ECP method (MAD = 1.6 kcal mol−1 and SD = 2.0 kcal mol−1), it was found that the G2 (MAD = 0.8 kcal mol−1 and SD = 1.1 kcal mol−1) and G3(MP2)//B3-SBK (MAD = 1.0 kcal mol−1 and SD = 1.4 kcal mol−1) theories showed smallest deviations with respect to the experimental data.

Concerning the performed additional tests combining different functionals B3LYP, M06-2X, WB97XD, and the MP2 method with different basis sets 6-311G(d,p), LANL2DZ, jorge-ADZP, and CEP-31G(d), the best combination functional/basis set was obtained with M06-2X/CEP-31G(d) producing a MAD = 17.3 kcal mol−1 and SD = 23.0 kcal mol−1. The best performance with the MP2 method was achieved with Pople 6-311G(d,p) basis generating a MAD = 12.9 kcal mol−1 and SD = 18.7 kcal mol−1. These additional tests mainly serve to reaffirm the efficiency of the composite methods in the accurate determination of thermochemical properties, including now the case of the standard enthalpy of formation of iodine compounds.