1 Introduction

In the last decade, the interest in photophysics and photochemistry processes has dramatically grown [2]. To date, fluorescence techniques exhibit several industrial and biomedical applications, such as photocatalytic [2], photoinduced isomerization of cis–trans [3], white organic light emitted dye [4] and can be used as fluorescent probes for image diagnosis [5]. This outlook is currently also reinforced by the large number of fluorescent compounds with biological and technological interest. In this context, one important class of compounds that exhibit fluorescent properties are the naphthoquinone derivatives [6]. These compounds of natural origin can be considered antitumor, antifungal, and antimalarial drugs [7]. Naphthoquinone derivatives have also been tested by many research groups [6, 8] as fluorescent probes for breast and buccal cancer with promising results [6, 9].

It is well known that structural modifications, such as a halogen addition, of potential fluorescent probe candidates can provide dramatic changes in their spectroscopic properties, exhibiting new absorption and emission regions, which can be appreciated for the generation of more efficient fluorescent probes [10]. In this context, halogen derivatives of naphthoquinones were studied by fluorescent spectroscopy [11], Raman [12] and UV–VIS techniques [3] for evaluation of their absorption and emission parameters [11]. Recently, theoretical calculations [13] have also been employed for the study of the Homo–Lumo barrier and other electronic properties [11] of this class of compounds.

Nevertheless, one of the greatest challenges is to investigate the spectroscopic parameters of halogen derivatives due to several electronic and structural effects that can modulate the excited-state properties of those compounds [11]. Despite several efforts, surprisingly little detailed theoretical work has appeared on the analysis of the different fluorescence mechanisms for naphthoquinone derivatives as well as the development of computational strategies, including relativistic effects, able to accurately reproduce spectroscopic parameters of halogen derivatives.

Different theoretical methodologies such as the complete active space self-consistent field (CASSCF), complete active space with second-order perturbation theory (CASPT2), multireference methods and TDDFT techniques can be used for analysis of emission and absorption parameters [3, 14]. It should be kept in mind, however, that the use of time dependent density functional theory (TD-DFT) has provided a promising and accurate alternative for the study of spectroscopic properties, such as vertical energy values, fluorescent parameters or excitation properties of molecules in condensed phase [1517].

In this context, the accurate description of electronic properties of each atom has great impact on the spectroscopic parameters of the molecule [18]. Among the various parameters and effects that can modulate the electronic properties of heavy atoms, relativistic effects play a crucial role on the spectroscopic properties [18]. However, these effects are complicated to model and generally neglected. In fact, various aspects of the emission and absorption process of organic compounds containing bromide and iodide atoms reveal a great influence of relativistic effects [19].

Currently, relativistic effects are in general employed for theoretical calculations of gold [19], platinum [20], and gadolinium [21] atoms as well as metallic complexes with heavy atoms. Even though this kind of effect in organic systems has been widely neglected spin–orbit coupling, mass/velocity and Darwin effects can strongly influence spectroscopic properties, such as emission and absorption parameters of organic molecules in condensed phase [19, 22].

One way of including the relativistic effects is to apply the relativistic Hamiltonian, which involves the resolution of Dirac’s equation [23, 24]. The Douglas–Kroll–Hess (DKH) approximation, which transforms the Dirac’s Hamiltonian of four components into two components, is one of the most used methods to access the relativistic effects [25]. Another method extensively cited in the literature is the zero-order of regular approximation (ZORA) [26], employing the Pauli’s approximation for the resolution of Dirac’s equation for chemical compounds [27, 28]. This method points out an alternative to solve the Dirac’s equation for the non-relativistic limit and ignores the contributions of the first order within the Pauli method [25, 29].

Besides modifying the Hamiltonian [30] of the system to take basis sets that include relativistic correction is another way of introducing the relativistic effects in the calculations [28]. The basic sets simulate the relativistic corrections implemented by the Dirac equation, by incorporating the relativistic correction for electronic properties. Many research groups have developed basis sets that have generated similar results to the ZORA [31] and DKH approximations [32]. For instance, the Ahlrichs TZV basis set [3133] has some basis set with relativistic approximation, like TZV_DKH or TZV_ZORA [32]. Finally, relativistic effects can also be included into the system by the introduction of spin–orbit coupling. The spin–orbit coupling is related to the splitting of the energy levels for the heavy atoms. In many cases, this splitting can influence electronic properties of the compounds [34]. The spin–orbit coupling can be investigated using approaches, such as effective nuclear charge (SOC-ENC) or effective nuclear charge with mean-field approach (SOC-ENC-MFA) [32, 35, 36].

Since many theoretical strategies exist to incorporate the relativistic effects on the electronic structure calculation, to select one specific approach is a difficult task and a critical evaluation is necessary. This outlook is aggravated in the case of fluorescent probes that contain heavy atoms, because the impact of relativistic effects on spectroscopic properties of those organic molecules has been little explored in literature and can be significant.

Keeping this in view, the Hamiltonian used, the incorporation or not of spin–orbit coupling and relativistic corrections for basis sets are the three main parameters in relativistic calculations, which in turn can provide vital clues about the type of modifications that might be desirable to increase the accuracy.

Chemometric techniques such as the experimental design or factorial model can assist in the choosing the best conditions and parameter calculation evaluations [37]. Actually, the factorial model is a statistical approach involving the building of the factor combinations selecting those that are significant from the parameters under study [38, 39]. The factorial model can be modeled in different ways employing two or three levels for each parameter (2K, 3K…), and in each level employing two (22, 23,…), three (32, 33,…) or a larger number of parameters in order to obtain the correct interpretation of the results [40].

In the present work, the 33 factorial model, the Hamiltonian, the spin–orbit coupling and the basis sets including relativistic corrections have been applied for the evaluation of the relativistic methods effect on the absorption process of halogen derivatives of 2-amino-1,4-naphthoquinones. The energy barriers involved in the excited-state intramolecular proton transfer (ESIPT) process for these halogen derivatives employing the selected methodology were evaluated afterward.

2 Computational details

The theoretical calculations were performed in the ORCA program [41]. The calculations for the ground state (S 0) and eight excited states were carried out at DFT and TDDFT level of theory using the B3LYP functional [42], respectively. The basis set employed in the study is described in Table 1 as the parameter B. The halogen derivatives for the 2-amino-3-halo-1,4-naphthoquinone were evaluated for the four halogens, F, Cl, Br and I [43] according to Fig. 1. In order to take into account environment effect methanol was taken, as there are experimental results available for this solvent [43, 44]. The COSMO approximation [11, 28, 35] was used in the absorption and fluorescence properties calculations. The Tamm–Dancoff approximation (TDA) [45] method was employed for simulation of the relaxation effect of the solvent in the excited state within the TD-DFT method.

Table 1 Levels and parameters for the 33 factorial model
Fig. 1
figure 1

Chemical structures of 3-halogen derivatives of 2-amino-1,4-naphthoquinone: a AFNQ, b ACNQ, c ABNQ and d AINQ

The factorial model calculations were performed in the Statistica® program [46]. The Factorial model employed the 3K method called the Box–Behnken design [40]. The parameters and levels of the model are reported in Table 1. The first factorial model parameter was the presence of the spin–orbit coupling, in which we evaluate the influence of the spin–orbit coupling on the system, by means of the method spin–orbit coupling effective charge nuclear (SOC-ECN) [41] as well as employing the most complete method spin–orbit coupling effective charge nuclear with mean-field approach (SOC-ECN-MFA) [41] and finally removing the SOC effect. The second parameter in the factorial model was the basis set, and its levels were a non-relativistic basis set was employed (TVZ) [31, 32], the basis set with ZORA implementation (TVZ_ZORA) [41] was used as a second level of a basis set and the third level was a DKH basis set (TVZ_DKH) [41] implemented in the ORCA program. The third parameter of the factorial model was the Hamiltonian of the system; thus, the non-relativistic Hamiltonian was the first level, the ZORA Hamiltonian the second and the DKH Hamiltonian was established as the third level [47]. The Box–Behnken design [40, 48, 49] was developed as the twenty-seven (27) cases involving the three parameters and the three levels, which are reported in Table 2. In all the cases, the parameters were evaluated using the ANOVA analysis [50, 51] (Supplementary Material Table S2A), and the acceptance criteria was the Student’s t test [3840].

Table 2 33 Factorial model listing the levels for the twenty-seven cases and the results (wavelength in nm) for the four compounds

The study of the ESIPT process was based on the absorption energy (E 1), the proton transfer energy (E 2), as well as the difference in the excited-state (S 1) energies between the keto and enol forms of the compounds. Finally, the fluorescence energy (F 2 or F 1), see Fig. 2, for the energy difference between the excited state (S 1) and ground state (S 0) was also evaluated. Those calculations were carried out at the DFT and TD-DFT levels for ground and excited states, respectively.

Fig. 2
figure 2

Energy diagram of the excited-state intramolecular proton transfer (ESIPT) for the four halogen derivative compounds for the 2-amino-1,4-naphthoquinone: a the diagram for the ANQ and AFNQ; b the diagram for the ACNQ, ABNQ and AINQ

3 Results and discussion

3.1 The Box–Behnken design

To our knowledge, there are only two compounds, 2-amino-3-chloro-1,4-naphthoquinone (ACNQ) [44], and 2-amino-3-Bromo-1,4-naphthoquinone (ABNQ) [43], for which experimental values of absorption have been reported, which can be used to develop our statistical model based on the Box–Behnken design [40]. Thus, the other two compounds used in this paper, 2-amino-3-fluor-1,4-naphthoquinone (AFNQ) and 2-amino-3-Iodo-1,4-naphthoquinone (AINQ), will have their energy barriers for the ESIPT process theoretically estimated. The Box–Behnken design for the four compounds, see Fig. 1, was evaluated according to the combination of the levels reported in Table 1, whose results are described in Table 2. Through the factorial model, three parameters have been investigated: (1) spin–orbit coupling (parameter A), (2) the basis set used (parameter B), and (3) the relativistic correction for the Hamiltonian of the system (parameter C). The lower level of each parameter was estimated without relativistic effects (no spin–orbit coupling, no relativistic basis set and no relativistic Hamiltonian), the medium level includes spin–orbit coupling SOC-ECN, ZORA basis set and ZORA Hamiltonian and at the high level spin–orbit coupling SOC-ENC-MFA, DKH basis set and DKH2 Hamiltonian. These levels are called levels −1, 0 and +1 on Tables 1 and 2, respectively. It is important to keep in mind that the factorial model 33 was developed with twenty-seven combinations from the three levels with three parameters and their interactions. The TD-DFT method and the COSMO model were employed to calculate the absorption and emission values for the four compounds as well as the values used for the statistical calculations. The selected model for the system was the quadratic model, and the significant effects selected were evaluated for statistical analyses, such as variance analysis (ANOVA), see Supplementary Material Table S2A. It is well known that with this model can get a better description of the interactions and the significance for the studied parameters [38].

3.2 The factorial model for the ACNQ and ABNQ

The experimental values of the absorption wavelength for ACNQ and ABQN are 476 nm [44] and 452 nm [43], respectively. The construction of the factorial design was performed using the error values from the difference between the experimental and theoretical values. According to ANOVA analysis [10, 52] reported in Supplementary Material (Table S2A), only the B and C parameters were significant for the factorial model for ACNQ, i.e., the basis set and Hamiltonian.

The variance analysis (ANOVA) is a statistical model used to employ the differences among group means and their associated methods [46, 49, 53]. In the ANOVA setting, the variance in a variable is divided into parts attributable to different fonts of variation. In other words, ANOVA can be useful for comparing (testing) three or more means (groups or variables) for statistical significance. In this context, Eqs. 1 and 2 represent statistical models to describe the dependence of a factor on the parameters A, B and C.

$$R = 9.35 - 0.23B - 0.08B^{2} + 0.10C - 0.18C^{2} + 0.15B*C$$
(1)
$$R = 5.20 - 0.70B - 0.95B^{2} - 3.53C + 1.70C^{2} - 2.85B*C$$
(2)

The value 9.35 is the average population of all factorial design responses in Eq. 1 and 5.20 is the value of the average population in Eq. 2. For the B parameter, when the nonrelativistic basis set (TVZ) is modified to the relativistic basis set (TVZ_DKH2), the error value decreases around 0.46 % according to Eq. 1. For the C parameter, the error value increases 0.10 % with the change of the Hamiltonian of the system ongoing from the nonrelativistic to the relativistic Hamiltonian (ZORA or DKH2). The factorial model equation exhibits significant value for the interaction between B and C parameters for ACNQ. An increase in the error value to 0.15 % when the lower level is modified to the high level was observed.

The factorial model for ABNQ showed that only the C parameter was significant. According to ANOVA analysis (see Supplementary Material (Table S2A)) when the level from lower to the higher was modified, the error value decreases 3.53 %, see Table 3. This feature showed that the relativistic Hamiltonian was essential for the absorption calculation for the bromide derivatives. Interestingly, the relativistic effects were more significant for the bromide derivatives than chloride derivatives. The B parameter was not relevant for the factorial model analysis for ABNQ compounds. However, it was important for the interaction between B and C parameters. Going on from the lowest level (−1) to the highest (+1), the error value decreases 2.85 % as reported in Table 3, indicating that the interaction was significant for the absorption calculation for ABNQ and ACNQ compounds. The equation generated for the ABNQ factorial model is described in Eq. 2. The Pareto graph shown in Supplementary Material Figure S1 exhibits the significant parameters according to Student’s t test. The analysis of those graphics indicates that in all cases the B and C parameters are always significant and for two compounds (ABNQ and AINQ), the interaction between parameter B and C was significant as well. The factorial model generated optimized values for the parameter B (TVZ_DKH) and C (DKH2 Hamiltonian). The parameters described will be used for the wavelength and the energy calculation of the ESIPT process in the third step.

Table 3 Estimate of the effects, standard error (Std.Err.), the t analysis [t(17)], and the coefficient limit for the 33 factorial model for the four compounds

3.3 The factorial model for AFNQ and AINQ

So far, experimental results for AFNQ and AINQ have still not been reported, for this reason, the factorial model used in these cases was developed in comparison with ACNQ and ABNQ. For AFNQ, the ANOVA analysis (in Supplementary Material Table S2A) showed that the B parameter is significant for the absorption calculation, but the C parameter is not. This feature indicated that the relativistic effects were not important for the fluoride derivatives. In fact, since fluoride is a small atom, the relativistic effects can be neglected for fluoride derivate compounds. Surprisingly, the C2 quadratic parameter is not neglected in this model. Comparing to the other two models, the relativistic contribution was important to the B parameter. In this case, ongoing from the lower level to the higher level, the wavelength value increases 5.02 %, (see Table 3). The B2 and C2 quadratic values increase the wavelength value of 4.83 and 2.90 %, respectively. Equation 3 shows the factorial model for AFNQ.

Just as the Eqs. 1 and 2, Eqs. 3 and 4 represent statistical models to describe the dependence of a factor on the parameters A, B and C.

$$R = 488.03 + 0.30B + 0.15B^{2} + 0.25C^{2}$$
(3)
$$R = 461.58 - 1.16C - 1.08C^{2} - 2.35B*C$$
(4)

In the factorial model described by Eq. 3, the interaction between B and C is not significant for the absorption calculation. This feature implies that only the isolated parameter was significant for the AFNQ compound. However, for other three models, the interaction between B and C parameters was significant.

For AINQ, for which experimental values are not available, the ANOVA analysis showed that the C parameter is significant according to Eq. 4. Ongoing from the lower level to the medium or higher level, the wavelength decreases 1.16 %. Comparatively, the ABNQ factorial model had a decrease in the wavelength value when going from the lower to the higher level, showing less error for ABNQ. In this way, the AINQ factorial model showed a lesser value for the wavelength and for comparison the lesser error when the DKH2 Hamiltonian is used. The bromide, chloride and iodine atoms have high influence on the relativistic Hamiltonian [35, 54]. The factorial model reinforced this finding. It is important note that the statistical models did not show that the basis set has a significant value for the three compounds; actually, only fluoride was significant. Therefore, the interaction between B and C was significant for ABNQ and AINQ. The interaction between B and C parameters decreased the wavelength values 2.85 and 2.35 %, respectively. ABNQ exhibits the lowest error values ongoing from the lower to the higher level. AINQ displays comparatively the same behavior. In this way, when the level goes from lower to the medium level, the AINQ factorial model decreases 2.35 %. In the other hand, going from the medium to the higher level, the value decreases another 2.35 %. All ANOVA analyses are shown in Supplementary Material Table S2A for the four compounds regarding the 33 factorial model.

3.4 Theoretical calculations for the ESIPT process

The factorial model confirmed that the ideal parameters for spectroscopic properties are the DKH2 Hamiltonian calculation and the relativistic basis set (TVZ_DKH). The spin–orbit coupling is not required. In this regard, we have applied this selected methodology and investigated the ESIPT process for all previous studied compounds and their halogen derivatives. The energy analysis considers three different energies: (1) the energy required for the keto form to reach the excited state, the absorption energy (E 1); (2) the energy needed for the conversion from the keto to enol form of the compounds in the excited state, the proton transfer energy (E 2); (3) the energy emitted when the enol form decays from the excited state to the ground state, emission (fluorescence) energy (F 2). The energy values are reported in Table 4. When the E 2 energy was positive, the enol form was more energetic than the keto form and the ESIPT process cannot occur. In this way, the compounds ACNQ, ABNQ and AINQ cannot exhibit the ESIPT process and the fluorescence occurs by the decaying of the keto form, emitting F1 energy. But, when the E 2 energy is negative or close to zero, the ESIPT process can occur emitting F 2 energy, the emission energy from the enol form is observed for ANQ and AFNQ. The two possibilities for the emission are showed in Fig. 2 where the first diagram shows the ESIPT process in which the keto form in the excited state is converted to an enol form before decaying to the ground state. In the second diagram, the conversion was not possible and the emission occurs in the keto form.

Table 4 Absorption values (E 1), proton transfer (E 2) and emission (F 1 and F 2) energy, fluorescence wavelength (λ 1) and emission color for the four compounds

The absorption wavelength calculated showed lower error for ABNQ compared to ANQ, this feature indicated that the model is more accurate for the heavy atoms. The emission wavelength values (λ1) observed refers to emission energy (F1 or F 2, respectively) of each compound, and the value decreases with the increase in substituent mass, see Table 4. These results provide evidence that the increase in mass dispersed more energy into the environment promoting a bathochromic shift. The heavy atoms promoted better dispersion of the electronic density, and this decreases the possibility of the ESIPT process amino group less susceptible to transfer the proton. ANQ absorbed at 445 nm (the experimental value for ANQ is 471 nm) emitting a red color at 663.4 nm. AFNQ absorbed at 488 nm and ACNQ absorbed at 467 nm (the experimental value for ACNQ is 476 nm [44]). The blue emission was observed for ABNQ, which absorbed at 453 nm (the experimental value for ABNQ is 452 nm [43]) and AINQ absorbed at 460 nm.

4 Conclusion

The 33 factorial model used for the study of the influence of the relativistic effects on spectroscopic parameters showed adequate for the halogen derivatives of ANQ. The experimental design was adequate to evaluate the best theoretical strategy to be employed in the investigation of relativistic effects on spectroscopic properties. The model pointed out that DKH2, the relativistic basis set, as well as the interaction between them, are important parameters for the study. The study showed that the presence of the halogen atoms Cl, Br and I does not favor the ESIPT process. The two compounds exhibit the ESIPT process, ANQ and AFNQ, emitting the red and violet color, respectively. Other compounds do not exhibit ESIPT process, but emit violet (ACNQ) and blue color (ABNQ and AINQ) fluorescence for the keto decay. We believe that the current study can assist in understanding the ESIPT behavior of ANQ derivatives and why the relativistic effects affect the process, as well as what theoretical strategy for the theoretical calculations of this class of compounds is the most appropriate.