Introduction

Coumarine (2H-1-benzopyran-2-one) is a naturally occurring substance found in essential oils [1], sweet clover, and woodruff of cassia [2]. The significant amounts can enter human body through dietary exposure, because these compounds are found in fruits [3], vegetables [4, 5], seeds [6], trees [7], coffee, and vine [8]. The function of coumarins in plants is still a matter of dispute, although it is suggested that they can be involved in growth regulation, photosynthesis, and control of respiration [9], or behave as fungistats and bacteriostats [10].

The structure of coumarine (Fig. 1) contains fused pyrone and benzene rings along with the carboxyl group on the pyrone ring. The large group of coumarine derivatives can be divided into four sections: simple coumarins, pyranocoumarins, furanocoumarins, and the pyrone-substituted coumarins [8]. There are numerous techniques for the synthesis of the novel coumarine derivatives, that have been found to be applicable in medicine and pharmacy [11]. Coumarine derivatives can be used as antibreast cancer substances [11, 12], anti-HIV agents [6, 7], neuroprotectors, antivirals, anticonvulsants, antioxidants [9, 13, 14], antifungals [15], antibacterials [13], and antimutagens [16]. The most important use of coumarine derivatives in clinical practice is their application as anticoagulant agents [17, 18]. The application of coumarine and its derivatives also includes fluorescent probes [19,20,21], dyes [22, 23], and food additives [24]. There are also negative effects reported for the consumption of these compounds, including toxic and carcinogenic behavior [25, 26].

Fig. 1
figure 1

Structure of 2H-1-benzopyran-2-one

Various computational methods have been employed for the investigation of the activity of coumarine and its derivatives. Quantitative structure–activity relationship (QSAR) method is very popular for the elucidation of the structure–activity relationship towards different radicals [27,28,29]. QSAR and density functional theory (DFT) was used on the series of 4-hydroxychromene-2-one derivatives to evaluate their antiradical activity [30]. Molecular docking studies have proved as very important for the interactions of coumarine derivatives with biologically important proteins [12, 31,32,33,34].

Two cartilage proteins, fibrillin and chondroadherin were chosen for molecular docking study because of their biological importance. Fibrillin appears in cartilage as early as 20th week of fetal gestation. The loose bundles of microfibrils are formed until early adolescence, after which broad banded fibers can be found accumulated pericellulary within cartilage. These fibers can be extracted using dissociative conditions and it has been postulated that they are laterally packed and crosslinked microfibrils. The growth-regulating function of Fib-1 may reside persistently within the perichondrium. The accumulation of special laterally crosslinked Fib-1 microfibrils around chondrocytes during late adolescence led to the conclusion that growth-regulating activities may also be performed by Fib-1 at these sites [35]. The other docked protein, chondroadherin is a cartilage matrix protein. It is thought that this protein mediates adhesion of isolated chondrocytes. Chondroadherin is consisting of 11 leucine-rich repeats flanked by cysteine-rich regions. The RNA messenger for chondroadherin messenger RNA is present in chondrocytes at all ages. Further research is needed for determination which of its structural features responsible for cell binding and matrix-binding properties. The interactions of this protein with matrix components in vivo also require further investigation. The role in mediating cell–matrix interactions makes chondroadherin a potentially important regulator of chondrocyte organization within cartilage. Therefore, it can be speculated that abundance or deficiency of this protein cartilage function [36].

In this contribution, the novel coumarine derivate 3-[1-(3-hydroxypropylamino)ethylidene]chroman-2,4-dione (1), is investigated for the reactivity toward cartilages proteins by the means of the Natural Bond Orbital (NBO), Quantum Theory of Atoms in Molecules (QTAIM), and Molecular Docking analysis. The reactivity indices for specific atoms are also discussed.

Results and discussion

The optimized geometries in water and benzene of 1 are given in Fig. 2 (coordinates given in Tables S1 and S2; supplementary material). From the structural point of view there are no significant differences in these two geometries in solvents. The coumarine part and double bond C=N are planar. This is expected because the starting structure is very rigid. The optimized structure in water is more stable by 10.25 kJ mol−1 when energies with zero point corrections are compared. The enthalpy and Gibbs free energy in water is lower for 10.79 and 5.40 kJ mol−1, respectively. The energy difference is attributed to the interaction of polar bonds (N–H and O–H) with the polarizable continuum of solvent.

Fig. 2
figure 2

Optimized geometries of the coumarine-derived 1 obtained at the B3LYP-D3BJ/6–311+G(d,p) level of theory, in water (a) and benzene (b)

The HOMO–LUMO gap [(ΔE), Fig. 3] determines chemical reactivity and molecular stability. ΔE is directly related to the easiness of electron excitation of investigated molecule. Data from Fig. 3 for ΔE suggest that 1 has higher chemical activity in benzene than in water, since it has the slightly lower value of the energy gap (4.56 eV compared to 4.63 eV). From the visualized orbitals, it is possible to conclude that the orbitals are localized on the polar bonds and electronegative atoms in molecule. This leads to the higher stabilization of the orbitals as reflected in their energy. When structures in water and benzene are compared, the HOMO orbital is more stabilized in polar solvent than LUMO orbital, 0.15 and 0.08 eV, respectively, therefore the energy difference reflects the variation in electron density for specific orbitals with solvent polarity.

Fig. 3
figure 3

Presentation of the energy levels, HOMO–LUMO gap and orbital composition distribution of the HOMO and LUMO for 1

The overall reactivity of the molecule can be complemented with the reactivity of the specific sites for different types of reactions. To evaluate the condensed Fukui functions, single point calculations have been performed for the systems N (neutral molecule), N1 (radical cation), and N + 1 (radical anion) electrons, over the optimized neutral geometries using the unrestricted B3LYP formalism for radical species, and restricted B3LYP for neutral molecule in both solvents. Therefore, charge of 0, +1, and −1 is distributed over all of the atoms based on their electronegativity and position in molecule. The condensed Fukui functions were estimated from the NBO and AIM charges employing the Eq. (5). These two methods are often used for the calculation of Fukui functions (f) [37]. Table 1 gives the most probable sites for the electrophilic (f ), nucleophilic (f +), and radical attack (f 0) in water and benzene. The complete list of values for CFF in two solvents is given in the supplementary material. The numeration follows the positions from Fig. 2.

Table 1 The condensed Fukui function for electrophilic attack (f ), nucleophilic attack (f +) and radical attack (f 0) for ligand 1 at B3LYP-D3BJ/6–311+G(d,p) level of theory, in water and benzene, respectively, calculated for NBO and AIM

The highest value of f should mark the most probable site for the selected type of attack. Calculated values for the Fukui functions f, when two different methods are used, show notable differences in value. This is expected because the charges are obtained using different approaches. Generally speaking higher values for Natural charges are obtained, while values for AIM charges are lower. This is unexpected result, because the values of AIM charges are usually overestimated [38]. The authors have to note that the calculated values are dependent on the functional and basis set used, especially when NBO charges are concerned [39]. What is important to observe is that the atoms for electrophilic, nucleophilic, and radical attack are the same, with few exceptions, for both methods. The relative order shows resemblance but cannot be taken as absolute. The following discussion is based on the comparison of the four atoms with highest values of f in each column (also made bold in Table 1). The C4, C1′, O4, and C7 are the most reactive atoms for electrophilic attack in both solvents, with the values that are very similar. The NBO predicts that the C4 atom is the most reactive site in water while AIM predicts O4. On the other hand C1′ and O4 are the most reactive centers in benzene by NBO and AIM charges, respectively.

As for the nucleophilic attack, on the basis of the value of the NBO charges, three positions favored are C3, O2, and N1 in both solvents, while values of CFF calculated by AIM approach predict C3, O2, and C4 and C3, O2, and N1 in water and benzene, respectively. According to the NBO values C3 is the most reactive atom, while AIM predict O2 as the most reactive position in both solvents. It should be noted that the value of Fukui function on C3 (0.314 and 0.254) is significantly higher than for the other atoms thus proving the high reactivity in both solvents. The Fukui parameters calculated by NBO for the free-radical attack are within 0.5 units which proves the competitiveness of various positions, although C3 can be considered as the most reactive one. On the other hand, the values obtained using AIM charges predict that the most reactive atoms for the reaction with free-radical are carbonyl groups. The O2 and O4 are the most reactive in water while O2 is in benzene. It is obvious that the NBO determines the most reactive positions for BDE and/or SPLET mechanisms, while AIM predicts most reactive positions for RAF mechanism.

When different solvents are investigated and CFFs compared, the values of f depend on the polarity through the separation of charges. In the most cases the values of Fukui functions are lower in benzene. The reactive sites are the same as mentioned earlier, which shows that these predictions are not solvent-dependent. There are just few examples when the four most probable positions are not the same for two solvents—for example, C3 in water is excluded in benzene, while N1 is the group of possible positions for radical attack in benzene. The values of f functions between C3 and N1 for radical attack are not significantly different; therefore, it is ambiguous to distinguish which one of them is more probable position for attack. The influence of solvent polarity is more prominent for the nucleophilic attack then for electrophilic, which is in good accordance with the findings from the ref. [40].

As it can be observed in Tables 2 and S3–S4 (Supplementary material), some of the CFF have negative values. In the paper by Bultinck and coworkers [41] it is discussed that the negative values of CFF might indicate the short inter-atomic distances between atoms inside molecule. The values of CFF depend on the distance between atoms, as shown by Senthilkumar and Kolandaivel [38], and they tend towards negative as distance decreases.

Table 2 Estimated free energy of binding (ΔG bind/kJ mol−1), estimated inhibition constant (K i /μM) of different poses of ligand against fibrillin protein

To evaluate the inhibitory nature of 1 against human cartilage proteins (fibrillin and chondroadherin) molecular docking was performed. Protein–ligand binding energy and identification of the potential ligand binding sites were determined from this study as well. The ligand conformation which showed the lowest binding energy (best position) was determined based on the ligand docking results. The position and orientation of ligand inside protein receptor and the interactions with amino acids which bound to the ligand were analyzed and visualized with Discovery Studio 4.0 and AutoDockTools.

In Tables 2 and S5 (Supplementary material) values of estimated free energy of binding and inhibition constant (K i ) for the investigated ligand in ten different conformations are given. Lower value of K i indicates better inhibition.

The lowest values of ΔG bind and K i are found for conformation 1 (Table 2; Table S5). Analyzing position of active amino acids, it can be concluded that ligand binds at the catalytic site of substrates by weak non-covalent interactions. The most prominent are H-bonds, alkyl–π, π–π and π–amide interactions. The atoms that interact with amino acids are O2, O4, and N1 as most potent sites for reaction proved by the Fukui indices. Glycine and aspartic acid in positions 15 and 110 in primary structures of chain A have predominant role as active sites of fibrillin and chondroadherin proteins for inhibition action, regardless of the conformation of investigated ligand (Fig. 4). GLY15 forms two H-bonds (2.07 and 2.32 Å length) with O–H and amino group of the ligand, while ASN14 forms H-bond of 2.90 Å length with C=O group of the ligand (Fig. 4). On other hand, ASP110 and HYS108 form hydrogen bonds of 2.12 and 2.30 Å length with O–H and C=O groups of ligand. LYS134 forms weak alkyl–π interaction with benzene ring of the ligand (Fig. 4). This analysis is good accordance with the previous discussion, proving that the Fukui parameters can be used for the prediction of the active sites for biologically active molecules.

Fig. 4
figure 4

Picture showing interaction between ligand (conformation 1, the lowest K i ) and amino acids in human cartilage proteins (left fibrillin and right chondroadherin)

Conclusion

In present work the reactivity parameters for molecule 3-[1-(3-hydroxypropylamino)ethylidene]chroman-2,4-dione (1) were calculated at the B3LYP-D3BJ/6–311+G(d,p) level of theory in the aqueous (polar solvent) and benzene (non-polar solvent) phases. The HOMO–LUMO gap (ΔE) suggested that 1 has higher chemical activity in benzene than in water, because it has slightly lower value in non-polar solvent.

The condensed Fukui indices were calculated for all atoms in molecule 1 by the means of Natural and Atoms in Molecules charges. Based on these parameters, the most probable reactions sites for nucleophilic, electrophilic, and radical attack were determined. The C4, C7, O4, and C1 atoms are favorable sites for electrophilic attack, while for nucleophilic attack the C3, O2, and N1 atoms are suitable. For free-radical attack, the C3, O2, and N1 atoms are possible positions for reaction.

The title ligand, according to the results of the molecular docking study, forms a stable complex with human cartilage proteins as evident from the binding energy (ΔG bind in kJ mol−1). The most important interactions are H-bonds, alkyl–π, π–π and π–amide. Atoms forming bonds are predicted as the most reactive sites by the Fukui indices. These preliminary results suggest that the investigated ligand 1 might exhibit inhibitory activity against fibrillin and chondroadherin proteins.

Methods

Chemical reactivity of molecule

The molecular structure and the electronic parameters, such as the energies of highest occupied molecular orbital (HOMO) and lowest unoccupied molecular orbital (LUMO) and the HOMO–LUMO gap (∆E = E LUMOE HOMO) can be obtained through theoretical calculations [42,43,44,45,46,47,48].

Fukui functions

Fukui function, proposed by Parr and Yang in 1984 [48,49,50], describes the differential change of the electron density induced by the change in total number of electrons. It is mathematically defined by the following equation:

$$f(r) = \left[ {\frac{\delta \mu }{\delta \upsilon (r)}} \right]_{N} = \left[ {\frac{\partial \rho (r)}{\partial N}} \right]_{\upsilon (r)}.$$
(1)

In the previous equation µ and N are the chemical potential and the number of electrons in the system, υ(r) is the external potential. The Fukui function is often calculated approximatively as follows:

$$\begin{aligned} f^{ - } (r) & = \rho_{N} (r) - \rho_{N - 1} (r) \approx \rho^{\text{HOMO}} (r) \\ f^{ + } (r) & = \rho_{N + 1} (r) - \rho_{N} (r) \approx \rho^{\text{LUMO}} (r) \\ f^{0} (r) & = [f^{ - } (r) + f^{ + } (r)]/2 \approx [\rho^{\text{HOMO}} (r) + \rho^{\text{LUMO}} (r)]/2. \\ \end{aligned}$$
(2)

The evaluation of these equations can be very complicated due to the electron density term [47]. In Eq. (2), the frozen-orbital approximation is assumed. ρ N (r), ρ N−1(r), and ρ N+1(r) represent separately the electron densities of the system with N (neutral molecule), N − 1 (radical cation), and N + 1 (radical anion) electrons. The values of ρ HOMO(r) and ρ LUMO(r) indicate the electron densities of the HOMO and LUMO, respectively.

If Eq. (2) is integrated for individual atoms in a molecule, one obtains the so-called condensed Fukui function (CFF) [51], which provide a more convenient way to predict the reaction site in a molecule. The definition of condensed Fukui function for an atom, noted as A, can be written as:

$$f_{\text{A}}^{ - } = p_{N}^{\text{A}} - p_{N - 1}^{\text{A}}$$
(3)
$$f_{\text{A}}^{\text{ + }} = p_{N + 1}^{\text{A}} - p_{N}^{\text{A}}$$
(4)

where \(p_{N}^{\text{A}}\) is the electron population number of atom A. Since atomic charge is defined as q A = Z Ap A, where Z is the charge of atomic nucleus, f and f + can be expressed as the difference of atomic charges in two states (note that the two Z terms are canceled). By analogous treatment, one can easily formulate the CFFs for an atom A in Eq. (5):

$$\begin{aligned} & {\text{Electrophilic attack}}{:}\quad f_{\text{A}}^{\text{ - }} = q_{N - 1}^{\text{A}} - q_{N}^{\text{A}} \\ & {\text{Nucleophilic attack}}{:}\quad f_{\text{A}}^{ + } = q_{N}^{\text{A}} - q_{N + 1}^{\text{A}} \\ & {\text{Radical attack}}{:}\quad f_{\text{A}}^{\text{0}} = [f_{\text{A}}^{{ + }} + f_{\text{A}}^{ - } ]/2 = [q_{N - 1}^{\text{A}} - q_{N + 1}^{\text{A}} ]/2 \\ \end{aligned}$$
(5)

where \(q_{N}^{\text{A}}\), \(q_{N - 1}^{\text{A}}\), and \(q_{N + 1}^{\text{A}}\) are the charges at atom A of the neutral, anionic, and cationic species, respectively [51].

The three formulas in Eq. (5) are exploited in this work to describe the electrophilic, nucleophilic and free-radical attack. In general, the larger the value of the condensed Fukui function at a reaction site, the greater the reactivity of that corresponding site towards reactive species. There are various methods for the calculation of the atomic charges, many of which were employed for the Fukui indices [51,52,53]. In this paper the atoms in molecules (AIM) and natural charges were used for calculations of the CFF.

Computational methods

The geometries of neutral molecules (Fig. 2) were optimized by the density functional theory (DFT) using global hybrid generalized gradient approximation (GGA) functional B3LYP [54, 55] with empirical dispersion corrections D3BJ (with Becke and Johnson damping) [54,55,56,57,58,59] in combination with the 6–311+G(d,p) basis set. Many authors employed the B3LYP-D3BJ functional successfully for description of similar systems [57,58,59]. B3LYP-D3BJ was selected as a widely applicable method that proved to describe inter-atomic interactions, at short and medium distances (≤5 Å), more accurately and reliably than traditional DFT methods. Hybrid GGA B3LYP-D3BJ includes an empirical correction term proposed by Grimme [57]. The geometry optimization was performed in Gaussian 09 program package [60]. The nature of the stationary points was determined by performing frequency analysis: the equilibrium geometries had no imaginary vibrations. The solvent effect was taken into account in geometry optimization and energy calculation using the SMD implicit solvation model [61]. The applied SMD continuum solvation model is based on interaction between the charge density of solute molecule with dielectric medium. The geometry was optimized in water (polar solvent) and benzene (non-polar solvent) to investigate the effect of solvent polarity on reactivity parameters. The NBO analysis was performed in the NBO 5.9 software [62, 63]. Quantum theory of atoms in molecules (QTAIM) calculations was done based on the wfn files produced by the Gaussian and analyzed in the Multiwfn Package [64] for the atomic charges. The analysis of atomic basins was performed at the high quality grid with spacing of 0.06 B. The molecular docking simulation was carried out using the AutoDock 4.0 software [65]. The three-dimensional crystal structures of human fibrillin and chondroadherin were obtained from the Protein Data Bank (PDB IDs: 2W86 and 5LFN) [66, 67]. The Discovery Studio 4.0 [68] was used for preparation of protein for docking by removing the co-crystallized ligand, water molecules and co-factors. To calculate Kollman charges and to add polar hydrogen the AutoDockTools (ADT) graphical user interface was used. Title molecule 1 was prepared for docking by minimizing its energy at B3LYP-D3BJ/6–311+G(d,p) level of theory. The flexibility of the ligands is considered, while the protein or biomolecules remained as rigid structure in the ADT. All bonds of 1 were set to be rotatable. The Geistenger method for calculation of partial charges was employed. All calculations for protein—ligand flexible docking were done using the Lamarckian genetic algorithm (LGA) method. The grid boxes with dimensions 17.442 × 28.297 × 58.056 of fibrillin and 7.214 × 59.888 × 95.047 of chondroadherin were used to cover the protein binding site and accommodate ligand to move freely. Inhibition potency of title molecule was investigated and discussed.