Introduction

Chalcones (or 1,3-diaryl-2-propen-1-one, IUPAC name) are obtained from either natural sources or synthetic procedures and correspond to the main group of bioactive compounds because these have several pharmacological applications, such as anti-inflammatory, anti-pyretic, anti-fungal, anti-bacterial, anti-mutagenic, anti-leishmanial, anti-cancer, anti-HIV, and anti-oxidant properties (Anto et al. 1994; Kamal et al. 2011; Luo et al. 2012; Pilatova et al. 2010; Alegaon et al. 2014; Zhuang et al. 2017). These compounds can be synthesized through the condensation of acetophenone (1) with benzaldehyde (2) in the presence of a base or an acid to form the respective chalcone (4) with high chemoselectivity (see Scheme 1). This reaction is generally known as Claisen–Schmidt condensation, and therefore, this a specific type of aldolic condensation and in the reaction mechanism, an intermediary product known as aldol (3) is formed. To obtain bioactive chalcone derivatives, usually alkaline conditions are more appropriated and efficient. (Akansksha, 2007; Bhagat et al. 2006; Jeon et al. 2012; Li et al. 2010; Liu et al. 2011; Narender and Reddy 2007; Patil and Bhanage 2013; Petrov et al. 2008; Sivakumar et al. 2001; Sugamoto et al. 2011; Narender et al. 2011). Likewise, it is well known that the substitution patterns of both 1 and/or 2 are important to achieve in a successful way this reaction, since which can influence the yield of this transformation.

Scheme 1
scheme 1

General Claisen–Schmidt reaction

On the other hand, several computational methodologies have been used to study the mechanism of reactions. Some analysis of structural relationships such as quantitative structure–property relationships or quantitative structure–reactivity relationships (QSPR and QSRR, respectively) have emerged as a useful tool to find relevant physicochemical descriptors to get some insight into the underlying mechanisms of reaction. These mathematical models are based on Hansch’s analysis and on the assumption that structurally similar compounds have similar activities or properties (Yee and Wei 2012). The QSPR and QSRR can also be used in the analysis of structural descriptors that capture different constitutional, topological, geometrical, or electronic features of molecular structures in relation to properties of interest (Duchowicz and Castro 2009; Yee and Wei 2012). These molecular descriptors can be readily calculated through mathematical formulae derived from several theories, e.g., quantum mechanics and other similar theories, which have been proven to function quite well for the wide spectra of properties/activities (Duchowicz and Castro 2009). In this context, density functional theory (DFT) is an important method in designing bioactive compounds, obtaining satisfactory results based on accuracy and reliability (Carloni et al. 2006). Furthermore, this methodology has been used in understanding chemical system reactivity (De Vleeschouwer et al. 2007; Moens et al. 2007). In fact, through DFT, it is possible to calculate different reactivity descriptors, such as electronic chemical potential (μ), Pearson’s hardness and softness (η and S, respectively), and global Electrophilicity Index (ω), all of which led to understanding reactivity in organic molecules (Chermette 1999; Li et al. 2010).

Herein, a quantitative structure–reactivity relationship (QSRR) model was developed to explain the different yields observed in Claisen–Schmidt condensation to obtain several chalcones, in alkaline conditions. The uses of this methodology should be useful for other similar reactions.

Experimental

General

Melting Point was measured in Fischer Scientific apparatus. Infrared spectra were recorded in Buck Scientific M500. Recorded range 600 cm−1–4000 cm−1, all samples were registered on ATR system (attenuated total reflectance). 1H NMR, 13C NMR, 2D-HSQC and 2D-HMBC spectra were recorded on a Bruker Avance 400 Digital NMR spectrometer, operating at 400.13 MHz for 1H and 100.6 MHz for 13C, respectively. Chemical shifts are reported in δ (ppm downfield from the TMS resonance) and coupling constants (J) are given in Hz. GC–MS was carried out Agilent Technologies 6890 model with automatic ALS, mass detector HP MD 5973 in spitless mode.

Chemicals

To a stirred solution of 4-methoxyacetophenone (5) (250 mg, 2.08 mmol) and the appropriate benzaldehydes (6a–w; 2.29 mmol) in EtOH (2.5 mL) was added saturated ethanolic NaOH (20 mL). The mixture was stirred for 48 h at room temperature. After reaction time, the reaction mixture was neutralized with HCl 5% solution until pH ≈ 7 and extracted with ethyl acetate (3 × 50 mL). Later, the organic phase was dried, concentrated and purified by flash column chromatography using EtOAc: Hexane mix (0:1–1:1) as eluent, obtaining the corresponding chalcones (7a–w).

Computational details

All compounds considered in this study (acetophenone (5), benzaldehydes (6a–w) and aldol-intermediate structures), were subjected to the following computational procedure. Full unconstrained geometry optimizations of these compounds were carried out using Gaussian 03 program (Frisch et al. 2004). The most widely used exchange–correlation function suggested exchange potential by Becke with gradient-corrected correlation provided by Lee, Yang and Parr (B3LYP) with Gaussian basis set triple-Z 6-311G double-polarized and d orbital expansion was necesary for obtained chemical properties of all compounds. Optimized geometries were verified by frequency calculations and characterized as minimum structure (no imaginary frequency) in their potential energy surfaces. The reactivity descriptors: Mülliken’s Charge on carbon and oxygen of carbonyl benzaldehyde, highest-occupied molecular orbital (HOMO), lowest unoccupied molecular orbital (LUMO), electronic chemical potential (µ), hardness (η), softness (S) and Electrophilicity Global Index (ω) were calculated with the following equations:

$$\mu = \frac{{\left( {E_{\text{LUMO}} + E_{\text{HOMO}} } \right)}}{2}$$
(1)
$$\eta = \frac{{\left( {E_{\text{LUMO}} - E_{\text{HOMO}} } \right)}}{2};$$
(2)
$$S = \frac{1}{2\eta }$$
(3)
$$\omega = \frac{{\mu^{2} }}{2\eta }$$
(4)

Hansch’s analysis

Hansch’s analysis was performed using multiple linear regressions, which has been attempted to relate the structural features of these benzaldehydes on the yield of reactions. In this analysis, different descriptors previously calculated were selected: HOMO and LUMO energies, energy difference between LUMO and HOMO (ΔL − H), electronic chemistry potential (µ), hardness (η), softness (S), global electrophilicity (ω), Mulliken’s charge (on CO and O of carbonyl groups) of benzaldehydes. Statistica 7.0 Package Software carried out model optimization.

Cross-validation QSRR model

Cross-validation of QSRR model was carried out by Golbraikh method (Golbraikh and Alexander 2002). Acceptable q 2 values are equal or greater than 0.5. q 2 values are obtained by the following formula:

$$q^{2} = 1 - \frac{{\mathop \sum \nolimits (y_{\text{obs}} - y_{\text{calc}} )^{2} }}{{\mathop \sum \nolimits (y_{\text{obs}} - y_{\text{ave}} )^{2} }}$$
(5)

where y obs is pYield observed, y calc is pYield calculated by QSPR model and y ave is pYield average of all compounds in the training set.

Results and discussion

Chemistry

The preparation of the chalcones 7a–w is described in Scheme 2. 4-methoxyacetophenone (5) was treated with different benzaldehydes (6a–w) in ethanolic NaOH to give 23 chalcones (7a–w), with medium to excellent yields (40–98%).

Scheme 2
scheme 2

Synthesis of chalcone 7a–w by Claisen–Schmidt reaction

The difference in the yields in this reaction is probably related to the substitution patterns of benzaldehydes, in terms of structural features and position of these groups. If considered 7a as a reference compound to evaluate the effect of chemical modifications over Claisen–Schmidt reaction, some tendency is possibly observed. One of these is the acidic hydrogen of hydroxyl–benzaldehydes (OH group) generated decreased yield (except 7q, 80% yield) (see Scheme 2). On the other hand, strong electron donor π groups on R3’ position (7g and 7h) decrease the chalcone yield, however, a few electronic donor as methyl group generate to increase the chalcone yield. Likewise, when the replacement of the hydrogen atom is in R4’ positions of benzaldehydes, is observed the best results for each group in particular (7j–l), except to 7n. In the case of the poly-substitution (7o–w), the behavior is unclear, because of the yield of this reaction depended on both, the position and feature of these groups. However, a Hansch’s analysis model gives an even more precise explanation of the yield differences of the studied compounds, which was developed in the next section.

The chemical structures of the target compounds were established based on their spectral properties (IR, 1H NMR, 13C NMR and MS, see the Experimental part and Supplementary Material). Infrared spectra of all obtained chalcones (7a–w) showed absorption peaks characteristic of conjugated carbonyl groups (υ = 1665 cm−1) by resonance effects (Silverstein et al. 2005). Furthermore, these peaks are characteristic of aromatic systems, with angle deformations of υ ≈ 1600, 1550, and 1500 cm−1.

The 1H-NMR spectra recorded in deuterated chloroform (CDCl3) of all compounds shows two coupled hydrogens with geometry trans to the down field (δ ≈ 7.88 and 7.56 ppm, J ≈ 15.6 Hz), corresponding to α and β hydrogens, respectively. 13C-NMR and 2D-HSQC spectra show two carbons correlated to hydrogens α (δ ≈ 122 ppm) and β (δ ≈ 143 ppm). Finally, with 2D-HMBC experiments, quaternary carbons that link the two aryl systems (see Scheme 3a) were determined. The EIMS of all compounds showed three key fragments: M+-15 (lost CH3 from methoxy group), M+-131 (lost cinnamic fragment from α carbonyl rupture), and M+-103 (lost styrene fragment from α carbonyl rupture) (see Scheme 3b).

Scheme 3
scheme 3

a Most important 2D-HMBC correlations for chalcone 7m. b Common fragmentation pattern observed for chalcones 7a–w

Hansch’s analysis

Quantum chemical calculations are an attractive source for new molecular descriptors, which can express all of the electronic and geometric properties of molecules and their interactions. Indeed, many recent Hansch’s analysis (QSAR/QSRR/QSPR studies) have employed quantum chemical descriptors alone or in combination with conventional descriptors, such as atomic charges, orbital electron density, dipole moments, and polarity indices, among others (Karelson and Lobanov 1996).

There are several descriptors to understand chemical reactivity. The most widely used are that of the molecular orbital (MO), e.g., highest-occupied molecular orbital (HOMO) and lowest unoccupied molecular orbital (LUMO), equivalent to Lewis base and acid, respectively (Ege 2000). HOMO and LUMO values can be used to calculate other reactivity descriptors, such as ΔLH (LUMO–HOMO), electronic chemical potential (µ), Pearson’s hardness and softness (η and S, respectively) and global Electrophilicity Index (ω). Electronic chemical potential (µ) is, at the physical level, electron donation capacity from HOMO to LUMO. Hardness can be understood as resistance to charge transfer. In addition, the global Electrophilicity Index (ω) is a concept of measure for energy stabilization, such as when systems receive additional electronic charges from underground (Lopez et al. 2013; Pearson 1993).

According to previously mentioned descriptors, the values of these parameters were calculated for benzaldehydes under investigation and those that elicited a statistical significance (p ≤ 0.05), then were correlated with the respective reaction yields (pYield). The descriptors considered were: Mülliken charge of the carbon in the carbonyl group (CO), LUMO, electronic chemical potential (µ), and global Electrophilicity Index (ω). However, when finished Hansch’s analysis statistical depuration, only LUMO and ω were the descriptors that can explain reactivity differences with 99.99% of certainity (p < 0.0001) (see Eq. 6 and Table 1 Supplementary Material).

$$\begin{aligned} p\text{Yield} = \, - 0. 6 3 9\left( { \pm 0. 3 2 9} \right) \, - { 336}\left( { \pm 4 6. 7} \right)\text{LUMO} - { 174}\left( { \pm 2 4. 8} \right)\omega \hfill \\ n = { 15}; \, r = 0. 9 1 8; \, r^{ 2} = \, 0. 8 4 3;{\text{ SD}} = \, 0.0 6 3; \, F = { 29}. 5; \, q^{ 2} = \, 0. 8 2 1\hfill \\ \end{aligned}$$
(6)

The first model (Eq. 6) shows that Claisen–Schmidt condensation is related to LUMO and global Electrophilicity Index (ω) values of the benzaldehydes, which could be connected to their capacities to accept two electrons (Lopez et al. 2013; Parr et al. 1999; Pearson 1993). This point is pivotal in the first step in the Claisen–Schmidt reaction mechanism, which involves nucleophilic attack from enolate through benzaldehyde, to obtain the respective addition intermediate (see Scheme 4).

Scheme 4
scheme 4

The pivotal step in Claisen–Schmidt reaction for some benzaldehydes, observed from Hansch’s analysis

Furthermore, in optimizing this Hansch’s analysis model, eight outlier compounds with no significance in this model were identified. All outlier compounds have a common feature: OH or O-alkyl groups, in positions 2 and/or 4, which could influence the stereo-electronic properties of these benzaldehydes. For explaining different features of outlier compounds and chalcone yield relationship, all descriptors calculated were correlated with pYield again, found a significant correlation (p < 0.05) with HOMO, ΔLH, η, S and µ descriptors. Furthermore, a new Hansch’s analysis statistical depuration showed that HOMO and ΔLH are the only descriptors that can explain reactivity differences with 99.99% of certainity (p < 0.0001). In second, the Hanch’s analysis (outlier model), two non-correlated compounds were found (6q and 6t). Despite efforts by reactivity understanding of this compounds using Hansch’s analysis, into two models have no contribution statistical. Variables involved in the Hanch’s analysis (outlier model) were thus HOMO and ΔLH (see Eq. 7 and Table 2 supplementary material), which correspond to reactivity descriptors related to electron-donating capacity (Lopez et al. 2013; Pearson 1993).

$$\begin{aligned} p\text{Yield}_{{\text{out}}} = \, - 4.0 3\left( { \pm 0. 2 6 4} \right) \, - { 11}. 5\left( { \pm 2. 5 5} \right)\text{HOMO} - { 2}. 8\left( { \pm 2. 6 4} \right){\Delta LH} \hfill \\ n = { 6}; \, r = 0. 9 8 2; \, r^{ 2} = \, 0. 9 6 4;{\text{ SD}} = \, 0.0 30; \, F = { 4}0. 1; \, q^{ 2} = \, 0. 9 9 9\hfill \\ \end{aligned}$$
(7)

Equation 7 shows that HOMO is the most important descriptor (up to four times more important than ΔLH), related to electron-donating capacity or the behavior as Lewis base of the outliner benzaldehydes (Ege 2000; Lopez et al. 2013; Pearson 1993). If Claisen–Schmidt reaction mechanism is considered, these electronic features are important for the second step of the reaction, the trans-elimination (see Scheme 5). To confirm this hypothesis, HOMO and ΔLH values of aldols were calculated and the values of these descriptors were correlated with the corresponding benzaldehydes, which shows r 2 ≥ 0.9. These results could indicate that in the Claisen–Schmidt reaction mechanism, the reactivity of outlier compounds was determined by the second step of the reaction, the trans-elimination (see Scheme 5).

Scheme 5
scheme 5

The pivotal step in Claisen–Schmidt reaction for outliner benzaldehydes, observed from Hansch’s analysis

Figure 1 shows HOMO diagrams for aldol intermediates of 7a and 7c. Figure 1b demonstrates that 2′-OH substitution increases electronic population in the ring near OH-aldol, behavior not observed for aldol intermediate of 7a. This effect producing smaller stability on the addition intermediate, which can be related to the HOMO values in Hansch’s analysis outlier model (All HOMO plots of aldols outlier model are in Fig. 1 Supplementary Material).

Fig. 1
figure 1

HOMO diagrams for aldol intermediates: a 7a and b 7c. DFT-B3LYP-6-311G ++(d) calculated. Isovalue: 0.035; Density: 0.0004

Conclusions

In summary, a series of 23 chalcones were synthesized through Claisen–Schmidt condensation using alkaline media and structural characterization several spectroscopic techniques (IR, NMR, and MS). Through DFT calculus with B3LYP exchange-correlated basis set and 6-311G ++(d) calculus level, was possible obtained quantum descriptors for Hansch’s analysis. The Hansch’s analysis indicated that, for 15/23 compounds (65%), the yield to obtain the respective chalcone, were related to the nucleophilic attack from enolate to the carbonyl group of the respective benzaldehyde. On the other hand, for 6/23 compounds (26%), the yield of this reaction was determined for aldol dehydration process. In two models, developed descriptors were correlated with 99.9% certainty (p < 0.001). Finally, this is the first report of Hansch’s analysis use to explain different yields in Claisen–Schmidt reaction and would be used in other chemical reactions to explain the same experimental aspects.