Introduction

Imidazo[2,1-b][1,3,4]thiadiazoles exhibit different kinds of biological activities. 2,6-Disubstituted imidazothiadiazole derivatives are described as antifungal, antibacterial, and antitubercular agents [13], while other analogs display anti-inflammatory activity or inhibit cyclooxygenase-2 [4, 5]. Several imidazo[2,1-b][1,3,4]thiadiazole analogs have been evaluated as potential anticancer agents [612] and some inhibited proliferation in a wide panel of cancer cell lines [7, 13, 14] with nM IC\(_{50}\) potency [6, 8]. Molecular mechanism studies have shown that most imidazo[2,1-b][1,3,4]thiadiazoles induce apoptosis in cancer cell lines [8, 10, 15, 16].

2D and 3D quantitative structure activity relationship (QSAR) models concerning anticancer agents are well known [1728]. In many cases, they use the Hansch equation taking into account lipophilicity as well as electronic and steric parameters [21, 22]. A multiple linear regression analysis of a set of indane carbocyclic nucleosides with antiproliferative potency against L1210 and CEM cell lines gave a QSAR model including the lowest unoccupied molecular orbital energy \((E_{\mathrm{lumo}})\) and the solvent accessible-hydrophobic surface area [23].

Elucidation of the structure-antiproliferative activity of 2- and 6-substituted-5,8-dimethoxy-1,4-naphthoquinones against L1210 cells showed that the most important factor is hydrophobicity [24]. These results were confirmed by Mekapati et al. who analyzed leukemia cells using bis(heterocyclic-carboxamides) [25]. A 3D QSAR study on the antiproliferative activity of 1,2,4,5-tetraoxane derivatives showed that hydrophobicity and hydrogen bond donor features are the main factors affecting antiproliferative activity of the studied analogs against the HeLa (cervix carcinoma) human cancer cell lines [27]. The established QSAR models were usually estimated by leave-one-out cross-validation and tested through the use of external test sets of compounds and characterized by good predictability [20, 29, 30].

In this paper, QSAR models for a set of imidazo[2,1-b][1,3,4]thiadiazoles possessing antiproliferative activity against three cancer cell lines were built and discussed. The biological data used for this study came from a set of compounds described previously [6, 8]. For the building of our QSAR models lipophilicity, electronic, and steric parameters were taken into account. The obtained models could explain if cell membrane permeability has an impact on the biological potency of the compounds, provide information about critical ligand–target interactions [24], and indicate differences in the structure of compounds that are crucial to antiproliferative activity.

Results and discussion

Dataset

Two different sets of imidazo[2,1-b][1,3,4]thiadiazoles were the object of our QSAR studies. The first group includes 15 analogs of 2-bromo-N-[3-(imidazo[2,1-b][1,3,4]thiadiazol-6-yl)phenyl]acrylamide modified in position 2 of the heterocyclic ring described by Romagnoli and co-workers (Table 1) [6]. The second set consists of 27 compounds, 5,6-disubstituted 2-(4-chlorobenzyl)imidazo[2,1-b][1,3,4]thiadiazoles presented by Kumar and co-workers (Table 1) [8]. The antiproliferative potency of both groups of compounds was evaluated against the same cancer cell lines using the MTT assay. In the case of these QSAR studies, the IC\(_{50}\) values were converted to the logarithm of IC\(_{50}\) (Table 1).

Descriptors

The QSAR model construction was based on lipophilicity, electronic, and steric descriptors obtained by computational methods: the lipophilicity of compounds was expressed as log P values estimated according to three different approaches (Clog P, Mlog P, log P) [31, 32] and is one of the most important QSAR model descriptors. Electronic parameters include the atomic partial charges of some atoms of heterocyclic skeleton and the Highest Occupied Molecular Orbital \((E_{\mathrm{homo}})\) and the Lowest Unoccupied Molecular Orbital \((E_{\mathrm{lumo}})\) were taken into account since they are associated with the nucleophilic and electrophilic properties of a molecule. Based on the energies of frontier orbitals, the hardness (\(\upeta )\) descriptor was obtained using the equation \(\upeta = (E_{\mathrm{lumo}}-E_{\mathrm{homo}})\)/2 [33]. Polar surface area (PSA) was also taken into account. Molar refractivity (MR, CMR determined according two different algorithms), volume, ovality, and the surface area of a molecule were used as steric parameters (supplementary materials). The correlation matrix between the parameters used in the QSAR model equations is presented in Table 2.

Table 1 Structure and antiproliferative potency (observed and calculated) of imidazo[2,1-b][1,3,4]thiadiazoles against murine leukemia (L1210), human T-lymphocyte (CEM), and human cervix carcinoma (HeLa) cells
Table 2 Correlation coefficient matrix (\(r^{2})\) for the descriptors used in the QSAR models

QAAR studies

To compare sensitivity of the individual cancer cell lines for the compounds under consideration, the quantitative–activity–activity relationships (QAAR) were studied and the results are presented in Table 3. They indicate that the activities of compounds against the studied cell lines are highly correlated with r\(^{2}\) values in the range 0.771–0.924. The best relationship was obtained comparing the antiproliferative potency of compounds against human HeLa and CEM cells which is described by the equation

$$\begin{aligned} \hbox {log IC}_{50(\mathrm{HeLa})}= & {} 0.907\hbox { log IC}_{50(\mathrm{CEM})} + 0.085 \end{aligned}$$
(1)

n = 37; \(r^{2}\) = 0.943; s = 0.161; F = 575.7

The highest correlations can be explained by the same origin of both lines (human cells).

QSAR analyses

The dataset was divided into training (26–28 compounds) and test (8 compounds) sets by the Kennard Stone method. Using Multiple Linear Regression (MLR) and setting up the selection criteria for descriptors, three-variable QSAR equations were constructed. QAAR studies showed that activities of the studied compounds against the considered cell lines are highly correlated; therefore, similar QSAR models were expected to be obtained.

Table 3 Correlation coefficient matrix \((r^{2})\) between the antiproliferative activities of compounds against the studied cancer cell lines

The best QSAR model of antiproliferative potency of imidazo[2,1-b][1,3,4]thiadiazoles against the L1210 cell line is described by the following equation:

$$\begin{aligned} \hbox {log IC}_{50(\mathrm{L}1210) }= & {} 0.861 (\pm 0.067) \hbox { log }P + 1.409(\pm 0.227) E_{\mathrm{Lumo}}\nonumber \\&- 0.444 (\pm 0.0764) \hbox {CMR} + 2.149 (\pm 1.065)\nonumber \\ \end{aligned}$$
(2)

\(n = 27; r = 0.960; r^{2}_{\mathrm{adj}} = 0.911; s = 0.266; F = 90.1; p < 0.0001; Q^{2} = 0.896; \hbox {Press }= 1.63; (\hbox {compound 22} \hbox {outlier})\)

Other statistical metrics are presented in Table 4. The calculated parameters are in the range recommended in the literature about the QSAR models validation: \(R^{2}_{\mathrm{pred}}> 0.6; Q^{2} > 0.5; \overline{{r_m^2}}_{(\mathrm{pred})}\) (scaled) > 0.5; \(\Delta r^{2}_{\mathrm{m}(\mathrm{pred})}\) (scaled) > 0.2; k and \(k^\prime \) in the range of 0.85–1.15 and \({\vert }R^{2}_{0}-{R^\prime }^{2}_{0}{\vert } < 0.3\). The statistical quantities for chance correlation (\(R_r^2 \), \({}^{\mathrm{c}} {R_{p}^{2}} )\) as well as \({r_m^2}_{(\mathrm{LOO})}\) and \({r_{m}^{2}}_{(\mathrm{overall})}\) confirm external prediction ability of the obtained models (Table 4) [3438].

Model Eq. (2) is the Hansch equation including lipophilicity as well as electronic and steric parameters. The equation could estimate 91 % variance in the observed activity. The model gives a leave-one-out cross-validation of 90 %. Other models were obtained similar to Eq. (2) which included \(E_{\mathrm{Lumo}}\), log P, and another steric descriptor: MR, surface area, volume, or ovality that are highly correlated with CMR. The predicted log \(\hbox {IC}_{50(\mathrm{L1210})}\) values based on Eq. (2) and the residues between the observed and predicted activities are presented in Table 1 and graphically in Fig. 1.

Table 4 Statistical metrics and validation parameters of the models

The QSAR model similar to that in Eq. (2) was obtained for the CEM cells, expressed by the following equation:

$$\begin{aligned} \hbox {log IC}_{50(\mathrm{CEM}) }&{=}&0.646 (\pm 0.060) \hbox {log} P {-} 0.0278(\pm 0.009) \hbox {MR}\nonumber \\&+ 1.017(\pm 0.245) {E}_{\mathrm{Lumo}} {+} 1.191 (\pm 1.335)\nonumber \\ \end{aligned}$$
(3)

\((n = 25; r = 0.942; r^{2}_\mathrm{adj} = 0.872; s = 0.265; F = 57.6; p < 0.0001; Q^{2} = 0.842\); Press = 1.555 (compound 27 outlier)

A slightly better model was obtained including lipophilicity (Mlog P) and two electronic parameters

$$\begin{aligned} \hbox {log IC}_{50(\mathrm{CEM})}= & {} 2.454(\pm 0.578) q_{(\mathrm{C2})} - 6.855(\pm 2.346) q_{(\mathrm{N7})}\nonumber \\&+\, 0.848(\pm 0.085) \hbox {Mlog }P 6.996(\pm 1.404)\nonumber \\ \end{aligned}$$
(4)

\(n = 26; r = 0.942; r^{2}_{\mathrm{adj}} = 0.872; s = 0.241; F = 58.03; p < 0.0001; Q^{2} = 0.850; \hbox {Press} = 1.544\) (compound 27 outlier).

The other statistical quantities are presented in Table 4. They confirm the predictability of the obtained models. The log \(\hbox {IC}_{50(\mathrm{CEM})}\) values predicted based on Eq. (4) and the residues between the calculated and observed activities are presented in Table 1.

In the case of HeLa cells, the best model also includes lipophilicity as well as electronic and steric factors and is expressed by

$$\begin{aligned} \hbox {log IC}_{50(\mathrm{HeLa})}= & {} 0.5378(\pm 0.048)\hbox { log } P + 1.095(\pm 0.1929)E_{\mathrm{lumo}}\nonumber \\&- 0.486 (\pm 0.068) \hbox {CMR} + 4.345(\pm 0.954)\nonumber \\ \end{aligned}$$
(5)

\(n = 27; r = 0.961; r^{2}_{\mathrm{adj}} = 0.917; s = 0.221; F = 97.2; p < 0.0001; Q^{2} = 0.904; \hbox {Press} = 1.129\) (compound 27 is outlier).

The above equation shows that the parameters log P, \(E_{\mathrm{lumo}}\), and CMR play a significant role in explaining the variance (92 %) in the activity against the HeLa cells.

Another effective model is expressed by

$$\begin{aligned} \hbox {log IC}_{50(\mathrm{HeLa})}= & {} 3.050(\pm 0.562) q_{(\hbox {C2})} - 7.891(\pm 3.117) q_{(\hbox {N3})} \nonumber \\&- 0.033(\pm 0.005)\hbox {PSA} -3.598(\pm 1.863)\nonumber \\ \end{aligned}$$
(6)

\(n = 29; r = 0.906; r^{2}_{\mathrm{adj}} = 0.799; s = 0.342; F = 38.24; p < 0.0001; Q^{2} = 0.705; \hbox {Press} = 2.921\) (compound 27 is outlier).

The log \(\hbox {IC}_{50(\mathrm{HeLa})}\) values predicted from Eq. (5) and the residues between the calculated and observed activities are presented in Table 1.

Fig. 1
figure 1

Log IC\(_{50(L1210)}\) observed versus predicted from. Eq. (2). Compound 22 is the outlier

The results showed that the best models, including all studied cell lines, were obtained when lipophilicity parameters were taken into account in the equations. Lipophilicity determines a compound passage through cell membranes. Biological potency of compounds is a parabolic function of lipophilicity in its wide range [39, 40]. Lipophilicity of the studied compounds is limited as reflected by log P since values are in the 4.86–8.08 range. It can be assumed that the considered compounds represent only one ram of parabola. In that case, the antiproliferative activity is enhanced with a decrease of compound lipophilicity.

The biological potency of the compounds is also connected to the binding force of a ligand with a biomolecule which is associated to the charge distribution in a molecule. The charge of nitrogens N3 and N7 as well as that of carbon C-2 of the imidazo[2,1-b][1,3,4]thiadiazole ring is a statistically significant descriptor in the generated models. The obtained results showed that low negative charge of nitrogen \((q_{(\hbox {N3})}\), \(q_{(\mathrm{N7})})\) and low positive charge of carbon \((q_{\hbox {C}(2)})\) atoms contribute to antiproliferative potency. The generated models indicate the important role of the nitrogen atoms in the imidazothiadiazole ring for interactions with a potential molecular target. \(E_{\mathrm{lumo}}\) as an electronic factor that influences biological potency and activity increased with a decreasing \(E_{\mathrm{lumo}}\). The antiproliferative activity of 1,3,4-thiadiazoles was also found to be a function of \(E_{\mathrm{Lumo}}\) in a molecule [21]. The activity of the considered compounds is additionally enhanced by the high MR parameter.

Conclusions

Using combinatorial protocols in multiple linear regression (CP MLR), statistically significant QSAR models were obtained for all studied cell lines. For all systems three-variable equations were obtained. As the activities of the studied compounds against the individual cell lines were highly correlated, similar dependences were obtained. The constructed models gave high square correlation coefficient values and exhibited good predictive accuracy confirmed by internal and external validations as well as by a randomization procedure.

Taking into account the descriptors of model equations, there can be drawn some conclusions about the antiproliferative potency of imidazo[2,1-b][1,3,4]thiadiazoles: (1) compounds should have suitable lipophilicity–hydrophobicity character which determines cell membrane penetration; (2) models indicate a significant role of nitrogen atoms which can interact with a potential molecular target; (3) \(E_{\mathrm{lumo}}\) influences the biological potency of the compounds; and (4) activity promotes high MR of the molecule.

These conclusions will be the guidance for the design and syntheses of novel compounds that could express significant potency against the presented cancer cells.

Experimental section

Antiproliferative activity

The compounds presented in Table 1 were analyzed. Their antiproliferative potency in vitro against the following cancer cell lines was assessed: murine leukemia (L1210), human T-lymphocyte (CEM), and human cervix carcinoma (HeLa) cells [6, 8]. Activity of compounds was expressed as \(\hbox {IC}_{50}\) values \((\upmu \hbox {M})\), concentrations inducing a 50 % inhibition of cells compared to the control. The cytotoxicity was evaluated using the colorimetric 3-(4,5-dimethyl-2-thiazolyl)-2,5-diphenyl-2H-tetrazolium bromide (MTT) assay.

Descriptors

The compounds were built with a standard bond length and angles using the PC SPARTAN Pro Ver. 1.08 molecular modeling program [41]. The energy of a molecule was minimized using molecular mechanics methods followed by the semi-empirical PM3 method and used for the electronic properties determination. Charge of atoms was determined (C, N, and S) from the potential distribution. Other descriptors were calculated from the ChemSketch 11.02 and MedChem Designer 3.0 programs [42, 43].

Methodology of QSAR studies

Combinatorial protocols in multiple linear regression (CP MLR) was used to build the QSAR models [44, 45]. The strategy combines the MLR procedure and properly set-up criteria for the selection of descriptors and equations: the descriptors in terms of interparameter correlation cut off limits in the subset regressions (0.79); t values of the regression coefficients (2.0); square-root of adjusted multiple correlation coefficient, r bar (0.71); the external consistency, \(R^{2}_{\mathrm{CV}}\) (0.3 \(\le \) \(R^{2}_{\mathrm{CV}} \le \) 1.0) [28]. There was used the outlier criterion: \(\vert \hbox {Yobs}.-\hbox { Ycalc}. \vert >2\) st. dev. The model selection and statistics were made using the BuildQSAR version 2.1.0.0 and Statistica version 7.1 software packages [4648]. Statistical significance of the regression equation was tested by the correlation coefficient (r), the adjusted r square coefficient \((r^{2}_{\mathrm{adj}})\), the standard error of estimate (s), and the variance ratio (F). The leave-one-out cross-validation (LOO CV) algorithm was applied to estimate the quality of the obtained equations \((Q^{2})\). The statistical quantities of external validation as well as randomization test: \(R_r^{2}\), \(^{\mathrm{c}} R_p^{2}\) were calculated. Additionally, \({r_{m}^{2}} _{(\mathrm{overall})}\) was determined [3438].