Introduction

Polarizability (α) is described as the linear response of electronic charge distribution with respect to an external applied electric field [1, 2]. It is an electronic effect which can be explored further. It is considered as a conversion element between the induced dipole moment (μ) and an applied electric field (ɛ).

$$ \mu = \alpha .\varepsilon $$
(1)

As majority of the molecules are asymmetric, polarizability is primarily a three-dimensional tensor. Nevertheless, the average of the tensor’s diagonal elements is sufficient for nearly all purposes:

$$ \alpha = \alpha = \frac{1}{3}\left( {\alpha_{xx} + \alpha_{yy} + \alpha_{zz} } \right) $$
(2)

Recently, an effective model of atomic polarizability, an important Conceptual Density Functional Theory (CDFT)-based descriptor, was proposed by Tandon et al. in terms of absolute radius (r) and electronegativity (χ) [3]:

$$ \alpha \propto \frac{{r^{2} }}{\chi } $$
(3)

The model uses an empirical approach which is simple and integrates relativistic effects. All the sine qua non of a periodic descriptor is followed excellently. Additionally, molecular polarizability is computed through property of additivity in their study.

Since the atomic polarizabilities of Tandon et al. are accurate to maintain periodicity trend [3], in the present study we have applied them to determine molecular polarizabilities. Molecular polarizability is an important electronic parameter and has a strong impact on chemical–biological interactions. It plays a crucial role in elucidating such interactions through Structure–Activity Relationships (SAR). In the present work, with the intention of exploring the efficacy of newly proposed polarizability by Tandon et al. in the real field, we have applied it to compute molecular polarizability to study chemical–biological interactions. The study supports validating the model as prescribed by Tandon et al. to compute polarizability in a molecular framework and not to develop the finest Quantitative Structure–Activity Relationship (QSAR). For future prospects, it is suggested that the descriptor polarizability should be contrasted with the use of other drug-like descriptors.

In these contemporary times, computational workers have overlooked the research of twentieth century on the importance of electronic interactions between a receptor and a ligand with the employment of combinatorial chemistry and in silico approaches. Since electronic interactions are a consequence of specific structural features and chemical traits, SAR analyses are indisputably of huge value in the present chemistry and biochemistry. The idea is to use chemical knowledge and perception to alter exploration for compounds with requisite properties into a mathematically quantified and programmed form. When a relationship is accomplished linking structure and activity, a great deal of compounds, inclusive of hypothetical, can be effortlessly analysed by computer in order to select structures with the necessary characters. Thus, it makes the selection of potential compounds simplistic, which can then be developed and examined in the laboratory. Hence, SAR methodology preserves resources and accelerates the route of development of new molecules to be utilized as drugs, anaesthetics, additives and other diverse materials.

Presently, our biodatabase contains a large number of QSAR equations with different parameters to describe distinct electronic effects. However, most of such equations do not encompass polarizability, which plays a crucial role in such relationships. A number of descriptors have been proved to be helpful in relating and accounting for numerous radical reactions [4,5,6]. These kinds of reactions are imperative in toxic conditions such as smoking or in the case of some environmental chemicals. Employing well-known parameters is the only means to perform comparative QSAR, which is the route towards a science of chemical–biological interactions through which one can approximate the biological activity of compounds that have not been investigated so far. The most essential facet of the theoretical methodology is to realize how living organisms (or their components such as DNA, protein, enzymes, and so on, and organisms ranging from bacteria to human) are affected by chemicals, for which mechanistic chemistry together with biology is crucial, in addition to the proficiency to carry out significant modelling with the ‘proper’ system. Electronic, hydrophobic and steric interactions constitute the chief interactions, and they must be suitably considered while modelling.

Conventionally, Lorentz–Lorenz equation is defined as [7],

$$ {\text{MR}} = \left( {n^{2} - \frac{1}{{n^{2} }} + 2} \right)\frac{{{\text{MW}}}}{d} $$
(4)

In this equation, n refers to the refractive index, MW indicates the molecular weight and d depicts the density of the substance. The refractive index (MR) is a quantification of the interaction of light with the electrons in a molecule. The term MW/d represents the volume. As there is a minor variation in n, MR depends on MW/d. The initial attempt to apply molecular refractivity in terms of the Lorentz–Lorenz equation to biological processes was made by Pauling and Pressman [8]. Their study was based on hapten antibody interactions, which considered polarizability and involved steric factors [9]. Agin et al. made a remarkable investigation of the ability of diverse chemicals to obstruct the sartorius muscle of the frog incorporating two parameters: α (polarizability) and Ip (ionization potential) [10]. One more case of the minimum blocking concentration of various drugs on frog muscle results from a study by Kamlet and others [11]. In another investigation, Hahin and his coworkers examined the action of alcohols on frog nerves [12]. The pharmacological activities of a number of phenyl alkane p-ω-bis(trialkylammonium) compounds were analysed by Wien and Mason [13]. Minimum blocking concentration to reduce action potential in central nerve cord from American cockroach was evaluated by Nishimura et al. [14]. Recently, García-Jacas et al. explored the aptness of alignment-free geometric molecular descriptors based on N-linear algebraic maps, commonly known as QuBiLS-MIDAS, for extracting structural information of the molecules [15]. A comparison of different QSAR methodologies for investigating their predictive abilities was also presented by their group [16]. Various studies have been carried out to study the mechanism of different biological activities such as blocking potency, toxicity, anticonvulsant activity, neuromuscular activity, neurophysiological activities, to mention some [17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34]. The history of such works is vast and will not be discussed here.

The mechanism of action of local anaesthetics, narcotics and drugs is in general one of the fundamental difficulties in the study of cell membranes [10]. There are two basic characteristics of this problem, which have puzzled and at the same time fascinated researchers: first of all, there is a broad range of chemical structures, varying from the inert gases to complex molecules, for example tetrodotoxin, which can stop electrical activity reversibly; secondly, a lot more is known about the characteristics of any of these chemical structures than regarding the membrane with which they presumably interact. The foremost complication in earlier times has been the absence of suitable quantitative parameters, mainly those pertaining to active and inactive molecular structures.

Numerous forms of biological activities rest on the pharmacokinetics, circulation and distribution of chemicals within the organism. Such processes are suggested to be greatly controlled by polarizability. Binding of chemicals with body fluids or cells, their passage and various other interactions depend on the polarizability of the chemical as well as the system. However, regardless of its potential, most of the QSAR studies do not employ this elementary parameter. Thus, the objective of this study is to present the potential of Tandon et al.’s polarizability in computing molecular polarizability to use as an electronic interaction parameter for chemical–biological systems and to suggest the probable effects and action mechanism. It is believed that the study of the action mechanism of a variety of molecules would reveal important properties of the biological structures and eventually lead to an understanding of the mechanism of anaesthesia, narcotics, toxicants and other similar chemicals.

Method of computation

The purpose of the present study is to assess the applicability of Tandon et al.’s polarizability [3] in computing molecular polarizability and further exploring its potential in the real field. For this purpose, varied biological activities of different compounds have been modelled using polarizability with the aim of explicating their course of action. In our analysis, we have empirically computed a CDFT-based reactivity descriptor, viz. polarizability (α), for 312 compounds through the property of additivity [35,36,37] using atomic polarizability data tabulated by Tandon et al. [3]:

$$ \alpha_{m} \approx \mathop \sum \limits_{i} \alpha_{i} $$
(5)

We have determined the molecular polarizabilities for a variety of molecules, viz. anaesthetics, miscellaneous drugs, phenyl-substituted and unsubstituted n-alkanols, phenyl alkane p-ω-bis(trialkylammonium) compounds, substituted benzyl pyrethroids, miscellaneous aliphatic alcohols and esters and 3-amidinophenylalanine derivatives. The data can be found in Online Resource 1.

The selected compounds have been categorized into seven sets on the basis of their biological activities, viz. minimum concentration of varied anaesthetics required for complete block of excitability in sartorius muscle of the frog, minimum concentration of miscellaneous drugs for blocking electrical activity in frog muscle, minimum blocking concentration of miscellaneous alkanols to reduce action potential of frog nerves by 50%, relative biological response for neuromuscular blocking activity of aromatic bis-quaternary compounds on rabbit phrenic nerve diaphragm, minimum blocking concentration of pyrethroids to suppress action potential of central nerve cord of cockroach, 50% inhibitory growth concentration of miscellaneous aliphatic alcohols and esters to present toxicological activity in Tetrahymena pyriformis and binding affinity of 3-amidinophenylalanine derivatives towards thrombin. The biological activity data for these compounds have been taken from literature [10,11,12,13,14, 38, 39]. Regression analysis has been carried out to correlate descriptor values with observed activities to build a scrupulous and realistic QSAR model. In the study, the DFT-based reactivity descriptor, polarizability, is adopted as an independent variable, while log of inverse of minimum blocking concentration (MBC), log of relative biological response (RBR), log of inverse of inhibitory growth concentration (IGC) or log of inverse of binding affinity (Ki) is considered a dependent variable to establish QSAR model to predict the biological activity:

$$ {\log}\left( {\frac{1}{{{\text{MBC}}}}} \right) \;{\text{or}}\; \left( {{\text{RBR}}} \right) \;{\text{or}}\; \left( {\frac{1}{{{\text{IGC}}_{50} }}} \right) = a\alpha + b $$
(6)

and

$$ {\log}\left( {\frac{1}{{K_{i} }}} \right) = a\alpha + b\beta + c $$
(7)

In these expressions, log (1/MBC), (RBR), (1/IGC50) or (1/Ki) implies predicted biological activity, α indicates polarizability of the compound, while ‘a’, ‘b’ and ‘c’ represent constants calculated for respective set of compounds. β signifies the compressibility of the molecule. The total number of compounds in every set (n) is split into training and test set to develop the model and assess its predictive power, respectively. The division is made such that both the sets contain similar functional group as far as possible. Goodness-of-fit of the developed model is determined by coefficient of determination (R2), and robustness is evaluated by cross-validation coefficient (using leave-one-out method) (R2CV). Further, predicted residual error sum of squares and error bar analysis is used to test the uniformity and significance of the data. All the OECD principles have been followed; namely endpoint has been defined, an unambiguous algorithm is developed, applicability domain is defined, measures of goodness-of-fit, robustness and predictivity have been provided and finally, a mechanistic interpretation is also presented. The computations have been carried out using Minitab including the development, testing and validation of the developed regression models [40].

Results and discussion

The study describes the application of Tandon et al.’s polarizability in the real field as a predictor of biological activity for the chosen 312 compounds. Observed activity data (1/MBC or RBR or 1/IGC50 or 1/Ki) of the selected compounds are taken as the dependent variable, whereas polarizability (α) is considered as an independent variable in developing the regression models. As mentioned in the above section, the study has been performed by differentiating the selected compounds into seven categories. Discussion for each of the above-mentioned sets is presented in this part.

Complete block of excitability in terms of minimum blocking concentration for sartorius muscle of the frog

The first set of compounds presents the potential of a broad variety of anaesthetics to block the sartorius muscle of the frog. Such blocking is possibly a result of nerve inhibition. The QSAR model obtained for the minimum blocking concentration using polarizability is represented by Eq. (8):

$$ \begin{gathered} \log \left( {\frac{1}{{{\text{MBC}}}}} \right) = \left( {0.019} \right)\alpha - \left( {0.596} \right) \hfill \\ n{ } = { }39,{ }R^{2} { } = { }0.950,{ }R^{2} _{\text{CV }} = { }0.938 \hfill \\ \end{gathered} $$
(8)

The predicted and observed biological activities (in mM) of the anaesthetics are presented in Table 1. It is seen that the molecules in the set possess heterogeneity in their chemical structures and yet show a close conformity with Eq. (8). High values of R2 and R2CV signify a very impressive correlation between the data sets. Predicted residual error sum of squares and standard error bar analysis presents uniformity and significance of the data. The molecules used in the construction of the model are the different from the ones used for validation purpose. Internal validation by the test set transpires the predictability of the model. It appears that anaesthetics or other simple molecules interact with electrically excitable membranes by a same mechanism. Hence, it is unacceptable to provide a dissimilar and specific interaction mechanism for each and every anaesthetic molecule based on its chemical configuration or structural relations to known biochemical systems. However, in reality, not every molecule behaves as an anaesthetic. The reason for this may be related to polarizability, water (lipid) solubility and other interactions such as hydrogen bonding taking place with molecules. If a group with lower polarizability and higher lipophilicity, for instance, an aromatic ring, is attached to a molecule such as sugar, the newly formed molecule will show anaesthetic action, i.e. block excitability, because it will allow an easy passage of chemicals through membranes which are less polar but more lipophilic. Thus, there will be a greater distribution of the chemicals to hydrophobic regions of the bioorganism consequently demonstrating an anaesthetic or excitability blocking activity, and the minimum blocking concentration will be in accordance with the polarizability of that group. On the other hand, if a polar group enters into an organism, its solubility in water (hydrophilicity) will increase since polarity of water is higher in comparison with lipids and proteins. As a result, there will be a slow passage of chemicals through membranes, which will favour excretion of chemicals out of the organism through various ways like urine, perspiration, etc. Therefore, the tendency to accumulate chemicals in high concentrations will decrease and no or very less anaesthetic action will be observed. The exact course of action of anaesthetics, nevertheless, remains vague. Even though the correlation reported here is fairly remarkable, which lets us predict the minimum blocking concentration of some molecules with significant precision, the theoretical opinions in reference to the involved physical parameters perhaps need to be investigated further.

Table 1 Minimum concentration of varied anaesthetics required for complete block of excitability in sartorius muscle of the frog (in mM)

Blocking of electrical activity in terms of minimum blocking concentration for frog muscle

The second set of compounds presents the ability of miscellaneous drugs to block electrical activity in frog muscle. The QSAR model for computing the minimum blocking concentration using polarizability is given in Eq. (9):

$$ \begin{gathered} \log \left( {\frac{1}{{{\text{MBC}}}}} \right) = \left( {0.023} \right)\alpha - \left( {1.121} \right) \hfill \\ n{ } = { }21,{ }R^{2} { } = { }0.977,{ }R^{2}_{\text{CV }} = { }0.968 \hfill \\ {\text{Outliers}}:{\text{ Chloroform}},{\text{ Nitrobenzene}} \hfill \\ \end{gathered} $$
(9)

The predicted and observed biological activities (in mM) of the drugs are listed in Table 2. It is noted that the molecules in the given set present a close agreement with Eq. (9). High values of R2 and R2CV present the goodness-of-fit and robustness, which in turn suggest a remarkable association amongst the activity data sets. Standard error bar analysis presents uniformity in data, while predicted residual error sum of squares indicates superiority of the model. The molecules used in the construction of the model are the different from the ones used for validation purpose. Internal validation by the test set signifies excellent predictability by the model. From a mechanistic point of view, an increase in the polarizability of drug may result in dipole–dipole or charge–dipole interactions between high-polarity drug molecules and chemicals associated with the electrical conduction of nerve impulses across nerve membranes such as zwitter ionic lecithin and/or ionic acetylcholine. As a result of such interaction, a hindrance is produced in the conduction of nerve impulses. Thus, it follows when interactions take place between extremely polar or charged species; the contribution of polarizability to blocking of electrical activity effect is highest. Undoubtedly, this indicates a definite relationship amongst highly polar solutes and certain molecular entities contributing to the nerve impulse transmission where lecithin and acetylcholine appear probable candidates.

Table 2 Minimum concentration of miscellaneous drugs for blocking electrical activity in frog muscle (in mM)

Reduction in action potential in terms of minimum blocking concentration for frog nerves by 50%

The third set is comprised of a variety of alkanols using which QSAR relationship is constructed for the computation of the minimum blocking concentration to reduce action potential of frogs. The model, displayed by Eq. (10), employs polarizability as a theoretical parameter:

$$ \begin{gathered} {\text{log }}\left( {\frac{1}{{{\text{MBC}}}}} \right) = \left( {0.021} \right)\alpha - \left( {1.139} \right) \hfill \\ n \, = { 11}, \, R^{{2}} = \, 0.{988}, \, R^{{2}}_{{{\text{CV}}}} = \, 0.{972} \hfill \\ \end{gathered} $$
(10)

The predicted and observed biological activities (in mM) of compounds are arranged in Table 3. An excellent correlation is noted for the selected compounds with Eq. (10). High R2 and R2cv values imply a significant relationship between the predicted and observed activities. Predicted residual error sum of squares presents superior predictability by the model. The molecules used in the construction of the model are the different from the ones used for validation purpose. As per error bar analysis, there is a slight variation in the selected data. However, internal validation suggests the good predictability by the model. The variable, viz. polarizability, adopted for the prediction performs a crucial role in binding interactions between membrane proteins and molecules (like alkanols). An increase in alkanol potency with phenyl substitution is represented by Eq. (10). Although the addition of phenyl moiety causes an increase in potency, a decrease can be seen with increasing chain length. This suggests greater efficiency of increase in chain length over addition of phenyl group. The explanation for such behaviour is possibly correlated with polarizability and hydrogen bond acceptor basicity. As the phenyl addition occurs, an increase in the hydrogen bond acceptor basicity of the molecule takes place, which is attributed to the polarization of a specific part of the molecule. Thus, the formation of hydrogen bonds with other donor molecules becomes facile. Since Na+ channels are known to play a crucial role in mechanism of action potential, obstruction of these channels will result in action potential block. Now, when a high-polarity (and the hydrogen bond acceptor basicity) chemical, i.e. alkanol, surrounds the Na+ channel, there will be an increment in the binding of donor hydrogen bond sites of the Na+ channel to alkanol molecules, thereby resulting in effective action potential block [41].

Table 3 Minimum blocking concentration of miscellaneous alkanols to reduce action potential of frog nerves by 50% (in mM)

Neuromuscular blocking activity in terms of relative biological response for phrenic nerve diaphragm of rabbit

The fourth set presents the practicability of polarizability to predict relative biological response for neuromuscular blocking activity on rabbit nerves. The QSAR model obtained for the biological activity is represented by Eq. (11):

$$ \begin{gathered} {\log} \left( {{\text{RBR}}} \right) = \left( {0.009} \right)\alpha - \left( {3.681} \right) \hfill \\ n \, = { 17}, \, R^{{2}} = \, 0.{825}, \, R^{{2}}_{{{\text{CV}}}} = \, 0.{78}0 \hfill \\ \end{gathered} $$
(11)

The predicted and observed biological activities (in mM) of the aromatic bis-quaternary compounds are presented in Table 4. The compounds selected for the investigation are in good accord with Eq. (11). The value of R2 and R2CV justifies the dependence of the neuromuscular blocking activity on polarizability. Analysis of error bars presents consistency in the data but predicted residual error sum of squares is high implying average predictability. The molecules used in the construction of the model are different from the ones used for validation purpose. The result of internal validation for predictability of the model is satisfactory. The present phenyl alkane p-ω-bis(trialkylammonium) series offers several suitable candidates of pharmacological importance, such as phenyl hexane p-ω-bis-(trimethylammonium iodide) and p-ω-bis(triethylammonium iodide). It is suggested that the block through such compounds occurs via depolarization, which alters the distribution of electric charge in the nerve cells as it enters or binds to the nerve cell. As a result, the transmission of signals is obstructed due to reversal of charges leading to neuromuscular block. However, there might be a possibility of competitive block too. It has also been observed that the replacement of methyl with ethyl groups in the series, in general, enhanced the neuromuscular blocking property. This may be elucidated by the fact that there is an increase in the polarizability on substitution of methyl with ethyl as indicated by our theoretical values. Accordingly, phenyl hexane p-ω-bis-(triethylammonium iodide) may act as a potential neuromuscular blocking drug.

Table 4 Relative biological response for neuromuscular blocking activity of phenyl alkane p-ω-bis(trialkylammonium) compounds to rabbit nerves (in mM)
Table 5 Minimum blocking concentration of pyrethroids to suppress action potential of central nerve cord of cockroach (in M)

Suppression in action potential in terms of minimum blocking concentration for cockroach central nerve cord

In this set, the ability to suppress action potential is analysed using a physicochemical parameter, polarizability, of substituted benzyl pyrethroids in the excised central nerve cord of American cockroach according to Eq. (6). The regression analysis yielded Eq. (12):

$$ \begin{gathered} {\log} \left( {\frac{1}{{{\text{MBC}}}}} \right) = \left( {0.006} \right)\alpha - \left( {4.523} \right) \hfill \\ n \, = { 14}, \, R^{{2}} = \, 0.{896}, \, R^{{2}}_{{{\text{CV}}}} = \, 0.{836} \hfill \\ {\text{Outliers}}:{\text{ NO}}_{{2}} ,{\text{SO}}_{{2}} {\text{Me}} \hfill \\ \end{gathered} $$
(12)

The predicted and observed biological activities (in M) of the pyrethroid derivatives are recorded in Table 5. A nice agreement is highlighted by the series of compounds with Eq. (12). The obtained R2 and R2CV validate the effective correlation between the two sets of data. Lower values of predicted residual error sum of square and uniformity in error bars imply good predictability by the model. The molecules used in the construction of the model are different from the ones used for validation purpose. Internal validation is also carried out by validating the test set which too transpires the predictability. It is well known that every cell in a living tissue is electrically polarized, i.e. maintains a potential difference across its plasma membrane. This potential difference is the effect of an association between ion pumps and ion channels (mainly Na+ and K+). When this membrane potential goes through a rapid rise and fall, an action potential is generated which depolarizes the nerve cell following which other nerve cells also get depolarized. Now, when a highly polar pyrethroid enters the living system, it interacts with the Na+ channel consequently disturbing its mechanism which eventually results in the suppression of action potential. Thus, it is transparent that polarizability plays a very important role in the transmission as well as blocking of nerve impulse.

Toxicological activity in terms of 50% inhibitory growth concentration for T. pyriformis

The sixth set of compounds presents the potential of a broad variety of miscellaneous aliphatic alcohols and esters to display toxicological effect in T. pyriformis. The QSAR model obtained for the 50% inhibitory growth concentration using polarizability is represented by Eq. (13):

$$ \begin{gathered} {\log} \left( {\frac{1}{{{\text{IGC}}_{50} }}} \right) = \left( {0.018} \right)\alpha - \left( {3.379} \right) \hfill \\ n \, = { 122}, \, R^{{2}} = \, 0.{845}, \, R^{{2}}_{{{\text{CV}}}} = \, 0.{838} \hfill \\ \end{gathered} $$
(13)

The predicted and observed biological activities (in mg/l) of the miscellaneous aliphatic alcohols and esters are presented in Table 6. It is seen that the molecules in the set possess heterogeneity in their chemical structures and yet show a close conformity with Eq. (13). High values of R2 and R2CV signify a very impressive correlation between the data sets. Predicted residual error sum of squares and standard error bar analysis presents uniformity and significance of the data. The molecules used in the construction of the model are different from the ones used for validation purpose. Internal validation by the test set transpires the predictability of the model. It is widely acknowledged that toxicity is a consequence of electronic interactions amongst the atoms/molecules of the toxicant and reactive site. Given that polarizability is a property of electronic distribution, it is assumed to play a major role in understanding diverse interactions including toxic interactions. From the analysis, it is observed that higher polarizability leads to lower toxicological activity. It is also evident that bulky molecules are more toxic as compared to the lower counterparts. Further, other factors such as solubility, medium and concentration also influence the extent of toxicity. Nevertheless, it must be noted that not all molecules are toxic. Although the correlation presented here is quite noteworthy which allows prediction of the inhibitory growth concentration of some alcohols and esters with significant accuracy, the theoretical observations with respect to the involved parameter perhaps require to be probed further.

Table 6 Inhibitory growth concentration of miscellaneous alcohols and esters to inhibit toxicity in T. pyriformis by 50% (in mg/l)

Selectivity in terms of binding affinity for the enzyme thrombin

The last set presents the potential of 3-amidinophenylalanine-derived inhibitors to bind with thrombin in order to elucidate their selectivity. The binding affinity data used in this set are taken from reference [39] and are a well-known benchmark data set [42]. The QSAR model developed for the binding affinity using polarizability (α) together with compressibility (β), another important atomic and molecular descriptor [34], is represented by Eq. (14):

$$ \begin{gathered} {\log} \left( {\frac{1}{{K_{i} }}} \right) = \left( {0.019} \right)\alpha + \left( {0.006} \right)\beta - \left( {0.00001} \right)\alpha \beta - 4.65 \hfill \\ n \, = { 88}, \, R^{{2}} = \, 0.{645}, \, R^{{2}}_{{{\text{CV}}}} = \, 0.{623} \hfill \\ {\text{Outliers}}{:}{\text{ Compound 53}},{\text{ Compound 58}},{\text{ Compound 69}},{\text{ Compound 88}} \hfill \\ \end{gathered} $$
(14)

The predicted and observed biological activities (in mol/l) of various 3-amidinophenylalanine-derived inhibitors are presented in Table 7. A more or less acceptable conformity with Eq. (14) is highlighted by the series of compounds. The obtained R2 and R2CV are satisfactory for validating the useful correlation between the two sets of data. Lower predicted residual error sum of squares and standard error bar analysis presents reliability and significance of the data to some extent. The molecules used in the construction of the model are different from the ones used for validation purpose. Internal validation by the test set transpires the predictability of the model. It appears that an increase in polarizability leads to an increase in the binding affinity of inhibitors to thrombin, in general. This is acceptable since higher polarizability favours interaction with potential molecules. Further, it is expected that replacement of simple groups by sterically extensive groups should enhance affinity.

Table 7 Binding affinity of 3-amidinophenylalanine derivatives towards thrombin to predict selectivity differences (in mol/l)

In order to establish the potential of polarizability in explaining chemical–biological interactions, another model has been constructed for the same set which considers only compressibility and excludes polarizability. The model is represented by Eq. (15):

$$ \begin{gathered} {\log} \left( {\frac{1}{{K_{i} }}} \right) = \left( {0.004} \right)\beta + 1.722 \hfill \\ n = {88}, \, R^{{2}} = 0.{534}, \, R^{{2}}_{{{\text{CV}}}} = 0.{5}0{8} \hfill \\ {\text{Outliers}}{:}{\text{ Compound 53}},{\text{ Compound 58}},{\text{ Compound 88}}. \hfill \\ \end{gathered} $$
(15)

The predicted binding affinities (in mol/l) of various 3-amidinophenylalanine-derived inhibitors based on compressibility are also presented in Table 7. The molecules used in the construction of the model are the different from the ones used for validation purpose. On comparing the models based on polarizability/compressibility (Eq. 14) and just compressibility (Eq. 15), it is observed that the polarizability/compressibility-based model is more potent. Comparatively higher values of R2 and R2CV are evident for polarizability-dependent model signifying a superior correlation between the data sets than the model excluding it. Further, predicted residual error sum of squares and standard error bar analysis transpires fine uniformity and significance of the data in the case of multi-descriptor model. Similarly, internal validation for the two models also justifies the ability of polarizability to act as a suitable descriptor for chemical–biological interaction modelling. It is recommended that multi-descriptor models are constructed as they offer accurate explanation of the occurring phenomenon due to the incorporation of wide range of effects. For this purpose, reference [43] can be used to calculate a number of descriptors.

A comparison of the predictive potential (R2pred) of the present polarizability-based MLR methodology with the predictive potentials of 14 QSAR methodologies for thrombin models reported in the literature is presented in Table 8 [15, 16, 42, 44,45,46]. It is evident from Table 8 that the present MLR model based on polarizability is satisfactory, although not the finest, amongst the data sets considered. However, owing to the simplicity, cost-effectiveness and hastiness of the present model, it is clear that the obtained results are rather useful to get a preview of structure and activity relationships.

Table 8 Comparison of the predictive potential (R2pred) of the present polarizability-based methodology for thrombin models with respect to those obtained by 14 QSAR methodologies established in the literature

The outcomes of this investigation undoubtedly reveal the importance of Tandon et al.’s model of polarizability in the real field. A striking correlation is presented between the predicted and observed biological activities. The reliability and predictability of the newly designed QSAR model are apparent from the statistics for every set. Moreover, it is heartening to note that the predicted biological activities are numerically comparable to those approximated by other complex techniques. Nevertheless, the above QSAR corresponds to only a small fraction of QSARs. An amazing element of our analysis is the diversity of the groups of chemicals studied.

Conclusion

A study is performed to investigate the application of recently proposed model of polarizability by Tandon et al. in the real field by determining molecular polarizability through additive property. A comprehensive chemical–biological interaction analysis has been carried out for preferred 312 compounds using molecular polarizability as a predictor to estimate their biological activity. An exceptional resemblance is found between the predicted and observed activities. The usefulness of this descriptor to predict the biological activity is revealed by superior correlations. All the principles of OECD are nicely followed in our study. Therefore, it is clear from the results that Tandon et al.’s polarizability can be suitably implemented as a molecular descriptor in real fields such as biological activity estimation for anaesthetics, narcotics, neurophysiological and other similar drugs. Species possessing lone pair of electrons offer future prospects for polarizability. Thus, further investigations based on such potent compounds must be carried out as they hold immense significance in this direction. Finally, it is suggested for future explorations that the use of descriptor polarizability should be contrasted with some other drug-like descriptors as well.