Introduction

Proto-oncogene serine/threonine-protein kinase (Pim-1) is an enzyme that is encoded by the Pim-1 gene [1, 2]. Pim proteins are of three forms Pim-1, Pim-2, and Pim-3, which regulate various signaling pathways in the development of cancer. [3] Pim kinases are often activated in various downstream substrates that are thought to contribute to tumor growth and survival in both hematologic and solid cancers [4]. The Pim-1 oncogene is identified as being the locus most frequently triggered by the Moloney murine leukemia virus in relation to murine T cell lymphomas [5].

Pim isoforms have different levels of expression and distinct roles. All three Pim genes in cancer, there is compelling evidence of a compensatory mechanism [6,7,8,9].

Adenosine triphosphate-competitive Pim inhibitors, SGI-1776, AZD1208, TP-36544, and LGH447 are in clinical trials for the treatment of acute prostate cancer, myelogenous leukemia, and lymphoma (Fig. 1) [10].

Indole derivatives are widely studied for their kinase inhibitory activity as reported in literature. [11,12,13,14] The novel series of meridianin C derivatives reported in the literature [15] and have shown strong inhibitory activities against Pim-1 and Pim-3 kinases that encouraged us to study 3D QSAR of these compounds.

3, 5-disubstituted indole derivatives as Pim kinase inhibitors were selected for 3D QSAR study because authors noted that the electronic properties of the substituents on the phenyl ring did not have any significant effect on the inhibitory function of compounds [15] and the 3D QSAR model can reveal features influencing the activity. It was also stated by the author that the inhibitory activity, in particular for Pim-2 kinase, depended on the position of the substituent [15].

Indole derivatives, specifically 3,5 substituted and meridianin C derivatives have been in focus as Pim kinase inhibitors [11, 16,17,18]. The development of the QSAR model can direct further lead optimization in Pim Kinase inhibitor’s discovery by studying their features viz. steric, electrostatic, hydrophobic, H-bond Acceptor, and H-bond Donor in three-dimensional space. The aim of the study was to disclose the minimum structural requirements for Pim kinase inhibitors based on the indole nucleus that may guide the rational design of potent compounds.

In the present work, the pharmacophore and 3D-QSAR models of 3, 5-disubstituted indole derivatives were generated and validated using parameters like R2, Q2, RMSE, Pearson R, and F value. Also ‘Drug likenesses and ADME properties of molecules were evaluated.

Materials and method

All the molecular modeling work was carried out using the Schrodinger Suite [19, 20].

Selection of ligand library for developing 3D-QSAR studies

All the selected 3,5-disubstituted indole derivatives were taken from the literature [15] (Table 1). The structures of ligands were drawn using 2D sketcher of Schrödinger maestro 2019–3. Structures were optimized for using LigPrep modulea where the structures converted to 3D and energy-optimized using OPLS2005. Least energy conformation was generated for ligands at pH 7.2 using Epik, in the LigPrep module with all other default options.

Fig. 1
figure 1

Pim kinase inhibitors in clinical trials [10]

Table 1 Structure and their ‘drug-likeness’ properties

Data set for modeling the atom-based and field based 3D QSAR studies

For developing an atom and field-based QSAR model, the data set comprised of 40 compounds was selected. All the models were developed through a random selection of training and test set into ratio, 70:30%, respectively, by PHASE moduleb.

Pharmacophore modeling

A minimum of four to a maximum of five pharmacophore features were selected for the development of the pharmacophore hypothesis. A hypothesis difference criterion was kept at 0.50 defaults. There were 8 actives and 32 inactive. Among the hypotheses developed, the top-ranked hypothesis was selected on the basis of survival score.

3D QSAR

All ligands were superimposed upon each other by utilizing the Flexible ligand alignment from the maestro. The assignment of training and test set molecules was totally performed by random splitting into a ratio of 70:30% by software, i.e., 28 training set and 12 test set molecules. The QSAR models were developed using PLS factor of 4 for atom-based and field-based QSAR models development. Grids spacing were kept as 1 Å. Gaussian based field style was utilized for field-based models which include five features like Gaussian Steric, Gaussian Electrostatic, Gaussian Hydrophobic, Gaussian H-bond Acceptor, and Gaussian H-bond Donor. In the building of field-based models, the steric force field, and electrostatic force fields were kept at 30.0 kcal/mol.

Absorption, distribution, metabolism, and excretion (ADME)

ADME predictions were carried out using the QikProp module.c The pharmacologically relevant characteristics of many organic molecules were predicted by the QikProp module.c ADME properties like CYP2C19 inhibition, CaCo2, CYP2C9 inhibition, CYP3A4 inhibition, CYP3A4 substrate, and percent Plasma protein binding were investigated in PreADMET online server (https://preadmet.bmdrc.kr/) (Scheme 1).

Scheme 1
scheme 1

Flowchart for the strategy used in the study.

Fig. 2
figure 2

Pharmacophore model DHRRR_1 for PIM kinase inhibitors a: Model with distances among the features, b1: most active ligand 40, and b2: least active ligand 37. H-bond donors (D)- Blue, hydrophobic (H)-Green, and aromatic rings (R)-Orange

Result and discussion

Determination of pharmacophore

The pharmacophore models were developed on energy optimized and aligned structures of molecules. A common pharmacophore models were generated for Pim-1 and Pim-3 kinase inhibitors by selecting top 10 most active molecules based Pim-1 and Pim-3 kinase inhibitory potential and out of those molecules, 8 compounds (3, 12, 17, 31, 32, 35, 38, and 40) were found common in both the selected sets. These 8 molecules were set as active in the generation of pharmacophore models. Pharmacophore models were developed and hypotheses were studied. The selected top-ranked five-point hypothesis DHRRR_1 based on a survival score of 5.6678, site score of 0.7699 (Fig. 2). All the active molecules were aligned on selected pharmacophore DHRRR_1 (Fig. 3).

Fig. 3
figure 3

Markush structure of compounds

Structures and their drug-likeness properties

All the molecules were subjected to in silico ‘drug-likeness’ analysis as suggested by CA Lipinski [21]. Almost all molecules were found within acceptable ‘drug-likeness’ parameters or ‘Rule of five’ except few molecules which are failed to satisfy drug-likeness because of their predicted logP values (Table 1).

3D QSAR

Predictive 3D QSAR models were developed and validated by various software-generated statistical parameters. Standard Deviation (SD) should not be more than the standard deviation of actual activities. The model to considered predictive when R2 for the regression (the coefficient of determination) is above 0.6 and Q2 (Q-squared Value is directly analogous to R-squared, but based on the test set predictions) is above 0.5. The difference between R2 and Q2 is accepted when it is not more than 0.3. F that is the ratio of the model variance to the observed activity variance value should be large. P (probability of error) should be as low as possible. RMSE (Root-mean-square error in the test set predictions) should be close to zero for a model to be predictive. Pearson R (value for the correlation between the predicted and observed activity for the test set) with a value close to 1 indicates less scattered predicted activities in the graph of actual vs. predicted activities [22,23,24].

The significance of each influencing factor was studied by evaluating their contribution as mentioned in contribution charts. Plots of actual vs. Predicted activities reflect the predictivity of the models (Fig. 4).

Fig. 4
figure 4

Actual vs. predicted activity of atom-based QSAR model for Pim-1 kinase inhibition ((PLS 1)

Fig. 5
figure 5

Atom-based 3D QSAR maps for Pim-1 kinase inhibition

Atom-based QSAR for Pim-1 kinase inhibitors

The atom-based 3D QSAR model was developed in order to study structural requirements for Pim-1 kinase inhibition (Fig. 5). The atom-based QSAR, the maps of occlusion, are either blue or red or both in the form of color cubes. The increases in activity were well associated with blue occlusion cubes in the atom-based models; while the red cubes associated with decrease in activtty. It was clearly observed that the region associated with phenyl ring directly attached with indole shows the blue colored cubes indicating that electron-withdrawing functionalities favor the activities where the region of phenyl ring attached to pyrazine shows both red and blue maps indicating that the further electron-withdrawing substituents may favor/disfavor the activity (1A and 1B). The region around the substituents of phenyl ring directly attached with indole shows the blue color for the hydrogen bond donor region which indicates favorable for activity (2A and 2B). The hydrophobic substituents were well associated with the decrement in activity as seen in the red contours around the phenyl ring directly attached to indole and incremental biological activity with longer alkyl chains on phenyl ring attached to pyrazine (3A and 3B) (Table 2, 3, and 4).

Fig. 6
figure 6

Actual vs. predicted activity of atom-based QSAR model for Pim-3 kinase inhibition (PLS 1)

Fig. 7
figure 7

Atom-based 3D QSAR maps for Pim-3 kinase inhibition

Table 2 Actual and predicted activities for atom-based QSAR for Pim-1 kinase inhibitors
Table 3 Statistical parameters of atom-based QSAR model for Pim-1 kinase inhibition
Table 4 Contribution factors of atom-based QSAR model for Pim-1 kinase inhibition

Atom-based QSAR for PIM 3 kinase inhibitors

The atom-based 3D QSAR model was developed in order to study structural requirements for Pim-1 kinase inhibition (Figs. 6 and 7). It was clearly observed that the region associated with phenyl ring directly attached with indole shows the blue colored cubes indicating that electron-withdrawing functionalities favor the activities where the region of phenyl ring attached to pyrazine shows both red and blue maps indicating that the further electron-withdrawing substituents may favor/disfavor the activity and the electron-withdrawing group on alkyl chain favors the activity (4A and 4B). The region around the substituents of phenyl ring attached with pyrazine shows mixed region blue/red color for the hydrogen bond donor region which indicates favorable/disfavourable for activity (5A and 5B) (Fig. 8). The hydrophobic substituents were well associated with the decrement in biological activity as seen in the red contours around the phenyl ring directly attached to indole and mixed biological activity for substituents with phenyl ring attached to pyrazine (6A and 6C) (Table 5, 6 and 7).

Fig. 8
figure 8

Actual vs predicted activity of field-based QSAR model for Pim-3 kinase inhibition

Table 5 Actual and predicted activities for atom-based QSAR for Pim-3 kinase inhibitors
Table 6 Statistical parameters of atom-based QSAR model for Pim-3 kinase inhibition
Table 7 Contribution factors of atom-based QSAR model for Pim-3 kinase inhibition
Table 8 Actual and predicted activities for Field-based QSAR for Pim-1 kinase inhibitors

Field-based QSAR of Pim-1 kinase inhibitors

The field-based 3D QSAR model was developed in order to study structural requirements for Pim-1 kinase inhibition (Fig. 9). The electrostatic contours (1P) associated with Gaussian electrostatic fields around shows mixed blue/red colored region around the substituents of attached to pyrazine ring may favors/disfavours the activity. The substituent region shown by red color around phenyl ring attached to indole moiety disfavours the activity with electrostatic groups. H-bond acceptors around phenyl ring attached to indole ring shown with red color suggest favored electropositive features and the magenta region around substituents close to pyrazine indicated disfavoured region (1Q). The contours for hydrogen bond donor shows cyan region around the phenyl ring attached to indole rings indicated disfavourable for activity and substituents close to pyrazine ring with violet-colored polyhedron indicated favorable for activity, also it should be taken into consideration that H-bond donor contribution in this model is minor (1R). The green contour for steric groups indicated favorable at the terminus substituents attached to pyrazine ring but also show disfavouable around either side of substituents on phenyl ring attached to the indole structure (1 T). The yellow contour for hydrophobic groups indicated favorable at the terminus and close to pyrazine ring but also show disfavouable for substituents on phenyl ring attached to indole structure (1S). The contribution chart indicated the high impact of steric and hydrophobic regions. The statistical parameters and field fractions for the developed field-based models were tabulated and also revealed good results for Q2 and R2 (Table 8, 9, and 10).

Fig. 9
figure 9

Field-based for Pim-1 kinase activity

Table 9 Statistical parameters of field-based QSAR model of Pim-1 inhibition

Field-based for Pim-3 kinase inhibitors

The field-based 3D QSAR model was developed in order to study structural requirements for Pim-3 kinase inhibition (Figs. 10 and 11). The contours associated with Gaussian electrostatic fields around shown with blue/red region around the phenyl ring attached to indole suggests favorable/disfavourable for the activity. Terminus and close proximity substituent at nitrogen attached to pyrazine ring showed by blue color contour suggest favorable region for the activity with electrostatic groups where red contour suggests disfavourable for activity (2P). For Hydrogen bond acceptors magenta colored contour around close proximity of pyrazine ring disfavors the activity where the red colored polyhedrons around terminus substituents attached to pyrazine and phenyl ring attached to indole are suggesting favored region (2Q). The contours for hydrogen bond donor shown violet colored region around the phenyl ring attached to indole ring suggests favorable for activity. The contours for hydrogen bond donor shown cyan/violet colored counters around the pyrazine ring suggests favorable/disfavourable for activity (2R). Contours for steric groups suggest disfavourable for activity around phenyl ring attached to indole and substituents close to pyrazine ring. The contour for steric groups found favorable around terminus substituents of pyrazine ring (2 T).The yellow contour of hydrophobic groups close to pyrazine ring favors activity. The white colored contour of hydrophobic groups around phenyl ring attached to indole and terminus region of pyrazine substituents suggest disfavor to activity (2S). The contribution chart suggests the high impact of steric regions also considerable impact on the hydrophobic and electrostatic region (Table 13). The statistical parameters and field fractions for the developed field-based models were tabulated and also revealed good results for Q2 and R2 (Table 11, 12, and 13).

Fig. 10
figure 10

Actual vs predicted activity of field-based QSAR model for Pim-3 kinase inhibition

Fig. 11
figure 11

Field-based for Pim-3 kinase activity

Table 10 Contribution of factors in field-based QSAR model for Pim-1 inhibition
Table 11 Actual and predicted activities for Field-based QSAR for Pim-3 kinase inhibitors
Table 12 Statistical parameters of field-based QSAR model of Pim-3 inhibition
Table 13 Contribution of factors in field-based QSAR model for Pim-3 inhibition
Table 14 Statistical quality parameter for biological activity distribution in test and training set

Statistical quality of models

The QSAR models are considered predictive when R2 for the regression (the coefficient of determination) is above 0.6 and RMSE should be close to zero or less than 0.5. All the models have shown RMSE value much below 0.5 and almost all the models have shown r2 values above 0.6 that indicates the models are highly predictive (Table 14).

Biological activity distribution in test and training sets

The minimum Pim-1 and Pim-3 inhibition (Biological activity) of the test set were greater than or equal to the minimum activity of the training set, and the maximum activity of the test set was less than the maximum activity of the training set for all the developed models. This indicates that the test sets are within the activity domain of the training set. A higher mean value of the test set than of the training set indicates the presence of relatively more potent compounds, as compared to inactive ones (Table 15) [23].

Table 15 Statistical parameter for biological activity distribution in test and training sets

In silico ADMET

CaCo2 (gut-blood barrier) and Madin-Darby Canine Kidney (MDCK) cell permeability is considered Low if the value was less than 4, average permeability if the value was within 4~70 and High permeability if the value was more than 70 and almost all the compounds found Highly permeable. BBB is a blood-brain barrier permeability for drugs and acceptable Compounds considered CNS active if the value of BBB is more than 1. All Compounds were found CNS inactive as per In Silico predictions. Also predicted Percent Human Oral Absorption found 65–100% for all the compounds. Percent of human intestinal absorption (% HIA) also found excellent which was around 88–94% (Table 16).

Table 16 ADME parameter of compounds

Conclusion

Developed field-based and atom-based QSAR models for 3, 5-disubstituted indole derivatives were found predictive for their Pim kinase inhibitory activities. The developed models were validated by using software-generated parameter like R2, Q2, RMSE and Pearson R. The results of the ligand-based pharmacophore hypothesis and atom/field-based 3D-QSAR revealed detailed structural insights of the novel of 3,5-disubstituted indole derivatives as Pim kinase inhibitors which could be guidance for the rational design of novel potent indole derivatives for PIM kinase inhibitory activity.

Notes

aLigPrep, Schrödinger, LLC, New York, NY, 2017.

bPHASE, Schrödinger, LLC, New York, NY, 2017.

cQikProp, Schrödinger, LLC, New York, NY, 2017.